Skip to content
Ercan Ermis
Ercan Ermis

notes for everyone

  • AWS
  • GCP
  • Docker
  • Linux
  • DevOps
  • Privacy Policy
  • Contact
Ercan Ermis

notes for everyone

AWS re:Invent 2025: The “Agentic” Era

Ercan, December 14, 2025February 2, 2026

If 2024 was about talking to LLMs, re:Invent 2025 was about letting them actually do the work. Here is the comprehensive breakdown of the most significant announcements.

1. The Amazon Nova 2 Model Family

AWS didn’t just update their models; they built a specialized fleet for different agentic roles:

  • Nova 2 Lite: Optimized for speed and cost. Equal or better than Gemini Flash 2.5 on 14/18 benchmarks.
  • Nova 2 Pro: The “Reasoning” heavy-lifter. Best for complex multi-step tasks and long-range planning.
  • Nova 2 Sonic: A speech-to-speech model for low-latency conversational AI.
  • Nova 2 Omni: The true multimodal star. It processes text, images, video, and speech simultaneously with a 1M token context window.
  • Nova Act: Generally Available and purpose-built for UI automation (browser-based tasks) with >90% reliability.

2. Custom Silicon: Graviton5 & Trainium3

The hardware story was about decoupling performance from cost:

  • Graviton5: 25% faster than Graviton4, with 192 cores and a 5x larger L3 cache. It introduces the Nitro Isolation Engine, using formal verification to provide mathematical proof of workload isolation.
  • Trainium3 & Trainium4: AWS announced Trn3 UltraServers and teased Trainium4, focusing on doubling the density of AI compute.

3. Frontier Agents: Your New Virtual Team

AWS launched three specialized “Frontier Agents” that work autonomously for days:

  • Kiro: An autonomous developer that learns your codebase and handles features from end-to-end.
  • AWS Security Agent: A 24/7 proactive security consultant that performs design reviews and automated penetration testing.
  • AWS DevOps Agent: An “Autonomous SRE” that resolves incidents and monitors application resilience proactively.

4. Infrastructure & Database Wins

  • Database Savings Plans: One flexible commitment for RDS, Aurora, DynamoDB, and more.
  • P6e-GB300 UltraServers: Generally Available, offering 1.5x GPU memory and FP4 compute for the heaviest inference tasks.
  • S3 Batch Operations: Now 10x faster, making massive data migrations nearly instantaneous.
Share on Social Media
xfacebooklinkedinreddit
AWS

Post navigation

Previous post
Next post
  • AI (0)
  • AWS (60)
    • Serverless (0)
  • Best (9)
  • DevOps (16)
  • Docker (10)
  • GCP (3)
  • Linux (13)
  • Uncategorized (9)

Recent Posts

  • I dropped my Google Pixel 9 XL Pro from 6th floor balcony to the street
  • I Built TrumpDaily to Track Donald Trump Without the Noise
  • AWS Monthly (Dec ’25): The Kiro Era Begins
  • AWS re:Invent 2025: The “Agentic” Era
  • When Spotify’s Share-to-Instagram Flow Turns Into a Free Billboard
  • AWS Monthly (Nov ’25) The Stateful Serverless Revolution
  • AWS Monthly (Oct ’25): Industrializing AI Training
  • When the Cloud Sneezes, the World Catches a Cold – Lessons from the us-east-1 Meltdown
  • AWS Monthly (Sep ’25): Vega OS & eBPF Observability
  • AWS Monthly (Aug ’25): Big Data, Zero Effort
  • AWS Monthly (July ’25): Kubernetes at the Edge of Sanity
  • AWS Monthly (June ’25): S3 Becomes Your Vector DB
  • AWS Monthly (May ’25): The Death of the War Room
  • Automating AWS CloudWatch Log Group Tagging with Python and Boto3
  • Automating AWS ECR Tagging with Python and Boto3
©2026 Ercan Ermis | WordPress Theme by SuperbThemes