What’s New from Google Cloud Next '25: A Quick Summary of Key Highlights

Written by Pitcha Gunteethong | May 2, 2025 2:00:00 AM

Catch the highlights summary Google Cloud Announced at Next '25 below:

⭐ New and Enhanced AI Models (Vertex AI Model Garden)

Gemini 2.5 Pro – High-performance model for deep reasoning and coding (now in public preview).
Gemini 2.5 Flash – Low-latency, cost-efficient model (coming soon).
Imagen 3 – Best text-to-image model with advanced inpainting.
Chirp 3 – Audio model with Instant Custom Voice (10-second input).
Lyria – First enterprise-ready text-to-music model.
Veo 2 – Enhanced video generation with editing and camera control.
Llama 4 (Meta) – Now available on Vertex AI.
AI2 Models – Open models from AI2 now in the Model Garden.

Vertex AI is now the only hyperscaler with generative models across video, image, speech, and music.

⭐ Tools for Model Management

Vertex AI Dashboards – Monitor usage, latency, and errors.
Model Customization – Fine-tune first- and third-party models securely.
Model Optimizer – Balances quality vs. cost for responses.
Live API – Enables real-time audio/video input for Gemini.
Global Endpoint – Ensures responsiveness with regional load balancing.

⭐ Multi-Agent Systems Enhancements

Agent Development Kit (ADK) – Open-source framework to build agent systems using Model Context Protocol (MCP).
Agent2Agent Protocol (A2A) – Open standard for agent communication across platforms.
Agent Garden – Pre-built agent samples to speed up development.
Agent Engine – Fully managed runtime for production deployment.
Google Maps Grounding – Adds geospatial awareness to agents.
Customer Engagement Suite – Emotion-aware agents, real-time video understanding, no-code agent builder.

⭐ Enterprise AI with Agentspace

Chrome Enterprise Integration – AI agents embedded in employee workflows.
Agent Gallery – Central hub for discovering enterprise agents.
Agent Designer – No-code tool to create task-based agents.
Idea Generation Agent – Autonomous ideation and evaluation.
Deep Research Agent – Synthesizes complex topics into reports.

⭐ Scientific & Research Applications

AlphaFold 3 – Protein folding at scale using Google Cloud.
WeatherNext Models – High-speed, accurate weather forecasting now available in Model Garden.

⭐ Next-Gen AI Hardware & Runtime

TPU v5p “Ironwood” – 7th-gen TPU coming in 2025, designed to efficiently power large inferential AI models.
Google Distributed Cloud + NVIDIA Blackwell – Gemini can now run locally in connected and air-gapped environments with support from Dell and NVIDIA.
Pathways on Cloud – Google DeepMind’s distributed AI runtime (used internally at Google) is now available on Google Cloud.

⭐ Optimized Inference & Serving

vLLM on TPU – PyTorch-based models using vLLM can now run on TPUs without code changes, supporting mixed TPU/GPU serving.
Dynamic Workload Scheduler – Adds support for Trillium, TPU v5e, NVIDIA B200 (A4), and H200 (A3 Ultra) via Flex Start mode; Calendar mode for TPUs coming soon.

⭐ Expanded GPU Offerings

A4 & A4X VMs – Powered by NVIDIA B200 and GB200 Blackwell GPUs; A4X in preview. Google Cloud is the first to offer both.
NVIDIA Vera Rubin GPUs – Coming soon, enabling up to 15 exaflops of FP4 inference performance per rack.

⭐ Accelerator Management at Scale

Cluster Director – Manages accelerator clusters with colocated VMs and topology-aware scheduling. Upcoming updates:
- Slurm integration
- Advanced observability (3600)
- Job continuity features
- Join preview for early access

⭐ Application Development Enhancements

Design & Management Tools
- Application Design Center (preview): Visual, collaborative canvas to design and deploy apps with inline code.
- Cloud Hub (preview): Central dashboard for app landscape insights, health, optimization, and support.
- App Hub: Now integrated with 20+ GCP products for app-centric visibility.
- Application Monitoring (preview): Auto-tags telemetry and enables app-aware alerts and dashboards.
- Cost Explorer (private preview): Granular visibility into app costs and usage efficiency.
- Gemini-Powered Developer Tools

Gemini Code Assist: Helps with dev tasks (migration, testing, docs) and integrates with Android Studio, Atlassian, Sentry, Snyk.
App Prototyping Agent: Converts app ideas into working prototypes (UI + backend) in Firebase Studio.
Gemini Cloud Assist: Embedded across design and operations tools (e.g., Observability, FinOps Hub, Security) to enhance infra design, troubleshooting, and cost optimization.
Developer Programs
Google Developer Program: Offers AI-assisted tools across all tiers. Premium ($75/month/seat) for enterprise teams is now in preview.

⭐ Compute Innovations

General-Purpose Compute
C4D & C4 VMs: High-performance VMs powered by AMD (C4D) and Intel (C4), with up to 288 vCPUs, DDR5 memory, Local SSD, and bare metal options.
Specialized Compute
H4D VMs (preview): HPC-optimized, delivering record-breaking per-core and memory bandwidth performance.
M4 VMs: Optimized for SAP HANA with 65% better price-performance over M3.
Z3 VMs (preview): Storage-optimized, up to 72TB Titanium SSDs with bare metal access.
Hybrid & Integrated Compute
Nutanix NC2 on GCP: Hybrid cloud platform now in public preview.
Google Cloud VMware Engine: Now supports 26 node shapes.
Infrastructure Upgrades
Titanium ML Adapter: Enhanced NIC integration for 3.2 Tbps GPU bandwidth.
Titanium Offload Processors: Connect GPU clusters to Jupiter fabric for scalable compute.
Managed Instance Groups (MIGs): Now manageable as a unit and support CUDs/reservation sharing with Vertex AI.

⭐ AI-Enhanced & Natural Language Capabilities

AlloyDB AI Upgrades:
- Natural language querying, vector search via ScaNN, and cross-attention reranking.
- New AI models: multimodal embeddings (text, images, video), Gemini Embeddings.
- Agentspace integration enables structured data search with AI agents.
- The new AI query engine supports natural language in SQL queries.

MCP Toolbox:
- Connects AI agents to enterprise databases using Model Context Protocol.

⭐ Database Compatibility & Migration

Firestore + MongoDB:
- MongoDB API compatibility with Firestore’s global scale and low latency.
Oracle on Google Cloud:
- Base Database Service and Exadata X11M (GA), now with CMEK support.
Database Migration Service (DMS):
- Now supports SQL Server → PostgreSQL (Cloud SQL & AlloyDB).

⭐ Performance, Availability & Management

Cloud SQL & AlloyDB on Axion (C4A): Uses Arm-based processors for better price-performance.
Database Center: GA release — unified, AI-powered database fleet management.
Spanner Updates: Vector search (GA), graph visualization (GA), Repeatable Read isolation (preview), MySQL migration tools.
Aiven for AlloyDB Omni: Fully managed AlloyDB Omni across AWS, Azure, and GCP (GA).

⭐ Bigtable, Memorystore & Firebase

Bigtable: Continuous materialized views (preview), Cassandra-compatible APIs & migration tools.
Memorystore for Valkey: Now GA, supports versions 7.2 & 8.0.
Firebase Data Connect: GA release — combines PostgreSQL’s reliability with GraphQL APIs and type-safe SDKs.

⭐ BigQuery Platform Innovations

Data Pipelines & Preparation:
- GA: Pipelines, Data preparation, Contribution analysis, Pipe syntax, Semantic search.
Data Quality & Detection:
- Preview: Anomaly detection, Knowledge engine powered by Gemini.
AI-Powered Enhancements:
- GA/Preview:
  - Natural language & real-world context querying via Gemini.
  - Intelligent SQL cells, scheduled analysis, native viz.
  - AI.GENERATE_TABLE, vector search index (ScaNN), TimesFM forecasting model.
  - Multimodal tables with ObjectRef type.
Governance & Cataloging:
- GA: Business glossary, Metastore, Universal catalog, Catalog metadata export, BigLake cataloging, Disaster recovery.
- Preview: Governance console, Iceberg table support.
Workload & Cost Management:
- GA: Advanced workload mgmt, Spend commit model.
- Preview: Fair slot sharing and better billing attribution.

⭐ AI & Natural Language for Analysts & Scientists

Colab Notebook Integration:
- Data science agent, intelligent SQL cells, native EDA, and AI code assist in DataFrames.
LLM & Model Integration:
- Supports Anthropic Claude, Llama, Mistral in BigQuery ML.
- TimesFM model, real-time Kafka pipelines, serverless Spark.
Location & Geospatial:
- Google Maps datasets + Earth Engine data now in BigQuery.

⭐ Looker Platform Updates

Conversational & AI Tools:
- Preview: Conversational Analytics (129), API (130), Code Interpreter.
- GA: Gemini features like formula/visualization assistants, slide generation.
Reporting Enhancements:
- New drag-and-drop reports with real-time collaboration.

DevOps for BI:
- SQL/LookML testing automation via Spectacles.dev.

⭐ Partner Ecosystem & Integrations

Notable Partner Launches:
- GrowthLoop: AI-driven marketing journeys.
- Informatica: Expanded governance/AI capabilities.
- Fivetran: Managed Data Lake Service with Iceberg/Delta.
- DBT: Integrated with BigQuery DataFrames & hosted on GCP.
- Datadog: Expanded BigQuery monitoring.

⭐ Firebase Updates

Firebase Studio (Preview):
A Gemini-powered, all-in-one cloud development environment for building full-stack AI apps. Includes Gemini Code Assist agents.
Genkit Framework:
Open-source AI framework now supports Python and Go. Firebase Studio includes templates to start building with it.
Vertex AI in Firebase:
Now supports Gemini Live API for conversational, multimodal app interactions (e.g., audio Q&A).
Firebase Data Connect (GA):
Production-ready PostgreSQL backend with GraphQL APIs and type-safe SDKs.
Firebase App Hosting (GA):
Git-centric hosting for full-stack web apps.
App Testing Agent (Preview):
Automatically generates and runs end-to-end tests in Firebase App Distribution to prep mobile apps for release.

⭐ Google Cloud Consulting

Agentspace Accelerator:
Prebuilt service to help integrate AI-powered search across organizational data.
Optimize with TPUs:
Helps migrate workloads to Google’s Tensor Processing Units for AI performance gains.
Oracle on Google Cloud:
Combines Oracle databases with Google Cloud’s AI & infra for improved performance.
Delivery Navigator (Preview):
Expanded to customers and partners—offering best practices and proven delivery methods for cloud implementations.

⭐ Network Infrastructure & Performance

Cloud WAN (GA):
Cross-Cloud Network backbone with 40% better performance and TCO savings.
400G Interconnect (Coming Soon):
4x bandwidth boost for Cloud and Cross-Cloud Interconnects.
30,000 GPU Support (Preview):
Enables massive AI clusters with non-blocking GPU configurations.
High-Performance RDMA Networking (GA):
Up to 3.2 Tbps GPU-to-GPU bandwidth with RDMA (186), and Zero-Trust RDMA security coming later in the year.

⭐ AI-Optimized Networking & Load Balancing

LLM Inference Optimizations:
Cloud Load Balancing now supports AI/LLM workloads across clouds/on-prem.
Inference Gateway & AI Security Integrations:
Combine with tools like Model Armor, NeMo Guardrails, and Palo Alto via Service Extensions.
WebAssembly-based Service Extensions (GA):
Plugin system now in Cloud Load Balancing; Cloud CDN support coming soon.

⭐ Content Delivery & Performance Enhancements

Cloud CDN Fast Cache Invalidation & TLS 1.3 0-RTT (Preview):
Faster content delivery and quicker resumed connections.

⭐ Service Management & Discovery

App Hub:
Streamlined service discovery, with cross-regional failover coming soon.
Private Service Connect for GKE (Coming 2025):
Publish multiple services in a single GKE cluster for broader access.

⭐ Security Enhancements

DNS Armor (Preview): Detects advanced DNS-based attacks like tunneling and DGAs.
Cloud Armor Hierarchical Policies: Granular protection enforcement.
Cloud NGFW Updates: New network types and firewall tags, plus Layer 7 domain filtering coming 2025.
Inline Network DLP (Preview): Real-time protection using third-party DLP (Symantec) via Secure Web Proxy and Load Balancer.
Network Security Integration (GA): Consistent hybrid/multi-cloud security policies without routing changes.
Imperva App Security: Now integrated with Cloud Load Balancing via Marketplace.

⭐ Security Innovations

Unified Security: A single platform combining threat detection, AI-driven security ops, Mandiant expertise, and more.
Security Agents:
- Alert Triage Agent: Automates alert investigation and decision-making.
  Malware Analysis Agent: Uses Code Insight to detect and analyze malicious code.
Google Security Operations: Enhanced data pipeline management for scale, cost, and compliance.
Cyber Insurance Expansion: Risk Protection Program now includes Beazley and Chubb.
Phishing Protection: Chrome Enterprise Premium uses Safe Browsing to defend against credential theft.
Mandiant Enhancements:
- Retainer: On-demand access to Mandiant’s services using prepaid credits.
- Partnerships: New recovery solution with Rubrik and Cohesity for cyberattack resilience.

⭐ Storage Innovations

Hyperdisk:
- Storage Pools: Now supports up to 5 PiB.
- Exapools: Industry-leading block storage at exabyte scale.
- ML: Integrates with GKE for fast data hydration.
Rapid Storage: Ultra-low latency cloud storage with 6 TB/s throughput and 20x faster data access.
Anywhere Cache: Reduces latency up to 70% by caching data near GPUs/TPUs.
Managed Lustre: High-performance file system with <1ms latency and massive throughput for AI.
Storage Intelligence: Uses LLMs to query metadata and provide actionable insights into data estates.

⭐ Startups

Lightspeed VC Partnership: Lightspeed-backed startups get up to $150K in Google Cloud credits.
Startup Perks Program: Early-stage startups get benefits from partners like Datadog, GitLab, NVIDIA, and more.
Vertex AI Credits: Additional $10K credits for using partner foundation models (e.g., Anthropic, Meta).

⭐ Google Workspace: AI-Powered Productivity

Help me Analyze: A feature in Google Sheets that automatically identifies insights from data, helping users make data-driven decisions.
Docs Audio Overview: Converts Google Docs into human-like audio summaries or podcast-style overviews of documents.
Workspace Flows: Automates daily tasks like managing approvals, organizing emails, researching customers, and summarizing agendas.

⭐Additional Launches

Cloud Run Updates:
- Cloud Run GPUs: Now generally available.
- High Availability Apps: Public preview of multi-region deployment.
- Kafka Consumer Worker Pools: Private preview for pull-based workloads.
SaaS Runtime: A lifecycle management service for building, deploying, and operating AI-infused SaaS services at scale.

View full post