We were thrilled to be in person at San Francisco's Moscone Center for the highly anticipated Google Cloud Next event, marking its first large-scale in-person gathering since 2019! The event started with a captivating keynote delivered by Google CEO Sundar Pichai, who delved into the remarkable advancements Google has made in its AI technologies to enhance their usefulness and responsibility. Subsequently, Google Cloud CEO Thomas Kurian shed light on how customers are leveraging our AI capabilities, infrastructure, and tools to revolutionize their business operations and provide unparalleled experiences to their customers.
Let's take a look at some of the exciting moments from the Keynotes and Spotlight sessions that took place on day one.
From the very first day of Google Cloud Next, it's clear that Google's commitment to AI is prominent. Google proudly unveiled its always-on AI assistant, Duet AI, which seamlessly integrates with Google Workspace and Google Cloud. We've also made significant updates to our AI platforms, introducing a plethora of new models and capabilities that empower you to create intelligent and easily manageable AI applications. Additionally, they have fine-tuned our existing services and infrastructure to fully leverage the power of AI, enabling you to train and deploy it more effectively.
What we announced:
Duet AI in Google Workspace is generally available, and Duet AI in Google Cloud is currently available in preview. In terms of the latter, users will have the capability to utilize Duet AI in various Google Cloud services such as BigQuery, Looker, Spanner, Database Migration Services, GKE, Mandiant Threat Intelligence, Chronicle Security Operations, and Security Command Center.
Vertex AI offers a comprehensive solution for creating generative AI applications, developing machine learning models, and implementing MLOps. Our recent updates include the introduction of new generative AI foundation models in Model Garden, such as Meta's Llama 2 and a pre-announcement of Anthropic's Claude 2. We have also made enhancements to our first-party models, PaLM 2, Codey, and Imagen. Additionally, we are excited to launch Colab Enterprise, a platform designed to streamline your data science workflows. With Vertex AI Extensions, you can now customize foundation models, integrate them with your data, and build powerful generative AI applications.
Experience lightning-fast and budget-friendly training for your AI workloads with the cutting-edge Cloud TPU v5e. Additionally, we are excited to announce that A3 VMs, powered by NVIDIA's H100 GPUs, are now generally available, empowering you with even greater performance capabilities.
As pioneers in the cloud industry, Google has introduced an innovative feature that sets us apart from other providers. Their cutting-edge technology, developed in collaboration with Google DeepMind SynthID, allows for the digital watermarking and verification of AI-generated images. This groundbreaking advancement empowers users to easily identify AI-generated content when browsing online.
To unlock the full potential of AI, it is crucial to have high-quality data for training. Recognizing this, Google has introduced a range of exciting enhancements and features to our data and data analytics solutions.
What they announced:
The new BigQuery Studio presents a single interface for data engineering, analytics, and predictive analysis. Additional features provide data teams with a simplified lakehouse experience, cross-cloud analytics, secure data sharing, and improved governance.
AlloyDB has recently introduced AlloyDB AI, an innovative feature that empowers users to create diverse AI experiences effortlessly. With this new addition, users can easily generate vector embeddings directly from their database and access data stored in various open-source gen AI tools like LangChain. Moreover, AlloyDB AI enables users to utilize both local models within AlloyDB and remote models in Vertex AI, providing a comprehensive AI solution for all your needs.
When it comes to AlloyDB, Google are committed to assisting you in seamlessly transitioning your legacy databases to the PostgreSQL-compatible database service. The new innovative Duet AI in Database Migration Service, currently in preview, utilizes AI-guided code conversion to automate the process of converting Oracle database code, including stored procedures, functions, triggers, packages, and custom PL/SQL code.
At the foundation of Google Cloud's extensive capabilities, including AI, lies a robust infrastructure and top-notch security offerings. Google's commitment to constant improvement allows us to continually enhance and build upon these core elements.
Key announcements
Google has recently unveiled a range of new services on our Google Distributed Cloud portfolio, including GDC Hosted and GDC Edge, all powered by state-of-the-art hardware.
The cutting-edge Cross-Cloud Network platform brings together a comprehensive range of products from Google Cloud's networking portfolio and its partners. This innovative platform is designed to streamline cloud connectivity, strengthen security measures, and enhance overall networking performance.
Mandiant Hunt for Chronicle offers expert-managed threat hunting services, leveraging the power of Chronicle data to uncover and mitigate attacker activity, ultimately minimizing the impact on businesses.
Google Compute Engine VMs benefit from Security Command Center's agentless vulnerability scanning, which is powered by Tenable. This powerful scanning tool can identify vulnerabilities in operating systems, software, and networks, ensuring enhanced security for your VMs.
Make sure you don't overlook the exciting developments in Google Cloud Firewall for network security, expanded processor support for Confidential Computing, the addition of Japan Regions support for Assured Workloads, and much more.
Google has made significant investments in our data centers and network over the past 25 years to support the advanced capabilities and wide-ranging applications of gen AI. With a global network of 38 cloud regions, the goal is to operate on carbon-free energy 24/7 by 2030, ensuring the most sophisticated and capable infrastructure for gen AI.
Their AI-optimized infrastructure has become the top choice for training and serving gen AI models. In fact, more than 70% of gen AI unicorns, such as AI21, Anthropic, Cohere, Jasper, MosaicML, Replit, Runway, and Typeface, are Google Cloud customers. Additionally, over half of all funded gen AI startups, including Copy.ai, CoRover, Elemental Cognition, Fiddler AI, Fireworks.ai, PromptlyAI, Quora, Synthesized, Writer, and many others, rely on Google's infrastructure.
To further support their customers, we are excited to announce key infrastructure advancements, including:
Cloud TPU v5e: Introducing the latest breakthrough in AI acceleration - the Cloud TPU v5e. This cutting-edge platform offers unparalleled cost-efficiency, versatility, and scalability. With the Cloud TPU v5e, customers can now seamlessly conduct large-scale AI training and inference on a single platform. This state-of-the-art technology scales to tens of thousands of chips and is meticulously optimized for maximum efficiency. In comparison to its predecessor, the Cloud TPU v4, the v5e delivers up to a 2x improvement in training performance per dollar and an impressive 2.5x improvement in inference performance per dollar.
A3 VMs with NVIDIA H100 GPU: Next month, organizations will have access to the powerful A3 VMs, which are equipped with NVIDIA's H100 GPU. These VMs have been specifically designed with advanced networking capabilities to cater to the demands of cutting-edge gen AI and large language model (LLM) innovations. With these high-performance VMs, organizations can achieve a remarkable three-fold improvement in training performance compared to the previous-generation A2.
GKE Enterprise: With the introduction of multi-cluster horizontal scaling, organizations can now experience remarkable productivity gains of 45% and reduce software deployment times by over 70%. The latest update brings autoscaling, workload orchestration, and automatic upgrades to the Cloud TPU v5e, making it an ideal choice for the most demanding and mission-critical AI/ML workloads.
Cross-Cloud Network: Cross-Cloud Network is a cutting-edge global networking platform that revolutionizes the way customers connect and secure their applications across multiple clouds. It provides an open, optimized infrastructure with ML-powered security, ensuring a zero trust environment. With its innovative design, Cross-Cloud Network makes it effortless for customers to access Google services seamlessly from any cloud, while also significantly reducing network latency by up to 35%.
Google Distributed Cloud: To cater to the specific requirements of organizations that aim to run workloads at the edge or within their own data center, Google is introducing enhancements to its GDC portfolio. These enhancements include next-generation hardware, advanced security capabilities, and integrations with Vertex AI. Additionally, Google is offering a new managed solution called AlloyDB Omni on GDC Hosted, bringing the power of AI to the edge as mentioned earlier.
On top of our world-class infrastructure, we deliver what we believe is the most comprehensive AI platform — Vertex AI — which enables customers to build, deploy and scale machine learning (ML) models. We have seen tremendous usage, with the number of gen AI customer projects growing more than 150 times from April-July this year. Customers have access to more than 100 foundation models, including third-party and popular open-source versions, in our Model Garden. They are all optimized for different tasks and different sizes, including text, chat, images, speech, software code, and more. We also offer industry specific models like Sec-PaLM 2 for cybersecurity, to empower global security providers like Broadcom and Tenable; and Med-PaLM 2 to assist leading healthcare and life sciences companies including Bayer Pharmaceuticals, HCA Healthcare, and Meditech.
Vertex AI Search and Conversation are now generally available, enabling organizations to create Search and Chat applications using their data in just minutes, with minimal coding and enterprise-grade management and security built in. In addition, Vertex AI Generative AI Studio provides user-friendly tools to tune and customize models, all with enterprise-grade controls for data security. These include developer tools like Text Embeddings API, which lets developers build sophisticated applications based on semantic understanding of text or images, and Reinforcement Learning from Human Feedback (RLHF), which incorporates human feedback to deeply customize and improve model performance.
Today, we’re excited to announce several new models and tooling in the Vertex AI platform:
PaLM 2, Imagen and Codey Upgrades: We’re updating PaLM 2 to 32k context windows so enterprises can easily process longer form documents like research papers and books. We’re also improving Imagen’s visual appeal, and extending support for new languages in Codey.
Tools for tuning: For PaLM 2 and Codey, we’re making adapter tuning generally available and in preview respectively, which can help improve LLM performance with as few as 100 examples. We're also introducing a new method of tuning for Imagen, called Style Tuning, so enterprises can create images aligned to their specific brand guidelines or other creative needs with a small amount of reference images.
New models: We’re announcing availability of Llama 2 and Code Llama from Meta, and Technology Innovative Institute’s Falcon LLM, a popular open-source model, as well as pre-announcing Claude 2 from Anthropic. In the case of Llama 2, we will be the only cloud provider offering both adapter tuning and RLHF.
Vertex AI extensions: Developers can access, build, and manage extensions that deliver real-time information, incorporate company data, and take action on the user's behalf. This opens up endless new possibilities for gen AI applications that can operate as an extension of your enterprise, enabled by the ability to access proprietary information and take action on third-party platforms like your CRM system or email.
Grounding: We are announcing an enterprise grounding service that works across Vertex AI foundation models, Search and Conversation that gives customers the ability to ground responses in their own enterprise data to deliver more accurate responses. We are also working with a few early customers to test grounding with the technology that powers Google Search.
Digital Watermarking on Vertex AI: Powered by Google DeepMind SynthID, this offers a state-of-the art technology that embeds the watermark directly into the image of pixels, making it invisible to the human eye and difficult to tamper with. Digital watermarking provides customers with a scalable approach to creating and identifying AI-generated images responsibly. We are the first hyperscale cloud provider to offer this technology for AI-generated images.
Colab Enterprise: This managed service combines the ease-of-use of Google’s Colab notebooks with enterprise-level security and compliance capabilities. Data scientists can use Colab Enterprise to collaboratively accelerate AI workflows with access to the full range of Vertex AI platform capabilities, integration with BigQuery, and even code completion and generation.
Equally important to discovering and training the right model is controlling your data. From the beginning, we designed Vertex AI to give you full control and segregation of your data, code, and IP, with zero data leakage. When you customize and train your model with Vertex AI — with private documents and data from your SaaS applications, databases, or other proprietary sources — you are not exposing that data to the foundation model. We take a snapshot of the model, allowing you to train and encapsulate it together in a private configuration, giving you complete control over your data. Your prompts and data, as well as user inputs at inference time, are not used to improve our models and are not accessible to other customers.
Moving on to Workspace, a productivity tool that has gained immense popularity worldwide, with over 3 billion users and more than 10 million paying customers relying on its capabilities daily. Recently, Duet AI was introduced, a game-changing feature that has revolutionized team productivity. With Duet AI, Users are empowered to enhance their content creation in platforms like Gmail and Google Docs, unleash their creativity with original images in Google Slides, transform ideas into actionable insights using Google Sheets, foster deeper connections in Google Meet, and so much more. The response has been overwhelming, with thousands of companies and over a million trusted testers embracing Duet AI as their ultimate collaboration partner. It not only acts as a coach and a source of inspiration but also ensures that every user and organization maintains complete control over their precious data.
At next there were a number of new enhancements introduced:
Duet AI in Google Meet: Duet AI will take notes during video calls, send meeting summaries, and even automatically translate captions in 18 languages. In addition, to ensure every meeting participant is clearly seen, heard, and understood, Duet AI in Meet announced studio look, studio lighting, and studio sound.
Duet AI in Google Chat: You'll be able to chat directly with Duet AI to ask questions about your content, get a summary of documents shared in a space, and catch up on missed conversations. We've also delivered a refreshed user interface, new shortcuts, and enhanced search to allow you to stay on top of conversations, as well as huddles in Chat which allow teams to start meetings from the place where they are already collaborating.
The top story in the world of Workspace was the general availability of Duet AI in Workspace, but that's not all that the Workspace family had to offer.
Step into the future with the revamped Google Chat! Experience the cutting-edge advancements like Duet AI in Chat, which not only extracts valuable insights from shared documents but also offers conversation summaries and much more. Discover all the exciting details right here.
Today, Duet AI in Google Workspace generally available, while expanding the preview capabilities of Duet AI in Google Cloud, with general availability coming later this year.
In addition to its capabilities within Workspace, Duet AI has expanded its reach to provide AI assistance across various Google Cloud products and services. It serves as a coding assistant for developers, offering faster coding solutions. It also acts as an expert adviser, helping operators troubleshoot application and infrastructure issues swiftly. Furthermore, Duet AI serves as a data analyst, providing quick and insightful data analysis, and as a security adviser, recommending best practices to prevent cyber threats.
The value of Duet AI in Google Cloud has already been realized by customers such as L’Oréal, who have achieved better and faster business decisions using their data. Early testing by Turing has shown engineering productivity gains of one-third.
The latest announcements regarding Duet AI in Google Cloud include advancements in:
Application and infrastructure operations: Operators have the incredible opportunity to engage in natural language conversations with Duet AI across various services, right within the Google Cloud Console. This seamless interaction allows operators to swiftly access valuable "how to" information regarding infrastructure configuration, deployment best practices, and receive expert recommendations for optimizing cost and performance. Say goodbye to time-consuming searches and hello to efficient and effective operations with Duet AI as your trusted guide.
Data Analytics: Duet AI in BigQuery revolutionizes data analytics by providing contextual support for crafting SQL queries and Python code. It goes beyond simple suggestions, offering comprehensive functions and code blocks, as well as explaining SQL statements in plain language. Moreover, Duet AI leverages your schema and metadata to generate personalized recommendations. These advanced capabilities empower data teams to focus on driving impactful outcomes for their business.
Accelerating and modernizing databases: Duet AI seamlessly integrates with Cloud Spanner, AlloyDB, and Cloud SQL to empower users in generating code that structures, modifies, or queries data using natural language. Its power extends to the Database Migration Service (DMS), automating the conversion of complex database code, including stored procedures, functions, triggers, and packages, which were previously challenging to convert using traditional translation technologies.
Security Operations: Moreover, Google is introducing Duet AI to enhance its suite of security products, including Chronicle Security Operations, Mandiant Threat Intelligence, and Security Command Center. This integration empowers security professionals to proactively prevent threats, streamline security workflows, and elevate their expertise.
Duet AI goes beyond merely providing recommendations and expert guidance. It taps into the power of PaLM 2 LLM models and leverages Google Cloud-specific content, including documentation, sample code, and best practices, to deliver contextual insights. With a strong focus on protecting customer security and privacy, as well as adhering to AI principles, Duet AI ensures that your data remains exclusively yours. Rest assured that your code, inputs to Duet AI, and recommendations generated by the system will never be utilized to train shared models or develop any products.
Google is at the forefront of a new era of digital transformation, driven by the power of gen AI. This revolutionary technology is already revolutionizing how businesses operate and how people interact with one another. It is transforming the healthcare industry, enabling doctors to provide better care for their patients. It is revolutionizing communication, making it easier for people to connect and collaborate. And it is even enhancing workplace safety, ensuring that workers are protected on the job. But this is just the beginning. Stay tuned for more updates and don't hesitate to reach out to our consultants to unlock the full potential of Gen AI capabilities.