News | Nvidia Partner with AWS to Speed Up AI Deployments in the Cloud

Nvidia Partner with AWS to Speed Up AI Deployments in the Cloud

Published by: Insights Desk Released: Dec 05, 2024 Source: DemandTalk

Highlights:

The two companies took the stage at Amazon’s annual customer conference, AWS re:Invent, this week, where they made several announcements about their continuing partnership.
Nvidia’s NIM offers developers simple access to microservices for deploying high-performance AI model inference workloads across environments.

Nvidia Corp. is partnering with Amazon Web Services Inc. to push the boundaries of artificial intelligence, robotics, and quantum computing innovation.

The two companies shared the spotlight at Amazon’s annual customer conference, AWS re:Invent, this week, unveiling several announcements about their ongoing partnership.

The updates feature the integration of Nvidia’s NIM microservices with various AWS AI services, enabling faster inference with reduced latency for AI developers. Additionally, they include the launch of Nvidia’s DGX Cloud on AWS and other advancements in AI technology.

The most significant update for developers is the broader availability of NIM microservices on AWS.

Nvidia’s NIM offers developers a suite of user-friendly microservices designed to simplify the deployment of high-performance AI model inference workloads across various environments, including the cloud, on-premises data centers, and workstations. With the latest update, these microservices are now accessible through the AWS Marketplace, the new AWS Bedrock Marketplace, and Amazon SageMaker Jumpstart, providing developers with greater flexibility to deploy them seamlessly from their preferred interface, the companies announced.

Users can also deploy these microservices across multiple AWS services, including Amazon Elastic Compute Cloud, Amazon SageMaker, and Amazon Elastic Kubernetes Service.

The NIM microservices are offered as prebuilt containers, providing a selection of inference engines such as Nvidia Triton Inference Server, Nvidia TensorRT, Nvidia TensorRT-LLM, and PyTorch. They support hundreds of AI models, including those from the AWS Bedrock Marketplace, Nvidia’s AI foundation models, and custom models developed by customers.

DGX Cloud Available on AWS

Alongside the NIM microservices, developers now have access to Nvidia DGX Cloud, a new infrastructure solution. Available through AWS Marketplace Private Offers, it provides a fully managed, high-performance computing platform for training, customizing, and deploying AI models.

DGX Cloud is a cloud-based AI supercomputing service that provides enterprises with access to Nvidia’s GPUs and the essential software for training advanced models used in generative AI and other applications.

Nvidia emphasized that one advantage of DGX Cloud is its flexible deployment terms. Furthermore, customers will have direct access to Nvidia’s experts, who will provide the technical support needed to scale their AI deployments.

The DGX Cloud platform currently offers access to Nvidia’s most powerful GPUs, the Nvidia H100 and H200, with plans to expand soon to include the next-generation Blackwell GPUs, set to launch in the new year.

AWS announced that the Blackwell chips will be available as part of the GB200 NVL supercomputing system, which will feature a new liquid cooling system designed to deliver superior performance and greater energy efficiency compared to other cloud platforms.

AI Templates, Robotics Models, and Drug Development Simulations

In other AI-related news, Nvidia announced the release of several new AI Blueprints for immediate deployment on AWS. These blueprints offer pre-configured AI agents for tasks like video search, container vulnerability analysis, and text summarization, which can be easily incorporated into existing developer workflows.

According to Nvidia, the AI Blueprints open up a range of possibilities. For example, developers can use the video search blueprint to rapidly build a visual AI agent capable of analyzing video in real time. This agent can then generate alerts for security teams, detect health and safety violations in the workplace, identify defective products on a production line, and more.

Nvidia is making significant strides in AI-powered robotics, driven by its longstanding belief in AI’s ability to automate robots for practical, real-world tasks. Its latest update is designed to accelerate the simulation of these applications.

Central to this effort is the Nvidia Omniverse platform. Nvidia announced a reference application now available on Omniverse, which enables the creation of realistic virtual environments and digital twins. Powered by high-performance AWS EC2 G6e instances accelerated by L40S GPUs, the application allows developers to simulate and test AI-powered robots in diverse environments with highly realistic physics, according to the company.

Nvidia and AWS are also working to advance AI’s role in pharmaceutical development. They announced that Nvidia’s BioNeMo NIM microservices and AI Blueprints for drug discovery are now integrated with AWS HealthOmics, a fully managed service for computing and storing biological data to support clinical diagnostics.

The partnership enhances AWS HealthOmics by enabling researchers to explore a broader range of AI models, the companies stated.

Progressing Quantum Computing

Nvidia announced a collaboration with AWS to accelerate quantum computing development. The chipmaker’s Nvidia CUDA-Q platform, designed for creating hybrid quantum-classical computing applications that bridge traditional and quantum systems, is being integrated with the Amazon Braket service.

Amazon Braket simplifies the process of setting up, monitoring, and running hybrid quantum-classical algorithms on quantum processors. With this integration, Nvidia stated that CUDA-Q users can access Amazon Braket’s quantum resources, while Braket users can leverage CUDA-Q’s GPU-accelerated workflows for development and simulation.

il est temps de devenir sérieux avec le genai dan...

harnessing ai: the future of business transformati...

prepare for the future now. achieve greater, secur...

stay ahead with modern technology...

stay ahead...

workforce upskilling for the ai era...

unlock the full potential of generative ai at work...

ai pcs are quickly becoming the key to achieving s...

developing tomorrow’s ai on today’s ai-ready w...

unveiling ai-level productivity...

the new cyber security opportunity in an ‘ai eve...

how ai is changing managed detection and response...

answering your 4 biggest questions about generativ...

understanding the costs of generative ai...

the top 5 generative ai questions on every executi...

7 leading generative ai use cases...

6 steps to success with generative...

revolutionize your product launches with ai-driven...

unlock the full potential of ai-powered software d...

new era energy efficiency whitepaper longform...

compliance automation: a strategic investment for ...

leading the way: how modern workplaces embrace cha...

choosing the right ai foundation model for your ne...

ai governance: the path to responsible ai...

ai in market research: new possibilities, new insi...

ai ready workforce: upskilling for the ai era...

ai pricing strategy: the key to sustainable busine...

ai in business strategy: enhancing decisions boos...

genai at work: revolutionizing modern business ope...

ai misinformation: ai’s role in amplifying misin...

decision intelligence empowering business actions ...

committee machine in ml harnessing ensemble techni...

information processing language serves scalable an...

ai agents in business: transforming operations dr...

ai adoption framework: key components for effectiv...

machine learning use cases that deliver tangible r...

profitable ai-powered data management solutions to...

business-centric cognitive architecture revolution...

ai use cases – innovations for business success...

the role of ai in software development...

openai upgrades chatgpt’s image generation tool ...

microsoft is improving security copilot service wi...

deepseek unveils enhanced v3 model under mit licen...

nvidia reportedly acquires gretel to generate arti...

dataminr raises usd 85 m for real-time analytics...

ai code review startup graphite raises usd 52 m to...

zoom upgrades with agentic ai for enhanced video c...

google introduces gemini robotics and gemini robot...

google launches next-gen lightweight gemma ai mode...

ai21 labs introduces maestro for enhancing llm qua...

servicenow to acquire moveworks in a usd 2.9 b...

qualcomm acquires edge impulse, edge ai startup...

google introduces two new ai features to enhance i...

coreweave plans to buy weight biases for seamless...

openai launches nextgenai consortium with 15 insti...

anthropic pbc raises usd 3.5 b at usd 61.5 b value...

openai introduces gpt-4.5 as the most advanced and...

amazon launches alexa , an llm-powered assistant...

perplexity ai is creating a browser for ‘agentic...

mongodb acquires voyage ai for ai models generatin...

role of machine learning in networking...

Nvidia Partner with AWS to Speed Up AI Deployments in the Cloud

Insights Desk

Related posts

OpenAI Upgrades ChatGPT’s Image Generation Tool ...

Microsoft is Improving Security Copilot Service wi...

DeepSeek Unveils Enhanced V3 Model Under MIT Licen...

Nvidia Reportedly Acquires Gretel to Generate Arti...

Dataminr Raises USD 85 M for Real-time Analytics...

AI Code Review Startup Graphite Raises USD 52 M to...

Zoom Upgrades with Agentic AI for Enhanced Video C...

Google Introduces Gemini Robotics and Gemini Robot...

Google Launches Next-Gen Lightweight Gemma AI Mode...

AI21 Labs Introduces Maestro for Enhancing LLM Qua...

Our Brands