Eliuth Triana’s Post

View profile for Eliuth Triana, graphic

Leading the global Amazon developer community optimizing GenAI with NVIDIA

🔥 The NVIDIA-AWS partnership is on fire! New assets are coming out almost every day! Thanks to the entire team for making this big effort! If you're working on deploying or optimizing large models with NVIDIA NIM and NeMo on AWS GPUs, follow me and send me the questions so we can help you keep up with the new stuff! Our Today's blog is Part 2 of a step-by-step guide on deploying Generative AI applications using NVIDIA NIM on Amazon EKS. In this post, we focus on creating a cluster with G5 instances accelerated by NVIDIA A10G GPUs and demonstrate how to use Cluster Auto Scaler + Horizontal Pod Auto Scaler for efficient, scalable AI model inference. Stay tuned for more powerful tools and insights to help you deploy and scale your AI models seamlessly! 🚀 #GenerativeAI #AI #NVIDIA #AWS #EKS #CloudComputing #AIInference #ScalingAI

View profile for Timothy Ma, graphic

Principal Tech BD and Product Management at Amazon Web Services (AWS)

Deploying Generative AI Applications with NVIDIA NIM Microservices on Amazon Elastic Kubernetes Service (Amazon EKS) – Part 2 | Amazon Web Services

Deploying Generative AI Applications with NVIDIA NIM Microservices on Amazon Elastic Kubernetes Service (Amazon EKS) – Part 2 | Amazon Web Services

aws.amazon.com

To view or add a comment, sign in

Explore topics