Machine Learning Lagos

Serving AI workloads on NVIDIA Dynamo

Criado por

Machine Learning Lagos

JAN

qui, 22 jan

05:00 PM - 06:00 PM (Coordinated Universal Time)

On-line

Inscreva-se para obter o link

O evento começa em

dias

horas

minutos

segundos

Garanta seus ingressos de graça!

Este evento é gratuito para todos.

Junte-se a 44 outras pessoas.

Preço do ingressoGrátis

Sobre

How do you choose the right serving strategy for your model? This presentation seeks not to just introduce Dynamo to the audience but to take a practical outlook by walking through techniques and real-world scenarios in serving AI workloads in production such as disaggregated serving, optimizing deployments against memory-bound bottlenecks.

Key learning objectives: Attendees will have an intuitive understanding of how Dynamo addresses memory-bound operations to get the best performance out of GPUs. They will understand the different architectural patterns in serving AI workloads in distributed environments and a framework to determine the choice of deployment based on technical and business constraints.

Target level: Beginner - Advanced

Evento por

Faça uma pergunta

44 participantes