Glass Image Background
Artificial Intelligence
cover image
Limited slots
AI Agent Evaluation: Practical guide to benchmark and improve AI Agents
Hosted by
host-profile-image
Mohammad Arshad

SEP

16

Tue, 16 Sep

03:30 PM - 04:30 PMfalse

Online

Register to get link
Hey, See you at the event!
Ticket PriceFree

Ticket PriceFree

About

AI agents look magical in demos but often fail in the real world—eroding trust, as seen with Air Canada’s chatbot error or Google Bard’s costly launch slip. This talk introduces a practical playbook for evaluating agents, from frameworks like RAGAS and TruLens to new ideas like Evaluation-Driven Development. The goal: to close the trust gap and shape the AI Quality Movement, where agents are not just impressive but truly reliable. Abdullah Mansoor https://www.linkedin.com/in/abdullahmansoor/
39 people attending
Attendees 0
Attendees 1
Attendees 2
Attendees 3
Attendees 4
See attendees

Location

AI Agent Evaluation: Practical guide to benchmark and improve AI Agents
Register to get event link
Online
This event is part of a community
community-profile-image
Artificial Intelligence
11,962 Members
Built with