Testing conversational AI is hard—human dialogue is unpredictable and diverse. This talk reveals how LLM-as-Judge and generative AI redefine QA with automated evaluation and intelligent test data, enabling scalable, reliable, and cost-efficient testing of voice and chat systems.
Learn for free, join the best tech learning community for a price of a pumpkin latte.
Event notifications, weekly newsletter
Delayed access to all content
Immediate access to Keynotes & Panels
Access to Circle community platform
Immediate access to all content
Courses, quizes & certificates
Community chats