Gaurav Goyal

About Speaker

Gaurav Goyal

Software Development Engineer in Test III
Safe Security

Gaurav build the systems that ensure software is not just functional, but resilient and scalable. 
His journey began in early 2021 as an intern, and over the past five years, He have scaled alongside a fast-paced startup environment—moving from writing automation scripts to architecting full-scale CI/CD infrastructures and leading a high-performing SDET team. He specialize in bridging the gap between Quality Engineering and DevOps, ensuring that “quality” isn’t just a phase, but a continuous part of the deployment pipeline.

Demo Session - Beyond the Prompt: A Unified Framework for Quantifying Trust for AI agents

Topic – Beyond the Prompt: A Unified Framework for Quantifying Trust for AI agents

As AI agents transition from experimental prototypes to core enterprise software, the industry faces a critical challenge: moving from “vibes-based” testing to measurable, repeatable proof of trustworthiness. This session introduces a comprehensive philosophy for testing autonomous, goal-driven AI systems that reason over context, take actions using tools, and produce outcomes within defined safety boundaries.

We will explore a multi-layered testing strategy that shifts away from evaluating just the model, instead treating the AI agent as a composed production system—inclusive of inputs, retrieved enterprise data (RAG), reasoning logic, and tool invocations. The talk will detail how to approach a Continuous Trust Framework and a Data Flywheel to ensure AI reliability across the entire development lifecycle.

Scroll to Top