90
min of engaging and interactive sessions
30
executives from Fortune 500 companies
1
hour of post-session conversation and networking
Workshop topic: “The invisible risk: Evaluating AI agents before they fail you”
Join Microsoft and Grid Dynamics for a private executive workshop focused on the AI Software Development Life Cycle (SDLC) in financial services. We’ll take a practical look at AI agent evaluation, an emerging discipline where expectations are high, risks are growing, and clear standards are still taking shape.
Building on this foundation, the workshop will focus on a challenge many banks, wealth managers, and insurers now face as AI agents move from pilots into production: how to define, measure, and sustain quality in highly regulated environments with strict model risk management, audit requirements, and compliance obligations.
We will unpack:
- Evaluation methodologies: deterministic, model-based, and human-in-the-loop
- Failure modes and quality drift: how to identify degradation in long-running, tool-augmented, and research-style agents
- Multi-agent orchestration: where and how to evaluate performance across complex workflows
- RAG systems and prompt reliability: discerning systemic flaws from design errors
You’ll gain insights from real financial services AI agent implementations, learn what works (and what doesn’t), and leave with actionable frameworks to help your teams deploy AI agents with confidence.
Following the main session, please join us for a delightful social hour featuring complimentary food and beverages.
Agenda
Arrival and Registration
Welcome and Opening Remarks
Keynote Presentation: “The Invisible Risk: evaluating AI agents before they fail you.”
Panel discussion: “Quality at scale – challenges in AI agent evaluation.”
Networking Break
Breakout session: “Client Perspective: risks, realities, and lessons learned.”
Summary & Q&A
Closing Remarks & Next Steps
Networking Reception
Speakers
Nikita Ivanov
VP of Technology, AI
Nikita is a technology executive and product-driven innovator with over 25 years of global leadership across AI, software engineering, and digital transformation. Product-centric leader with extensive experience in developing new software middleware and GenAI/LLM/RAG/AI Agentic products, managing software engineering & data science teams, and nurturing product evolution throughout all stages of its lifecycle. Frequent international speaker and Forbes Technology Council member, long-time startup mentor, advisor and investor, active open source contributor, Java/Scala founding community member. Recognized expert and authority on distributed high-performance data processing. Industry innovator in NLP and GenAI/Agentic, reinforcement learning and deterministic AI/ML. Global thought leader in In-Memory Computing. Expert in programming languages, design and compiler development. Trusted advisor to boards and executive teams on AI strategy, national digital transformation, and workforce innovation.
Pat Converse
Senior Director Hyperscaler Partnerships
Senior Technology Consulting Executive and digital innovator with 25+ years in tech and digital engineering consulting experience. Has led global tech consulting and advisory practices at Accenture/Avanade and several PE backed firms.Industry expertise includes Oil & Gas, manufacturing, Aerospace, Retail/CPG, and High Tech.
Intended audience
This workshop is designed for:
C-suite and senior technology leaders in BFSI (CTO, CIO, CDO, CAIO, CRO, CCO)
Heads of AI, Data, and Machine Learning
Senior leaders responsible for AI governance, risk, and production quality
Executives overseeing enterprise-scale AI platforms and agent-based systems
Why should you attend?
- Hear insights about AI risk management and governance from experienced professionals at Microsoft and Grid Dynamics
- Explore tested approaches to evaluating AI agents
- Understand how to effectively govern AI beyond the pilot phase
- Connect with peers facing the same challenges
- Engage in off-the-record executive dialogue

