Howie/validate sample by agent #44345

howieleung · 2025-12-09T18:42:24Z

The existing code has already collected print call content and validate if the content after ==> Result meets certain criteria.
Now change replace this validation by submitting all print contents to response.create and validate by AI.
This response.create call will be recorded but the input which is the print contents are sanitized. So if you modify the print statement in samples, you don't need to re-record and still able to replay the record with assertion passed.

Also, if responses said test fail, it is hard to check what content was in the print call. So I write it to temp file.

Copilot

Pull request overview

This PR refactors the sample testing infrastructure to use an AI agent for validating sample outputs. The main change replaces pattern-based output validation with an LLM-powered validation approach using Azure OpenAI.

Key Changes:

Converted SampleExecutor from a simple helper class to a decorator pattern with context manager support
Introduced agent-based validation to check if sample outputs indicate success or failure
Moved environment variable mapping to a separate function and refactored execution flow

sdk/ai/azure-ai-projects/tests/samples/test_samples.py

dargilco · 2025-12-11T00:40:56Z

Big change, don't forget to run 'black' tool. Thanks!

sdk/ai/azure-ai-projects/tests/samples/test_samples.py

Copilot AI review requested due to automatic review settings December 9, 2025 18:42

howieleung requested review from dargilco, glharper, kingernupur, nick863, trangevi and trrwilson as code owners December 9, 2025 18:42

github-actions bot added the AI Projects label Dec 9, 2025

Copilot started reviewing on behalf of howieleung December 9, 2025 18:43 View session

Copilot AI reviewed Dec 9, 2025

View reviewed changes

howieleung force-pushed the howie/validate-sample-by-agent branch 2 times, most recently from 0309437 to 3a9f439 Compare December 10, 2025 20:23

howieleung added 6 commits December 10, 2025 15:23

clean up

dc7114f

update

0d22624

update

288b2f2

restore some code

e83c7bd

clean up

3cb71f3

update

fa356ed

howieleung force-pushed the howie/validate-sample-by-agent branch from 510d630 to b55c649 Compare December 10, 2025 23:55

dargilco approved these changes Dec 11, 2025

View reviewed changes

sdk/ai/azure-ai-projects/tests/samples/test_samples.py Outdated Show resolved Hide resolved

sdk/ai/azure-ai-projects/tests/samples/test_samples.py Show resolved Hide resolved

howieleung force-pushed the howie/validate-sample-by-agent branch from b55c649 to 8d2ff2f Compare December 11, 2025 01:53

fix recording

99386f8

howieleung force-pushed the howie/validate-sample-by-agent branch from 8d2ff2f to 99386f8 Compare December 11, 2025 02:36

howieleung enabled auto-merge (squash) December 11, 2025 04:57

resolved comment

65570ab

howieleung force-pushed the howie/validate-sample-by-agent branch from 94f6871 to 65570ab Compare December 11, 2025 05:15

dargilco approved these changes Dec 11, 2025

View reviewed changes

howieleung merged commit 7772fd6 into main Dec 11, 2025
20 checks passed

howieleung deleted the howie/validate-sample-by-agent branch December 11, 2025 14:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Howie/validate sample by agent #44345

Howie/validate sample by agent #44345

Uh oh!

howieleung commented Dec 9, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dargilco commented Dec 11, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Howie/validate sample by agent #44345

Howie/validate sample by agent #44345

Uh oh!

Conversation

howieleung commented Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dargilco commented Dec 11, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

howieleung commented Dec 9, 2025 •

edited

Loading