Optimize the selection NextSpeaker mechanism of RolePlayOrchestrator … #6688

JeffreySu · 2025-06-17T15:39:38Z

…to improve the accuracy of name hits

Improvement content:

Enforce the use of the JSON standard return format.
Introduce a one-time retry mechanism to reduce the error rate (currently, all tests pass 100%).

Checks

I've included any doc changes needed for https://microsoft.github.io/autogen/. See https://github.com/microsoft/autogen/blob/main/CONTRIBUTING.md to build and test documentation locally.
I've added tests (if relevant) corresponding to the changes introduced in this PR.
I've made sure all auto checks have passed.

…to improve the accuracy of name hits Improvement content: 1. Enforce the use of the JSON standard return format. 2. Introduce a one-time retry mechanism to reduce the error rate (currently, all tests pass 100%).

…to improve the accuracy of name hits

dotnet/src/AutoGen.Core/Orchestrator/RolePlayOrchestrator.cs

LittleLittleCloud · 2025-06-17T16:42:38Z

dotnet/src/AutoGen.Core/Orchestrator/RolePlayOrchestrator.cs

-        var name = response.GetContent() ?? throw new ArgumentException("No name is returned.");
+        var responseMessageStr = response.GetContent() ?? throw new ArgumentException("No name is returned.");
+
+        RolePlayOrchestratorResponse? responseMessage;


nit:

var responseMessage = JsonSerializer.Deserialize<RolePlayOrchestratorResponse>(responseMessageStr) ?? throw new InvalidOperationException("Incorrect RolePlayOrchestratorResponse JSON format.");

LittleLittleCloud · 2025-06-17T16:48:13Z

dotnet/src/AutoGen.Core/Orchestrator/RolePlayOrchestrator.cs

+
+        var reaginCandidate = candidates.FirstOrDefault(x => x.Name!.ToUpper() == regainResponseMessage.Speaker!.ToUpper());
+
+        if (reaginCandidate != null)


reaginCandidate -> regainCadidate

LittleLittleCloud · 2025-06-17T16:49:37Z

dotnet/src/AutoGen.Core/Orchestrator/RolePlayOrchestratorResponse.cs

+namespace AutoGen.Core.Orchestrator;
+internal class RolePlayOrchestratorResponse
+{
+    internal string? Speaker { get; set; }


The access modifier of property doesn't have to be also internal here?

public is also fine.

LittleLittleCloud · 2025-06-17T16:55:08Z

dotnet/src/AutoGen.Core/Orchestrator/RolePlayOrchestrator.cs

-Each message will start with 'From name:', e.g:
-From {agentNames.First()}:
-//your message//.");
+## Available Speaker Names


Maybe adding a class-level summary on top?

/// <summary> /// This orchestrator uses a robust two-step strategy to select the next speaker in a roleplay conversation: /// 1. It first prompts the LLM to select the next speaker from the list of valid candidate names, requiring output in a strict JSON format. /// 2. If the LLM's chosen name does not exactly match any candidate (e.g., due to hallucination, abbreviation, or formatting issues), /// the orchestrator issues a second prompt to the LLM, instructing it to map the provided name to the closest valid candidate name from the original list. /// This approach ensures that the selected speaker always corresponds to an authorized candidate and guards against LLM output errors. /// </summary>

LittleLittleCloud

PR LGTM, minor changes requested before merging

codecov · 2025-06-17T17:05:20Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 79.71%. Comparing base (89927ca) to head (8912db2).

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #6688   +/-   ##
=======================================
  Coverage   79.71%   79.71%           
=======================================
  Files         232      232           
  Lines       17323    17323           
=======================================
  Hits        13809    13809           
  Misses       3514     3514

Flag	Coverage Δ
unittests	`79.71% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

JeffreySu added 5 commits June 17, 2025 23:37

Merge branch 'main' into main

4ee44f7

Merge branch 'main' of https://github.com/JeffreySu/autogen

ca2d4b8

Optimize the selection NextSpeaker mechanism of RolePlayOrchestrator …

8912db2

…to improve the accuracy of name hits

LittleLittleCloud self-requested a review June 17, 2025 16:39

LittleLittleCloud reviewed Jun 17, 2025

View reviewed changes

dotnet/src/AutoGen.Core/Orchestrator/RolePlayOrchestrator.cs Show resolved Hide resolved

LittleLittleCloud reviewed Jun 17, 2025

View reviewed changes

LittleLittleCloud requested changes Jun 17, 2025

View reviewed changes

Merge branch 'main' into main

10c6c5a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimize the selection NextSpeaker mechanism of RolePlayOrchestrator … #6688

Optimize the selection NextSpeaker mechanism of RolePlayOrchestrator … #6688

JeffreySu commented Jun 17, 2025

Uh oh!

LittleLittleCloud Jun 17, 2025

LittleLittleCloud Jun 17, 2025

LittleLittleCloud Jun 17, 2025

JeffreySu Jun 17, 2025

LittleLittleCloud Jun 17, 2025 •

edited

Loading

LittleLittleCloud left a comment

codecov bot commented Jun 17, 2025

Labels

2 participants


		var reaginCandidate = candidates.FirstOrDefault(x => x.Name!.ToUpper() == regainResponseMessage.Speaker!.ToUpper());

		if (reaginCandidate != null)

Optimize the selection NextSpeaker mechanism of RolePlayOrchestrator … #6688

Are you sure you want to change the base?

Optimize the selection NextSpeaker mechanism of RolePlayOrchestrator … #6688

Conversation

JeffreySu commented Jun 17, 2025

Checks

Uh oh!

LittleLittleCloud Jun 17, 2025

Choose a reason for hiding this comment

LittleLittleCloud Jun 17, 2025

Choose a reason for hiding this comment

LittleLittleCloud Jun 17, 2025

Choose a reason for hiding this comment

JeffreySu Jun 17, 2025

Choose a reason for hiding this comment

LittleLittleCloud Jun 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

LittleLittleCloud left a comment

Choose a reason for hiding this comment

codecov bot commented Jun 17, 2025

Codecov Report

Labels

2 participants

LittleLittleCloud Jun 17, 2025 •

edited

Loading