Human in the Loop Agent UI and Agent interface #290

amanjaiswal73892 · 2025-09-02T14:30:33Z

Introduces a new agent interface to make any agent human-in-the-loop. At each step, the agent proposes the next action, and humans either select one of the actions or provide a natural-language hint to guide the generation of proposed actions.

Includes a companion UI to enable human intervention, by adding one or more hints to get candidate actions and see past history of interaction.
Adds a stable human_guided_generic_agent.
Adds a draft human-in-the-loop (HITL) agent interface: Multi-candidate Generic Agent.
Updates Xray to view any added in hints in agent_info tab.
Registered the new script entry point agentlab-mentor for launching the HITL agent UI.
You can run the the UI and the stable human guided generic agent as follows:

agentlab-mentor  --benchmark miniwob --task-name "miniwob.book-flight" --seed 7

Download the human hints:

agentlab-mentor  --download-hints

Run agentlab-mentor Generic Agent UI on a benchmark task

Run HITL Generic Agent UI on a benchmark task

options:
  -h, --help            show this help message and exit
  --benchmark BENCHMARK
                        Benchmark name as registered in BrowserGym, e.g., miniwob, workarena_l1, webarena,
                        visualwebarena
  --task-name TASK_NAME
                        Exact task name within the benchmark (e.g., 'miniwob.book-flight')
  --seed SEED           Task seed to use for the selected task.
  --headless, --no-headless
                        Run the browser headless (default: True). Use --no-headless to show the browser.
  --download-hints [[OUTPUT_CSV]]
                        Extract hints from the default study directory and save to OUTPUT_CSV. If OUTPUT_CSV is
                        omitted, saves to 'extracted_hints.csv'. When provided, other args are ignored.

This pull request introduces a new "Human-in-the-Loop" (HILT) agent architecture for web automation tasks, enabling a human operator to select among multiple candidate actions proposed by an underlying agent. The changes modularize the agent design, add protocol definitions for multi-candidate agents.
Human-in-the-loop agent workflow and UI integration:

Generic multi-candidate agent implementation:

[Stable] Added MultipleProposalGenericAgent and its argument class in generic_human_guided_agent.py, providing a concrete agent that generates multiple candidate actions using LLM prompts, parses structured responses, and integrates with a hint-labeling UI for human selection.
Human-in-the-Loop agent architecture and protocol:
[Draft] Added the MultiCandidateAgent protocol in base_multi_candidate_agent.py, defining a standard interface for agents that generate multiple candidate actions and update their internal state based on the selected candidate. Also introduced MultiCandidateAgentArgs for agent argument handling and naming conventions.
[Draft] Implemented the HumanInTheLoopAgent class in hilt_agent.py, which wraps any multi-candidate agent and presents candidate actions to a human via a UI, allowing hints and selection, and updating agent state accordingly. Includes error handling and UI integration.

User interface and action visualization:

Introduced utilities (overlay_action, img_to_base_64) in both agent files to overlay proposed actions on screenshots and encode images for the UI, enhancing human interpretability of agent suggestions. [1] [2]

Description by Korbit AI

What change is being made?

Introduce Human in the Loop (HILT) Agent UI and interfaces for agent implementations that support multiple candidate actions and user guidance.

Why are these changes being made?

These changes are being made to facilitate a more interactive approach where a human can guide the decision-making process of an AI agent by selecting among multiple proposed actions. The implementation allows agents to propose several viable actions in complex environments, with a user interface enabling human users to provide hints and select preferred actions. This setup enhances the adaptability and effectiveness of agents in dynamic environments by leveraging human intuition and expertise.

Is this description stale? Ask me to generate a new description by commenting /korbit-generate-pr-description

korbit-ai

Review by Korbit AI

Korbit automatically attempts to detect when you fix issues in new commits.

Category	Issue	Status
	Missing action match validation ▹ view	🧠 Incorrect
	Missing configuration interface definition ▹ view	🧠 Not in standard
	Inefficient Image Processing Loop ▹ view	✅ Fix detected
	Unclear State Update Parameter Specification ▹ view	✅ Fix detected
	Document factory function purpose and types ▹ view	✅ Fix detected
	Unused intervention round counter ▹ view	✅ Fix detected
	Optional Agent Name ▹ view	🧠 Not in scope
	Document complex interaction flow ▹ view	🧠 Not in standard
	Inconsistent Function Name Case ▹ view	✅ Fix detected
	Improper Type Hint Comment ▹ view	🧠 Incorrect

Files scanned

File Path	Reviewed
src/agentlab/agents/hilt_agent/base_multi_candidate_agent.py	✅
src/agentlab/agents/hilt_agent/hint_labelling.py	✅
src/agentlab/agents/hilt_agent/multi_candidate_generic_agent.py	✅
src/agentlab/agents/hilt_agent/hilt_agent.py	✅
src/agentlab/agents/hilt_agent/generic_human_guided_agent.py	✅
src/agentlab/agents/hilt_agent/hint_labelling_ui_files/hint_labeling_ui.html	✅

Explore our documentation to understand the languages and file types we support and the files we ignore.

Check out our docs on how you can make Korbit work best for you and your team.

Loving Korbit!? Share us on LinkedIn Reddit and X

src/agentlab/agents/hilt_agent/base_multi_candidate_agent.py

+class MultiCandidateAgentArgs(AgentArgs):
+    def make_agent(self) -> MultiCandidateAgent: ...


src/agentlab/agents/hilt_agent/base_multi_candidate_agent.py

+        """
+        ...
+
+    def update_agent_state_from_selected_candidate(self, output: dict):


src/agentlab/agents/hilt_agent/hilt_agent.py

+        self.ui = None
+
+    @cost_tracker_decorator
+    def get_action(self, obs):


src/agentlab/agents/hilt_agent/base_multi_candidate_agent.py

+    def __post_init__(self):
+        """Prefix subagent name with 'MC-'."""
+        super().__post_init__()
+        if hasattr(self, 'agent_name') and self.agent_name:


src/agentlab/agents/hilt_agent/hilt_agent.py

+
+    def __init__(
+        self,
+        subagent_args,  # Type: any object with MultiCandidateAgent interface


src/agentlab/agents/hilt_agent/hilt_agent.py

+def get_base_human_in_the_loop_genericagent(llm_config):
+    from agentlab.agents.generic_agent.tmlr_config import BASE_FLAGS
+    from agentlab.llm.llm_configs import CHAT_MODEL_ARGS_DICT
+    from agentlab.agents.hilt_agent.hilt_agent import HumanInTheLoopAgentArgs


src/agentlab/agents/hilt_agent/hilt_agent.py

+        step_n_human_intervention_rounds += 1
+        suggestions = [{ 'action': c['action'], 'think': c['agent_info'].think} for c in candidates]
+        # List of Images as base64 - create overlay screenshots for each suggested action
+        screenshots = [overlay_action(obs, choice["action"]) for choice in suggestions]


src/agentlab/agents/hilt_agent/hilt_agent.py

+                    choice_idx = None
+                    for i, candidate in enumerate(suggestions):
+                        if candidate["action"] == selected_action:
+                            choice_idx = i
+                            break
+                    selected_candidate = candidates[choice_idx]


src/agentlab/agents/hilt_agent/hilt_agent.py

+        step_n_human_intervention_rounds = 0
+        step_hint = []
+
+        # Initialize UI once outside the loop
+        if self.ui is None:
+            self.ui = HintLabeling(headless=False)
+            # Show initial waiting state
+            initial_inputs = HintLabelingInputs(
+                goal=(
+                    obs.get("goal_object", [{}])[0].get("text", "")
+                    if obs.get("goal_object")
+                    else ""
+                ),
+                error_feedback="",
+                screenshot=(img_to_base_64(obs["screenshot"]) if "screenshot" in obs else ""),
+                screenshots=[],  # no overlay screenshots yet
+                axtree=obs.get("axtree_txt", ""),
+                history=[],
+                hint="",
+                suggestions=[],  # no suggestions yet
+            )
+            self.ui.update_context(initial_inputs)
+
+        # Generate first candidates
+        candidates = self.subagent.get_candidate_generations(obs, hint=None, n_candidates=3)
+        step_n_human_intervention_rounds += 1


recursix · 2025-09-02T17:33:27Z

src/agentlab/agents/hilt_agent/generic_human_guided_agent.py

+    return img_to_base_64(act_img)
+
+
+def img_to_base_64(image: Image.Image | np.ndarray) -> str:


maybe this could be in a utile file somehwere where we reconcile to avoid duplicates. e.g. the _url version would call this one.

src/agentlab/agents/hilt_agent/generic_human_guided_agent.py

recursix · 2025-09-02T17:36:55Z

src/agentlab/agents/hilt_agent/hilt_agent.py

+
+from agentlab.agents.agent_args import AgentArgs
+from agentlab.agents.hilt_agent.base_multi_candidate_agent import MultiCandidateAgent
+from agentlab.agents.hilt_agent.hint_labelling import (


linter doesn't resolve, are we missing init.py ?

src/agentlab/agents/hilt_agent/hilt_agent.py

src/agentlab/agents/hilt_agent/generic_human_guided_agent.py

amanjaiswal73892 added 2 commits August 29, 2025 15:57

rename to trace-recorder to hilt_agent

c9852ec

add timeout error for hilt agent.

ed0f1bd

korbit-ai bot reviewed Sep 2, 2025

View reviewed changes

darglint and black

b2c1ac8

amanjaiswal73892 requested a review from recursix September 2, 2025 17:07

amanjaiswal73892 added 3 commits September 2, 2025 14:58

correct spelling hilt -> hitl

3b07fe9

Move the overlay_action to utils

9633275

Increase timeout

51cacdb

recursix previously approved these changes Sep 2, 2025

View reviewed changes

amanjaiswal73892 added 5 commits September 2, 2025 16:11

add docstring for functions and black

958430c

Improve UI and step hint handling for multiple hints

4453a00

add snapshots navigation to see history of interactions.

97f3904

View human added hints in xray agent_info.

88d1d8d

revert change to ipynb

6b78e8e

amanjaiswal73892 dismissed recursix’s stale review via 6b78e8e September 2, 2025 23:20

amanjaiswal73892 and others added 10 commits September 2, 2025 19:41

add agent-mentor laucher

79cde90

addling headless as args in agentlab-mentor

517aaf5

improve entry point args for agentlab-mentor to allow multiple seeds

9ed3376

update error-handling for agentlab-mentor

4f50293

update default window size (revert to playwright default)

dbc332f

hack to fix bbox issue

7d988a8

simplify CLI args and add ability to download hints using CLI.

8addffb

black

5921777

formatting

92f9a74

add flag to select llm config

02347f8

recursix approved these changes Sep 5, 2025

View reviewed changes

recursix merged commit 80a2d82 into main Sep 5, 2025
6 checks passed

recursix deleted the aj/hilt branch September 5, 2025 18:12

patricebechard mentioned this pull request Sep 5, 2025

Hint Labeling UI Support in Loop #287

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Human in the Loop Agent UI and Agent interface #290

Human in the Loop Agent UI and Agent interface #290

Uh oh!

amanjaiswal73892 commented Sep 2, 2025 •

edited

Loading

korbit-ai bot left a comment •

edited

Loading

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

recursix Sep 2, 2025

Uh oh!

recursix Sep 2, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Labels

4 participants

		class MultiCandidateAgentArgs(AgentArgs):
		def make_agent(self) -> MultiCandidateAgent: ...

		return img_to_base_64(act_img)


		def img_to_base_64(image: Image.Image \| np.ndarray) -> str:

Human in the Loop Agent UI and Agent interface #290

Human in the Loop Agent UI and Agent interface #290

Uh oh!

Conversation

amanjaiswal73892 commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description by Korbit AI

What change is being made?

Why are these changes being made?

korbit-ai bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Review by Korbit AI

Korbit automatically attempts to detect when you fix issues in new commits.

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

recursix Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

recursix Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Labels

4 participants

amanjaiswal73892 commented Sep 2, 2025 •

edited

Loading

korbit-ai bot left a comment •

edited

Loading