Research Agent

A production-grade reflective research agent that creates comprehensive markdown reports using multi-agent orchestration.

Features

Multi-Agent Architecture: Specialized agents for planning, research, writing, and editing
ReAct Prompting: Planner uses Reasoning + Acting pattern for thoughtful research plans
Tavily-First Search: Primary web search with conditional ArXiv/Wikipedia enrichment
Human-in-the-Loop: Plan approval checkpoint before research execution
Reflective Writing: Quality-threshold loop between writer and editor agents
Rich CLI: Beautiful terminal interface with progress display

Architecture

High-Level Architecture

graph TB
    subgraph cli_layer [CLI Layer]
        CLI[CLI Entry Point]
    end
    
    subgraph future [Future - Not in Scope]
        FastAPI[FastAPI Endpoint]
    end
    
    subgraph orchestration [Orchestration Layer]
        Orchestrator[Orchestrator]
        PlannerAgent[Planner Agent<br>ReAct Prompting]
        HITL[Human Review<br>Plan Approval]
    end
    
    subgraph agents_layer [Agent Layer]
        ResearchAgent[Research Agent]
        WriterAgent[Writer Agent]
        EditorAgent[Editor Agent]
    end
    
    subgraph tools_layer [Tools Layer]
        Tavily[Tavily Search<br>Primary Discovery]
        Wikipedia[Wikipedia Extractor<br>Conditional]
        ArXiv[ArXiv Extractor<br>Conditional]
    end
    
    subgraph output [Output]
        Report[Markdown Report]
    end
    
    CLI --> Orchestrator
    FastAPI -.->|Future| Orchestrator
    Orchestrator --> PlannerAgent
    PlannerAgent --> HITL
    HITL -->|Approved/Modified| ResearchAgent
    PlannerAgent --> WriterAgent
    ResearchAgent --> Tavily
    Tavily -->|Wikipedia links| Wikipedia
    Tavily -->|ArXiv/paper links| ArXiv
    WriterAgent --> EditorAgent
    EditorAgent -->|Feedback Loop| WriterAgent
    EditorAgent --> Report

Human-in-the-Loop: Plan Approval

flowchart TD
    Planner[Planner Agent] --> Plan[Research Plan]
    Plan --> Display[Display Plan to Human]
    Display --> Decision{Human Decision}
    Decision -->|Approve| Execute[Execute Plan]
    Decision -->|Modify| Edit[Human Edits Plan]
    Decision -->|Reject| Replan[Planner Tries Again]
    Edit --> Execute
    Replan --> Planner

Link-Based Enrichment Flow

flowchart LR
    subgraph discovery [Discovery Phase]
        Query[User Query] --> Tavily[Tavily Search]
    end
    
    subgraph enrichment [Enrichment Phase]
        Tavily --> LinkAnalyzer{Analyze URLs}
        LinkAnalyzer -->|arxiv.org links| ArXiv[ArXiv Extractor]
        LinkAnalyzer -->|wikipedia.org links| Wikipedia[Wikipedia Extractor]
        LinkAnalyzer -->|Other links| WebContent[Web Content]
    end
    
    subgraph aggregation [Aggregation]
        ArXiv --> Results[Research Findings]
        Wikipedia --> Results
        WebContent --> Results
    end

Complete Workflow Sequence

sequenceDiagram
    participant User
    participant Orchestrator
    participant Planner as PlannerAgent
    participant Human as Human Review
    participant Researcher as ResearchAgent
    participant Writer as WriterAgent
    participant Editor as EditorAgent
    
    User->>Orchestrator: Submit question
    Orchestrator->>Planner: Create research plan
    
    loop ReAct Loop
        Planner->>Planner: Thought/Action/Observation
    end
    
    Planner-->>Orchestrator: ResearchPlan
    
    rect rgb(255, 245, 230)
        Note over Orchestrator,Human: Human-in-the-Loop Checkpoint
        Orchestrator->>Human: Display plan for review
        Human-->>Orchestrator: HumanPlanReview
        
        alt Rejected
            Orchestrator->>Planner: Replan with feedback
            Planner-->>Orchestrator: New ResearchPlan
            Orchestrator->>Human: Display new plan
        else Modified
            Note over Human: Human edits plan inline
        end
    end
    
    loop For each task in approved plan
        Orchestrator->>Researcher: Execute task
        Note over Researcher: Tavily + conditional enrichment
        Researcher-->>Orchestrator: ResearchFindings
    end
    
    Orchestrator->>Writer: Write report from findings
    Writer-->>Orchestrator: Draft report
    
    loop Quality Threshold Loop
        Orchestrator->>Editor: Review draft
        Editor-->>Orchestrator: EditorFeedback
        alt Not approved AND iterations < max
            Orchestrator->>Writer: Revise with feedback
            Writer-->>Orchestrator: Revised draft
        end
    end
    
    Orchestrator-->>User: FinalReport

Project Structure

research_agent/
├── src/
│   ├── agents/           # PydanticAI-based agents
│   │   ├── base.py       # Base agent abstraction
│   │   ├── planner.py    # ReAct research planner
│   │   ├── researcher.py # Tool-using researcher
│   │   ├── writer.py     # Report writer
│   │   └── editor.py     # Quality reviewer
│   ├── tools/            # Research tools
│   │   ├── tavily.py     # Web search (primary)
│   │   ├── arxiv.py      # Paper extraction
│   │   ├── wikipedia.py  # Article extraction
│   │   └── link_analyzer.py
│   ├── models/           # Pydantic models
│   ├── prompts/          # Agent prompts
│   ├── hitl/             # Human-in-the-loop
│   ├── cli/              # CLI entry point
│   ├── utils/            # Config & logging
│   └── orchestrator.py   # Workflow coordinator
├── tests/
├── main.py
├── pyproject.toml
└── .env.example

Installation

Clone the repository:

git clone <repository-url>
cd research_agent

Install dependencies with uv:

uv sync

Set up environment variables:

cp .env.example .env
# Edit .env with your API keys

Configuration

Set the following environment variables in .env:

Variable	Required	Default	Description
`OPENAI_API_KEY`	Yes	-	OpenAI API key
`TAVILY_API_KEY`	Yes	-	Tavily API key
`MODEL_NAME`	No	`gpt-4o`	OpenAI model to use
`MAX_REFLECTION_ITERATIONS`	No	`3`	Max writer/editor loops
`APPROVAL_THRESHOLD`	No	`7`	Score (1-10) for auto-approval
`LOG_LEVEL`	No	`INFO`	Logging level

Usage

CLI

Run the research agent:

uv run python main.py

Or use the installed script:

uv run research-agent

Programmatic

import asyncio
from src import Orchestrator
from src.utils.config import get_settings

async def main():
    settings = get_settings()
    orchestrator = Orchestrator(settings)
    
    report = await orchestrator.run(
        "What are the latest advances in transformer architectures?"
    )
    
    print(report.content)

asyncio.run(main())

Workflow

Planning Phase
- User submits a research question
- Planner Agent creates a research plan using ReAct
- Human reviews and approves/modifies/rejects the plan
Research Phase
- Researcher executes each task using Tavily
- ArXiv links are enriched with paper metadata
- Wikipedia links are enriched with article content
Writing Phase
- Writer creates a markdown report from findings
- Editor reviews against quality criteria
- Loop continues until approved or max iterations
Output
- Final markdown report with sources
- Option to save to file

Development

Running Tests

uv run pytest

Type Checking

uv run mypy src

Future Enhancements

State persistence for long-running processes
FastAPI endpoint for web UI
Additional HITL checkpoints
Caching layer for tool results
Rate limiting and retry logic

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
src		src
tests		tests
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Research Agent

Features

Architecture

High-Level Architecture

Human-in-the-Loop: Plan Approval

Link-Based Enrichment Flow

Complete Workflow Sequence

Project Structure

Installation

Configuration

Usage

CLI

Programmatic

Workflow

Development

Running Tests

Type Checking

Future Enhancements

License

About

Uh oh!

Releases

Packages

Languages

silacode/research-agent

Folders and files

Latest commit

History

Repository files navigation

Research Agent

Features

Architecture

High-Level Architecture

Human-in-the-Loop: Plan Approval

Link-Based Enrichment Flow

Complete Workflow Sequence

Project Structure

Installation

Configuration

Usage

CLI

Programmatic

Workflow

Development

Running Tests

Type Checking

Future Enhancements

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages