⚡︎ BoxLang AI

|:------------------------------------------------------:|
| ⚡︎ B o x L a n g ⚡︎
| Dynamic : Modular : Productive
|:------------------------------------------------------:|

Copyright Since 2023 by Ortus Solutions, Corp
www.boxlang.io | www.ortussolutions.com

👋 Welcome

Welcome to the BoxLang AI Module 🚀 This module provides AI generation capabilities to your BoxLang applications in an easy to use and abstracted API, so you can interact with ANY AI provider in a consistent manner. Our core focus is productivity, fluency and ease of use. ✨

BoxLang AI eliminates the complexity of working with multiple AI providers by offering a unified interface. Whether you're using OpenAI, Claude, Gemini, Grok, DeepSeek, or Perplexity, your code remains the same—just change a configuration setting to switch providers. 🔄

✨ Key Features

🔌 Multi-Provider Support - Seamlessly integrate with leading AI providers through a single API
💬 Fluent Interface - Chainable, expressive syntax that makes AI integration intuitive
📝 Flexible Messaging - Send simple strings, structured messages, or complex conversation arrays
🤖 Flexible Agents - Easily create autonomous and composable AI Agents
⚡ Async Support - Built-in asynchronous capabilities with futures for non-blocking operations
🔒 Multi-Tenant Memory - Enterprise-grade user and conversation isolation across all memory types
📚 Document Loaders - 12+ built-in loaders for documents, files, web content, databases, and more
🧬 RAG Pipeline - Complete Retrieval-Augmented Generation workflow from documents to context injection
🎯 Vector Memory - Semantic search and retrieval using ChromaDB, PostgreSQL, MySQL, TypeSense, and Weaviate
⚙️ Configurable - Global defaults, per-request overrides, and comprehensive logging options
🎯 Event-Driven - Intercept and extend AI processing with lifecycle events
🏭 Production-Ready - Timeout controls, error handling, and debugging tools

📄 License

BoxLang is open source and licensed under the Apache 2 license. 🎉 You can also get a professionally supported version with enterprise features and support via our BoxLang +/++ Plans (www.boxlang.io/plans). 💼

🚀 Getting Started

You can easily get started with BoxLang AI by using the module installer:

install-bx-module bx-ai

If you would like to leverage it in your CommandBox Based Web applications, make sure you add it to your server.json or use box install bx-ai.

Once installed, make sure you setup any of the supported AI providers and their API keys in your boxlang.json configuration file. After that you can leverage the global functions (BIFs) in your BoxLang code. Here is a simple example:

// chat.bxs
answer = aiChat( "How amazing is BoxLang?" )
println( answer )

📚 New to AI concepts? Check out our Key Concepts Guide for terminology and fundamentals, or browse our FAQ for quick answers to common questions.

🤖 Providers

The following are the AI providers supported by this module. Please note that in order to interact with these providers you will need to have an account with them and an API key. 🔑

🧠 Claude Anthropic
🧬 Cohere
🔍 DeepSeek
💎 Gemini
⚡ Grok
🚀 Groq
🤗 HuggingFace
🌀 Mistral
🦙 Ollama
🟢 OpenAI
🔀 OpenRouter
🔮 Perplexity
🚢 Voyage AI

🎯 Features

Here are some of the features of this module:

🔌 Integration with multiple AI providers
📦 Structured Output - Type-safe AI responses using BoxLang classes, structs, or JSON schemas
🤖 AI Agents - Autonomous agents with memory, tools, and sub-agent orchestration
🔒 Multi-Tenant Memory - Built-in user and conversation isolation for enterprise applications
📚 Document Loaders - Built-in loaders for Text, Markdown, CSV, JSON, XML, PDF, Log, HTTP, Feed, SQL, Directory, and WebCrawler
🧬 RAG (Retrieval-Augmented Generation) - Complete workflow: load documents → chunk → embed → store → retrieve → inject into AI context
🎯 Vector Memory Systems - Semantic search with ChromaDB, PostgreSQL pgvector, MySQL vector, TypeSense, and Weaviate
📝 Compose raw chat requests
💬 Build message objects
🛠️ Create AI service objects
🔧 Create AI tool objects
🔍 Generate embeddings for semantic search
⛓️ Fluent API
⚡ Asynchronous chat requests
🌐 Global defaults
✨ And much more

📊 Provider Support Matrix

Here is a matrix of the providers and their feature support. Please keep checking as we will be adding more providers and features to this module. 🔄

Provider	Real-time Tools	Embeddings	Structured Output
Claude	✅	❌	✅
Cohere	✅	✅	✅
DeepSeek	✅	✅	✅
Gemini	[Coming Soon]	✅	✅
Grok	✅	✅	✅
Groq	✅	✅	✅
HuggingFace	✅	✅	✅
Mistral	✅	✅	✅
Ollama	✅	✅	✅
OpenAI	✅	✅	✅ (Native)
OpenRouter	✅	✅	✅
Perplexity	✅	❌	✅
Voyage	❌	✅ (Specialized)	❌

Note:

OpenAI provides native structured output support with strict schema validation. Other providers use JSON mode with schema constraints, which provides excellent results but may occasionally require prompt refinement.
Voyage AI is a specialized embeddings-only provider with state-of-the-art models optimized for semantic search, RAG applications, and clustering. It does not support chat completions or structured output.
Cohere provides high-quality embeddings with excellent multilingual support (100+ languages), chat capabilities, real-time tool calling, and structured output via JSON schema validation.

📤 Return Formats

The AI module supports different return formats for the responses. You can specify the return format in the options struct when calling the aiChat() or aiChatAsync() functions, globally in the settings (as we saw above), or in the ChatRequest object. 🎯

Format	Description
`single`	Returns a single message as a string (the content from the first choice). This is the default format for BIFs.
`all`	Returns an array of all choice messages. Each message is a struct with `role` and `content` keys.
`json`	Returns the parsed JSON object from the content string. Automatically parses JSON responses.
`xml`	Returns the parsed XML document from the content string. Automatically parses XML responses.
`raw`	Returns the full raw response from the AI provider. This is useful for debugging or when you need the full response structure with metadata. This is the default for pipelines.
`structuredOutput`	Used internally when `.structuredOutput()` is called. Returns a populated class/struct based on the schema.

💬 Chats

Interact with AI models through simple and powerful chat interfaces 🎯 supporting both one-shot responses and streaming conversations. BoxLang AI provides fluent APIs for building everything from basic Q&A to complex multi-turn dialogues with system prompts, message history, and structured outputs. 💡

🤔 Why Use Chats?

⚡ Simple & Fast - One-line chat interactions with aiChat()
🔄 Streaming Support - Real-time token streaming with aiChatStream()
💾 Memory Integration - Automatic conversation history with memory systems
🎨 Flexible Messages - Support for text, images, files, and structured data
🌊 Fluent API - Chain message builders for readable, maintainable code

💡 Quick Examples

Simple One-Shot Chat:

// Quick question-answer
response = aiChat( "What is BoxLang?" )
println( response )

// With custom model and options
response = aiChat(
    messages: "Explain quantum computing",
    model: "gpt-4",
    temperature: 0.7,
    maxTokens: 500
)

Multi-Turn Conversation with Memory:

// Create conversation with memory
memory = aiMemory( "windowed", maxMessages: 10 )

// First turn
response = aiChat(
    messages: "My name is Luis",
    memory: memory
)

// Second turn - AI remembers context
response = aiChat(
    messages: "What's my name?",
    memory: memory
)
println( response ) // "Your name is Luis"

Streaming Chat:

// Stream tokens as they arrive
aiChatStream(
    onChunk: ( chunk ) => {
        writeOutput( chunk )
        flush
    },
    messages: "Write a short story about a robot",
    model: "claude-3-5-sonnet-20241022"
)

Fluent Message Builder:

// Build complex message chains
request = aiChatRequest()
    .setModel( "gpt-4o" )
    .addSystemMessage( "You are a helpful coding assistant" )
    .addUserMessage( "How do I create a REST API in BoxLang?" )
    .addImage( "diagram.png" )
    .setTemperature( 0.7 )

response = request.send()

📚 Learn More

🚀 Quick Start: Getting Started Guide
📖 Full Guide: Chatting Documentation
🌊 Streaming: Streaming Guide
🎨 Message Formats: Message Builder Guide
💻 Examples: Check examples/chats/ for complete examples

🔗 Pipelines

Build composable AI workflows 🎯 using BoxLang AI's powerful runnable pipeline system. Chain models, transformers, tools, and custom logic into reusable, testable components that flow data through processing stages. Perfect for complex AI workflows, data transformations, and multi-step reasoning. 💡

🤔 Why Use Pipelines?

🔄 Composable - Chain any runnable components together with .to()
🧪 Testable - Each pipeline stage is independently testable
♻️ Reusable - Build once, use in multiple workflows
🌊 Streaming - Full streaming support through entire pipeline
🎯 Type-Safe - Input/output contracts ensure data flows correctly

💡 Quick Examples

Simple Transformation Pipeline:

// Chain model with transformers
pipeline = aiModel( "gpt-4o" )
    .to( aiTransform( data => data.toUpperCase() ) )
    .to( aiTransform( data => data.trim() ) )

result = pipeline.run( "hello world" )
println( result ) // "HELLO WORLD"

Multi-Stage AI Pipeline:

// Create reusable stages
summarizer = aiModel( "gpt-4o-mini" )
    .setSystemMessage( "Summarize in one sentence" )

translator = aiModel( "gpt-4o" )
    .setSystemMessage( "Translate to Spanish" )

formatter = aiTransform( text => {
    return { summary: text, timestamp: now() }
} )

// Compose full pipeline
pipeline = summarizer
    .to( translator )
    .to( formatter )

// Run through all stages
result = pipeline.run( "Long article text here..." )
println( result.summary ) // Spanish summary

Streaming Pipeline:

// Stream through entire pipeline
pipeline = aiModel( "claude-3-5-sonnet-20241022" )
    .to( aiTransform( chunk => chunk.toUpperCase() ) )

pipeline.stream(
    onChunk: ( chunk ) => writeOutput( chunk ),
    input: "Tell me a story"
)

Custom Runnable Component:

// Implement IAiRunnable for custom logic
component implements="IAiRunnable" {
    function run( input, params = {} ) {
        // Custom processing
        return processedData;
    }

    function stream( onChunk, input, params = {} ) {
        // Streaming support
        onChunk( processedChunk );
    }

    function to( nextRunnable ) {
        // Chain to next stage
        return createPipeline( this, nextRunnable );
    }
}

// Use in pipeline
customStage = new CustomRunnable()
pipeline = aiModel( "gpt-4o" ).to( customStage )

📚 Learn More

📖 Full Guide: Runnables & Pipelines
🎯 Overview: Main Components
🔧 Custom Runnables: Building Custom Components
💻 Examples: Check examples/pipelines/ for complete examples

🤖 AI Agents

Build autonomous AI agents 🎯 that can use tools, maintain memory, and orchestrate complex workflows. BoxLang AI agents combine LLMs with function calling, memory systems, and orchestration patterns to create intelligent assistants that can interact with external systems and solve complex tasks. 💡

🤔 Why Use Agents?

🛠️ Tool Integration - Agents can execute functions, call APIs, and interact with external systems
🧠 Stateful Intelligence - Built-in memory keeps context across multi-turn interactions
🔄 Self-Orchestration - Agents decide which tools to use and when
🎯 Goal-Oriented - Give high-level instructions, agents figure out the steps
🤝 Human-in-the-Loop - Optional approval workflows for sensitive operations

💡 Quick Examples

Simple Agent with Tools:

// Define tools the agent can use
weatherTool = aiTool()
    .setName( "get_weather" )
    .setDescription( "Get current weather for a location" )
    .setFunction( ( location ) => {
        return { temp: 72, condition: "sunny", location: location };
    } )

// Create agent with memory
agent = aiAgent()
    .setName( "Weather Assistant" )
    .setDescription( "Helpful weather assistant" )
    .setTools( [ weatherTool ] )
    .setMemory( aiMemory( "windowed" ) )

// Agent decides when to call tools
response = agent.run( "What's the weather in Miami?" )
println( response ) // Agent calls get_weather tool and responds

Autonomous Agent with Multiple Tools:

// Agent with database and email tools
agent = aiAgent()
    .setName( "Customer Support Agent" )
    .setTools( [
        aiTool( "query_orders", orderQueryFunction ),
        aiTool( "send_email", emailFunction ),
        aiTool( "create_ticket", ticketFunction )
    ] )
    .setMemory( aiMemory( "session" ) )
    .setMaxIterations( 5 ) // Prevent infinite loops

// Agent orchestrates multiple tool calls
agent.run( "Find order #12345, email the customer with status, and create a ticket if there's an issue" )

📚 Learn More

📖 Full Guide: AI Agents Documentation
🎓 Interactive Course: Lesson 6 - Building AI Agents
🔧 Advanced Patterns: Agent Orchestration
💻 Examples: Check examples/agents/ for complete working examples

📦 Structured Output

Get type-safe, validated responses ✅ from AI providers by defining expected output schemas using BoxLang classes, structs, or JSON schemas. The module automatically converts AI responses into properly typed objects, eliminating manual parsing and validation. 🎯

🤔 Why Use Structured Output?

✅ Type Safety - Get validated objects instead of parsing JSON strings
🔒 Automatic Validation - Schema constraints ensure correct data types and required fields
🎯 Better Reliability - Reduces hallucinations by constraining response format
💻 Developer Experience - Work with native BoxLang objects immediately
🧪 Testing & Caching - Use aiPopulate() to create objects from JSON for tests or cached responses

💡 Quick Examples

Using a Class:

class Person {
    property name="name" type="string";
    property name="age" type="numeric";
    property name="email" type="string";
}

result = aiChat( "Extract person info: John Doe, 30, john@example.com" )
    .structuredOutput( new Person() );

writeOutput( "Name: #result.getName()#, Age: #result.getAge()#" );

Using a Struct Template:

template = {
    "title": "",
    "summary": "",
    "tags": [],
    "sentiment": ""
};

result = aiChat( "Analyze this article: [long text]" )
    .structuredOutput( template );

writeOutput( "Tags: #result.tags.toList()#" );

Extracting Arrays:

class Task {
    property name="title" type="string";
    property name="priority" type="string";
    property name="dueDate" type="string";
}

tasks = aiChat( "Extract tasks from: Finish report by Friday (high priority), Review code tomorrow" )
    .structuredOutput( [ new Task() ] );

for( task in tasks ) {
    writeOutput( "#task.getTitle()# - Priority: #task.getPriority()#<br>" );
}

Multiple Schemas (Extract Different Types Simultaneously):

result = aiChat( "Extract person and company: John Doe, 30 works at Acme Corp, founded 2020" )
    .structuredOutputs( {
        "person": new Person(),
        "company": new Company()
    } );

writeOutput( "Person: #result.person.getName()#<br>" );
writeOutput( "Company: #result.company.getName()#<br>" );

🔧 Manual Population with aiPopulate()

Convert JSON responses or cached data into typed objects without making AI calls:

// From JSON string
jsonData = '{"name":"John Doe","age":30,"email":"john@example.com"}';
person = aiPopulate( new Person(), jsonData );

// From struct
data = { name: "Jane", age: 25, email: "jane@example.com" };
person = aiPopulate( new Person(), data );

// Populate array
tasksJson = '[{"title":"Task 1","priority":"high"},{"title":"Task 2","priority":"low"}]';
tasks = aiPopulate( [ new Task() ], tasksJson );

Perfect for: ⭐

🧪 Testing with mock data
💾 Using cached AI responses
🔄 Converting existing JSON data to typed objects
✅ Validating data structures

✅ Provider Support

All providers support structured output! 🎉 OpenAI offers native structured output with strict validation, while others use JSON mode with schema guidance (which works excellently in practice). 💪

📚 Learn More

🚀 Quick Start: Simple Interactions Guide
🔧 Advanced Pipelines: Pipeline Integration Guide
🎓 Interactive Course: Lesson 12 - Structured Output
💻 Examples: Check examples/structured/ for complete working examples

🧠 Memory Systems

Build stateful, context-aware AI applications 🎯 with flexible memory systems that maintain conversation history, enable semantic search, and preserve context across interactions. BoxLang AI provides both traditional conversation memory and advanced vector-based memory for semantic understanding. 💡

🤔 Why Use Memory?

💭 Context Retention - AI remembers previous messages and maintains coherent conversations
💬 Stateful Applications - Build chat interfaces that remember user preferences and conversation history
🔍 Semantic Search - Find relevant past conversations using vector embeddings
💾 Flexible Storage - Choose from in-memory, file-based, database, session, or vector storage
⚙️ Automatic Management - Memory handles message limits, summarization, and context windows

📋 Memory Types

Standard Memory 💬 (Conversation History):

Type	Description	Best For
Windowed	Keeps last N messages	Quick chats, cost-conscious apps
Summary	Auto-summarizes old messages	Long conversations, context preservation
Session	Web session persistence	Multi-page web applications
File	File-based storage	Audit trails, long-term storage
Cache	CacheBox-backed	Distributed applications
JDBC	Database storage	Enterprise apps, multi-user systems

Vector Memory 🔍 (Semantic Search):

Type	Description	Best For
BoxVector	In-memory vectors	Development, testing, small datasets
Hybrid	Recent + semantic	Best of both worlds approach
Chroma	ChromaDB integration	Python-based infrastructure
Postgres	PostgreSQL pgvector	Existing PostgreSQL deployments
MySQL	MySQL 9 native vectors	Existing MySQL infrastructure
TypeSense	Fast typo-tolerant search	Low-latency search, autocomplete
Pinecone	Cloud vector database	Production, scalable semantic search
Qdrant	High-performance vectors	Large-scale deployments
Weaviate	GraphQL vector database	Complex queries, knowledge graphs
Milvus	Enterprise vector DB	Massive datasets, high throughput

💡 Quick Examples

Windowed Memory (Multi-Tenant):

// Automatic per-user isolation
memory = aiMemory( "windowed",
    key: createUUID(),
    userId: "user123",
    config: { maxMessages: 10 }
)
agent = aiAgent( name: "Assistant", memory: memory )

agent.run( "My name is John" )
agent.run( "What's my name?" )  // "Your name is John"

Summary Memory (Preserves Full Context):

memory = aiMemory( "summary", {
    maxMessages: 30,
    summaryThreshold: 15,
    summaryModel: "gpt-4o-mini"
} )
agent = aiAgent( name: "Support", memory: memory )
// Long conversation - older messages summarized automatically

Vector Memory (Semantic Search + Multi-Tenant):

memory = aiMemory( "chroma",
    key: createUUID(),
    userId: "user123",
    conversationId: "support",
    config: {
        collection: "customer_support",
        embeddingProvider: "openai",
        embeddingModel: "text-embedding-3-small"
    }
)
// Retrieves semantically relevant past conversations
// Automatically filtered by userId/conversationId

Hybrid Memory (Recent + Semantic):

memory = aiMemory( "hybrid", {
    recentLimit: 5,       // Keep last 5 messages
    semanticLimit: 5,     // Add 5 semantic matches
    vectorProvider: "chroma"
} )
// Combines recency with relevance

📚 Learn More

💬 Standard Memory: Memory Systems Guide
🔍 Vector Memory: Vector Memory Guide
🔧 Custom Memory: Building Custom Memory
🎓 Interactive Course: Lesson 7 - Memory Systems
💻 Examples: Check examples/advanced/ and examples/vector-memory/ for complete examples

📚 Document Loaders & RAG

BoxLang AI provides 12+ built-in document loaders for ingesting content from files, databases, web sources, and more. These loaders integrate seamlessly with vector memory systems to enable Retrieval-Augmented Generation (RAG) workflows.

🔄 RAG Workflow

graph LR
    LOAD[📄 Load Documents] --> CHUNK[��️ Chunk Text]
    CHUNK --> EMBED[🧬 Generate Embeddings]
    EMBED --> STORE[💾 Store in Vector Memory]
    STORE --> QUERY[❓ User Query]
    QUERY --> RETRIEVE[🔍 Retrieve Relevant Docs]
    RETRIEVE --> INJECT[💉 Inject into Context]
    INJECT --> AI[🤖 AI Response]

    style LOAD fill:#4A90E2
    style EMBED fill:#BD10E0
    style STORE fill:#50E3C2
    style RETRIEVE fill:#F5A623
    style AI fill:#7ED321

📄 Available Loaders

Loader	Type	Use Case	Example
📝 TextLoader	`text`	Plain text files	`.txt`, `.log`
📘 MarkdownLoader	`markdown`	Markdown files	`.md` documents
📊 CSVLoader	`csv`	CSV files	Data files, exports
🗂️ JSONLoader	`json`	JSON files	Configuration, data
🏷️ XMLLoader	`xml`	XML files	Config, structured data
📄 PDFLoader	`pdf`	PDF documents	Reports, documentation
📋 LogLoader	`log`	Log files	Application logs
🌐 HTTPLoader	`http`	Web pages	Documentation, articles
📰 FeedLoader	`feed`	RSS/Atom feeds	News, blogs
💾 SQLLoader	`sql`	Database queries	Query results
📁 DirectoryLoader	`directory`	File directories	Batch processing
🕷️ WebCrawlerLoader	`webcrawler`	Website crawling	Multi-page docs

✨ Quick Examples

Load a Single Document:

// Load a PDF document
docs = aiDocuments( "/path/to/document.pdf", "pdf" )
println( "#docs.len()# documents loaded" )

// Load with configuration
docs = aiDocuments(
    source = "/path/to/document.pdf",
    type   = "pdf",
    config = {
        sortByPosition: true,
        addMoreFormatting: true,
        startPage: 1,
        endPage: 10
    }
)

Load Multiple Documents:

// Load all markdown files from a directory
docs = aiDocuments(
    source = "/knowledge-base",
    type   = "directory",
    config = {
        recursive: true,
        extensions: ["md", "txt"],
        excludePatterns: ["node_modules", ".git"]
    }
)

Ingest into Vector Memory:

// Create vector memory
vectorMemory = aiMemory( "chroma", {
    collection: "docs",
    embeddingProvider: "openai",
    embeddingModel: "text-embedding-3-small"
} )

// Ingest documents with chunking and embedding
result = aiMemoryIngest(
    memory        = vectorMemory,
    source        = "/knowledge-base",
    type          = "directory",
    loaderConfig  = { recursive: true, extensions: ["md", "txt", "pdf"] },
    ingestOptions = { chunkSize: 1000, overlap: 200 }
)

println( "✅ Loaded #result.documentsIn# docs as #result.chunksOut# chunks" )
println( "💰 Estimated cost: $#result.estimatedCost#" )

RAG with Agent:

// Create agent with vector memory
agent = aiAgent(
    name: "KnowledgeAssistant",
    description: "AI assistant with access to knowledge base",
    memory: vectorMemory
)

// Query automatically retrieves relevant documents
response = agent.run( "What is BoxLang?" )
println( response )

📚 Learn More

📖 Full Guide: Document Loaders Guide
🧬 RAG Workflow: RAG Implementation Guide
🔧 Custom Loaders: Building Custom Loaders
💻 Examples: Check examples/loaders/ and examples/rag/ for complete examples

🔌 MCP Client

Connect to Model Context Protocol (MCP) servers 🎯 and use their tools, prompts, and resources in your AI applications. BoxLang AI's MCP client provides seamless integration with the growing MCP ecosystem, allowing your agents to access databases, APIs, filesystems, and more through standardized interfaces. 💡

🤔 Why Use MCP Client?

🌍 Ecosystem Access - Use any MCP server (filesystems, databases, APIs, tools)
🔒 Secure Integration - Standardized permissions and authentication
🎯 Tool Discovery - Automatically discover and use server capabilities
🔄 Dynamic Resources - Access changing data sources (files, DB records, etc.)
🤖 Agent Integration - Seamlessly add MCP tools to your AI agents

💡 Quick Examples

Connect to MCP Server:

// Connect to filesystem MCP server
mcpClient = aiMcpClient( "filesystem" )
    .setCommand( "npx" )
    .setArgs( [ "-y", "@modelcontextprotocol/server-filesystem", "/path/to/docs" ] )
    .connect()

// List available tools
tools = mcpClient.listTools()
println( tools ) // read_file, write_file, list_directory, etc.

Use MCP Tools in Agent:

// Connect to multiple MCP servers
filesystemMcp = aiMcpClient( "filesystem" )
    .setCommand( "npx" )
    .setArgs( [ "-y", "@modelcontextprotocol/server-filesystem", "/data" ] )
    .connect()

databaseMcp = aiMcpClient( "postgres" )
    .setCommand( "npx" )
    .setArgs( [ "-y", "@modelcontextprotocol/server-postgres", "postgresql://..." ] )
    .connect()

// Agent can use all MCP tools
agent = aiAgent()
    .setName( "Data Assistant" )
    .addMcpClient( filesystemMcp )
    .addMcpClient( databaseMcp )

// Agent automatically uses MCP tools
agent.run( "Read config.json and update the database with its contents" )

Access MCP Resources:

// List available resources
resources = mcpClient.listResources()

// Read resource content
content = mcpClient.readResource( "file:///docs/readme.md" )
println( content )

// Use prompts from server
prompts = mcpClient.listPrompts()
prompt = mcpClient.getPrompt( "code-review", { language: "BoxLang" } )

📚 Learn More

📖 Full Guide: MCP Client Documentation
🌍 MCP Ecosystem: Model Context Protocol
🔧 Available Servers: MCP Servers List
💻 Examples: Check examples/mcp/ for complete examples

🖥️ MCP Server

Expose your BoxLang functions and data as MCP tools 🎯 for use by AI agents and applications. Build custom MCP servers that provide tools, prompts, and resources through the standardized Model Context Protocol, making your functionality accessible to any MCP client. 💡

🤔 Why Build MCP Servers?

🔌 Universal Access - Any MCP client can use your tools
🎯 Standardized Interface - No custom integration code needed
🛠️ Expose Functionality - Make BoxLang functions available to AI agents
📊 Share Resources - Provide data sources, templates, and prompts
🏢 Enterprise Integration - Connect AI to internal systems safely

💡 Quick Examples

Simple MCP Server:

// Create server with tools
server = aiMcpServer( "my-tools" )
    .setDescription( "Custom BoxLang tools" )

// Register tool
server.registerTool(
    name: "calculate_tax",
    description: "Calculate tax for a given amount",
    function: ( amount, rate = 0.08 ) => {
        return amount * rate;
    },
    parameters: {
        amount: { type: "number", description: "Amount to calculate tax on" },
        rate: { type: "number", description: "Tax rate as decimal" }
    }
)

// Start server
server.start() // Listens on stdio by default

Advanced Server with Resources:

// Create server with tools, prompts, and resources
server = aiMcpServer( "enterprise-api" )
    .setDescription( "Internal enterprise tools" )

// Register multiple tools
server.registerTool( "query_orders", queryOrdersFunction, orderSchema )
server.registerTool( "create_invoice", createInvoiceFunction, invoiceSchema )
server.registerTool( "send_notification", notifyFunction, notifySchema )

// Provide templates as prompts
server.registerPrompt(
    name: "customer-email",
    description: "Generate customer email",
    template: ( orderNumber ) => {
        return "Write a professional email about order ##orderNumber#";
    }
)

// Expose data resources
server.registerResource(
    uri: "config://database",
    description: "Database configuration",
    getData: () => {
        return fileRead( "/config/database.json" );
    }
)

// Start with custom transport
server.start( transport: "http", port: 3000 )

Integration with BoxLang Web App:

// In your BoxLang app's Application.bx
component {
    function onApplicationStart() {
        // Start MCP server on app startup
        application.mcpServer = aiMcpServer( "myapp-api" )
            .registerTool( "search", variables.searchFunction )
            .registerTool( "create", variables.createFunction )
            .start( background: true )
    }

    function onApplicationEnd() {
        application.mcpServer.stop()
    }
}

📚 Learn More

📖 Full Guide: MCP Server Documentation
🌍 MCP Protocol: Model Context Protocol Specification
🔧 Advanced Features: Custom Transports & Authentication
💻 Examples: Check examples/mcp/server/ for complete examples

⚙️ Settings

Here are the settings you can place in your boxlang.json file:

{
	"modules" : {
		"bxai" : {
			"settings": {
				// The default provider to use: openai, claude, deepseek, gemini, grok, mistral, ollama, openrouter, perplexity
				"provider" : "openai",
				// The default API Key for the provider
				"apiKey" : "",
				// The default request params to use when calling a provider
				// Ex: { temperature: 0.5, max_tokens: 100, model: "gpt-3.5-turbo" }
				"defaultParams" : {
					// model: "gpt-3.5-turbo"
				},
				// The default timeout of the ai requests
				"timeout" : 30,
				// If true, log request to the ai.log
				"logRequest" : false,
				// If true, log request to the console
				"logRequestToConsole" : false,
				// If true, log the response to the ai.log
				"logResponse" : false,
				// If true, log the response to the console
				"logResponseToConsole" : false,
				// The default return format of the AI response: single, all, raw
				"returnFormat" : "single"
			}
		}
	}
}

🦙 Ollama Configuration

Ollama allows you to run AI models locally on your machine. It's perfect for privacy, offline use, and cost savings. 💰

🔧 Setup Ollama

📥 Install: Download from https://ollama.ai
⬇️ Pull a model: ollama pull llama3.2 (or any supported model)
▶️ Start service: Ollama runs on http://localhost:11434 by default

📝 Configuration

{
	"modules": {
		"bxai": {
			"settings": {
				"provider": "ollama",
				"apiKey": "",  // Optional: for remote/secured Ollama instances
				"chatURL": "http://localhost:11434",  // Default local instance
				"defaultParams": {
					"model": "llama3.2"  // Any Ollama model you have pulled
				}
			}
		}
	}
}

🌟 Popular Ollama Models

🦙 llama3.2 - Latest Llama model (recommended)
⚡ llama3.2:1b - Smaller, faster model
💻 codellama - Code-focused model
🎯 mistral - High-quality general model
🔷 phi3 - Microsoft's efficient model

🛠️ Global Functions (BIFs)

Function	Purpose	Parameters	Return Type	Async Support
`aiAgent()`	Create autonomous AI agent	`name`, `description`, `instructions`, `model`, `memory`, `tools`, `params`, `options`	AiAgent Object	❌
`aiChat()`	Chat with AI provider	`messages`, `params={}`, `options={}`	String/Array/Struct	❌
`aiChatAsync()`	Async chat with AI provider	`messages`, `params={}`, `options={}`	BoxLang Future	✅
`aiChatRequest()`	Compose raw chat request	`messages`, `params`, `options`, `headers`	AiRequestObject	N/A
`aiChatStream()`	Stream chat responses from AI provider	`messages`, `callback`, `params={}`, `options={}`	void	N/A
`aiChunk()`	Split text into chunks	`text`, `options={}`	Array of Strings	N/A
`aiDocuments()`	Create fluent document loader	`source`, `config={}`	IDocumentLoader Object	N/A
`aiEmbed()`	Generate embeddings	`input`, `params={}`, `options={}`	Array/Struct	N/A
`aiMemory()`	Create memory instance	`type`, `config={}`	IAiMemory Object	N/A
`aiMessage()`	Build message object	`message`	ChatMessage Object	N/A
`aiModel()`	Create AI model wrapper	`provider`, `apiKey`	AiModel Object	N/A
`aiPopulate()`	Populate class/struct from JSON	`target`, `data`	Populated Object	N/A
`aiService()`	Create AI service provider	`provider`, `apiKey`	IService Object	N/A
`aiTokens()`	Estimate token count	`text`, `options={}`	Numeric	N/A
`aiTool()`	Create tool for real-time processing	`name`, `description`, `callable`	Tool Object	N/A
`aiTransform()`	Create data transformer	`transformer`	Transformer Runnable	N/A
`MCP()`	Create MCP client for Model Context Protocol servers	`baseURL`	MCPClient Object	N/A
`mcpServer()`	Get or create MCP server for exposing tools	`name="default"`, `description`, `version`, `cors`	MCPServer Object	N/A

Note on Return Formats: When using pipelines (runnable chains), the default return format is raw (full API response), giving you access to all metadata. Use .singleMessage(), .allMessages(), or .withFormat() to extract specific data. The aiChat() BIF defaults to single format (content string) for convenience. See the Pipeline Return Formats documentation for details.

💡 Quick Usage Examples

// Simple chat
result = aiChat( "Hello, world!" )

// Create an autonomous AI agent
agent = aiAgent(
    name: "MyAgent",
    description: "A helpful assistant",
    instructions: "Be concise and friendly"
)
response = agent.run( "What is BoxLang?" )

// Async chat with callback
future = aiChatAsync( "Hello!" ).then( r -> println(r) )

// Stream chat responses
aiChatStream( "Tell me a story", ( chunk ) => {
    print( chunk.choices?.first()?.delta?.content ?: "" )
} )

// Build complex request
request = aiChatRequest( messages, { model: "gpt-4" }, { provider: "openai" } )

// Fluent message building
msg = aiMessage().system( "Be helpful" ).user( "Hello" )

// AI Model wrapper
model = aiModel( "openai" ).bindTools( [tool1, tool2] )

// Service with custom settings
service = aiService( "openai", "my-key" ).defaults( { temperature: 0.7 } )

// Tool for function calling
tool = aiTool( "weather", "Get weather data", location => getWeather(location) )

// Load documents from files or directories
docs = aiDocuments( "/path/to/document.txt" )
docs = aiDocuments( "/path/to/folder", "directory", { recursive: true } )

// Create a loader for advanced configuration
loader = aiDocumentLoader( "/docs", "markdown" )
    .splitByHeaders( 2 )
    .removeCodeBlocks()
docs = loader.load()

// Ingest documents into memory with detailed reporting
result = aiMemoryIngest(
    memory = myVectorMemory,
    source = "/knowledge-base",
    type   = "directory",
    loaderConfig  = { recursive: true, extensions: ["md", "txt"] },
    ingestOptions = { chunkSize: 500, overlap: 50 }
)
println( "Ingested #result.documentsIn# docs as #result.chunksOut# chunks" )

// Multi-memory fan-out
result = aiMemoryIngest(
    memory = [ chromaMemory, pgVectorMemory ],
    source = "/docs",
    type   = "markdown"
)

// MCP client for Model Context Protocol servers
client = MCP( "http://localhost:3000" )
    .withTimeout( 5000 )
    .withBearerToken( "token" )
result = client.send( "searchDocs", { query: "syntax" } )

// MCP server for exposing tools to AI clients
mcpServer( "myApp" )
    .registerTool( aiTool( "search", "Search docs", ( query ) => searchDocs( query ) ) )
    .registerResource( uri: "docs://readme", name: "README", handler: () => fileRead( "/readme.md" ) )

This module exposes the following BoxLang global functions (BIFs) for you to interact with the AI providers:

💬 Chat Functions

aiChat( messages, struct params={}, struct options={} ) : This function will allow you to chat with the AI provider and get responses back. This is the easiest way to interact with the AI providers.
aiChatAsync( messages, struct params={}, struct options={} ) : This function will allow you to chat with the AI provider and get a BoxLang future back so you can build fluent asynchronous code pipelines.
aiChatStream( messages, callback, struct params={}, struct options={} ) : This function will allow you to stream responses from the AI provider in real-time. A callback function is invoked for each chunk of data received.
aiChatRequest( messages, struct params, struct options, struct headers) - This allows you to compose a raw chat request that you can then later send to an AI service. The return is a ChatRequest object that you can then send to the AI service.

🔢 Embedding Functions

aiEmbed( input, struct params={}, struct options={} ) : Generate embeddings for text input. Input can be a single string or an array of strings. Returns numerical vectors that capture semantic meaning, useful for semantic search, clustering, and recommendations.
aiDocuments( source, type="", struct config={} ) : Load documents from various sources (files, directories, web, databases) using built-in loaders. Supports configuration options for each loader type.

✂️ Text Processing Functions

aiChunk( text, struct options={} ) : Split text into chunks for processing within AI token limits. Supports multiple chunking strategies (recursive, characters, words, sentences, paragraphs) with configurable chunk size and overlap.
aiTokens( text, struct options={} ) : Estimate token count for text using character-based or word-based methods. Useful for planning API usage and managing token budgets.

🤖 Agent Functions

aiAgent( name, description, instructions, model, memory, tools, params, options ) - Creates an autonomous AI agent that can maintain conversation memory, use tools, and execute tasks. Agents simplify complex AI workflows by managing state and context automatically.
aiMemory( type, config ) - Creates a memory instance for agents and pipelines. Available types:
- window - Windowed memory keeping last N messages (default, configurable via maxMessages)
- summary - Intelligently compresses old messages while preserving context
- session - Web session-persisted memory
- file - File-based persistent storage
- cache - CacheBox-backed storage
- jdbc - Database-backed storage
- chroma - Vector memory with semantic search (ChromaDB)
- mysql - MySQL 9 native vector support
- typesense - TypeSense fast typo-tolerant search
- hybrid - Combines recent + semantic memory

🧰 Helper Functions

aiMessage( message ) - Allows you to build a message object that you can then use to send to the aiChat() or aiAiRequest() functions. It allows you to fluently build up messages as well.
aiModel( provider, apiKey ) - Creates an AI model wrapper that can be configured with tools and used in agents or pipelines. Provides a fluent API for model configuration.
aiService( provider, apiKey ) - Creates a reference to an AI Service provider that you can then use to interact with the AI service. This is useful if you want to create a service object and then use it multiple times. You can pass in optional provider and apiKey to override the global settings.
aiTool( name, description, callable) - Creates a tool object that you can use to add to a chat request for real-time system processing. This is useful if you want to create a tool that can be used in multiple chat requests against localized resources. You can then pass in the tool to the aiChat() or aiAiRequest() functions.
MCP( baseURL ) - Creates a fluent client for consuming Model Context Protocol (MCP) servers. MCP provides standardized access to external tools, resources, and prompts that AI models can use.
mcpServer( name, description, version, cors ) - Gets or creates an MCP server instance for registering tools, resources, and prompts that can be exposed to AI clients. Servers are singletons by name, stored globally for access across requests. The description and version parameters allow you to provide additional metadata for the server instance. The cors parameter sets the allowed CORS origin (empty string by default for secure-by-default behavior).

📢 Events

The BoxLang AI module emits several events throughout the AI processing lifecycle that allow you to intercept, modify, or extend functionality. These events are useful for logging, debugging, custom providers, and response processing.

Event Reference Table

Event	When Fired	Data Emitted	Use Cases
`afterAIAgentRun`	After agent completes execution	`agent`, `response`	Agent monitoring, result tracking
`afterAIEmbed`	After generating embeddings	`embeddingRequest`, `service`, `result`	Result processing, caching
`afterAIModelInvoke`	After model invocation completes	`model`, `aiRequest`, `results`	Performance tracking, validation
`afterAIPipelineRun`	After pipeline execution completes	`sequence`, `result`, `executionTime`	Pipeline monitoring, metrics
`afterAIToolExecute`	After tool execution completes	`tool`, `results`, `executionTime`	Tool performance tracking
`beforeAIAgentRun`	Before agent starts execution	`agent`, `input`, `messages`, `params`	Agent validation, preprocessing
`beforeAIEmbed`	Before generating embeddings	`embeddingRequest`, `service`	Request validation, preprocessing
`beforeAIModelInvoke`	Before model invocation starts	`model`, `aiRequest`	Request validation, cost estimation
`beforeAIPipelineRun`	Before pipeline execution starts	`sequence`, `stepCount`, `steps`, `input`	Pipeline validation, tracking
`beforeAIToolExecute`	Before tool execution starts	`tool`, `name`, `arguments`	Permission checks, validation
`onAIAgentCreate`	When agent is created	`agent`	Agent registration, configuration
`onAIEmbedRequest`	Before sending embedding request	`dataPacket`, `embeddingRequest`, `provider`	Request logging, modification
`onAIEmbedResponse`	After receiving embedding response	`embeddingRequest`, `response`, `provider`	Response processing, caching
`onAIError`	When AI operation error occurs	`error`, `errorMessage`, `provider`, `operation`, `canRetry`	Error handling, retry logic, alerts
`onAiMemoryCreate`	When memory instance is created	`memory`, `type`, `config`	Memory configuration, tracking
`onAIMessageCreate`	When message is created	`message`	Message validation, formatting
`onAIModelCreate`	When model wrapper is created	`model`, `service`	Model configuration, tracking
`onAIProviderCreate`	After provider is created	`provider`	Provider initialization, configuration
`onAIProviderRequest`	When provider is requested	`provider`, `apiKey`, `service`	Custom provider registration
`onAIRateLimitHit`	When rate limit (429) is encountered	`provider`, `statusCode`, `retryAfter`	Rate limit handling, provider switching
`onAIRequest`	Before sending HTTP request	`dataPacket`, `aiRequest`, `provider`	Request logging, modification, authentication
`onAIRequestCreate`	When request object is created	`aiRequest`	Request validation, modification
`onAIResponse`	After receiving HTTP response	`aiRequest`, `response`, `rawResponse`, `provider`	Response processing, logging, caching
`onAITokenCount`	When token usage data is available	`provider`, `model`, `promptTokens`, `completionTokens`, `totalTokens`	Cost tracking, budget enforcement
`onAIToolCreate`	When tool is created	`tool`, `name`, `description`	Tool registration, validation
`onAITransformerCreate`	When transformer is created	`transform`	Transform configuration, tracking

Event Registration

Leverage the BoxRegisterListener() BIF, or if you are developing a module, you can use the interceptors structure.

boxRegisterInterceptor( "onAIRequest", myRequestHandler );
boxRegisterInterceptor( "onAIResponse", myResponseHandler );

🌐 GitHub Repository and Reporting Issues

Visit the GitHub repository for release notes. You can also file a bug report or improvement suggestion via Jira.

🧪 Testing

This module includes tests for all AI providers. To run the tests:

./gradlew test

Ollama Testing

For Ollama provider tests, you need to start the test Ollama service first:

# Start the Ollama test service
docker-compose up -d ollama-test

# Wait for it to be ready (this may take a few minutes for the first run)
# The service will automatically pull the qwen2.5:0.5b model

# Run the tests
./gradlew test --tests "ortus.boxlang.ai.providers.OllamaTest"

# Clean up when done
docker-compose down -v

You can also use the provided test script:

./test-ollama.sh

This will start the service, verify it's working, and run a basic test.

Note: The first time you run this, it will download the qwen2.5:0.5b model (~500MB), so it may take several minutes.

💖 Ortus Sponsors

BoxLang is a professional open-source project and it is completely funded by the community and Ortus Solutions, Corp. Ortus Patreons get many benefits like a cfcasts account, a FORGEBOX Pro account and so much more. If you are interested in becoming a sponsor, please visit our patronage page: https://patreon.com/ortussolutions

THE DAILY BREAD

"I am the way, and the truth, and the life; no one comes to the Father, but by me (JESUS)" Jn 14:1-12

Name		Name	Last commit message	Last commit date
Latest commit History 558 Commits
.github		.github
.vscode		.vscode
bootcamp		bootcamp
docs		docs
examples		examples
gradle/wrapper		gradle/wrapper
src		src
www		www
.cfformat.json		.cfformat.json
.editorconfig		.editorconfig
.env.template		.env.template
.gitattributes		.gitattributes
.gitignore		.gitignore
.markdownlint.json		.markdownlint.json
.ortus-java-style.xml		.ortus-java-style.xml
BoxLangAI.png		BoxLangAI.png
CONTRIBUTING.md		CONTRIBUTING.md
box.json		box.json
build.gradle		build.gradle
changelog.md		changelog.md
docker-compose-ollama.yml		docker-compose-ollama.yml
docker-compose.yml		docker-compose.yml
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
readme.md		readme.md
settings.gradle		settings.gradle

Uh oh!

ortus-boxlang/bx-ai

Folders and files

Latest commit

History

Repository files navigation

⚡︎ BoxLang AI

👋 Welcome

✨ Key Features

📄 License

🚀 Getting Started

🤖 Providers

🎯 Features

📊 Provider Support Matrix

📤 Return Formats

💬 Chats

🤔 Why Use Chats?

💡 Quick Examples

📚 Learn More

🔗 Pipelines

🤔 Why Use Pipelines?

💡 Quick Examples

📚 Learn More

🤖 AI Agents

🤔 Why Use Agents?

💡 Quick Examples

📚 Learn More

📦 Structured Output

🤔 Why Use Structured Output?

💡 Quick Examples

🔧 Manual Population with aiPopulate()

✅ Provider Support

📚 Learn More

🧠 Memory Systems

🤔 Why Use Memory?

📋 Memory Types

💡 Quick Examples

📚 Learn More

📚 Document Loaders & RAG

🔄 RAG Workflow

📄 Available Loaders

✨ Quick Examples

📚 Learn More

🔌 MCP Client

🤔 Why Use MCP Client?

💡 Quick Examples

📚 Learn More

🖥️ MCP Server

🤔 Why Build MCP Servers?

💡 Quick Examples

📚 Learn More

⚙️ Settings

🦙 Ollama Configuration

🔧 Setup Ollama

📝 Configuration

🌟 Popular Ollama Models

🛠️ Global Functions (BIFs)

💡 Quick Usage Examples

💬 Chat Functions

🔢 Embedding Functions

✂️ Text Processing Functions

🤖 Agent Functions

🧰 Helper Functions

📢 Events

Event Reference Table

Event Registration

🌐 GitHub Repository and Reporting Issues

🧪 Testing

Ollama Testing

💖 Ortus Sponsors

THE DAILY BREAD

About

Topics

Resources

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Packages