PocketAI

Run AI models locally on your Android phone

No cloud. No subscription. No compilation. Just works.

Features

100% Offline - Works without internet after initial setup
Free Forever - No subscriptions, no API keys, no hidden costs
No Compilation - Powered by Mozilla llamafile (just download and run)
Private & Secure - Your data never leaves your device
Android Native - Optimized for mobile with proot isolation
Multiple Models - Choose from tiny (270MB) to powerful (2GB+)
Web Dashboard - Browser-based UI for easy management
REST API - Full control via HTTP endpoints
OpenAI Compatible - Drop-in replacement for OpenAI API

Quick Start

One-Command Install

git clone https://github.com/mithun50/PocketAi.git
cd PocketAi
./setup.sh

Start Using

# Activate environment (or restart terminal)
source ~/.pocketai_env

# Install a model (Qwen3 recommended for 2025)
pai install qwen3

# Start chatting!
pai chat

Available Models

2025 Models (Recommended)

Model	Size	RAM	Quality	Best For
`qwen3`	400MB	512MB	⭐⭐⭐	Best for low RAM
`llama3.2`	700MB	1GB	⭐⭐⭐⭐	Best balance
`llama3.2-3b`	2.0GB	2GB	⭐⭐⭐⭐⭐	Best quality

Classic Models

Model	Size	RAM	Quality	Best For
`smollm2`	270MB	400MB	⭐⭐	Ultra-low RAM
`qwen2`	400MB	512MB	⭐⭐⭐	Low RAM
`qwen2-1b`	1.0GB	1.2GB	⭐⭐⭐⭐	Daily use
`gemma2b`	1.4GB	2GB	⭐⭐⭐⭐	Google quality
`qwen2-3b`	2.0GB	3GB	⭐⭐⭐⭐⭐	Best quality
`phi2`	1.6GB	3GB	⭐⭐⭐⭐	Coding tasks

Commands

Chat & Inference

pai chat                 # Interactive chat
pai ask "What is AI?"    # Quick question
pai complete "Once..."   # Text completion

Model Management

pai models               # List available models
pai models installed     # List installed models
pai install <model>      # Download a model
pai use <model>          # Switch active model
pai remove <model>       # Delete a model

OpenAI-Compatible Server

pai server start         # Start API server (port 8080)
pai server stop          # Stop the server
pai server status        # Show server info

Use with any OpenAI-compatible client:

curl http://localhost:8080/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{"messages": [{"role": "user", "content": "Hello"}]}'

REST API & Web Dashboard

pai api start            # Start REST API (port 8081)
pai api web              # Start API + Web Dashboard
pai api stop             # Stop API server
pai api status           # Show API endpoints

Open http://localhost:8081/ in your browser for the web dashboard.

API Endpoints:

Method	Endpoint	Description
GET	`/api/health`	Health check
GET	`/api/status`	System status
GET	`/api/models`	Available models
GET	`/api/models/installed`	Installed models
POST	`/api/models/install`	Install model
POST	`/api/models/use`	Switch model
POST	`/api/chat`	Send message
GET	`/api/config`	Get config
POST	`/api/config`	Set config

Configuration

pai config               # Show current config
pai config set key val   # Change settings
pai config reset         # Reset to defaults

Option	Default	Description
`threads`	4	CPU threads to use
`ctx_size`	2048	Context window size

System

pai status               # System information
pai doctor               # Diagnose issues
pai help                 # Show all commands
pai version              # Version info

Project Structure

pocketai/
├── bin/
│   └── pai                  # CLI entry point
├── core/
│   └── engine.sh            # Core engine (inference, models, API)
├── data/
│   ├── config               # User configuration
│   ├── llamafile            # LLM runtime engine
│   └── api_server.py        # REST API server
├── models/                  # Downloaded GGUF models
├── web/
│   └── index.html           # Web dashboard
├── docs/
│   ├── COMMANDS.md          # Command reference
│   ├── MODELS.md            # Model guide
│   └── TROUBLESHOOTING.md   # Problem solving
├── setup.sh                 # Installer
└── README.md

Architecture

┌─────────────────────────────────────────────────────────────┐
│                        PocketAI                              │
├─────────────────────────────────────────────────────────────┤
│                                                              │
│   ┌─────────┐    ┌──────────┐    ┌─────────────────────┐    │
│   │   CLI   │───►│  Engine  │───►│  proot container    │    │
│   │  (pai)  │    │          │    │  (Alpine Linux)     │    │
│   └─────────┘    └──────────┘    └──────────┬──────────┘    │
│                                              │               │
│   ┌─────────┐    ┌──────────┐               ▼               │
│   │   Web   │───►│ REST API │         ┌──────────┐          │
│   │Dashboard│    │ (Python) │         │llamafile │          │
│   └─────────┘    └──────────┘         └────┬─────┘          │
│                                             │                │
│   ┌─────────┐                              ▼                │
│   │ OpenAI  │◄────────────────────  GGUF Model             │
│   │ Clients │                                               │
│   └─────────┘                                               │
│                                                              │
└─────────────────────────────────────────────────────────────┘

Components:

pai CLI - User-friendly bash interface
engine.sh - Core logic (model management, inference, API)
api_server.py - REST API + Web dashboard server
llamafile - Mozilla's portable LLM runtime
proot - Lightweight Linux container for isolation
GGUF models - Quantized models optimized for mobile

Requirements

Device: Android phone/tablet
App: Termux from F-Droid
Storage: 1GB+ free (varies by model)
RAM: 512MB+ (more = better models)

Troubleshooting

Quick Fix

pai doctor    # Diagnose all issues

Common Issues

Issue	Solution
`pai: command not found`	Run `source ~/.pocketai_env`
`No model active`	Run `pai install qwen3`
Slow responses	Use smaller model: `pai use smollm2`
Out of memory	Close apps, use smaller model
API offline	Run `pai api web` not `pai api start`

See TROUBLESHOOTING.md for more.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the repository
Create your feature branch (git checkout -b feature/amazing)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing)
Open a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

Mozilla llamafile - Portable LLM runtime
Termux - Android terminal emulator
proot-distro - Linux containers for Termux
Model providers: Qwen, Meta (Llama), HuggingFace, Google, Microsoft

Contact

Author: Mithun
GitHub: @mithun50
Issues: GitHub Issues

Star this repo if you find it useful!

Made with love for the Android AI community

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PocketAI

Features

Quick Start

One-Command Install

Start Using

Available Models

2025 Models (Recommended)

Classic Models

Commands

Chat & Inference

Model Management

OpenAI-Compatible Server

REST API & Web Dashboard

Configuration

System

Project Structure

Architecture

Requirements

Troubleshooting

Quick Fix

Common Issues

Contributing

License

Acknowledgments

Contact

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.github/workflows		.github/workflows
bin		bin
core		core
data		data
docs		docs
web		web
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
setup.sh		setup.sh

License

mithun50/PocketAi

Folders and files

Latest commit

History

Repository files navigation

PocketAI

Features

Quick Start

One-Command Install

Start Using

Available Models

2025 Models (Recommended)

Classic Models

Commands

Chat & Inference

Model Management

OpenAI-Compatible Server

REST API & Web Dashboard

Configuration

System

Project Structure

Architecture

Requirements

Troubleshooting

Quick Fix

Common Issues

Contributing

License

Acknowledgments

Contact

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages