Skip to content
View Davidqian123's full-sized avatar
:octocat:
Exploring AI
:octocat:
Exploring AI
  • Nexa AI Inc
  • Bay Area, United States

Block or report Davidqian123

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

llama and other large language models on iOS and MacOS offline using GGML library.

C 1,917 156 Updated Sep 19, 2025

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

C++ 76,931 8,303 Updated May 27, 2025

Suno AI's Bark model in C/C++ for fast text-to-speech generation

C++ 848 80 Updated Nov 16, 2024

Nexa AI PDF Chatbot using Nexa SDK

Python 2 2 Updated Oct 27, 2024

🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes.

Python 1 Updated Oct 14, 2024

🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes.

Python 320 61 Updated Sep 25, 2025

An AI-powered file management tool that ensures privacy by organizing local texts, images. Using Llama3.2 3B and Llava v1.6 models with the Nexa SDK, it intuitively scans, restructures, and organiz…

Python 2,680 253 Updated Oct 21, 2024

C++ implementation of Qwen-LM

C++ 609 60 Updated Dec 6, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 19,844 1,662 Updated Nov 26, 2025

An interactive AI character with voice input, voice output, and profile image generation—all running locally with Nexa SDK and powered by Llama3 Uncensored Model. Enjoy a private and immersive expe…

Python 11 1 Updated Oct 7, 2024

Android ChatBot with Octopus v2 - Function Calling Demo

Java 1 Updated Jul 30, 2024

Awesome LLMs on Device: A Comprehensive Survey

1,270 111 Updated Jan 12, 2025

Low-bit LLM inference on CPU/NPU with lookup table

C++ 896 74 Updated Jun 5, 2025

Android ChatBot with Octopus v2 - Function Calling Demo

Java 14 5 Updated Jul 30, 2024

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 156,843 13,796 Updated Nov 30, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 153,201 31,278 Updated Nov 29, 2025

LLM inference in C/C++

C++ 90,638 13,889 Updated Dec 1, 2025

AI for all: Build the large graph of the language models

Python 277 21 Updated Jun 3, 2024

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 40,744 3,166 Updated Nov 28, 2025

Universal LLM Deployment Engine with ML Compilation

Python 21,676 1,867 Updated Nov 26, 2025

Build dynamic, secure APIs with FastAPI: Features DB integration, real-time WebSocket, streaming, and efficient request handling with middleware, powered by Starlette and Pydantic.

Python 35 6 Updated Feb 25, 2024

Python Concurrency Web Crawler, and Redis

Python 4 Updated Feb 20, 2024

An audio streaming mobile application

Kotlin 1 Updated Sep 8, 2023

A Personalized Twitch Resources Recommendation Engine

Java 1 Updated Sep 8, 2023

A Cloud and React based App Purchase Platform

Go 1 Updated Sep 8, 2023

NFT Price Visualization

JavaScript 1 Updated Sep 8, 2023
Jupyter Notebook 1 Updated May 26, 2023

Java client library for Google Maps API Web Services

Java 1,769 962 Updated Nov 24, 2025