Dynamic Attention Mask (DAM) generate adaptive sparse attention masks per layer and head for Transformer models, enabling long-context inference with lower compute and memory overhead without fine-tuning.

inference-optimization sparse-attention efficient-ai

Updated Jun 16, 2025
Python

yumozi / GUARD

Star

Official PyTorch implementation of the paper "Towards Adversarially Robust Dataset Distillation by Curvature Regularization" (AAAI 2025).

computer-vision efficiency robustness distillation dataset-distillation efficient-ai aaai2025

Updated Oct 21, 2025
Python

Shikha-code36 / early-exit-cnn

Star

A deep learning framework that implements Early Exit strategies in Convolutional Neural Networks (CNNs) using Deep Q-Learning (DQN). This project enhances computational efficiency by dynamically determining the optimal exit point in a neural network for image classification tasks on CIFAR-10.

reinforcement-learning deep-learning cnn pytorch dqn image-classification cifar10 cifar-10 pytorch-cnn cnn-pytorch cifar10-classification early-exit model-optimization efficient-ai

Updated Feb 23, 2025
Jupyter Notebook

Pro-GenAI / Gen-UI-Lang

Star

Fast, concise, LLM-first Generative UI language

ai llms generative-ai gen-ai genai generative-ui efficient-ai ai-scalability gen-ai-app efficient-llms

Updated Dec 20, 2025
Python

LumGenLab / LumGPT

Star

Transformer (GPT) implemented from scratch in C++. Runs on modest hardware with complete mathematical derivations and optimized tensor operations.

deep-learning transformer cpp17 gpt language-model efficient-ai opensource-llm lumgenlab

Updated Nov 5, 2025
C++

raphischer / ai-energy-validation

Star

Ground-Truthing AI Energy Consumption: Validating CodeCarbon Against External Measurements

sustainability ai energy-consumption efficient-ai

Updated Sep 29, 2025
Python

tripptytrip / Symbolic-Transformers

Star

Symbolic Transformers: 2.2MB models for logical reasoning. Achieves 47% accuracy with 566K parameters—220× smaller than GPT-2. Proves data quality > model size for symbolic AI. 🔬 Novel base-625 symbolic encoding | 🚀 Edge-deployable | 📊 Open research

machine-learning research first-order-logic transformers edge-ai symbolic-reasoning tinyml neuro-symbolic-ai efficient-ai

Updated Dec 23, 2025
Python

sebasmos / curious-qmoe

Star

🔬 Curiosity-Driven Quantized Mixture of Experts

pytorch audio-classification mixture-of-experts model-quantization efficient-ai

Updated Nov 13, 2025
Python

mtuann / blog

Star

Welcome to my digital garden where I cultivate thoughts on Machine Learning, Generative AI, Trustworthy AI, AI Systems, Efficient AI, and Paper Reviews.

machine-learning ai-systems paper-reviews trustworthy-ai generative-ai efficient-ai

Updated Jan 1, 2026

afondiel / edge-language

Sponsor

Star

An open and practical guide to Edge Language

slm smol embedded-ai edge-ai edge-intelligence green-ai llm small-language-models frugal-ai efficient-ai ai-efficiency greener-ai

Updated Nov 14, 2025

priyanshujiiii / awesome-Quantization

Sponsor

Star

In this repo you will understand .The process of reducing the precision of a model’s parameters and/or activations (e.g., from 32-bit floating point to 8-bit integers) to make neural networks smaller, faster, and more energy-efficient with minimal accuracy loss.

deep-learning neural-networks quantization zero-shot model-compression mixed-precision edge-ai hardware-aware data-free model-optimization quantization-aware-training post-training-quantization efficient-ai

Updated Aug 11, 2025

arutovan-droid / symbion-trm-integration

Star

"TRM (Tiny Recursive Model) integration architecture for Symbion.space ecosystem"

trm symbiont reasoning-systems ai-orchestration efficient-ai tiny-recursive-model recursive-reasoning problem-structured-language geobench

Updated Oct 24, 2025

paredezadrian / mocanet

Star

MOCA-Net: Novel neural architecture with sparse MoE, external memory, and budget-aware computation. Real Stanford SST-2 integration, O(L) complexity, 96.40% accuracy. Built for efficient sequence modeling.

deep-learning sentiment-analysis pytorch neural-networks research-tool external-memory mixture-of-experts sequence-modeling budget-optimization sst2 efficient-ai

Updated Aug 16, 2025
Python

Improve this page

Add a description, image, and links to the efficient-ai topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the efficient-ai topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

efficient-ai

Here are 25 public repositories matching this topic...

NVlabs / Long-RL

cokeshao / Awesome-Multimodal-Token-Compression

jeho-lee / Awesome-On-Device-AI-Systems

tiannuo-yang / SearchAgent-X

BaiTheBest / SparseLLM

Liu-Hy / WMDD

erectbranch / MIT-Efficient-AI

ResponsibleAILab / DAM

yumozi / GUARD

Shikha-code36 / early-exit-cnn

Pro-GenAI / Gen-UI-Lang

LumGenLab / LumGPT

raphischer / ai-energy-validation

tripptytrip / Symbolic-Transformers

sebasmos / curious-qmoe

mtuann / blog

afondiel / edge-language

priyanshujiiii / awesome-Quantization

arutovan-droid / symbion-trm-integration

paredezadrian / mocanet

Improve this page

Add this topic to your repo