Hugging Face – Posts

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

All HF Hub posts

danielhanchen

posted an update 2 days ago

Post

2721

New DeepSeek-R1-0528 1.65-bit Dynamic GGUF!

Run the model locally even easier! Will fit on a 192GB Macbook and run at 7 tokens/s.

DeepSeek-R1-0528 GGUFs: unsloth/DeepSeek-R1-0528-GGUF
Qwen3-8B DeepSeek-R1-0528 GGUFs: unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF

And read our Guide: https://docs.unsloth.ai/basics/deepseek-r1-0528

yeonseok-zeticai

posted an update 1 day ago

Post

1273

🚀 NEW DROP: run your own on-device LLM—in minutes, on any phone
Today we’re open-sourcing everything you need to put Qwen3-0.6B straight into a production-ready mobile app:

🎥 Watch Qwen3-0.6B chat in real time on any smartphones!

📊 TPS benchmarks – slides comparing token-per-second across heterogeneous mobile devices

💻 Plug-and-play source – Just Copy & Run the source to your project for Android (Kotlin & Java) and iOS (Swift).

🤞 Cross-platform, one pipeline – ZETIC.MLange auto-tunes kernels for every different devices, we’ve tested.

👨‍💻 Ready for production – swap in your own model, re-benchmark with one command, publish.

Get started
Just Sign-up and check the playground project, QWEN-0.6B
- https://mlange.zetic.ai/p/zetic-example/Qwen3-0.6B

We built this to show that cloud-free LLMs are ready today. Dive in, fork it, and tag ZETIC.ai when you launch your own on-device assistant, game NPC, or offline content generator—we’ll spotlight the best projects.

VirtualOasis

posted an update 1 day ago

Post

1860

I built an AI Website: ai-garden.netlify.app
It is a curated garden of AI resources.
It's my database for writing and research, organized by category and designed for quick access. Whether you're looking for learning materials, development tools, research papers, or industry news, everything's laid out in a clean, searchable format.

Feel free to suggest new resources or improvements - this garden grows better with community input. 🍻

3 replies

danaaubakirova

posted an update 1 day ago

Post

898

We just dropped SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics!

check out the blog: https://huggingface.co/blog/smolvla
read the technical report: SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics (2506.01844)
access the model weights: lerobot/smolvla_base

azettl

posted an update 2 days ago

Post

1518

Agents & MCP Hackathon Day 1

Before starting with the second night of the Agents & MCP Hackathon, I briefly wanted to share my progress from last night. Not much sleep, but lots of progress!

So I managed to build the first MVP version of my custom Gradio component, azettl/gradio_consilium_roundtable (https://pypi.org/project/gradio-consilium-roundtable/). This creates a visual roundtable component for AI consensus discussions. Displays AI participants as avatars positioned around an oval table (poker style!) with animated speech bubbles, thinking states, and real-time discussion updates.

Also, I managed to get a rough draft of the Gradio app + MCP server done, but not so much yet that I can share the space with you. You will be able to define your question the AI participants should discuss, decide on the protocol, do role assignments like having a devil's advocate on the table, and define the communication pattern. Lastly, you can decide which AI should be the moderator and how many rounds of discussions there should be. You can see my progress in the attached image.

Most of the options are just placeholders right now, and I will work on their implementation tonight. Hopefully, I can add an MVP tomorrow evening to the following space: https://huggingface.co/spaces/Agents-MCP-Hackathon/consilium_mcp.

I am also very interested in the cool stuff you all are building; please let me know in the comments. :)

1 reply

danieldk

posted an update about 10 hours ago

Post

221

We have been working on a project called kernels. kernels makes it possible to load compute kernels directly from the Hub! 🚀

We plan to give kernels a more proper introduction soon. But for those who have been following along, we are happy to announce a new release:

- New layer API with torch.compile support.
- Experimental support for loading Apple Silicon Metal Kernels.
- Generate wheels from Hub kernels for legacy deployments.

Full release notes here: https://github.com/huggingface/kernels/releases/tag/v0.6.0

attackerElvies

posted an update 1 day ago

Post

1419

HALOOO MY COMMUNITY

shivance

posted an update 1 day ago

Post

1376

The AI Memory Layer Will Change Everything ‼️

Why do even the smartest AIs like OpenAI's o3 and GPT-4o, Google's Gemini and Anthropic's Claude forget?

In this blog we unpack this challenge and explore how building a real memory into AI will redefine personalization and agent capabilities!

https://fullstackagents.substack.com/p/forget-me-not-the-ai-memory-layer

ProCreations

posted an update 2 days ago

Post

2186

60 followers,
yay

2 replies

codelion

posted an update 3 days ago

Post

3264

🧠 We just implemented Andrej Karpathy's "third paradigm" for LLM learning!

System Prompt Learning (SPL) enables LLMs to automatically learn problem-solving strategies from experience, rather than relying on static prompts.

🚀 How it works:
Your LLM builds a database of effective strategies, selects the best ones for each problem, and refines them over time based on success rates.

📊 Results across math benchmarks:
Arena Hard: 29% → 37.6% (+8.6%)
AIME24: 23.33% → 30% (+6.67%)
OptILLMBench: 61% → 65% (+4%)

The best part? All strategies are human-readable and the system gets progressively better at problem types you use frequently.

✨ Key benefits:
🔄 Cumulative learning over time
📖 Transparent, inspectable strategies
🔌 Works with any OpenAI-compatible API
⚡ Simple integration: just add "spl-" prefix to your model

Built as an open-source plugin in optillm. After 500 queries, our system developed 129 strategies and refined 97 of them!

This feels like a genuine step toward AI that learns from experience while staying completely interpretable.

🔗 GitHub: https://github.com/codelion/optillm/tree/main/optillm/plugins/spl
📖 Full article: https://huggingface.co/blog/codelion/system-prompt-learning
🐦 Original Karpathy tweet: https://x.com/karpathy/status/1921368644069765486

Have you experimented with advanced system prompting? What strategies would you want your LLM to learn?

Recently active users