I am bullish and biased, but the best way to use flash attention 3 or 4 is via 🤗 kernels: ``` from kernels import get_kernel kernel_module = get_kernel(

I am bullish and biased, but the best way to use flash attention 3 or 4 is via 🤗 kernels: ``` from kernels import get_kernel kernel_module = get_kernel("kernels-community/flash-attn3", version=1) flash_attn_func = kernel_module.flash_attn_func flash_attn_func(...) ````

21 Comments

🎧 Eric Riddoch 5d

Should we be replacing uv.lock and pixi.lock with... lines of python? I'm trying to think: would this help us reproduce training code from a B300 node to a B200 node and then even to an H100 node? Solving dependencies has been painful for us because: build times take forever, we may need to solve the env differently depending on the node we're running on-- to properly leverage newer instruction sets in the more recent nvidia chips

6 Reactions

Mathieu Gosbee 4d

/me slaps his 4090 with a large trout y u not a 5090?

Shrirang Mahajan 5d

This is nice! Genuine question: when is the actual kernel compiled? When you import it or is there some sort of JIT sorcery to compile when the function is called?

8 Reactions

Vishal Gupta 4d

The get_kernel API makes this look almost too easy. The bigger question is whether version pinning stays stable enough for reproducible training environments. That's usually where these conveniences break down at scale.

Jinwon Kim 5d

Cool! Is there way to use in air gap environment?

Tarini Mohapatra 5d

Great ! Can we get a notebook example for this?

Ahmad Raza Khan 4d

Open collaboration accelerates progress in machine learning. Your work on open models, evaluation, and tooling benefits both research and production.

Paolo Perrone 1d

What's the most advanced model architecture you've seen someone use in production via a Hugging Face kernel?

Treveur BRETAUDIERE 4d

This said don't we all hate how we can't zoom on photos with LinkedIn on a phone?

Muhammad Awais Chaudhry 1d

The CMAKE errors are a rite of passage at this point 😭 Didn't know kernels made this that clean — adding this to my toolkit. Thanks for sharing!

See more comments

To view or add a comment, sign in

Sayak Paul’s Post

Explore content categories