Skip to content

Conversation

@Datta0
Copy link
Collaborator

@Datta0 Datta0 commented Oct 23, 2025

Move hf quantizer patch to unsloth from unsloth zoo to make it run for non fast inference.
Import the patched Fbgemmfp8linear and fp8linear classes from transformers but patched by unsloth fp8 kernels

Needed for : unslothai/unsloth#3496

@danielhanchen danielhanchen merged commit 33302aa into unslothai:main Oct 27, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

2 participants