Skip to content

Conversation

@stephantul
Copy link
Contributor

This PR adds much faster distill time for larger vocab. On glove-sized vocab, this reduces inference time from 3 minutes to 1.5. The Progress bar also looks a lot nicer.

@stephantul stephantul requested a review from Pringled April 24, 2025 17:39
Copy link
Member

@Pringled Pringled left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🚤

@stephantul stephantul merged commit 0402b98 into main Apr 24, 2025
6 checks passed
@stephantul stephantul deleted the speed-inference branch April 24, 2025 17:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

3 participants