Skip to content
Navigation Menu
Toggle navigation
Sign in
Appearance settings
Platform
GitHub Copilot
Write better code with AI
GitHub Spark
New
Build and deploy intelligent apps
GitHub Models
New
Manage and compare prompts
GitHub Advanced Security
Find and fix vulnerabilities
Actions
Automate any workflow
Codespaces
Instant dev environments
Issues
Plan and track work
Code Review
Manage code changes
Discussions
Collaborate outside of code
Code Search
Find more, search less
Explore
Why GitHub
Documentation
GitHub Skills
Blog
Integrations
GitHub Marketplace
MCP Registry
View all features
Solutions
By company size
Enterprises
Small and medium teams
Startups
Nonprofits
By use case
App Modernization
DevSecOps
DevOps
CI/CD
View all use cases
By industry
Healthcare
Financial services
Manufacturing
Government
View all industries
View all solutions
Resources
Topics
AI
DevOps
Security
Software Development
View all
Explore
Learning Pathways
Events & Webinars
Ebooks & Whitepapers
Customer Stories
Partners
Executive Insights
Open Source
GitHub Sponsors
Fund open source developers
The ReadME Project
GitHub community articles
Repositories
Topics
Trending
Collections
Enterprise
Enterprise platform
AI-powered developer platform
Available add-ons
GitHub Advanced Security
Enterprise-grade security features
Copilot for business
Enterprise-grade AI features
Premium Support
Enterprise-grade 24/7 support
Pricing
Search or jump to...
Search code, repositories, users, issues, pull requests...
Search syntax tips
Provide feedback
Saved searches
Use saved searches to filter your results more quickly
Sign in
Sign up
Appearance settings
Resetting focus
You signed in with another tab or window.
Reload
to refresh your session.
You signed out in another tab or window.
Reload
to refresh your session.
You switched accounts on another tab or window.
Reload
to refresh your session.
Dismiss alert
{{ message }}
explosion
/
curated-transformers
Public
Notifications
You must be signed in to change notification settings
Fork
35
Star
893
Code
Issues
16
Pull requests
1
Actions
Security
Uh oh!
There was an error while loading.
Please reload this page
.
Insights
Additional navigation options
Code
Issues
Pull requests
Actions
Security
Insights
Issues
Search Issues
is
:
issue
state
:
open
is:issue state:open
Search
Labels
Milestones
New issue
Search results
Open
Closed
Truncation of sequences that are beyond the model's maximum length
feat/tokenization
Feature: Tokenization/piecer
Feature: Tokenization/piecer
type/bug
Type: Bug
Type: Bug
type/feature
Type: Feature
Type: Feature
Status: Open.
#359
In explosion/curated-transformers;
·
MootezSaaD
opened
on Jan 14, 2024
Add suggested PyTorch LLM optimizations
feat/generation
Feature: Generation
Feature: Generation
feat/model
Feature: models
Feature: models
Status: Open.
#356
In explosion/curated-transformers;
·
danieldk
opened
on Dec 1, 2023
Move the old Falcon architecuture to the extras/addons pacakage
type/maintenance
Type: Maintenance
Type: Maintenance
Status: Open.
#355
In explosion/curated-transformers;
·
shadeMe
opened
on Oct 19, 2023
·
Undecided
Add support for attention sinks
feat/layers
Feature: Layers
Feature: Layers
feat/model
Feature: models
Feature: models
type/feature
Type: Feature
Type: Feature
Status: Open.
#350
In explosion/curated-transformers;
·
danieldk
opened
on Oct 4, 2023
·
Undecided
Support DeBERTa v2/3
feat/model
Feature: models
Feature: models
type/feature
Type: Feature
Type: Feature
Status: Open.
#348
In explosion/curated-transformers;
·
danieldk
opened
on Oct 3, 2023
·
Undecided
Add a an extras/contrib package
type/maintenance
Type: Maintenance
Type: Maintenance
Status: Open.
#347
In explosion/curated-transformers;
·
danieldk
opened
on Oct 3, 2023
·
Undecided
Expose more outputs through the <code>Generator</code> interface
feat/generation
Feature: Generation
Feature: Generation
type/feature
Type: Feature
Type: Feature
Status: Open.
#345
In explosion/curated-transformers;
·
danieldk
opened
on Oct 3, 2023
·
v2.0.0
Make <code>QkvMode</code> ADT-like
feat/layers
Feature: Layers
Feature: Layers
type/maintenance
Type: Maintenance
Type: Maintenance
Status: Open.
#344
In explosion/curated-transformers;
·
danieldk
opened
on Oct 3, 2023
·
v2.0.0
Convert QKV projection splitting methods into Torch modules
feat/layers
Feature: Layers
Feature: Layers
type/maintenance
Type: Maintenance
Type: Maintenance
Status: Open.
#343
In explosion/curated-transformers;
·
danieldk
opened
on Oct 3, 2023
·
v2.0.0
Option to only return the last hidden layer output from models
feat/model
Feature: models
Feature: models
type/feature
Type: Feature
Type: Feature
Status: Open.
#342
In explosion/curated-transformers;
·
danieldk
opened
on Oct 3, 2023
·
v2.0.0
Add support for Mistral
feat/model
Feature: models
Feature: models
type/feature
Type: Feature
Type: Feature
Status: Open.
#341
In explosion/curated-transformers;
·
danieldk
opened
on Oct 3, 2023
·
v2.1.0
Support for Encoder-Decoder-style architectures
feat/model
Feature: models
Feature: models
type/feature
Type: Feature
Type: Feature
Status: Open.
#340
In explosion/curated-transformers;
·
bilelomrani1
opened
on Oct 2, 2023
·
Undecided
You can’t perform that action at this time.