[Bug] save_pretrained_torchao uses AutoModel instead of AutoModelForCausalLM, saving base model without LM head

🐛 Bug Description

When using model.save_pretrained_torchao(), the function incorrectly uses AutoModel instead of AutoModelForCausalLM to reload the 16-bit model.

This causes the saved config.json in the final -torchao directory to have the base model architecture (e.g., Qwen3Model) instead of the language modeling head architecture (e.g., Qwen3ModelForCausalLM).

reproducing the bug

You can see this in the unsloth/save.py file, inside the unsloth_save_pretrained_torchao function.

The problematic lines are:

On line 2772:
from transformers import AutoModel, AutoTokenizer, TorchAoConfig

And around line 2791:
model = AutoModel.from_pretrained(...)

✅ The Fix

This bug is fixed by changing the function to use AutoModelForCausalLM:

Change the import to:
from transformers import AutoModelForCausalLM, AutoTokenizer, TorchAoConfig
Change the model loading line to:
model = AutoModelForCausalLM.from_pretrained(...)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bug] save_pretrained_torchao uses AutoModel instead of AutoModelForCausalLM, saving base model without LM head #3599

🐛 Bug Description

reproducing the bug

✅ The Fix

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[Bug] save_pretrained_torchao uses AutoModel instead of AutoModelForCausalLM, saving base model without LM head #3599

Description

🐛 Bug Description

reproducing the bug

✅ The Fix

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions