Commit 861ef12
Bug fixes (#347)
* Update attention_sink.py
* Update gpt_oss.py
* prefer_nd_tiling
* Update patching_utils.py
* flex_attention_with_sink
* Compile Flex Attention
* Update mxfp4.py
* Update mxfp4.py
* Update mxfp4.py
* Update mxfp4.py
* Update gpt_oss.py
* bitsandbytes patch
* Update bitsandbytes.py
* Update gpt_oss.py
* Inplace ops
* Update gpt_oss.py
* has_static_cache
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update attention_sink.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update attention_sink.py
* Update attention_sink.py
* Update rl_replacements.py
* Update rl_replacements.py
* Update rl_replacements.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* torch compile
* Update attention_sink.py
* Update common.py
* Update common.py
* Patches
* Compiled mask creation
* Update attention_sink.py
* Update gpt_oss.py
* Update gpt_oss.py
* Revert
* Update gpt_oss.py
* Update gpt_oss.py
* Fix up
* Update attention_sink.py
* Update attention_sink.py
* Update utils.py
* Update attention_sink.py
* Update attention_sink.py
* Retry
* Update gpt_oss.py
* Update gpt_oss.py
* Fix Flex
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Bug fixes
* Update patching_utils.py
* Update patching_utils.py
* Update patching_utils.py
* Update rl_replacements.py
* Update patching_utils.py
* Update patching_utils.py
* Update patching_utils.py
* flash attn
* Update gpt_oss.py
* Update __init__.py
* Update attention_sink.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* dropout_p
* Update gpt_oss.py
* Update gpt_oss.py
* Update attention_sink.py
* Update gpt_oss.py
* Update gpt_oss.py
* fix
* Update attention_sink.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update loss_utils.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update loss_utils.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Versioning
* Update saving_utils.py
* Update saving_utils.py
* Update saving_utils.py
* Update saving_utils.py
* Update saving_utils.py
* Update saving_utils.py
* Update saving_utils.py
* Update saving_utils.py
* Fix Gemma 3
* Update misc.py
* Update rl_environments.py
* Update pyproject.toml
* Update rl_environments.py
* Update __init__.py
* Update empty_model.py
* Update empty_model.py
* Update empty_model.py
* Update empty_model.py
* Device type
* Update vllm_utils.py
* Update compiler.py
* Update empty_model.py
* Update vllm_utils.py
* Update empty_model.py
* Fixes
* Update empty_model.py
* Update empty_model.py
* Update __init__.py
* Update vllm_utils.py
* Update vllm_utils.py
* Update rl_environments.py
* Update cross_entropy_loss.py
* Update vllm_utils.py
* Update vllm_utils.py
* Update rl_environments.py
* Update vllm_utils.py
* Qwen3 VL vLLM (#324)
* qwen3 vl additional layers
* qwen3 fused vision qkv
* refactor for handling qwen 3 vl
* [WIP] fix backward pass issues
* out hidden size change
* Qwen 2.5 and qwen 3 conv3d->Linear vLLM changes
* Update __init__.py
* Update __init__.py
* Update __init__.py
* Update __init__.py
* Update __init__.py
* Update __init__.py
* Update __init__.py
* Update vllm_utils.py
* Update vllm_utils.py
* Update pyproject.toml
* Update vllm_utils.py
* Update vllm_utils.py
* Update vllm_utils.py
* Update vllm_utils.py
* Update vllm_utils.py
* Update vllm_utils.py
* Update __init__.py
* Update compiler.py
* Update __init__.py
* Update vllm_utils.py
* Update rl_replacements.py
* Update rl_replacements.py
* Update rl_replacements.py
* Fix CE compile
* Update loss_utils.py
* Update cross_entropy_loss.py
* Fix
* Deepseekocr fix: save single model shard (#346)
* DeepSeekOCR Fix: check for saftensors_list shard naming convention
* turned off shard padding length check bc deepseeks padding is different
* if you try to copy the index.json file and the same file alredy exists it wil throw and error.
---------
Co-authored-by: Datta Nimmaturi <venkatadattasainimmaturi@gmail.com>
Co-authored-by: DoubleMathew <mmathew23@gmail.com>1 parent 6690af3 commit 861ef12
File tree
8 files changed
+71
-21
lines changed- unsloth_zoo
- fused_losses
8 files changed
+71
-21
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
14 | 14 | | |
15 | 15 | | |
16 | 16 | | |
17 | | - | |
| 17 | + | |
18 | 18 | | |
19 | 19 | | |
20 | 20 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
95 | 95 | | |
96 | 96 | | |
97 | 97 | | |
98 | | - | |
99 | | - | |
100 | | - | |
101 | | - | |
102 | | - | |
103 | | - | |
104 | | - | |
| 98 | + | |
105 | 99 | | |
106 | 100 | | |
107 | 101 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
106 | 106 | | |
107 | 107 | | |
108 | 108 | | |
109 | | - | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
110 | 114 | | |
111 | 115 | | |
112 | 116 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
171 | 171 | | |
172 | 172 | | |
173 | 173 | | |
174 | | - | |
175 | | - | |
176 | | - | |
| 174 | + | |
| 175 | + | |
| 176 | + | |
| 177 | + | |
| 178 | + | |
177 | 179 | | |
178 | 180 | | |
179 | 181 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
126 | 126 | | |
127 | 127 | | |
128 | 128 | | |
129 | | - | |
130 | 129 | | |
131 | 130 | | |
132 | 131 | | |
| |||
176 | 175 | | |
177 | 176 | | |
178 | 177 | | |
179 | | - | |
| 178 | + | |
180 | 179 | | |
181 | 180 | | |
182 | 181 | | |
| 182 | + | |
| 183 | + | |
| 184 | + | |
| 185 | + | |
| 186 | + | |
| 187 | + | |
| 188 | + | |
| 189 | + | |
| 190 | + | |
| 191 | + | |
| 192 | + | |
| 193 | + | |
| 194 | + | |
183 | 195 | | |
184 | 196 | | |
185 | 197 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
920 | 920 | | |
921 | 921 | | |
922 | 922 | | |
| 923 | + | |
| 924 | + | |
| 925 | + | |
| 926 | + | |
| 927 | + | |
| 928 | + | |
| 929 | + | |
| 930 | + | |
| 931 | + | |
| 932 | + | |
| 933 | + | |
| 934 | + | |
| 935 | + | |
| 936 | + | |
| 937 | + | |
| 938 | + | |
| 939 | + | |
| 940 | + | |
| 941 | + | |
| 942 | + | |
923 | 943 | | |
924 | 944 | | |
925 | 945 | | |
| |||
1170 | 1190 | | |
1171 | 1191 | | |
1172 | 1192 | | |
1173 | | - | |
| 1193 | + | |
| 1194 | + | |
1174 | 1195 | | |
1175 | 1196 | | |
1176 | 1197 | | |
| |||
1180 | 1201 | | |
1181 | 1202 | | |
1182 | 1203 | | |
1183 | | - | |
| 1204 | + | |
| 1205 | + | |
| 1206 | + | |
| 1207 | + | |
| 1208 | + | |
| 1209 | + | |
| 1210 | + | |
1184 | 1211 | | |
1185 | 1212 | | |
1186 | 1213 | | |
| |||
1282 | 1309 | | |
1283 | 1310 | | |
1284 | 1311 | | |
1285 | | - | |
| 1312 | + | |
| 1313 | + | |
1286 | 1314 | | |
1287 | 1315 | | |
1288 | 1316 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
74 | 74 | | |
75 | 75 | | |
76 | 76 | | |
77 | | - | |
| 77 | + | |
| 78 | + | |
78 | 79 | | |
79 | 80 | | |
80 | 81 | | |
81 | 82 | | |
82 | 83 | | |
83 | 84 | | |
84 | | - | |
| 85 | + | |
| 86 | + | |
85 | 87 | | |
86 | 88 | | |
87 | 89 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
28 | 28 | | |
29 | 29 | | |
30 | 30 | | |
| 31 | + | |
31 | 32 | | |
32 | 33 | | |
33 | 34 | | |
| |||
1463 | 1464 | | |
1464 | 1465 | | |
1465 | 1466 | | |
1466 | | - | |
1467 | | - | |
| 1467 | + | |
| 1468 | + | |
1468 | 1469 | | |
1469 | 1470 | | |
1470 | 1471 | | |
| |||
1482 | 1483 | | |
1483 | 1484 | | |
1484 | 1485 | | |
| 1486 | + | |
| 1487 | + | |
| 1488 | + | |
| 1489 | + | |
| 1490 | + | |
| 1491 | + | |
| 1492 | + | |
1485 | 1493 | | |
1486 | 1494 | | |
1487 | 1495 | | |
| |||
0 commit comments