Commit d7fb886
Bug fixes (#331)
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update __init__.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update compiler.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Dannightly (#304)
* gpt oss inference fix
* gpt oss fix bf16
* gpt oss fix bf16
* gpt oss fix bf16
* gpt oss fix bf16
* gpt oss fix bf16
* gpt oss fix bf16
---------
Co-authored-by: DoubleMathew <mmathew23@gmail.com>
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Fix Flex Attention autotuning
* Update patching_utils.py
* Update patching_utils.py
* Update patching_utils.py
* Update mxfp4.py
* Update mxfp4.py
* Update gpt_oss.py
* Update attention_sink.py
* Update patching_utils.py
* Update attention_sink.py
* Update gpt_oss.py
* prefer_nd_tiling
* Update patching_utils.py
* flex_attention_with_sink
* Compile Flex Attention
* Update mxfp4.py
* Update mxfp4.py
* Update mxfp4.py
* Update mxfp4.py
* Update gpt_oss.py
* bitsandbytes patch
* Update bitsandbytes.py
* Update gpt_oss.py
* Inplace ops
* Update gpt_oss.py
* has_static_cache
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update attention_sink.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update attention_sink.py
* Update attention_sink.py
* Update rl_replacements.py
* Update rl_replacements.py
* Update rl_replacements.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* torch compile
* Update attention_sink.py
* Update common.py
* Update common.py
* Patches
* Compiled mask creation
* Update attention_sink.py
* Update gpt_oss.py
* Update gpt_oss.py
* Revert
* Update gpt_oss.py
* Update gpt_oss.py
* Fix up
* Update attention_sink.py
* Update attention_sink.py
* Update utils.py
* Update attention_sink.py
* Update attention_sink.py
* Retry
* Update gpt_oss.py
* Update gpt_oss.py
* Fix Flex
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Bug fixes
* Update patching_utils.py
* Update patching_utils.py
* Update patching_utils.py
* Update rl_replacements.py
* Update patching_utils.py
* Update patching_utils.py
* Update patching_utils.py
* flash attn
* Update gpt_oss.py
* Update __init__.py
* Update attention_sink.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* dropout_p
* Update gpt_oss.py
* Update gpt_oss.py
* Update attention_sink.py
* Update gpt_oss.py
* Update gpt_oss.py
* fix
* Update attention_sink.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update loss_utils.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update loss_utils.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Update gpt_oss.py
* Versioning
* Update saving_utils.py
* Update saving_utils.py
* Update saving_utils.py
* Update saving_utils.py
* Update saving_utils.py
* Update saving_utils.py
* Update saving_utils.py
* Update saving_utils.py
* Fix Gemma 3
* Update misc.py
* Update rl_environments.py
* Update pyproject.toml
* Update rl_environments.py
* Update __init__.py
* Update empty_model.py
* Update empty_model.py
* Update empty_model.py
* Update empty_model.py
* Device type
* Update vllm_utils.py
* Update compiler.py
* Update empty_model.py
* Update vllm_utils.py
* Update empty_model.py
* Fixes
* Update empty_model.py
* Update empty_model.py
* Update __init__.py
---------
Co-authored-by: DoubleMathew <mmathew23@gmail.com>1 parent 677086d commit d7fb886
File tree
7 files changed
+104
-26
lines changed- unsloth_zoo
- fused_losses
- temporary_patches
7 files changed
+104
-26
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
14 | 14 | | |
15 | 15 | | |
16 | 16 | | |
17 | | - | |
| 17 | + | |
18 | 18 | | |
19 | 19 | | |
| 20 | + | |
20 | 21 | | |
21 | 22 | | |
22 | 23 | | |
| |||
101 | 102 | | |
102 | 103 | | |
103 | 104 | | |
| 105 | + | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
| 114 | + | |
| 115 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2035 | 2035 | | |
2036 | 2036 | | |
2037 | 2037 | | |
2038 | | - | |
| 2038 | + | |
| 2039 | + | |
| 2040 | + | |
| 2041 | + | |
| 2042 | + | |
| 2043 | + | |
| 2044 | + | |
| 2045 | + | |
2039 | 2046 | | |
2040 | 2047 | | |
2041 | 2048 | | |
| |||
2189 | 2196 | | |
2190 | 2197 | | |
2191 | 2198 | | |
| 2199 | + | |
2192 | 2200 | | |
2193 | 2201 | | |
2194 | 2202 | | |
| |||
2197 | 2205 | | |
2198 | 2206 | | |
2199 | 2207 | | |
| 2208 | + | |
2200 | 2209 | | |
| 2210 | + | |
| 2211 | + | |
2201 | 2212 | | |
2202 | 2213 | | |
2203 | 2214 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
223 | 223 | | |
224 | 224 | | |
225 | 225 | | |
226 | | - | |
| 226 | + | |
227 | 227 | | |
228 | | - | |
| 228 | + | |
229 | 229 | | |
230 | | - | |
| 230 | + | |
| 231 | + | |
231 | 232 | | |
232 | 233 | | |
233 | 234 | | |
234 | 235 | | |
235 | 236 | | |
236 | 237 | | |
237 | | - | |
238 | | - | |
239 | | - | |
240 | | - | |
241 | | - | |
242 | | - | |
243 | | - | |
244 | | - | |
245 | | - | |
246 | | - | |
| 238 | + | |
| 239 | + | |
| 240 | + | |
| 241 | + | |
| 242 | + | |
| 243 | + | |
| 244 | + | |
| 245 | + | |
| 246 | + | |
| 247 | + | |
| 248 | + | |
| 249 | + | |
247 | 250 | | |
248 | | - | |
| 251 | + | |
| 252 | + | |
| 253 | + | |
| 254 | + | |
| 255 | + | |
| 256 | + | |
249 | 257 | | |
250 | | - | |
251 | | - | |
252 | | - | |
| 258 | + | |
| 259 | + | |
| 260 | + | |
| 261 | + | |
| 262 | + | |
| 263 | + | |
| 264 | + | |
253 | 265 | | |
254 | 266 | | |
255 | 267 | | |
| |||
302 | 314 | | |
303 | 315 | | |
304 | 316 | | |
305 | | - | |
| 317 | + | |
306 | 318 | | |
307 | 319 | | |
308 | 320 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
26 | 26 | | |
27 | 27 | | |
28 | 28 | | |
29 | | - | |
| 29 | + | |
30 | 30 | | |
31 | 31 | | |
32 | 32 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
23 | 23 | | |
24 | 24 | | |
25 | 25 | | |
26 | | - | |
| 26 | + | |
27 | 27 | | |
28 | 28 | | |
29 | 29 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
535 | 535 | | |
536 | 536 | | |
537 | 537 | | |
538 | | - | |
| 538 | + | |
539 | 539 | | |
540 | 540 | | |
541 | 541 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
39 | 39 | | |
40 | 40 | | |
41 | 41 | | |
| 42 | + | |
| 43 | + | |
42 | 44 | | |
43 | 45 | | |
44 | 46 | | |
| |||
59 | 61 | | |
60 | 62 | | |
61 | 63 | | |
62 | | - | |
63 | | - | |
| 64 | + | |
64 | 65 | | |
65 | 66 | | |
66 | 67 | | |
| |||
2360 | 2361 | | |
2361 | 2362 | | |
2362 | 2363 | | |
| 2364 | + | |
2363 | 2365 | | |
2364 | 2366 | | |
2365 | | - | |
| 2367 | + | |
| 2368 | + | |
| 2369 | + | |
| 2370 | + | |
| 2371 | + | |
| 2372 | + | |
| 2373 | + | |
| 2374 | + | |
| 2375 | + | |
| 2376 | + | |
| 2377 | + | |
| 2378 | + | |
| 2379 | + | |
| 2380 | + | |
| 2381 | + | |
| 2382 | + | |
| 2383 | + | |
| 2384 | + | |
| 2385 | + | |
| 2386 | + | |
| 2387 | + | |
| 2388 | + | |
| 2389 | + | |
| 2390 | + | |
| 2391 | + | |
| 2392 | + | |
| 2393 | + | |
| 2394 | + | |
| 2395 | + | |
| 2396 | + | |
| 2397 | + | |
| 2398 | + | |
| 2399 | + | |
| 2400 | + | |
| 2401 | + | |
| 2402 | + | |
| 2403 | + | |
| 2404 | + | |
| 2405 | + | |
| 2406 | + | |
| 2407 | + | |
2366 | 2408 | | |
2367 | 2409 | | |
2368 | 2410 | | |
| |||
2419 | 2461 | | |
2420 | 2462 | | |
2421 | 2463 | | |
| 2464 | + | |
2422 | 2465 | | |
2423 | 2466 | | |
2424 | 2467 | | |
| |||
0 commit comments