Fix llama.cpp quantize location and execution on Windows. #2894

simpolism · 2025-07-07T04:49:15Z

This PR fixes two specific issues with gguf quantization on Windows:

The first issue relates to the llama-quantize.exe execution, which relies on a hardcoded ./ prefix to the located quantization executable rather than relying on os.path functions to refer to the file properly. This bug produced a consistent exception on Windows: '.' is not recognized as an internal or external command, operable program or batch file.. I have modified the quantize_location variable to use the os library and remove the prefix, and the issue was resolved for me in local testing.
The second issue is that an at least partially successful Windows build of llama.cpp outputs binary files in the llama.cpp/build/bin/Release path, rather than the llama.cpp/build/bin. This is a smaller consideration, as I wasn't able to get llama.cpp compilation working via the script anyway on Windows due to curl linking issues, but I figured I might as well add proper path detection based on the cmake outputs I was seeing. It would take significantly more work to fix the overall automatic compilation flow, but this at least saves a step of manual effort.

Closes: #1645
Can also be closed as no longer needed: #1646

Datta0

Thanks for the changes. A minor suggestion.
Also on a side note, do you think finding the binary in the said folder is better than just explicitly listing out 6 possible paths?

Datta0 · 2025-07-07T05:27:02Z

unsloth/save.py

+        possible_paths = [
+            os.path.join(".", "llama.cpp", "quantize.exe"),
+            os.path.join(".", "llama.cpp", "quantize"),
+            os.path.join(".", "llama.cpp", "llama-quantize.exe"),
+            os.path.join(".", "llama.cpp", "llama-quantize"),
+            os.path.join(".", "llama.cpp", "build", "bin", "llama-quantize"),
+            os.path.join(".", "llama.cpp", "build", "bin", "quantize"),
+            os.path.join(".", "llama.cpp", "build", "bin", "Release", "llama-quantize.exe"),
+            os.path.join(".", "llama.cpp", "build", "bin", "Release", "quantize.exe")
+        ]
+
+        for path in possible_paths:
+            if os.path.exists(path):
+                quantize_location = path
+                break


Can we wrap this in a get_llamacpp_binary_location function or something and use the same thing in https://github.com/unslothai/unsloth/pull/2894/files#diff-46849d25980ee8d9337f4f8c30369faf36ceda3479272fd737ebf5ad9c703840R887-R897 ?

Should be resolved. Confirmed successful execution with update + failure case by deleting the binaries.

simpolism · 2025-07-07T05:58:20Z

Also on a side note, do you think finding the binary in the said folder is better than just explicitly listing out 6 possible paths?

I think the current approach is optimal: we don't expect the file to be in other locations than these specified, and no need to waste time trawling the filesystem if it's determinate, especially since it's really only 3 paths with 2 different executable names.

simpolism · 2025-07-10T22:11:49Z

Hey @Datta0 anything else needed here to get this PR in?

Fix windows quantize execution and locating.

3e9e50d

Datta0 reviewed Jul 7, 2025

View reviewed changes

Extract quantize path detection into separate function call

d2f1dc2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix llama.cpp quantize location and execution on Windows. #2894

Fix llama.cpp quantize location and execution on Windows. #2894

Uh oh!

simpolism commented Jul 7, 2025

Datta0 left a comment

Datta0 Jul 7, 2025

simpolism Jul 7, 2025 •

edited

Loading

simpolism commented Jul 7, 2025

simpolism commented Jul 10, 2025

Labels

2 participants

Uh oh!

Fix llama.cpp quantize location and execution on Windows. #2894

Are you sure you want to change the base?

Fix llama.cpp quantize location and execution on Windows. #2894

Uh oh!

Conversation

simpolism commented Jul 7, 2025

Datta0 left a comment

Choose a reason for hiding this comment

Datta0 Jul 7, 2025

Choose a reason for hiding this comment

simpolism Jul 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

simpolism commented Jul 7, 2025

simpolism commented Jul 10, 2025

Labels

2 participants

simpolism Jul 7, 2025 •

edited

Loading