3DCF / doc2dataset: token-efficient document layer with NumGuard numeric integrity and multi-framework exports for RAG & fine-tuning.
-
Updated
Dec 7, 2025 - Rust
3DCF / doc2dataset: token-efficient document layer with NumGuard numeric integrity and multi-framework exports for RAG & fine-tuning.
📄 Ingest documents into structured datasets for LLMs, ensuring numeric integrity and easy export across multiple frameworks with doc2dataset.
Add a description, image, and links to the numguard topic page so that developers can more easily learn about it.
To associate your repository with the numguard topic, visit your repo's landing page and select "manage topics."