You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
From quickly building POCs to scalable LLM Apps for the enterprise, LLMWare is packed with all the tools you need.
9
9
10
-
`llmware` is an integrated framework with over 50+ models in Hugging Face for quickly developing LLM-based applications including Retrieval Augmented Generation (RAG) and Multi-Step Orchestration of Agent Workflows.
10
+
`llmware` is an integrated framework with over 50+ models for quickly developing LLM-based applications including Retrieval Augmented Generation (RAG) and Multi-Step Orchestration of Agent Workflows.
11
11
12
12
This project provides a comprehensive set of tools that anyone can use - from a beginner to the most sophisticated AI developer - to rapidly build industrial-grade, knowledge-based enterprise LLM applications.
13
13
@@ -47,7 +47,7 @@ from llmware.prompts import Prompt
47
47
models = ModelCatalog().list_all_models()
48
48
49
49
# to use any model in the ModelCatalog - "load_model" method and pass the model_name parameter
# use local cpu model - smallest, fastest (use larger BLING models for higher accuracy)
583
-
model ="llmware/bling-tiny-llama-v0"
586
+
# use local cpu model - try the newest - RAG finetune of Phi-3 quantized and packaged in GGUF
587
+
model ="bling-phi-3-gguf"
584
588
585
589
contract_analysis_on_laptop(model)
586
590
@@ -590,15 +594,15 @@ if __name__ == "__main__":
590
594
591
595
## 🔥 What's New? 🔥
592
596
593
-
-**Web Services with Agent Calls for Financial Research** - end-to-end scenario - [video](https://youtu.be/l0jzsg1_Ik0?si=hmLhpT1iv_rxpkHo) and [example](examples/SLIM-Agents/web_services_slim_fx.py)
597
+
-**Web Services with Agent Calls for Financial Research** - end-to-end scenario - [video](https://youtu.be/l0jzsg1_Ik0?si=hmLhpT1iv_rxpkHo) and [example](examples/Use_Cases/web_services_slim_fx.py)
594
598
595
-
-**Voice Transcription with WhisperCPP** - fast, accurate local transcription of voice files - [example](examples/Models/using-whisper-cpp-getting-started.py)
599
+
-**Voice Transcription with WhisperCPP** - [getting_started](examples/Models/using-whisper-cpp-getting-started.py), [using_sample_files](examples/Models/using-whisper-cpp-sample-files.py), and [analysis_use_case](examples/Use_Cases/parsing_great_speeches.py)
-**LLM to Answer Yes/No questions** - introducing slim-boolean model - [video](https://youtu.be/jZQZMMqAJXs?si=7HpkLqG39ohgNecx) and [example](examples/SLIM-Agents/using_slim_boolean_model.py)
600
604
601
-
-**Natural Language Query to CSV End to End example** - using slim-sql model - [video](https://youtu.be/z48z5XOXJJg?si=V-CX1w-7KRioI4Bi) and [example](examples/SLIM-Agents/text2sql-end-to-end-2.py) and now using Custom Tables on Postgres [example](https://github.com/llmware-ai/llmware/tree/main/examples/Structured_Tables/agent_with_custom_tables.py)
605
+
-**Natural Language Query to CSV End to End example** - using slim-sql model - [video](https://youtu.be/z48z5XOXJJg?si=V-CX1w-7KRioI4Bi) and [example](examples/SLIM-Agents/text2sql-end-to-end-2.py) and now using Custom Tables on Postgres [example](https://github.com/llmware-ai/llmware/tree/main/examples/Use_Cases/agent_with_custom_tables.py)
602
606
603
607
-**Multi-Model Agents with SLIM models** - multi-step Agents with SLIMs on CPU - [video](https://www.youtube.com/watch?v=cQfdaTcmBpY) - [example](examples/SLIM-Agents)
604
608
@@ -621,9 +625,11 @@ if __name__ == "__main__":
621
625
622
626
## 🔥 Top New Examples 🔥
623
627
624
-
End-to-End Scenario - [**Function Calls with SLIM Extract and Web Services for Financial Research**](https://github.com/llmware-ai/llmware/tree/main/examples/SLIM-Agents/web_services_slim_fx.py)
628
+
End-to-End Scenario - [**Function Calls with SLIM Extract and Web Services for Financial Research**](https://github.com/llmware-ai/llmware/tree/main/examples/Use_Cases/web_services_slim_fx.py)
629
+
Analyzing Voice Files - [**Great Speeches with LLM Query and Extract**](https://github.com/llmware-ai/llmware/tree/main/examples/Use_Cases/parsing_great_speeches.py)
625
630
New to LLMWare - [**Fast Start tutorial series**](https://github.com/llmware-ai/llmware/tree/main/fast_start)
| 3. Hybrid Retrieval - Semantic + Text ([code](examples/Retrieval/dual_pass_with_custom_filter.py)) | Using 'dual pass' retrieval to combine best of semantic and text search |
| 6. RAG with BLING ([code](examples/RAG/contract_analysis_on_laptop_with_bling_models.py) / [video](https://www.youtube.com/watch?v=8aV5p3tErP0)) | Using contract analysis as an example, experiment with RAG for complex document analysis and text extraction using `llmware`'s BLING ~1B parameter GPT model running on your laptop. |
636
-
| 7. Master Service Agreement Analysis with DRAGON ([code](examples/RAG/msa_processing.py) / [video](https://www.youtube.com/watch?v=Cf-07GBZT68&t=2s)) | Analyzing MSAs using DRAGON YI 6B Model. |
637
-
| 8. Streamlit Example ([code](examples/Getting_Started/ui_without_a_database.py)) |Upload pdfs, and run inference on llmware BLING models. |
641
+
| 6. RAG with BLING ([code](examples/Use_Cases/contract_analysis_on_laptop_with_bling_models.py) / [video](https://www.youtube.com/watch?v=8aV5p3tErP0)) | Using contract analysis as an example, experiment with RAG for complex document analysis and text extraction using `llmware`'s BLING ~1B parameter GPT model running on your laptop. |
642
+
| 7. Master Service Agreement Analysis with DRAGON ([code](examples/Use_Cases/msa_processing.py) / [video](https://www.youtube.com/watch?v=Cf-07GBZT68&t=2s)) | Analyzing MSAs using DRAGON YI 6B Model. |
643
+
| 8. Streamlit Example ([code](examples/UI/simple_rag_ui_with_streamlit.py)) |Ask questions to Invoices with UI run inference. |
638
644
| 9. Integrating LM Studio ([code](examples/Models/using-open-chat-models.py) / [video](https://www.youtube.com/watch?v=h2FDjUyvsKE&t=101s)) | Integrating LM Studio Models with LLMWare |
639
645
| 10. Prompts With Sources ([code](examples/Prompts/prompt_with_sources.py)) | Attach wide range of knowledge sources directly into Prompts. |
640
646
| 11. Fact Checking ([code](examples/Prompts/fact_checking.py)) | Explore the full set of evidence methods in this example script that analyzes a set of contracts. |
- 💡 Making it easy to deploy fine-tuned open source models to build state-of-the-art RAG workflows
759
765
- 💡 Private cloud - keeping documents, data pipelines, data stores, and models safe and secure
760
-
- 💡 Model quantization, especially GGUF, and democratizing the game-changing use of 7B CPU-based LLMs
766
+
- 💡 Model quantization, especially GGUF, and democratizing the game-changing use of 1-7B CPU-based LLMs
761
767
- 💡 Developing small specialized RAG optimized LLMs between 1B-7B parameters
762
768
- 💡 Industry-specific LLMs, embedding models and processes to support core knowledge-based use cases
763
769
- 💡 Enterprise scalability - containerization, worker deployments and Kubernetes
@@ -775,11 +781,15 @@ Questions and discussions are welcome in our [github discussions](https://github
775
781
776
782
## 📣 Release notes and Change Log
777
783
778
-
**Wednesday, May 1 - v0.2.12-WIP Update**
779
-
- Working on support for Python 3.12 -> will deprecate faiss and replace with 'no-install' chromadb in Fast Start examples
784
+
**Sunday, May 5 - v0.2.12-WIP Update**
785
+
- Launched ["bling-phi-3"](https://huggingface.co/llmware/bling-phi-3) and ["bling-phi-3-gguf"](https://huggingface.co/llmware/bling-phi-3-gguf) in ModelCatalog - newest and most accurate BLING/DRAGON model
786
+
- New long document summarization method using slim-summary-tool [example](https://github.com/llmware-ai/llmware/tree/main/examples/Prompts/document_summarizer.py)
787
+
- New Office (Powerpoint, Word, Excel) sample files [example](https://github.com/llmware-ai/llmware/tree/main/examples/Parsing/parsing_microsoft_ir_docs.py)
788
+
- Added support for Python 3.12
789
+
- Deprecated faiss and replaced with 'no-install' chromadb in Fast Start examples
780
790
- Refactored Datasets, Graph and Web Services classes
781
791
- Updated Voice parsing with WhisperCPP into Library
782
-
- Changes merged into main branch in repo - will be released as pypi 0.2.12 version targeted by Friday, May 3 EOD
792
+
- Changes merged into main branch in repo - will be released as pypi 0.2.12 version targeted by Monday, May 6
783
793
784
794
**Monday, April 29 - v0.2.11 Update**
785
795
- Updates to gguf libs for Phi-3 and Llama-3
@@ -789,7 +799,7 @@ Questions and discussions are welcome in our [github discussions](https://github
789
799
- Improved CUDA detection on Windows and safety checks for older Mac OS versions
790
800
791
801
**Monday, April 22 - v0.2.10 Update**
792
-
- Updates to Agent class to support Natural Language queries of Custom Tables on Postgres [example](https://github.com/llmware-ai/llmware/tree/main/examples/Structured_Tables/agent_with_custom_tables.py)
802
+
- Updates to Agent class to support Natural Language queries of Custom Tables on Postgres [example](https://github.com/llmware-ai/llmware/tree/main/examples/Use_Cases/agent_with_custom_tables.py)
793
803
- New Agent API endpoint implemented with LLMWare Inference Server and new Agent capabilities [example](https://github.com/llmware-ai/llmware/tree/main/examples/SLIM-Agents/agent_api_endpoint.py)
0 commit comments