Add MultiModalModel class for handling multi-modal data processing and inference #1194

AshAnand34 · 2025-05-26T18:21:48Z

This pull request introduces a new MultiModalModel class to the llmware/models.py file, designed to support multi-modal models that handle various data types like text and images. It includes methods for preprocessing, postprocessing, and performing inference across different model types, such as PyTorch, ONNX, OpenVINO, TensorFlow, and GGUF.

Related Issue: #1025

New `MultiModalModel` class implementation:

Class overview: Added the MultiModalModel class to manage multi-modal models, with attributes for model name, type, and optional preprocessors and postprocessors.
Preprocessing and postprocessing: Introduced methods add_preprocessor, add_postprocessor, preprocess, and postprocess to handle data transformations for specific data types.
Inference logic: Implemented the inference method to preprocess inputs, run the model, and postprocess outputs. The _run_model method supports multiple model types, including PyTorch, ONNX, OpenVINO, TensorFlow, and GGUF.

This addition significantly enhances the flexibility and extensibility of the codebase for handling multi-modal machine learning models.

…d inference

doberst · 2025-06-03T12:58:51Z

@AshAnand34 - love it .... great stuff - has been on our to-do list for a while - so really appreciate this contribution. Please give me a couple of days to go through some testing - and may add some conforming elements with other model classes. Well done!

doberst · 2025-06-18T20:47:59Z

@AshAnand34 - nice work .... we will build on this multi modal class further - it is a good contribution. 👍

Add MultiModalModel class for handling multi-modal data processing an…

c663576

…d inference

doberst merged commit eab5000 into llmware-ai:main Jun 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add MultiModalModel class for handling multi-modal data processing and inference #1194

Add MultiModalModel class for handling multi-modal data processing and inference #1194

Uh oh!

AshAnand34 commented May 26, 2025

doberst commented Jun 3, 2025

doberst commented Jun 18, 2025

Labels

2 participants

Add MultiModalModel class for handling multi-modal data processing and inference #1194

Add MultiModalModel class for handling multi-modal data processing and inference #1194

Uh oh!

Conversation

AshAnand34 commented May 26, 2025

New MultiModalModel class implementation:

doberst commented Jun 3, 2025

doberst commented Jun 18, 2025

Labels

2 participants

New `MultiModalModel` class implementation: