Ollama integration for OpenACS

This package provides integration with Ollama in OpenACS.

Main features

Complete wrapper for the Ollama API, including streaming response.
Implements the FtsEngineDriver contract. You can use semantic embedding-based indexing to search into your documents.
RAG implementation. Use the indexed data from your packages (file-storage, xowiki, forums...) to inquire the model.
RAG from web search. Optionally, one can use the query as a web search to provide additional context to the model.

Dependencies

NaviServer >= 5.0 with ns_http -response_data_callback feature
Postgres databse
PgVector extension must be available.
An Ollama instance accessible by the OpenACS installation.

Usage as an FTS driver

Install the package
Ensure search is mounted
set "ollama-driver" as value for the FtsEngineDriver parameter in the search package
if your existance already used full-text-search in the past with a different driver, you can bootstrap the new ollama index by calling the ollama::bootstrap_index proc. On a system with a lot of data, this could take a long time, so you may need to schedule this accordingly.

How does indexing work?

The search package converts every document implementing the FtsEngineDriver contract into a text representation. We split this text into slightly overlapping chunks and create an index entry for each of them. For every such entry, we generate the corresponding embedding via ollama.

Upon search, the search query is also transformed into an embedding, and used to retrieve all relevant entries by comparing the vector distance between query and indexed chunks. We use pg_vector for this.

How does RAG work?

Retrieval Augmented Generation is a popular technique used to inject on-the-fly topic knowledge into a conversation with an LLM. Instead of training a new model with domain knowledge, we use the query to perform an index search, then use the retrieved knowledge to generate a new question that includes this relevant information for the model to use when replying to us.

The way RAG is implemented in this package is the following: regardless of whether you are using the ollama-driver for full-text-search, searchable packages such as file-storage, xowiki, forums and so on that are mounted underneath an ollama instance will be indexed. They will be treated as a "knowledge base" for their parent package.

We then implemented a "chat-like" UI where questions to the selected model will be enhanced by our knowledge whenever relevant information is found.

As many models will output markdown when replying to the user, we have also integrated on-the-fly markdown-to-html conversion via the Showdown library.

Possible TODOs

images in RAG conversations
support for index bootsrap/reindexing in the UI
...and much more

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
catalog		catalog
sql/postgresql		sql/postgresql
tcl		tcl
www		www
README.md		README.md
ollama.info		ollama.info

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Ollama integration for OpenACS

Main features

Dependencies

Usage as an FTS driver

How does indexing work?

How does RAG work?

Possible TODOs

About

Uh oh!

Releases

Packages

Languages

Elettrotecnica/openacs-ollama

Folders and files

Latest commit

History

Repository files navigation

Ollama integration for OpenACS

Main features

Dependencies

Usage as an FTS driver

How does indexing work?

How does RAG work?

Possible TODOs

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages