Ontology Drives LLM Content with SHACL

This title was summarized by AI from the post below.

1mo

Putting a knowledge graph into an LLM is usually not feasible. However, you can put an ontology (structure + taxonomy + rules) that then allow you to query (and update) a knowledge graph that in turn can drive LLM content. In effect, the SHACL becomes your contract between the language model and the knowledge or context graph. This is a long article, but I cover a lot of ground here, including why I believe that such an approach can both dramatically improve accuracy and provide a working environment for dynamic data that can better feed the LLM in its role not as database but as transformer. Thoughts on Queens and cabbages and sailing ships, or at least the role of Steampunk as a programming aesthetic, all of course in … The Ontologist!

How SHACL Makes Your LLMs Hum ontologist.substack.com

19 Comments

Kurt Cagle 1mo

Denis O. Put another way, an empty input string is still an input string. The narrative that comes out will be from latent randomizers due to temperature, in a distribution pattern that is likely some form of noise (probably not fully white noise, depending upon the associated random generator). In this case what comes out is likely oracular: LLM as I Ching, if you will. Of course, this condition is usually trapped early on in a guardrail in most live systems, probably with statements like "Say something profound, wise, or funny." In the event of a supposed zero string prompt.

1 Reaction

Kingsley Uyi Idehen 1mo

Yes, but #SHACL isn’t mandatory. What’s mandatory is understanding what you are trying to achieve —IMHO. Regarding SHACL, as a knowledgeable practitioner, I use it to constrain #SPARQL insert operations associated with special folders mapped to named graphs in a #VirtuosoRDBMS instance. Basically, SHACL is integrated via a folder attribute. The filesystem hook is relevant, and important, due to AI Agents looking to it as the universal interface for both context building and utilization activities.

1 Reaction

Frédéric MOROSOFF 1mo

SHACL provides valuable validation of declared constraints. But it primarily checks conformance to defined shapes and rules. What remains is the question of “behavioral” rules — lifecycle management, authorized transitions, cross-entity constraints — which often go beyond local validation and end up implemented in application logic.

Himanshu Tiwari 1mo

So SHACL or similar standards, such as RDF Schema or OWL, serve as a shared context enabling seamless semantic interactions.

2 Reactions

Denis O. 1mo

EXCEPT FOR ONE THING.. any LLM can simply ignore all input context when generating the token trajectory.. so a graph decoupled from internal LLM dynamics or any fancy RAG is no panacea

10 Reactions

Charlie Northrup 1mo

Using an ontology plus SHACL as the contract between an LLM and a context graph is a strong way to prevent schema drift and reduce hallucinated structure. One clarification, though, SHACL can enforce conformance (structural validity against declared shapes) but it’s not the same as establishing adequacy for reliance. A node can be perfectly shape-valid and still be stale, unauthorized, misattributed, or contextually unsafe to act upon.

Célian Ringwald 1mo

Pleased to read your article :) I've learnt a lot ! I completly like the "SHACL as a contract" concept ! This is pretty usefull to generate under contraints and validate it afterward. I recently defended a thesis on // SHACL + SML relation extraction : https://hal.science/view/index/docid/5446838 Depending of the context making the use of smaller models and better controlling the cost + sovergnerty of the data is also a subject.

4 Reactions

Timothy Cook 1mo

Kurt Cagle, spot on. The LLM cannot reason without a Shape to constrain it. We are finding that SHACL is the perfect 'City Planning' tool for the Graph, but we still need a 'Building Code' for the data before it enters the city. We’re using strict XSD/Archetypes to force that constraint at the Packet level (Ingest). If the packet arrives with H=0 (Zero Entropy), SHACL has much less work to do. Great to see the focus returning to Constraints. https://www.linkedin.com/posts/axius-sdc_the-zero-entropy-data-packet-why-multilevel-activity-7428624538285907969-r0UU

2 Reactions

Mikkel Fishman 1mo

I'm going to try https://github.com/Hawksight-AI/semantica And adapt it with shacl to try this out

2 Reactions

Youssef Ben Mahmoud 1mo

SHACL as a contract layer between the LLM and the graph is a really clean mental model. I've seen teams burn months trying to get LLMs to write raw SPARQL reliably, when the real move is constraining the interface. The ontology-as-API pattern just makes more sense at scale.

1 Reaction

See more comments

To view or add a comment, sign in

More Relevant Posts

Jan Voskuil
1mo
Report this post
Another great article by Kurt on SHACL ontologies and LLMs. It is long but very much worth the effort. Thanks Kurt Cagle for sharing

Kurt Cagle Kurt Cagle is an Influencer

Editor In Chief @ The Cagle Report | Ontologist | Author | Iconoclast
1mo

Putting a knowledge graph into an LLM is usually not feasible. However, you can put an ontology (structure + taxonomy + rules) that then allow you to query (and update) a knowledge graph that in turn can drive LLM content. In effect, the SHACL becomes your contract between the language model and the knowledge or context graph. This is a long article, but I cover a lot of ground here, including why I believe that such an approach can both dramatically improve accuracy and provide a working environment for dynamic data that can better feed the LLM in its role not as database but as transformer. Thoughts on Queens and cabbages and sailing ships, or at least the role of Steampunk as a programming aesthetic, all of course in … The Ontologist!

How SHACL Makes Your LLMs Hum ontologist.substack.com
Like Comment
To view or add a comment, sign in
Jessica Littman Moszkowicz
1mo
Report this post
Part 2 of my entity resolution series is live! In the first post, I covered why preparation matters. Now, it's time to actually run the system. In this post, I walk through the mechanics of retrieval + evaluation working together. We'll see that the LLM performs better when it's not asked to search, while search performs better when it isn't asked to judge. That separation of concerns turns out to be really powerful. There’s still a final architectural twist coming later in the series, but this is where things start to click. Check it out! 🔗 https://lnkd.in/g2zbjUG4

Elasticsearch entity resolution: Match entities with LLMs & semantic search - Elasticsearch Labs elastic.co
Like Comment
To view or add a comment, sign in
James M. Tucker
1mo
Report this post
So, the Epstein file analysis is proceeding rather well. The DOJ is constantly changing the website, breaking my scraper but I have Claude Code iterating on the code to write solutions to overcome the barriers they implement. Today, I am passing every page through a Vision Language Model to get captions for images, which will be helpful for the PDFs with only images in file. Then, I will OCR the documents with text or text and image. The next big step is to create a knowledge graph over the files to isolate actors and their actions. I will use a semantic triple: Subject>Predicate>Object. I am interested in cases where Subject and Object is the Universal Part of Speech as Proper Noun and the verbal actions will be the matrices in the graph. Stay tuned folks. We are going to get some deeper understanding.
Like Comment
To view or add a comment, sign in
Towards Data Science

646,059 followers
2w
Report this post
If you need to balance infrastructure costs with retrieval accuracy — a common pain point! — Oleg Tereshin proposes pairing Matryoshka Representation Learning (MRL) with int8 and binary quantization.

Scaling Vector Search: Comparing Quantization and Matryoshka Embeddings for 80% Cost Reduction | Towards Data Science https://towardsdatascience.com

1 Comment
Like Comment
To view or add a comment, sign in
Towards Data Science

646,059 followers
1mo
Report this post
Improve your understanding of retrieval-augmented generation (RAG) for source code. Kenneth Leung uses Cursor as a case study to show how a full pipeline is built for contextual awareness.

How Cursor Actually Indexes Your Codebase | Towards Data Science https://towardsdatascience.com
Like Comment
To view or add a comment, sign in
Muhammad Saqib
3w
Report this post
𝗧𝗵𝗲 "𝗟𝗼𝘀𝘁 𝗶𝗻 𝘁𝗵𝗲 𝗠𝗶𝗱𝗱𝗹𝗲" 𝗽𝗿𝗼𝗯𝗹𝗲𝗺 𝗶𝘀 𝘁𝗵𝗲 𝗯𝗶𝗴𝗴𝗲𝘀𝘁 𝗵𝗶𝗱𝗱𝗲𝗻 𝘁𝗿𝗮𝗽 𝗶𝗻 𝗥𝗔𝗚. I am currently building the retrieval pipeline for a financial reconciliation tool (Refinely), and I’m actively researching the best ways to optimize how the LLM processes context. The issue I am planning for: When a vector database fetches a large amount of relevant financial documents, LLMs often ignore the critical data buried right in the middle of the context window. To solve this, I am looking at moving beyond basic vector search and implementing a two-step retrieval architecture. My top two considerations right now: 1. 𝗦𝗲𝗺𝗮𝗻𝘁𝗶𝗰 𝗖𝗵𝘂𝗻𝗸𝗶𝗻𝗴: Breaking documents down by logical meaning rather than strict character counts. 2. 𝗔𝗱𝗱𝗶𝗻𝗴 𝗮 𝗖𝗿𝗼𝘀𝘀-𝗘𝗻𝗰𝗼𝗱𝗲𝗿: Using a reranker to score and re-order the retrieved chunks before they ever reach the LLM. Vector search gets the documents to the door, but traditional information retrieval principles are what actually put the right data in front of the model. For the senior AI engineers here: If you were building a high-accuracy pipeline for financial data, what approach gave you the best ROI for fixing middle-context loss? #RAG #MachineLearning #Python #FastAPI #AIArchitecture
Like Comment
To view or add a comment, sign in
Todd McGee
3w
Report this post
Frequency is not always related to correctness I was querying Claude on why it returned a particular bit of text and it provided a response which resonated. It's something that most of us know but rarely inspect, I believe. Specifically, Claude said that the LLM is a statistical engine trained on text gathered from many sources. It draws statistical relationships (vectors) between words or fragments of words. Boiling this down, it means that the LLM equates frequency with correctness. I think that any of us with significant life experience can easily come up with examples where frequency of repetition and correctness are not directly correlative. And this is why we code review and edit AI-generated content.

8 Comments
Like Comment
To view or add a comment, sign in
Gustavo R Santos
3w
Report this post
LLMs can be "unpredictable". But often, the real problem is the prompt. One simple trick I use when working with LLMs for coding, debugging, or data science: Instead of asking: “Explain overfitting.” 👉 Try this: 1. Define 5 criteria for a high-quality explanation of overfitting. 2. Using those criteria, write the explanation. Why this works: • The model defines the quality standard first • The answer becomes more structured • You reduce generic responses You’re basically turning the LLM into: • The writer • The critic • The editor All in one prompt. Want to see more tips like this one? I have just published a new post with 6 tips to improve your work in Data Science. Check it out: https://lnkd.in/eqz-Fssr

How to Get Better LLM Outputs: 6 Prompt Engineering Tips for Coding, Debugging, and Data Science gustavorsantos.medium.com
Like Comment
To view or add a comment, sign in
Sai Varsha Vatsavai
1mo Edited
Report this post
Wen Gong recently published a paper on Structured Prompt Language( SPL) which proposes a SQL-like approach for declarative context management in RAG systems. I believe SPL could significantly improve our RAG workflows, especially for complex data sources. I’m exploring how we might integrate SPL and reposting for you all to take a look. #SPL#Context#GenAI

Wen Gong
1mo Edited

my Structured Prompt Language (SPL) paper was accepted by arXiv and listed under cl.CL, cl.PL, cl.DB - https://lnkd.in/ePmaKdY5, it was modeled after SQL to address prompt-engineering issue

Structured Prompt Language: Declarative Context Management for LLMs arxiv.org

1 Comment
Like Comment
To view or add a comment, sign in
🚀 Navicstein Chinemerem
2w
Report this post
𝗖𝗼𝗻𝘁𝗲𝘅𝘁 𝘄𝗶𝗻𝗱𝗼𝘄 𝗯𝗹𝗼𝗮𝘁 𝗶𝘀 𝘁𝗵𝗲 𝗻𝗲𝘄 𝘁𝗲𝗰𝗵𝗻𝗶𝗰𝗮𝗹 𝗱𝗲𝗯𝘁. If you’re running agents like Claude Code, Cursor or Kilo-Code you aren't just sending data, you're sending garbage too Most prompts are bloated with 40–60% "filler" tokens. We’re talking about redundant boilerplate, useless logging, and conversational fluff that adds ZERO value to the LLM’s final answer. 𝗧𝗵𝗲 𝗿𝗲𝗮𝗹𝗶𝘁𝘆? 𝗬𝗼𝘂’𝗿𝗲 𝗽𝗮𝘆𝗶𝗻𝗴 𝗳𝗼𝗿 𝘁𝗵𝗲 𝗻𝗼𝗶𝘀𝗲, 𝗻𝗼𝘁 𝘁𝗵𝗲 𝘀𝗶𝗴𝗻𝗮𝗹 While teams like Compresr-ai try to solve this with "Background Summarization" -- they’re missing the point. Summarizing code is a gamble. It rewrites your logic, renames your variables, and breaks your syntax. It’s like trying to shrink a blueprint by redrawing it from memory. We do it differently. We don't "summarize." We Mathematically prune tokens that are useless to the LLM We're currently building the API to do exactly this. It’s not fully finished yet but it is being built to provide the surgical token-level compression that engineering teams actually need to scale without going broke. our APIs surgically erase the low-information tokens that LLM's don't need while keeping every bracket, variable, and logic gate 100% intact. ✅ Stop paying for "the" and "a" in your 100k token prompts. ✅ Stop waiting 20 seconds for a "summarizer" to think. ✅ Stop risking hallucinations from abstractive rewrites. We provide the surgical token-level compression that engineering teams actually need to scale without going broke. 𝗧𝗵𝗲 𝗺𝗮𝘁𝗵 𝗶𝘀 𝘀𝗶𝗺𝗽𝗹𝗲: Why pay for the whole context window when the LLM only needs half of it? How much of your last $1,000 Anthropic bill was just "filler"?

2 Comments
Like Comment
To view or add a comment, sign in

27,000 followers

View Profile Connect

Ontology Drives LLM Content with SHACL

More from this author

Thoughts From a Hospital Bed in the Age of AI

When Jobs Go Away

Through a Glass Darkly 2026

Explore content categories

Ontology Drives LLM Content with SHACL

More Relevant Posts

More from this author

Thoughts From a Hospital Bed in the Age of AI

When Jobs Go Away

Through a Glass Darkly 2026

Explore related topics

Explore content categories