"Schema vocabulary and higher-order catamorphisms"

This title was summarized by AI from the post below.

6mo

Great thoughts in this post!! At the "schema vocabulary" levels, taxonomic composition maps beautifully to well-founded higher-order catamorphisms, and term lists as either sum types or traits-like patterns (e.g., facet templates). These schema layers form a cohesive axiomatic surface for the value layers. Very exciting to see these unifying correspondences happening across KM, category theory, and type theory!

Heather Hedden

6mo

Following on last month's Accidental Taxonomist Blog post on "Types of Metadata Schemas," my latest post is "Schema Vocabularies and Value Vocabularies," the topic of the panel I spoke on at the DCMI conference in Barcelona in October. https://lnkd.in/evheedJk

Schema Vocabularies and Value Vocabularies accidental-taxonomist.blogspot.com

To view or add a comment, sign in

More Relevant Posts

KMWorld Conference

312 followers
6mo
Report this post
Schema vocabularies, taxonomies, ontologies. @joebusch of Taxonomy Strategies explains how they build consistency across structured content. Don?t miss this deep dive: https://lnkd.in/gzFZ4FDS #TaxoBC
Like Comment
To view or add a comment, sign in
Tamás Diósi-Mákos
7mo
Report this post
I’ve recently been involved in document processing and knowledge management, and I have a mental image that helps with understanding the difference between vector and graph-based RAG systems. Imagine your local library as a knowledge hub. When a new book arrives, the team records its details and assigns subject headings to create a comprehensive bibliographic record. Digitally, the book’s content is transformed into embeddings, capturing its essence and storing it in a vector database that understands context. Your librarian, an LLM, has knowledge of millions of books but relies on its last training session. Occasionally, it might confidently mention information that doesn’t exist. This is where VectorRAG steps in. When a visitor poses a question, the system translates it into a vector, scouts the database for similar documents, and presents them to the LLM alongside the original query. This method anchors the LLM’s responses in real sources, minimising errors and ensuring the information is up-to-date. Vector-based retrieval excels at handling straightforward factual questions where semantic similarity matches relevance. However, when questions become complex, like \”Did any former Google employees start their own company?\” the system’s elegance falters. These multi-hop inquiries require linking information across various catalogue sections. As questions span multiple domains, involving six or nine sections, vector search accuracy diminishes. The system retrieves document chunks mentioning related terms independently, but vital connections remain obscured, and repeated information floods the results. To address these challenges, the library has other tools available that goes beyond mere vectors to search for concepts, not just strings. Inverted indexes allow for precise keyword matching, retrieving exact documents without semantic confusion. Knowledge graphs, the pinnacle of this evolution, encode real relationships rather than numerical proximity. Unlike vectors, which approximate similarity, graphs explicitly depict relationships. They maintain context, navigate explicit connections, and enable traceable multi-hop reasoning that vector similarity cannot achieve. Your library must be equipped to handle complex retrieval, transitioning from approximate to precise. Inverted indexes provide exact term matching, while knowledge graphs encode meaningful relationships. Together, they offer accuracy, explainability, and the ability to address complex questions across various domains. #graphrag #knowledgegraphs
8 Comments
Like Comment
To view or add a comment, sign in
Rodrigo Rocco
6mo
Report this post
Great Ahrefs' article (by Mateusz Makosiewicz): How to Earn LLM Citations to Build Traffic & Authority What the LLMs looks for when selecting citations? The article lists several factors that tend to correlate with being cited: > Freshness: newer or recently updated content tends to be preferred. > Domain authority: Sites with strong backlink profiles / high domain rating tend to get cited more. > Semantic relevance: Content that directly addresses the user’s query, with clear, extractable answers. > Structured & accessible formatting: clear headings, paragraphs, data in text rather than only in images, etc.
Like Comment
To view or add a comment, sign in
Andy Fitzgerald, PhD
6mo
Report this post
Hey Sanity friends! I know that _you_ know that taxonomy, vocabulary control, and content semantics isn't a set it and forget it operation. Just like your content structure, your content semantics is a living system, one that evolves as the needs of your business, users, and content do. Of course, those systems don't evolve on their own: that's where governance comes in! Truth be told, taxonomy governance is one of the reasons I built the Sanity Taxonomy Manager plugin to begin with. With the `4.1.0` update, you'll now see a new "Tagged Resources" view for individual concepts. This view will help you better understand how your taxonomy is performing by showing you which terms are not getting used ... and which are carrying way too much content—a sure sign those terms could be further subdivided. Naturally, this view also plays nice with the different content perspectives available in Studio, including any content releases you may have queued up. Check out the latest version on NPM—and do get in touch if you'd like help building and managing your own standards-based taxonomies in Sanity Studio.

3 Comments
Like Comment
To view or add a comment, sign in
Jean-Pierre Palomba-Marin
6mo
Report this post
Building powerful RAG pipelines with Docling and OpenSearch A technical blog post detailing how to build RAG pipelines by integrating the Docling document processing toolkit with OpenSearch for high-performance, metadata-aware vector retrieval. https://lnkd.in/dgnWG4hX

Building powerful RAG pipelines with Docling and OpenSearch - OpenSearch opensearch.org
Like Comment
To view or add a comment, sign in
Pierre de Lacaze
7mo
Report this post
Building an Agentic Deep-Thinking RAG Pipeline to Solve Complex Queries (Fareed Khan, October 2025, 68mn read) "A RAG system often fails not because the LLM lacks intelligence, but because its architecture is too simple. It tries to handle a cyclical, multi-step problem with a linear, one-shot approach. Many complex queries demand reasoning, reflection, and smart decisions about when to act, much like how we retrieve information when faced with a question. That’s where agent-driven actions within the RAG pipeline come into play. Let’s take a look at what a typical deep-thinking RAG pipeline looks like… 1. Plan: First, the agent decomposes the complex user query into a structured, multi-step research plan, deciding which tool (internal document search or web search) is needed for each step. 2. Retrieve: For each step, it executes an adaptive, multi-stage retrieval funnel, using a supervisor to dynamically choose the best search strategy (vector, keyword, or hybrid). 3. Refine: It then uses a high-precision cross-encoder to rerank the initial results and a distiller agent to compress the best evidence into a concise context. 3. Reflect: After each step, the agent summarizes its findings and updates its research history, building a cumulative understanding of the problem. 4. Critique: A policy agent then inspects this history, making a strategic decision to either continue to the next research step, revise its plan if it hits a dead end, or finish. 5. Synthesize: Once the research is complete, a final agent synthesizes all the gathered evidence from all sources into a single, comprehensive, and citable answer. In this blog, we are going to implement the entire deep thinking RAG pipeline and compare it with a basic RAG pipeline to demonstrate how it solves complex multi-hop queries." https://lnkd.in/eHuX3hhZ
1 Comment
Like Comment
To view or add a comment, sign in

635 followers

17 Posts

View Profile Follow

"Schema vocabulary and higher-order catamorphisms"

More Relevant Posts

Explore content categories