Retrieval-augmented generation (RAG) - OpenAI API Tutorial

From the course: Creating a Chat Tool Using OpenAI Models and Pinecone

Start my 1-month free trial Buy for my team

Retrieval-augmented generation (RAG)

“

- [Instructor] Retrieval-augmented generation. I bet these words sound highly technical and intense at first. Still retrieval-augmented generation is a common approach that AI engineers use to enhance AI systems and make them respond with highly relevant information conversationally and interact in a human-like way. So what do I mean by that? Retrieval-augmented generation, commonly referred to as RAG, is a framework in AI and natural language processing that merges two core steps, as the name states, retrieval and generation. So I'll break these down to help you understand how they work. In the retrieval phase, information gets pulled or retrieved from a large data source like a vector database to find contextually-relevant data based on a search query. This phase is important for putting together the information needed for the next step, generation. In the generation phase, a language model or text generation model like GPT steps in to generate a response using the retrieved…

- (Locked)
  
  Next steps
  
  1m 38s

Unlock the full course today

Join today to access over 25,200 courses taught by industry experts.

Retrieval-augmented generation (RAG) - OpenAI API Tutorial

From the course: Creating a Chat Tool Using OpenAI Models and Pinecone

Retrieval-augmented generation (RAG)

Practice while you learn with exercise files

Download courses and learn on the go

Contents

Explore Business Topics

Explore Creative Topics

Explore Technology Topics