From the course: Creating a Chat Tool Using OpenAI Models and Pinecone
Unlock the full course today
Join today to access over 25,200 courses taught by industry experts.
Retrieval-augmented generation (RAG) - OpenAI API Tutorial
From the course: Creating a Chat Tool Using OpenAI Models and Pinecone
Retrieval-augmented generation (RAG)
- [Instructor] Retrieval-augmented generation. I bet these words sound highly technical and intense at first. Still retrieval-augmented generation is a common approach that AI engineers use to enhance AI systems and make them respond with highly relevant information conversationally and interact in a human-like way. So what do I mean by that? Retrieval-augmented generation, commonly referred to as RAG, is a framework in AI and natural language processing that merges two core steps, as the name states, retrieval and generation. So I'll break these down to help you understand how they work. In the retrieval phase, information gets pulled or retrieved from a large data source like a vector database to find contextually-relevant data based on a search query. This phase is important for putting together the information needed for the next step, generation. In the generation phase, a language model or text generation model like GPT steps in to generate a response using the retrieved…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.