From the course: Advanced RAG Applications with Vector Databases
Unlock this course with a free trial
Join today to access over 25,600 courses taught by industry experts.
Introduction to the types of multimodality
From the course: Advanced RAG Applications with Vector Databases
Introduction to the types of multimodality
- [Instructor] Let's begin by exploring the answer to this question, what is multimodality? The core idea behind multimodal AI applications is that they deal with multiple types of data. There's a lot of buzz around the term, multimodal, AI right now, but what does it really mean? Let's take a look from the bottom up. The word multimodal comes from multi and modal. Multi meaning many and modal meaning types. The reason why multimodal AI is so popular right now is because it gives AI more human-like power. Humans have a multimodal interface with the world. Think of the senses. We have sight, hearing, taste, touch, and smell. When it comes to AI, the two modalities that are being emulated the most are closest to sight and hearing. While the term, multimodal, is still a highly debated term, some examples of multimodality can be classically agreed upon by the industry. These examples include images and text, images and audio, and video. Notice that these correspond to the sense I…
Contents
-
-
-
-
-
(Locked)
Introduction to the types of multimodality2m 23s
-
(Locked)
Ways to do multimodal RAG4m 13s
-
Introduction to multimodal embedding models3m 4s
-
(Locked)
Demo: Embedding and storing data40s
-
(Locked)
Demo: Query images with text3m 5s
-
(Locked)
Challenge: Find anomalies in your embeddings1m 24s
-
(Locked)
Solution: Find anomalies in your embeddings2m 3s
-
(Locked)
-