From the course: Advanced RAG Applications with Vector Databases

Unlock this course with a free trial

Join today to access over 25,600 courses taught by industry experts.

Introduction to the types of multimodality

Introduction to the types of multimodality

- [Instructor] Let's begin by exploring the answer to this question, what is multimodality? The core idea behind multimodal AI applications is that they deal with multiple types of data. There's a lot of buzz around the term, multimodal, AI right now, but what does it really mean? Let's take a look from the bottom up. The word multimodal comes from multi and modal. Multi meaning many and modal meaning types. The reason why multimodal AI is so popular right now is because it gives AI more human-like power. Humans have a multimodal interface with the world. Think of the senses. We have sight, hearing, taste, touch, and smell. When it comes to AI, the two modalities that are being emulated the most are closest to sight and hearing. While the term, multimodal, is still a highly debated term, some examples of multimodality can be classically agreed upon by the industry. These examples include images and text, images and audio, and video. Notice that these correspond to the sense I…

Contents