From the course: AI Pricing and ROI: A Technical Breakdown
Unlock the full course today
Join today to access over 25,200 courses taught by industry experts.
Choosing the right hardware for AI models
From the course: AI Pricing and ROI: A Technical Breakdown
Choosing the right hardware for AI models
- [Instructor] Let's talk about hardware costs for AI hosting use cases. In this case, I typically think through three important considerations: Latency, throughput, and model location. Let's start off with latency. Latency requirements depend on your user expectations ranging from milliseconds to days. If you're on social media, and you're trying to find the next post, it should show up very quickly, otherwise you'll turn off the app. And if you're working on more of a business process, like batch document processing, this might take hours or maybe days to do so. It really depends on the use case. What's important to understand here is that different use cases have different latency requirements. The next consideration is throughput, which is the number of requests per second that you need to handle. And finally, we have model location. Model location can mean a number of different things. For example, there…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.
Contents
-
-
-
-
-
-
-
(Locked)
Hosting and running your AI models2m 37s
-
(Locked)
Running your own models or outsourcing2m 31s
-
(Locked)
Choosing the right hardware for AI models5m 6s
-
(Locked)
Logging and monitoring AI inference7m 22s
-
(Locked)
Hiring the team for AI inference2m 26s
-
(Locked)
Challenge: Running AI for your start-up1m 23s
-
(Locked)
Solution: Running AI for your start-up3m 27s
-
(Locked)
Challenge: Running AI for your enterprise1m 9s
-
(Locked)
Solution: Running AI for your enterprise3m 18s
-
(Locked)
-
-
-
-