From the course: DataOps with Apache Iceberg using Spark, Nessie, and Dremio
Unlock this course with a free trial
Join today to access over 24,800 courses taught by industry experts.
What is DataOps?
From the course: DataOps with Apache Iceberg using Spark, Nessie, and Dremio
What is DataOps?
Hey there. Let's talk about what is DataOps. So first, let's just kind of go off with a textbook definition of what is DataOps. DataOps is an agile, process-oriented methodology that focuses on streamlining and automating data management to improve collaboration, quality, and delivery of data across teams and systems. Like, what does that all mean? Well, bottom line is you're going to use lots of different tools, and you're going to automate the usage of those tools in order to achieve many different goals. And let's talk about what some of those different goals are. They include improving communication. So basically, the idea of being able to generate documentation and metadata and things that allow people to understand not just what has been done, but what was created from what was done. And being able to understand and be able to troubleshoot things in an effective way, which then influences data quality and reliability. Okay. Because bottom line is if there's data quality issues…