From the course: Build with AI: Data Pipelines with Cursor, Neon, and Streamlit
Unlock this course with a free trial
Join today to access over 25,200 courses taught by industry experts.
Consolidate pipeline logic
From the course: Build with AI: Data Pipelines with Cursor, Neon, and Streamlit
Consolidate pipeline logic
- [Instructor] When we write code for our data pipeline, we want our code to be eventually unified and well-organized and easy to manage. Until now, we have basically implemented all the steps that we want for our data pipeline. We can get data on papers from the OpenAlex API, we can create the database table if needed, load the papers into the table with deduplication, and then run data tests on this pipeline. And this is all great, but we have implemented these steps separately and now they are implemented in a bunch of disconnected Python scripts. So what we want to do now is to reorganize our code in a way that's much cleaner and clearer. So I've written a prompt for my agent, and I'm asking to consolidate our logic into a pipeline class that is also a script, and it will go through all the steps in the pipeline that I mentioned, and I'm making sure that the agent uses existing logic instead of trying to rewrite…
Contents
-
-
-
-
OpenAlex quick start: Analyze data for Python data pipelines6m 59s
-
(Locked)
Data extraction layer: Get data from OpenAlex22m 40s
-
(Locked)
Neon database setup: Cloud PostgreSQL for data pipeline projects9m 50s
-
(Locked)
Design table schema and create a table in the database9m 50s
-
(Locked)
Process and load your data12m 23s
-
(Locked)
Data quality testing11m 21s
-
(Locked)
Consolidate pipeline logic8m 38s
-
(Locked)
Build a Streamlit dashboard7m 58s
-
(Locked)
Deploy the Streamlit dashboard7m 28s
-
-