scd2
Here are 11 public repositories matching this topic...
SCD2 implementation using pyspark
-
Updated
Mar 10, 2019 - Jupyter Notebook
A modern banking data pipeline built with Dagster and DBT!
-
Updated
Dec 24, 2025 - Python
Advanced Healthcare Claims Pipeline using Snowflake, Snowpipe, Streams, Tasks, SCD Type 2, and AWS S3. Automates ingestion, CDC, dimensional modeling, and data quality checks for healthcare patient and claims data.
-
Updated
Nov 10, 2025
This is a data engineering pipeline built on Databricks + Delta Lake + PySpark that ingests travel booking and customer master data, applies SCD Type 2 logic, and delivers analytics-ready tables. It includes data quality enforcement, dimension versioning, fact aggregation, and performance tuning.
-
Updated
Oct 8, 2025 - Jupyter Notebook
Vijay works in an IT company for last 5 years, he always needs extra money to spend on his monthly expenses so he decided to apply for the credit card in icici bank. The bank does a background check of vijay to know if he is elligible for the credit card or not.
-
Updated
Mar 1, 2022
Implements a data pipeline using DLT in Databricks (Delta Lake) and uses medallion layering in Delta Lake
-
Updated
Sep 2, 2025 - Python
🏥 Streamline healthcare claims processing with this Snowflake pipeline, featuring auto-ingestion, CDC, SCD Type 2, and data quality checks.
-
Updated
Jan 1, 2026
End-to-end ETL and data warehouse pipeline implementing star schema design, SCD Type 2 dimensions, and fact tables for analytical reporting. Built with SQL and structured for scalable analytics.
-
Updated
Dec 12, 2025
Improve this page
Add a description, image, and links to the scd2 topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the scd2 topic, visit your repo's landing page and select "manage topics."