Your dbt project just got smarter. Not a replacement. An extension. Here's what I mean: dbt is great at what it does. SQL transformations, testing, documentation. But it stops there. It doesn't know when your source data changes. It doesn't understand your full pipeline. It doesn't respond to incidents at 2 AM. What if your dbt models were part of something that did? That's what we've been building. Take your existing dbt project, bring it into Ascend, and suddenly: Your models run automatically when upstream data actually changes. Not on a schedule. Not after some orchestration layer polls and checks. Actually triggered by the data itself. Otto, our AI agent, understands your dbt models alongside everything else in your pipeline. It can debug issues, suggest optimizations, and generate new models that follow your existing patterns. When something breaks, you get intelligent triage that traces the problem back through your dbt models to the source. Not just "this model failed" but "this model failed because this column changed type in this upstream table." Smart Components extend beyond dbt's Simple and Incremental transforms. They track code fingerprints and data lineage to avoid unnecessary rebuilds. We've seen teams cut compute costs by 40-60% without changing their logic. The integration takes about 10 minutes. Point at your repo. Configure your data plane. Done. dbt isn't going anywhere. It's just getting more capable. Our very own Cody Peterson is holding an exciting hands-on lab this Wednesday -- come check it out @ https://lnkd.in/grPxMU-F #dbt #DataEngineering #AgenticDataEngineering
Enhance dbt with Ascend's AI-driven pipeline management
More Relevant Posts
-
𝐓𝐡𝐢𝐧𝐤𝐢𝐧𝐠 𝐚𝐛𝐨𝐮𝐭 𝐝𝐛𝐭 𝐦𝐨𝐝𝐞𝐥𝐬 𝐧𝐨𝐭 𝐣𝐮𝐬𝐭 𝐚𝐬 𝐥𝐨𝐠𝐢𝐜, 𝐛𝐮𝐭 𝐚𝐬 𝐢𝐧𝐭𝐞𝐫𝐟𝐚𝐜𝐞𝐬 As I’ve been spending more time revisiting how data models are used across an organization, I’ve started to think of them less as internal SQL logic and more as 𝐢𝐧𝐭𝐞𝐫𝐟𝐚𝐜𝐞𝐬 𝐭𝐡𝐚𝐭 𝐨𝐭𝐡𝐞𝐫 𝐭𝐞𝐚𝐦𝐬, 𝐝𝐚𝐬𝐡𝐛𝐨𝐚𝐫𝐝𝐬, 𝐚𝐧𝐝 𝐝𝐞𝐜𝐢𝐬𝐢𝐨𝐧𝐬 𝐫𝐞𝐥𝐲 𝐨𝐧. From that perspective, 𝐬𝐭𝐚𝐛𝐢𝐥𝐢𝐭𝐲 𝐛𝐞𝐜𝐨𝐦𝐞𝐬 𝐣𝐮𝐬𝐭 𝐚𝐬 𝐢𝐦𝐩𝐨𝐫𝐭𝐚𝐧𝐭 𝐚𝐬 𝐜𝐨𝐫𝐫𝐞𝐜𝐭𝐧𝐞𝐬𝐬. One concept that stood out to me is the idea of 𝐦𝐨𝐝𝐞𝐥 𝐜𝐨𝐧𝐭𝐫𝐚𝐜𝐭𝐬, treating the shape of a model (its columns, types, and expectations) as something downstream users can depend on, not something that can quietly change overnight. Closely related to that is 𝐯𝐞𝐫𝐬𝐢𝐨𝐧𝐢𝐧𝐠 𝐚𝐧𝐝 𝐝𝐞𝐩𝐫𝐞𝐜𝐚𝐭𝐢𝐨𝐧. Instead of changing a model in place and hoping nothing breaks, keeping an older version available while introducing a new one creates space for dashboards and consumers to migrate safely. It turns change into a managed process rather than a surprise. There’s also an access side to this. Not every model needs to be “public.” Being intentional about which models are meant to be reused versus which ones are internal helps keep the data layer easier to navigate and reduces the chance of people building critical reports on something that was never meant to be a stable interface. What I find interesting is how these ideas connect technical practices with business outcomes: 𝐟𝐞𝐰𝐞𝐫 𝐛𝐫𝐨𝐤𝐞𝐧 𝐝𝐚𝐬𝐡𝐛𝐨𝐚𝐫𝐝𝐬, 𝐦𝐨𝐫𝐞 𝐜𝐨𝐧𝐟𝐢𝐝𝐞𝐧𝐜𝐞 𝐢𝐧 𝐦𝐞𝐭𝐫𝐢𝐜𝐬, 𝐚𝐧𝐝 𝐬𝐦𝐨𝐨𝐭𝐡𝐞𝐫 𝐜𝐡𝐚𝐧𝐠𝐞𝐬 𝐚𝐬 𝐫𝐞𝐪𝐮𝐢𝐫𝐞𝐦𝐞𝐧𝐭𝐬 𝐞𝐯𝐨𝐥𝐯𝐞. 🔍📊 Looking forward to continuing to explore how treating data models as products and interfaces can make analytics work more dependable and easier to scale. 🚀 #dbt #AnalyticsEngineering #DataEngineering #DataModeling #SQL #ModernDataStack #TechCareers
To view or add a comment, sign in
-
It’s time for data engineers to stop wasting time on repetitive Data Vault coding. Building and maintaining a Data Vault in dbt can be slow, error-prone, and hard to keep consistent. AutomateDV solves this by generating production-ready, dbt-native Data Vault models automatically. Why it matters for data engineers: ✔️ Less repetitive coding – focus on problem-solving, not boilerplate SQL ✔️ Consistent, governed structures – reduce errors and tech debt ✔️ Adaptable to changing sources – metadata-driven automation keeps models up-to-date ✔️ Faster delivery – get analytics-ready data in dbt sooner AutomateDV helps you turn complex Data Vault development into a repeatable, reliable process, so you can spend more time building insights, not pipelines. Get started with our open-source Data Vault automation tool here: https://lnkd.in/ePr873Xy
To view or add a comment, sign in
-
-
How do you track DBT model logic versions? I'm building daily append-only snapshot of a table and need to know which version of the logic produced each day’s data. The challenge: changes can happen not just in the model itself, but also in upstream int models. Approaches I'm considering: 1) Manual var() in dbt_project.yml - simple integer, bump it when logic changes. Bleh... doesn't track upstream changes, easy to forget. 2) dbt_artifacts checksum - the package stores a SHA hash of each model’s SQL per run. Automatic but only tracks individual model, not upstream changes 3) Compile-time macro using graph.nodes - combine checksums from the model + key upstream models into one hash at compile time. Catches upstream changes automatically but produces a hash, not a readable version number. 4) dbt model version: in YAML - native feature, but static/manual - same limitation as approach 1. Leaning toward #3 but curious - has anyone solved this differently? #dbt #analyticsengineering
To view or add a comment, sign in
-
📈 Week 4 – Data Engineering Zoomcamp | Analytics Engineering with dbt Analytics Engineering sits at the intersection of data engineering and decision intelligence. It turns raw data into reliable, well-structured data products that power dashboards, forecasting, and AI models. This week, I built and experimented with a full transformation pipeline using dbt. Here are the key outcomes: • ELT + dimensional modeling: Raw → Staging → Intermediate → Mart (Star Schema) • Dependency-driven models with Jinja ref() and automated lineage documentation • Performance optimization: Parquet ingestion ~4x faster than CSV, with chunked loads and unique constraints to prevent duplicates • Incremental and materialized models for efficient processing of 30M+ rows (fct_trips, fct_monthly_zone_revenue) • Data tests as governance: schema validation, uniqueness, accepted values across layers • Operational readiness: dev/prod schema separation, disk management, and automated cleanup The result: reproducible, version-controlled, and tested analytics models ready for BI consumption — treating transformations as code, not ad-hoc SQL. As I continue my journey in Data & AI, I’m focusing not only on building pipelines, but on designing scalable and governed data platforms that deliver measurable business value. Because Data/AI leaders don’t just build models — they design systems. On to the next module 🚀 Homework solution: https://lnkd.in/gTbJRJ3x Full course: https://lnkd.in/gYrUAiHD #AnalyticsEngineering #dbt #DataEngineering #ELT #DataModeling #DataStrategy #LearnInPublic #DataEngineeringZoomcamp
To view or add a comment, sign in
-
-
🪤 The "5-Minute" Trap Building a data pipeline is the easy part. Managing it is where the real work begins. In my last post (linked below ⬇️), I touched on how we’re redefining the role of the data engineer. Today, I want to talk about the metric that actually matters: Maintenance and Management. We’ve all heard the "I can write that in Python in 5 minutes" argument. But creation time is just the start. The real "Total Cost of Ownership" lives in the months and years after that script is written. The 70% Difference: A recent Forrester study found that using Matillion reduces the time spent on maintenance and management of pipelines by 70%. Why? Because Matillion isn't just about moving data; it’s about: ● Built-in Version Control: No more "v2_final_final" scripts. ● Reliability: Native Git integration and enterprise-grade error handling. ● Efficiency: Optimized requests to sources and warehouses that keep your cloud costs in check. Enter Maia, the Agentic Advantage: Now, take that 70% foundation and add Maia, our Agentic AI. If a pipeline breaks or a schema changes, you don't spend your afternoon debugging. You ask Maia to fix it. Maia doesn't just "suggest" a fix, it can repair the pipeline on request with almost no effort from your side. Plus by using your context files, Maia ensures every repair and optimization follows your specific governance and architectural standards. It’s like having an expert assistant that never sleeps and knows your entire codebase by heart. The value isn't in how fast you can build a pipeline. It's in how rarely you have to think about it once it's built. Stop building "5-minute" technical debt. Start building productive data. https://lnkd.in/e3hKy2A3 Link to previous post: https://lnkd.in/eepu3mFQ #DataEngineering #GenerativeAI #Matillion #MaiaAI #DataOps #Efficiency
To view or add a comment, sign in
-
-
𝗪𝗵𝗮𝘁 𝗺𝗼𝗱𝗲𝗹𝗹𝗶𝗻𝗴 𝘄𝗶𝘁𝗵 𝗱𝗯𝘁 𝗰𝗵𝗮𝗻𝗴𝗲𝘀 🧠 Working with dbt makes this trade-off very clear. Instead of relying on views and BI-layer calculations, dbt encourages: • source freshness and testing • clean, minimal staging models • reusable intermediate models • intentional, well-defined data marts 🏗️ This shifts the work to the warehouse where it belongs. When transformations live in dbt: • logic runs once and is reused, • models are tested and documented, • lineage makes breakages traceable 🔍, • dashboards become consumers, not compensators. One lesson that keeps coming up: Patchwork to bad inputs is not sustainable 🩹. Yes, delivery pressure is real, especially when dashboards are expected first thing in the morning ☕ But every fix in the BI layer increases fragility, cost, and cognitive load. A good dbt project doesn’t eliminate pressure — it absorbs it. When upstream models are solid: • refreshes are cheaper 💸, • failures are easier to debug, • and dashboards become boring (in the best way). And boring dashboards usually mean a healthy pipeline. #Still refactoring. #Still adding tests. But very clear now on one thing: In dbt, where you model is just as important as how you model. ✨
To view or add a comment, sign in
-
𝗜𝘀 𝘁𝗲𝗰𝗵𝗻𝗶𝗰𝗮𝗹 𝗱𝗲𝗯𝘁 𝘀𝗹𝗼𝘄𝗶𝗻𝗴 𝗱𝗼𝘄𝗻 𝘆𝗼𝘂𝗿 𝗱𝗮𝘁𝗮 𝘁𝗲𝗮𝗺? Every day without proper transformation management means more brittle SQL scripts, more undocumented assumptions, and more time debugging instead of delivering insights. dbt Labs is the industry-standard framework that lets analytics engineers work like software engineers—with version control, automated testing, and comprehensive documentation. 𝗪𝗵𝗮𝘁 𝗼𝗿𝗴𝗮𝗻𝗶𝘇𝗮𝘁𝗶𝗼𝗻𝘀 𝗴𝗮𝗶𝗻 𝘄𝗶𝘁𝗵 𝗱𝗯𝘁: • Significantly faster development through modular, reusable code • Up to 70% reduction in warehouse compute costs via incremental models • 90% fewer data quality incidents with automated testing • Seamless collaboration through auto-generated docs and standard PR workflows 𝗠𝘆 𝗱𝗯𝘁 𝗶𝗺𝗽𝗹𝗲𝗺𝗲𝗻𝘁𝗮𝘁𝗶𝗼𝗻 𝘀𝗲𝗿𝘃𝗶𝗰��� 𝗶𝗻𝗰𝗹𝘂𝗱𝗲𝘀: ✓ Repository setup with Git workflows and dev/prod environments ✓ Refactoring existing SQL into modular dbt models (staging/intermediate/mart layers) ✓ dbt Cloud orchestration with scheduled jobs and monitoring ✓ dbt Core dev container for local development ✓ Comprehensive testing framework and data quality checks ✓ Full documentation with lineage graphs ✓ Team training and ongoing support The investment typically pays for itself in reduced warehouse costs and engineering time within the first year. Ready to modernize your data transformation pipeline? Let's talk about your current stack and build a roadmap that fits your team. #DataEngineering #Analytics #dbt #DataTransformation #AnalyticsEngineering
To view or add a comment, sign in
-
🤯 Data teams are under pressure to deliver trustworthy insights at scale, and traditional approaches to governance and analytics often slow them down. That’s why we’re excited to share our latest post on how 𝐝𝐛𝐭 + 𝐒𝐭𝐚𝐫𝐑𝐨𝐜𝐤𝐬 enables a 𝐃𝐚𝐭𝐚𝐎𝐩𝐬-𝐝𝐫𝐢𝐯𝐞𝐧 approach that unifies speed, reliability, and governance. In this article, we covered: ✅ Why “infrastructure-only” data thinking is no longer enough, and where DataOps fits in. ✅ How dbt can act as a data control plane for versioning, testing, and lineage. ✅ How StarRocks’ strengths in ad-hoc and real-time analytics complement dbt to deliver both governance and high-performance queries. 👉 Read the full post: https://lnkd.in/gdrAcpVG #DataOps #Analytics #DataGovernance #dbt #DataEngineering
To view or add a comment, sign in
-
Excited to share a major enhancement to Feast dbt integration. We've just released support for multiple entities per FeatureView, unlocking powerful new use cases for ML feature engineering https://lnkd.in/dGVnHBt2. What This Enables: - Transaction Fraud Detection: Features keyed by both user_id AND merchant_id - Recommendation Systems: User-item interaction features with composite keys - Multi-tenant Analytics: Features across multiple business dimensions - Marketplace Platforms: Features for buyer-seller-product combinations The Problem It Solves: Previously, you could only define features with a single entity key. But real-world ML problems often require multiple dimensions. Transaction features need user context AND merchant context. Product recommendations need user preferences AND item attributes. Now, your dbt models can seamlessly transform into Feast feature views with multiple entities—bridging the gap between your data transformations and ML feature serving. Simple Developer Experience: feast dbt import -m manifest.json -e user_id -e merchant_id --tag feast #MachineLearning #MLOps #FeatureStore #DataEngineering #dbt #OpenSource
To view or add a comment, sign in