Data Engineering Podcast Podcast Republic

Data Engineering Podcast

By Tobias Macey

Listen to a podcast, please open Podcast Republic app. Available on Google Play Store and Apple App Store.

Image by Tobias Macey

Open Website

Rate for this podcast

Subscribers: 813
Reviews: 0
Episodes: 512

Description

This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.

Episode	Date
Text to Data Products: Kaarvi’s End-to-End AI for Ingestion, Quality, and Dashboards Read the full episode description	Jun 08, 2026
Scaling Graph Analytics Without ETL: Inside PuppyGraph’s Architecture Read the full episode description	Jun 01, 2026
Maximizing GPU Utilization: Heterogeneous Pipelines with Ray and Kubernetes Read the full episode description	May 06, 2026
The AI-First Data Engineer: 10–50x Productivity and What Changes Next Read the full episode description	Apr 07, 2026
Treat Metering Like Finance: Building Data Platforms for Consumption Economics Read the full episode description	Mar 29, 2026
Beyond the PDF: Rowan Cockett on Reproducible, Composable Science Read the full episode description	Mar 22, 2026
Beyond Prompts: Practical Paths to Self‑Improving AI Read the full episode description	Mar 16, 2026
Orion at Gravity: Trustworthy AI Analysts for the Enterprise Read the full episode description	Mar 08, 2026
From Models to Momentum: Uniting Architects and Engineers with ER/Studio Read the full episode description	Mar 02, 2026
From Data Models to Mind Models: Designing AI Memory at Scale Read the full episode description	Feb 22, 2026
Prompt Management, Tracing, and Evals: The New Table Stakes for GenAI Ops Read the full episode description	Feb 15, 2026
From Legacy to AI-Ready: How MongoDB AMP Accelerates Modernization Read the full episode description	Feb 08, 2026
Branches, Diffs, and SQL: How Dolt Powers Agentic Workflows Read the full episode description	Feb 01, 2026
Logical First, Physical Second: A Pragmatic Path to Trusted Data Read the full episode description	Jan 25, 2026
Your Data, Your Lake: How Observe Uses Iceberg and Streaming ETL for Observability Read the full episode description	Jan 18, 2026
Semantic Operators Meet Dataframes: Building Context for Agents with FENIC Read the full episode description	Jan 12, 2026
Beyond Dashboards: How Data Teams Earn a Seat at the Table Read the full episode description	Jan 05, 2026
Unfreezing The Data Lake: The Future-Proof File Format Read the full episode description	Dec 29, 2025
From Context to Semantics: How Metadata Powers Agentic AI Read the full episode description	Dec 21, 2025
From Data Engineering to AI Engineering: Where the Lines Blur Read the full episode description	Dec 14, 2025
Malloy: Hierarchical Data, Semantic Models, and the Future of Analytics Read the full episode description	Dec 08, 2025
Blurring Lines: Data, AI, and the New Playbook for Team Velocity Read the full episode description	Nov 24, 2025
State, Scale, and Signals: Rethinking Orchestration with Durable Execution Read the full episode description	Nov 16, 2025
The AI Data Paradox: High Trust in Models, Low Trust in Data Read the full episode description	Nov 09, 2025
Bridging the AI–Data Gap: Collect, Curate, Serve Read the full episode description	Nov 02, 2025
Beyond the Perimeter: Practical Patterns for Fine‑Grained Data Access Read the full episode description	Oct 27, 2025
The True Costs of Legacy Systems: Technical Debt, Risk, and Exit Strategies Read the full episode description	Oct 18, 2025
Context Engineering as a Discipline: Building Governed AI Analytics Read the full episode description	Oct 11, 2025
The Data Model That Captures Your Business: Metric Trees Explained Read the full episode description	Oct 05, 2025
From GPUs-as-a-Service to Workloads-as-a-Service: Flex AI’s Path to High-Utilization AI Infra Read the full episode description	Sep 28, 2025
From RAG to Relational: How Agentic Patterns Are Reshaping Data Architecture Read the full episode description	Sep 18, 2025
Duck Lake: Simplifying the Lakehouse Ecosystem Read the full episode description	Sep 10, 2025
Aligning Business and Data: The Essential Role of Data Modeling Read the full episode description	Sep 01, 2025
From Academia to Industry: Bridging Data Engineering Challenges Read the full episode description	Aug 26, 2025
High Performance And Low Overhead Graphs With KuzuDB Read the full episode description	Aug 18, 2025
Bridging Data and Decision-Making: AI's Role in Modern Analytics Read the full episode description	Aug 12, 2025
From Bits to Tables: The Evolution of S3 Storage Read the full episode description	Aug 05, 2025
Revolutionizing Python Notebooks with Marimo Read the full episode description	Jul 28, 2025
Warehouse Native Incremental Data Processing With Dynamic Tables And Delayed View Semantics Read the full episode description	Jul 21, 2025
Streamlining Data Pipelines with MCP Servers and Vector Engines Read the full episode description	Jul 15, 2025
Foundational Data Engineering At Two Sigma Read the full episode description	Jul 06, 2025
Enabling Agents In The Enterprise With A Platform Approach Read the full episode description	Jun 29, 2025
Dagster's New Era: Modularizing Data Transformation in the Age of AI Read the full episode description	Jun 18, 2025
AI and the Lakehouse: How Starburst is Pioneering New Workflows Read the full episode description	Jun 11, 2025
Amazon S3: The Backbone of Modern Data Systems Read the full episode description	Jun 03, 2025
Scaling Data Operations With Platform Engineering Read the full episode description	May 29, 2025
From Data Discovery to AI: The Evolution of Semantic Layers Read the full episode description	May 21, 2025
Balancing Off-the-Shelf and Custom Solutions in Data Engineering Read the full episode description	May 13, 2025
StarRocks: Bridging Lakehouse and OLAP for High-Performance Analytics Read the full episode description	May 05, 2025
Exploring NATS: A Multi-Paradigm Connectivity Layer for Distributed Applications Read the full episode description	Apr 28, 2025
Advanced Lakehouse Management With The LakeKeeper Iceberg REST Catalog Read the full episode description	Apr 21, 2025
Simplifying Data Pipelines with Durable Execution Read the full episode description	Apr 12, 2025
Overcoming Redis Limitations: The Dragonfly DB Approach Read the full episode description	Mar 30, 2025
Bringing AI Into The Inner Loop of Data Engineering With Ascend Read the full episode description	Mar 24, 2025
Astronomer's Role in the Airflow Ecosystem: A Deep Dive with Pete DeJoy Read the full episode description	Mar 16, 2025
Accelerated Computing in Modern Data Centers With Datapelago Read the full episode description	Mar 08, 2025
The Future of Data Engineering: AI, LLMs, and Automation Read the full episode description	Feb 26, 2025
Evolving Responsibilities in AI Data Management Read the full episode description	Feb 16, 2025
CSVs Will Never Die And OneSchema Is Counting On It Read the full episode description	Jan 13, 2025
Breaking Down Data Silos: AI and ML in Master Data Management Read the full episode description	Jan 03, 2025
Building a Data Vision Board: A Guide to Strategic Planning Read the full episode description	Dec 23, 2024
How Orchestration Impacts Data Platform Architecture Read the full episode description	Dec 16, 2024
An Exploration Of The Impediments To Reusable Data Pipelines Read the full episode description	Dec 08, 2024
The Art of Database Selection and Evolution Read the full episode description	Dec 01, 2024
Bridging Code and UI in Data Orchestration with Kestra Read the full episode description	Nov 26, 2024
Streaming Data Into The Lakehouse With Iceberg And Trino At Going Read the full episode description	Nov 18, 2024
An Opinionated Look At End-to-end Code Only Analytical Workflows With Bruin Read the full episode description	Nov 11, 2024
Feldera: Bridging Batch and Streaming with Incremental Computation Read the full episode description	Nov 04, 2024
Accelerate Migration Of Your Data Warehouse with Datafold's AI Powered Migration Agent Read the full episode description	Oct 27, 2024
Bring Vector Search And Storage To The Data Lake With Lance Read the full episode description	Oct 20, 2024
The Role of Python in Shaping the Future of Data Platforms with DLT Read the full episode description	Oct 13, 2024
Build Your Data Transformations Faster And Safer With SDF Read the full episode description	Oct 06, 2024
Scaling Airbyte: Challenges and Milestones on the Road to 1.0 Read the full episode description	Sep 23, 2024
Enhancing Data Accessibility and Governance with Gravitino Read the full episode description	Sep 01, 2024
The Evolution of DataOps: Insights from DataKitchen's CEO Read the full episode description	Aug 04, 2024
Achieving Data Reliability: The Role of Data Contracts in Modern Data Management Read the full episode description	Jul 28, 2024
How Generative AI Is Impacting Data Engineering Teams Read the full episode description	Jul 21, 2024
The Role of Product Managers in Data-Centric Organizations Read the full episode description	Jul 13, 2024
Neon: A Serverless And Developer Friendly Postgres Read the full episode description	Jul 08, 2024
Improve Data Quality Through Engineering Rigor And Business Engagement With Synq Read the full episode description	Jun 30, 2024
Stitching Together Enterprise Analytics With Microsoft Fabric Read the full episode description	Jun 23, 2024
Being Data Driven At Stripe With Trino And Iceberg Read the full episode description	Jun 16, 2024
X-Ray Vision For Your Flink Stream Processing With Datorios Read the full episode description	Jun 09, 2024
Practical First Steps In Data Governance For Long Term Success Read the full episode description	Jun 02, 2024
Data Migration Strategies For Large Scale Systems Read the full episode description	May 27, 2024
Zenlytic Is Building You A Better Coworker With AI Agents Read the full episode description	May 19, 2024
Release Management For Data Platform Services And Logic Read the full episode description	May 12, 2024
Barking Up The Wrong GPTree: Building Better AI With A Cognitive Approach Read the full episode description	May 05, 2024
Build Your Second Brain One Piece At A Time Read the full episode description	Apr 28, 2024
Making Email Better With AI At Shortwave Read the full episode description	Apr 21, 2024
Designing A Non-Relational Database Engine Read the full episode description	Apr 14, 2024
Establish A Single Source Of Truth For Your Data Consumers With A Semantic Layer Read the full episode description	Apr 07, 2024
Adding Anomaly Detection And Observability To Your dbt Projects Is Elementary Read the full episode description	Mar 31, 2024
Ship Smarter Not Harder With Declarative And Collaborative Data Orchestration On Dagster+ Read the full episode description	Mar 24, 2024
Reconciling The Data In Your Databases With Datafold Read the full episode description	Mar 17, 2024
Version Your Data Lakehouse Like Your Software With Nessie Read the full episode description	Mar 10, 2024
When And How To Conduct An AI Program Read the full episode description	Mar 03, 2024
Find Out About The Technology Behind The Latest PFAD In Analytical Database Development Read the full episode description	Feb 25, 2024
Using Trino And Iceberg As The Foundation Of Your Data Lakehouse Read the full episode description	Feb 18, 2024
Data Sharing Across Business And Platform Boundaries Read the full episode description	Feb 11, 2024
Tackling Real Time Streaming Data With SQL Using RisingWave Read the full episode description	Feb 04, 2024
Build A Data Lake For Your Security Logs With Scanner Read the full episode description	Jan 29, 2024
Modern Customer Data Platform Principles Read the full episode description	Jan 22, 2024
Pushing The Limits Of Scalability And User Experience For Data Processing WIth Jignesh Patel Read the full episode description	Jan 07, 2024
Designing Data Platforms For Fintech Companies Read the full episode description	Jan 01, 2024
Troubleshooting Kafka In Production Read the full episode description	Dec 24, 2023
Adding An Easy Mode For The Modern Data Stack With 5X Read the full episode description	Dec 18, 2023
Run Your Own Anomaly Detection For Your Critical Business Metrics With Anomstack Read the full episode description	Dec 11, 2023
Designing Data Transfer Systems That Scale Read the full episode description	Dec 04, 2023
Addressing The Challenges Of Component Integration In Data Platform Architectures Read the full episode description	Nov 27, 2023
Unlocking Your dbt Projects With Practical Advice For Practitioners Read the full episode description	Nov 20, 2023
Enhancing The Abilities Of Software Engineers With Generative AI At Tabnine Read the full episode description	Nov 13, 2023
Shining Some Light In The Black Box Of PostgreSQL Performance Read the full episode description	Nov 06, 2023
Surveying The Market Of Database Products Read the full episode description	Oct 30, 2023
Defining A Strategy For Your Data Products Read the full episode description	Oct 23, 2023
Reducing The Barrier To Entry For Building Stream Processing Applications With Decodable Read the full episode description	Oct 15, 2023
Using Data To Illuminate The Intentionally Opaque Insurance Industry Read the full episode description	Oct 09, 2023
Building ETL Pipelines With Generative AI Read the full episode description	Oct 01, 2023
Powering Vector Search With Real Time And Incremental Vector Indexes Read the full episode description	Sep 25, 2023
Building Linked Data Products With JSON-LD Read the full episode description	Sep 17, 2023
An Overview Of The State Of Data Orchestration In An Increasingly Complex Data Ecosystem Read the full episode description	Sep 10, 2023
Eliminate The Overhead In Your Data Integration With The Open Source dlt Library Read the full episode description	Sep 04, 2023
Building An Internal Database As A Service Platform At Cloudflare Read the full episode description	Aug 28, 2023
Harnessing Generative AI For Creating Educational Content With Illumidesk Read the full episode description	Aug 20, 2023
Unpacking The Seven Principles Of Modern Data Pipelines Read the full episode description	Aug 14, 2023
Quantifying The Return On Investment For Your Data Team Read the full episode description	Aug 06, 2023
Strategies For A Successful Data Platform Migration Read the full episode description	Jul 31, 2023
Build Real Time Applications With Operational Simplicity Using Dozer Read the full episode description	Jul 24, 2023
Datapreneurs - How Todays Business Leaders Are Using Data To Define The Future Read the full episode description	Jul 17, 2023
Reduce Friction In Your Business Analytics Through Entity Centric Data Modeling Read the full episode description	Jul 09, 2023
How Data Engineering Teams Power Machine Learning With Feature Platforms Read the full episode description	Jul 03, 2023
Seamless SQL And Python Transformations For Data Engineers And Analysts With SQLMesh Read the full episode description	Jun 25, 2023
How Column-Aware Development Tooling Yields Better Data Models Read the full episode description	Jun 18, 2023
Build Better Tests For Your dbt Projects With Datafold And data-diff Read the full episode description	Jun 11, 2023
Reduce The Overhead In Your Pipelines With Agile Data Engine's DataOps Service Read the full episode description	Jun 04, 2023
A Roadmap To Bootstrapping The Data Team At Your Startup Read the full episode description	May 29, 2023
Keep Your Data Lake Fresh With Real Time Streams Using Estuary Read the full episode description	May 21, 2023
What Happens When The Abstractions Leak On Your Data Read the full episode description	May 15, 2023
Use Consistent And Up To Date Customer Profiles To Power Your Business With Segment Unify Read the full episode description	May 07, 2023
Realtime Data Applications Made Easier With Meroxa Read the full episode description	Apr 24, 2023
Building Self Serve Business Intelligence With AI And Semantic Modeling At Zenlytic Read the full episode description	Apr 16, 2023
An Exploration Of The Composable Customer Data Platform Read the full episode description	Apr 10, 2023
Mapping The Data Infrastructure Landscape As A Venture Capitalist Read the full episode description	Apr 03, 2023
Unlocking The Potential Of Streaming Data Applications Without The Operational Headache At Grainite Read the full episode description	Mar 25, 2023
Aligning Data Security With Business Productivity To Deploy Analytics Safely And At Speed Read the full episode description	Mar 19, 2023
Use Your Data Warehouse To Power Your Product Analytics With NetSpring Read the full episode description	Mar 10, 2023
Exploring The Nuances Of Building An Intentional Data Culture Read the full episode description	Mar 06, 2023
Building A Data Mesh Platform At PayPal Read the full episode description	Feb 27, 2023
The View Below The Waterline Of Apache Iceberg And How It Fits In Your Data Lakehouse Read the full episode description	Feb 19, 2023
Let The Whole Team Participate In Data With The Quilt Versioned Data Hub Read the full episode description	Feb 11, 2023
Reflecting On The Past 6 Years Of Data Engineering Read the full episode description	Feb 06, 2023
Let Your Business Intelligence Platform Build The Models Automatically With Omni Analytics Read the full episode description	Jan 30, 2023
Safely Test Your Applications And Analytics With Production Quality Data Using Tonic AI Read the full episode description	Jan 22, 2023
Building Applications With Data As Code On The DataOS Read the full episode description	Jan 16, 2023
Automate Your Pipeline Creation For Streaming Data Transformations With SQLake Read the full episode description	Jan 08, 2023
Increase Your Odds Of Success For Analytics And AI Through More Effective Knowledge Management With AlignAI Read the full episode description	Dec 29, 2022
Using Product Driven Development To Improve The Productivity And Effectiveness Of Your Data Teams Read the full episode description	Dec 29, 2022
Simple And Scalable Encryption Of Data In Use For Analytics And Machine Learning With Opaque Systems Read the full episode description	Dec 26, 2022
An Exploration Of Tobias' Experience In Building A Data Lakehouse From Scratch Read the full episode description	Dec 26, 2022
Making Sense Of The Technical And Organizational Considerations Of Data Contracts Read the full episode description	Dec 19, 2022
Revisit The Fundamental Principles Of Working With Data To Avoid Getting Caught In The Hype Cycle Read the full episode description	Dec 19, 2022
Convert Your Unstructured Data To Embedding Vectors For More Efficient Machine Learning With Towhee Read the full episode description	Dec 12, 2022
Run Your Applications Worldwide Without Worrying About The Database With Planetscale Read the full episode description	Dec 12, 2022
Business Intelligence In The Palm Of Your Hand With Zing Data Read the full episode description	Dec 05, 2022
Adopting Real-Time Data At Organizations Of Every Size Read the full episode description	Dec 05, 2022
Supporting And Expanding The Arrow Ecosystem For Fast And Efficient Data Processing At Voltron Data Read the full episode description	Nov 28, 2022
Analyze Massive Data At Interactive Speeds With The Power Of Bitmaps Using FeatureBase Read the full episode description	Nov 28, 2022
A Look At The Data Systems Behind The Gameplay For League Of Legends Read the full episode description	Nov 21, 2022
Tame The Entropy In Your Data Stack And Prevent Failures With Sifflet Read the full episode description	Nov 21, 2022
Build Data Products Without A Data Team Using AgileData Read the full episode description	Nov 14, 2022
Taking A Look Under The Hood At CreditKarma's Data Platform Read the full episode description	Nov 14, 2022
Build Better Data Products By Creating Data, Not Consuming It Read the full episode description	Nov 07, 2022
Clean Up Your Data Using Scalable Entity Resolution And Data Mastering With Zingg Read the full episode description	Nov 07, 2022
Expanding The Reach of Business Intelligence Through Ubiquitous Embedded Analytics With Sisense Read the full episode description	Oct 31, 2022
Analytics Engineering Without The Friction Of Complex Pipeline Development With Optimus and dbt Read the full episode description	Oct 30, 2022
How To Bring Agile Practices To Your Data Projects Read the full episode description	Oct 23, 2022
Going From Transactional To Analytical And Self-managed To Cloud On One Database With MariaDB Read the full episode description	Oct 23, 2022
Speeding Up The Time To Insight For Supply Chains And Logistics With The Pathway Database That Thinks Read the full episode description	Oct 16, 2022
An Exploration Of The Open Data Lakehouse And Dremio's Contribution To The Ecosystem Read the full episode description	Oct 16, 2022
Making The Open Data Lakehouse Affordable Without The Overhead At Iomete Read the full episode description	Oct 10, 2022
Investing In Understanding The Customer Journey At American Express Read the full episode description	Oct 10, 2022
Gain Visibility And Insight Into Your Supply Chains Through Operational Analytics Powered By Roambee Read the full episode description	Oct 03, 2022
Make Data Lineage A Ubiquitous Part Of Your Work By Simplifying Its Implementation With Alvin Read the full episode description	Oct 03, 2022
Power Your Real-Time Analytics Without The Headache Using Fivetran's Change Data Capture Integrations Read the full episode description	Sep 26, 2022
Build A Common Understanding Of Your Data Reliability Rules With Soda Core and Soda Checks Language Read the full episode description	Sep 26, 2022
Building A Shared Understanding Of Data Assets In A Business Through A Single Pane Of Glass With Workstream Read the full episode description	Sep 19, 2022
Operational Analytics To Increase Efficiency For Multi-Location Businesses With OpsAnalitica Read the full episode description	Sep 19, 2022
Building Data Pipelines That Run From Source To Analysis And Activation With Hevo Data Read the full episode description	Sep 12, 2022
Build Confidence In Your Data Platform With Schema Compatibility Reports That Span Systems And Domains Using Schemata Read the full episode description	Sep 12, 2022
A Reflection On Data Observability As It Reaches Broader Adoption Read the full episode description	Sep 05, 2022
Introduce Climate Analytics Into Your Data Platform Without The Heavy Lifting Using Sust Global Read the full episode description	Sep 05, 2022
An Exploration Of What Data Automation Can Provide To Data Engineers And Ascend's Journey To Make It A Reality Read the full episode description	Aug 29, 2022
Alumni Of AirBnB's Early Years Reflect On What They Learned About Building Data Driven Organizations Read the full episode description	Aug 28, 2022
An Exploration Of The Expectations, Ecosystem, and Realities Of Real-Time Data Applications Read the full episode description	Aug 22, 2022
Understanding The Role Of The Chief Data Officer Read the full episode description	Aug 22, 2022
Bringing Automation To Data Labeling For Machine Learning With Watchful Read the full episode description	Aug 14, 2022
Collecting And Retaining Contextual Metadata For Powerful And Effective Data Discovery Read the full episode description	Aug 14, 2022
Useful Lessons And Repeatable Patterns Learned From Data Mesh Implementations At AgileLab Read the full episode description	Aug 06, 2022
Optimize Your Machine Learning Development And Serving With The Open Source Vector Database Milvus Read the full episode description	Aug 06, 2022
Interactive Exploratory Data Analysis On Petabyte Scale Data Sets With Arkouda Read the full episode description	Jul 31, 2022
What "Data Lineage Done Right" Looks Like And How They're Doing It At Manta Read the full episode description	Jul 31, 2022
Writing The Book That Offers A Single Reference For The Fundamentals Of Data Engineering Read the full episode description	Jul 24, 2022
Re-Bundling The Data Stack With Data Orchestration And Software Defined Assets Using Dagster Read the full episode description	Jul 24, 2022
Making The Total Cost Of Ownership For External Data Manageable With Crux Read the full episode description	Jul 17, 2022
Joe Reis Flips The Script And Interviews Tobias Macey About The Data Engineering Podcast Read the full episode description	Jul 17, 2022
Charting the Path of Riskified's Data Platform Journey Read the full episode description	Jul 10, 2022
Maintain Your Data Engineers' Sanity By Embracing Automation Read the full episode description	Jul 10, 2022
Be Confident In Your Data Integration By Quickly Validating Matching Records With data-diff Read the full episode description	Jul 03, 2022
The View From The Lakehouse Of Architectural Patterns For Your Data Platform Read the full episode description	Jul 03, 2022
Bring Geospatial Analytics Across Disparate Datasets Into Your Toolkit With The Unfolded Platform Read the full episode description	Jun 27, 2022
Strategies And Tactics For A Successful Master Data Management Implementation Read the full episode description	Jun 27, 2022
Combining The Simplicity Of Spreadsheets With The Power Of Modern Data Infrastructure At Canvas Read the full episode description	Jun 19, 2022
Level Up Your Data Platform With Active Metadata Read the full episode description	Jun 19, 2022
Discover And De-Clutter Your Unstructured Data With Aparavi Read the full episode description	Jun 13, 2022
Hire And Scale Your Data Team With Intention Read the full episode description	Jun 13, 2022
Simplify Data Security For Sensitive Information With The Skyflow Data Privacy Vault Read the full episode description	Jun 06, 2022
Bringing The Modern Data Stack To Everyone With Y42 Read the full episode description	Jun 06, 2022
A Multipurpose Database For Transactions And Analytics To Simplify Your Data Architecture With Singlestore Read the full episode description	May 30, 2022
Data Cloud Cost Optimization With Bluesky Data Read the full episode description	May 30, 2022
Unlocking The Value Of Data Across The Organization Through User Friendly Data Tools With Prophecy Read the full episode description	May 23, 2022
Cloud Native Data Orchestration For Machine Learning And Data Engineering With Flyte Read the full episode description	May 23, 2022
Insights And Advice On Building A Data Lake Platform From Someone Who Learned The Hard Way Read the full episode description	May 16, 2022
Designing And Deploying IoT Analytics For Industrial Applications At Vopak Read the full episode description	May 16, 2022
Scaling Analysis of Connected Data And Modeling Complex Relationships With The TigerGraph Graph Database Read the full episode description	May 09, 2022
Exploring The Insights And Impact Of Dan Delorey's Distinguished Career In Data Read the full episode description	May 09, 2022
Leading The Charge For The ELT Data Integration Pattern For Cloud Data Warehouses At Matillion Read the full episode description	May 02, 2022
Evolving And Scaling The Data Platform at Yotpo Read the full episode description	May 02, 2022
Operational Analytics At Speed With Minimal Busy Work Using Incorta Read the full episode description	Apr 24, 2022
Gain Visibility Into Your Entire Machine Learning System Using Data Logging With WhyLogs Read the full episode description	Apr 24, 2022
Connecting To The Next Frontier Of Computing With Quantum Networks Read the full episode description	Apr 18, 2022
What Does It Really Mean To Do MLOps And What Is The Data Engineer's Role? Read the full episode description	Apr 16, 2022
DataOps As A Service For Your Data Integration Workflows With Rivery Read the full episode description	Apr 11, 2022
Synthetic Data As A Service For Simplifying Privacy Engineering With Gretel Read the full episode description	Apr 10, 2022
Accelerate Development Of Enterprise Analytics With The Coalesce Visual Workflow Builder Read the full episode description	Apr 03, 2022
Repeatable Patterns For Designing Data Platforms And When To Customize Them Read the full episode description	Apr 03, 2022
Eliminate The Bottlenecks In Your Key/Value Storage With SpeeDB Read the full episode description	Mar 27, 2022
Building A Data Governance Bridge Between Cloud And Datacenters For The Enterprise At Privacera Read the full episode description	Mar 27, 2022
Exploring Incident Management Strategies For Data Teams Read the full episode description	Mar 20, 2022
Accelerate Your Embedded Analytics With Apache Pinot Read the full episode description	Mar 20, 2022
Taking A Multidimensional Approach To Data Observability At Acceldata Read the full episode description	Mar 14, 2022
Accelerating Adoption Of The Modern Data Stack At 5X Data Read the full episode description	Mar 14, 2022
Move Your Database To The Data And Speed Up Your Analytics With DuckDB Read the full episode description	Mar 05, 2022
Developer Friendly Application Persistence That Is Fast And Scalable With HarperDB Read the full episode description	Mar 05, 2022
Reflections On Designing A Data Platform From Scratch Read the full episode description	Feb 28, 2022
Manage Your Unstructured Data Assets Across Cloud And Hybrid Environments With Komprise Read the full episode description	Feb 28, 2022
Build Your Python Data Processing Your Way And Run It Anywhere With Fugue Read the full episode description	Feb 21, 2022
Understanding The Immune System With Data At ImmunAI Read the full episode description	Feb 21, 2022
Bring Your Code To Your Streaming And Static Data Without Effort With The Deephaven Real Time Query Engine Read the full episode description	Feb 14, 2022
Build Your Own End To End Customer Data Platform With Rudderstack Read the full episode description	Feb 14, 2022
Scale Your Spatial Analysis By Building It In SQL With Syntax Extensions Read the full episode description	Feb 07, 2022
Scalable Strategies For Protecting Data Privacy In Your Shared Data Sets Read the full episode description	Feb 06, 2022
A Reflection On Learning A Lot More Than 97 Things Every Data Engineer Should Know Read the full episode description	Jan 31, 2022
Effective Pandas Patterns For Data Engineering Read the full episode description	Jan 31, 2022
Building And Managing Data Teams And Data Platforms In Large Organizations With Ashish Mrig Read the full episode description	Jan 23, 2022
The Importance Of Data Contracts As The Interface For Data Integration With Abhi Sivasailam Read the full episode description	Jan 23, 2022
Automated Data Quality Management Through Machine Learning With Anomalo Read the full episode description	Jan 15, 2022
An Introduction To Data And Analytics Engineering For Non-Programmers Read the full episode description	Jan 15, 2022
Open Source Reverse ETL For Everyone With Grouparoo Read the full episode description	Jan 08, 2022
Data Observability Out Of The Box With Metaplane Read the full episode description	Jan 08, 2022
Creating Shared Context For Your Data Warehouse With A Controlled Vocabulary Read the full episode description	Jan 02, 2022
A Reflection On The Data Ecosystem For The Year 2021 Read the full episode description	Jan 02, 2022
Exploring The Evolving Role Of Data Engineers Read the full episode description	Dec 27, 2021
Revisiting The Technical And Social Benefits Of The Data Mesh Read the full episode description	Dec 27, 2021
Fast And Flexible Headless Data Analytics With Cube.JS Read the full episode description	Dec 21, 2021
Building A System Of Record For Your Organization's Data Ecosystem At Metaphor Read the full episode description	Dec 20, 2021
Building Auditable Spark Pipelines At Capital One Read the full episode description	Dec 13, 2021
Deliver Personal Experiences In Your Applications With The Unomi Open Source Customer Data Platform Read the full episode description	Dec 12, 2021
Data Driven Hiring For Data Professionals With Alooba Read the full episode description	Dec 04, 2021
Experimentation and A/B Testing For Modern Data Teams With Eppo Read the full episode description	Dec 04, 2021
Creating A Unified Experience For The Modern Data Stack At Mozart Data Read the full episode description	Nov 27, 2021
Doing DataOps For External Data Sources As A Service at Demyst Read the full episode description	Nov 27, 2021
Exploring Processing Patterns For Streaming Data Integration In Your Data Lake Read the full episode description	Nov 20, 2021
Laying The Foundation Of Your Data Platform For The Era Of Big Complexity With Dagster Read the full episode description	Nov 20, 2021
Data Quality Starts At The Source Read the full episode description	Nov 14, 2021
Eliminate Friction In Your Data Platform Through Unified Metadata Using OpenMetadata Read the full episode description	Nov 10, 2021
Business Intelligence Beyond The Dashboard With ClicData Read the full episode description	Nov 06, 2021
Exploring The Evolution And Adoption of Customer Data Platforms and Reverse ETL Read the full episode description	Nov 05, 2021
Removing The Barrier To Exploratory Analytics with Activity Schema and Narrator Read the full episode description	Oct 29, 2021
Streaming Data Pipelines Made SQL With Decodable Read the full episode description	Oct 29, 2021
Data Exploration For Business Users Powered By Analytics Engineering With Lightdash Read the full episode description	Oct 23, 2021
Completing The Feedback Loop Of Data Through Operational Analytics With Census Read the full episode description	Oct 21, 2021
Bringing The Power Of The DataHub Real-Time Metadata Graph To Everyone At Acryl Data Read the full episode description	Oct 16, 2021
How And Why To Become Data Driven As A Business Read the full episode description	Oct 14, 2021
Make Your Business Metrics Reusable With Open Source Headless BI Using Metriql Read the full episode description	Oct 08, 2021
Adding Support For Distributed Transactions To The Redpanda Streaming Engine Read the full episode description	Oct 06, 2021
Building Real-Time Data Platforms For Large Volumes Of Information With Aerospike Read the full episode description	Oct 02, 2021
Delivering Your Personal Data Cloud With Prifina Read the full episode description	Sep 30, 2021
Digging Into Data Reliability Engineering Read the full episode description	Sep 26, 2021
Massively Parallel Data Processing In Python Without The Effort Using Bodo Read the full episode description	Sep 25, 2021
Declarative Machine Learning Without The Operational Overhead Using Continual Read the full episode description	Sep 19, 2021
An Exploration Of The Data Engineering Requirements For Bioinformatics Read the full episode description	Sep 19, 2021
Setting The Stage For The Next Chapter Of The Cassandra Database Read the full episode description	Sep 12, 2021
A View From The Round Table Of Gartner's Cool Vendors Read the full episode description	Sep 09, 2021
Designing And Building Data Platforms As A Product Read the full episode description	Sep 04, 2021
Presto Powered Cloud Data Lakes At Speed Made Easy With Ahana Read the full episode description	Sep 02, 2021
Do Away With Data Integration Through A Dataware Architecture With Cinchy Read the full episode description	Aug 28, 2021
Decoupling Data Operations From Data Infrastructure Using Nexla Read the full episode description	Aug 25, 2021
Let Your Analysts Build A Data Lakehouse With Cuelake Read the full episode description	Aug 21, 2021
Migrate And Modify Your Data Platform Confidently With Compilerworks Read the full episode description	Aug 18, 2021
Prepare Your Unstructured Data For Machine Learning And Computer Vision Without The Toil Using Activeloop Read the full episode description	Aug 15, 2021
Build Trust In Your Data By Understanding Where It Comes From And How It Is Used With Stemma Read the full episode description	Aug 10, 2021
Data Discovery From Dashboards To Databases With Castor Read the full episode description	Aug 07, 2021
Charting A Path For Streaming Data To Fill Your Data Lake With Hudi Read the full episode description	Aug 03, 2021
Adding Context And Comprehension To Your Analytics Through Data Discovery With SelectStar Read the full episode description	Jul 31, 2021
Building a Multi-Tenant Managed Platform For Streaming Data With Pulsar at Datastax Read the full episode description	Jul 28, 2021
Bringing The Metrics Layer To The Masses With Transform Read the full episode description	Jul 23, 2021
Strategies For Proactive Data Quality Management Read the full episode description	Jul 20, 2021
Low Code And High Quality Data Engineering For The Whole Organization With Prophecy Read the full episode description	Jul 16, 2021
Exploring The Design And Benefits Of The Modern Data Stack Read the full episode description	Jul 13, 2021
Democratize Data Cleaning Across Your Organization With Trifacta Read the full episode description	Jul 09, 2021
Stick All Of Your Systems And Data Together With SaaSGlue As Your Workflow Manager Read the full episode description	Jul 05, 2021
Leveling Up Open Source Data Integration With Meltano Hub And The Singer SDK Read the full episode description	Jul 03, 2021
A Candid Exploration Of Timeseries Data Analysis With InfluxDB Read the full episode description	Jun 29, 2021
Lessons Learned From The Pipeline Data Engineering Academy Read the full episode description	Jun 26, 2021
Make Database Performance Optimization A Playful Experience With OtterTune Read the full episode description	Jun 23, 2021
Bring Order To The Chaos Of Your Unstructured Data Assets With Unstruk Read the full episode description	Jun 18, 2021
Accelerating ML Training And Delivery With In-Database Machine Learning Read the full episode description	Jun 15, 2021
Taking A Tour Of The Google Cloud Platform For Data And Analytics Read the full episode description	Jun 12, 2021
Make Sure Your Records Are Reliable With The BookKeeper Distributed Storage Layer Read the full episode description	Jun 09, 2021
Build Your Analytics With A Collaborative And Expressive SQL IDE Using Querybook Read the full episode description	Jun 03, 2021
Making Data Pipelines Self-Serve For Everyone With Shipyard Read the full episode description	Jun 02, 2021
Paving The Road For Fast Analytics On Distributed Clouds With The Yellowbrick Data Warehouse Read the full episode description	May 28, 2021
Easily Build Advanced Similarity Search With The Pinecone Vector Database Read the full episode description	May 25, 2021
A Holistic Approach To Data Governance Through Self Reflection At Collibra Read the full episode description	May 21, 2021
Unlocking The Power of Data Lineage In Your Platform with OpenLineage Read the full episode description	May 18, 2021
Building Your Data Warehouse On Top Of PostgreSQL Read the full episode description	May 14, 2021
Making Analytical APIs Fast With Tinybird Read the full episode description	May 11, 2021
Making Spark Cloud Native At Data Mechanics Read the full episode description	May 07, 2021
The Grand Vision And Present Reality of DataOps Read the full episode description	May 04, 2021
Self Service Data Exploration And Dashboarding With Superset Read the full episode description	Apr 27, 2021
Moving Machine Learning Into The Data Pipeline at Cherre Read the full episode description	Apr 20, 2021
Exploring The Expanding Landscape Of Data Professions with Josh Benamram of Databand Read the full episode description	Apr 13, 2021
Put Your Whole Data Team On The Same Page With Atlan Read the full episode description	Apr 06, 2021
Data Quality Management For The Whole Team With Soda Data Read the full episode description	Mar 30, 2021
Real World Change Data Capture At Datacoral Read the full episode description	Mar 23, 2021
Managing The DoorDash Data Platform Read the full episode description	Mar 16, 2021
Leave Your Data Where It Is And Automate Feature Extraction With Molecula Read the full episode description	Mar 09, 2021
Bridging The Gap Between Machine Learning And Operations At Iguazio Read the full episode description	Mar 02, 2021
Self Service Open Source Data Integration With AirByte Read the full episode description	Feb 23, 2021
Building The Foundations For Data Driven Businesses at 5xData Read the full episode description	Feb 16, 2021
How Shopify Is Building Their Production Data Warehouse Using DBT Read the full episode description	Feb 09, 2021
System Observability For The Cloud Native Era With Chronosphere Read the full episode description	Feb 02, 2021
Making It Easier To Stick B2B Data Integration Pipelines Together With Hotglue Read the full episode description	Jan 26, 2021
Using Your Data Warehouse As The Source Of Truth For Customer Data With Hightouch Read the full episode description	Jan 19, 2021
Enabling Version Controlled Data Collaboration With TerminusDB Read the full episode description	Jan 11, 2021
Bringing Feature Stores and MLOps to the Enterprise at Tecton Read the full episode description	Jan 05, 2021
Off The Shelf Data Governance With Satori Read the full episode description	Dec 28, 2020
Low Friction Data Governance With Immuta Read the full episode description	Dec 21, 2020
Building A Self Service Data Platform For Alternative Data Analytics At YipitData Read the full episode description	Dec 15, 2020
Proven Patterns For Building Successful Data Teams Read the full episode description	Dec 07, 2020
Streaming Data Integration Without The Code at Equalum Read the full episode description	Nov 30, 2020
Keeping A Bigeye On The Data Quality Market Read the full episode description	Nov 23, 2020
Self Service Data Management From Ingest To Insights With Isima Read the full episode description	Nov 17, 2020
Building A Cost Effective Data Catalog With Tree Schema Read the full episode description	Nov 10, 2020
Add Version Control To Your Data Lake With LakeFS Read the full episode description	Nov 03, 2020
Cloud Native Data Security As Code With Cyral Read the full episode description	Oct 26, 2020
Better Data Quality Through Observability With Monte Carlo Read the full episode description	Oct 19, 2020
Rapid Delivery Of Business Intelligence Using Power BI Read the full episode description	Oct 12, 2020
Self Service Real Time Data Integration Without The Headaches With Meroxa Read the full episode description	Oct 05, 2020
Speed Up And Simplify Your Streaming Data Workloads With Red Panda Read the full episode description	Sep 29, 2020
Cutting Through The Noise And Focusing On The Fundamentals Of Data Engineering With The Data Janitor Read the full episode description	Sep 22, 2020
Distributed In Memory Processing And Streaming With Hazelcast Read the full episode description	Sep 15, 2020
Simplify Your Data Architecture With The Presto Distributed SQL Engine Read the full episode description	Sep 07, 2020
Building A Better Data Warehouse For The Cloud At Firebolt Read the full episode description	Sep 01, 2020
Metadata Management And Integration At LinkedIn With DataHub Read the full episode description	Aug 25, 2020
Exploring The TileDB Universal Data Engine Read the full episode description	Aug 17, 2020
Closing The Loop On Event Data Collection With Iteratively Read the full episode description	Aug 10, 2020
A Practical Introduction To Graph Data Applications Read the full episode description	Aug 04, 2020
Build More Reliable Distributed Systems By Breaking Them With Jepsen Read the full episode description	Jul 28, 2020
Making Wind Energy More Efficient With Data At Turbit Systems Read the full episode description	Jul 21, 2020
Open Source Production Grade Data Integration With Meltano Read the full episode description	Jul 13, 2020
DataOps For Streaming Systems With Lenses.io Read the full episode description	Jul 06, 2020
Data Collection And Management To Power Sound Recognition At Audio Analytic Read the full episode description	Jun 30, 2020
Bringing Business Analytics To End Users With GoodData Read the full episode description	Jun 23, 2020
Accelerate Your Machine Learning With The StreamSQL Feature Store Read the full episode description	Jun 15, 2020
Data Management Trends From An Investor Perspective Read the full episode description	Jun 08, 2020
Building A Data Lake For The Database Administrator At Upsolver Read the full episode description	Jun 02, 2020
Mapping The Customer Journey For B2B Companies At Dreamdata Read the full episode description	May 25, 2020
Power Up Your PostgreSQL Analytics With Swarm64 Read the full episode description	May 18, 2020
StreamNative Brings Streaming Data To The Cloud Native Landscape With Pulsar Read the full episode description	May 11, 2020
Enterprise Data Operations And Orchestration At Infoworks Read the full episode description	May 04, 2020
Taming Complexity In Your Data Driven Organization With DataOps Read the full episode description	Apr 28, 2020
Building Real Time Applications On Streaming Data With Eventador Read the full episode description	Apr 20, 2020
Making Data Collection In Your Code Easy With Rookout Read the full episode description	Apr 14, 2020
Building A Knowledge Graph Of Commercial Real Estate At Cherre Read the full episode description	Apr 07, 2020
The Life Of A Non-Profit Data Professional Read the full episode description	Mar 30, 2020
Behind The Scenes Of The Linode Object Storage Service Read the full episode description	Mar 23, 2020
Building A New Foundation For CouchDB Read the full episode description	Mar 17, 2020
Scaling Data Governance For Global Businesses With A Data Hub Architecture Read the full episode description	Mar 09, 2020
Easier Stream Processing On Kafka With ksqlDB Read the full episode description	Mar 02, 2020
Shining A Light on Shadow IT In Data And Analytics Read the full episode description	Feb 25, 2020
Data Infrastructure Automation For Private SaaS At Snowplow Read the full episode description	Feb 18, 2020
Data Modeling That Evolves With Your Business Using Data Vault Read the full episode description	Feb 09, 2020
The Benefits And Challenges Of Building A Data Trust Read the full episode description	Feb 03, 2020
Pay Down Technical Debt In Your Data Pipeline With Great Expectations Read the full episode description	Jan 27, 2020
Replatforming Production Dataflows Read the full episode description	Jan 20, 2020
Planet Scale SQL For The New Generation Of Applications With YugabyteDB Read the full episode description	Jan 13, 2020
Change Data Capture For All Of Your Databases With Debezium Read the full episode description	Jan 06, 2020
Building The DataDog Platform For Processing Timeseries Data At Massive Scale Read the full episode description	Dec 30, 2019
Building The Materialize Engine For Interactive Streaming Analytics In SQL Read the full episode description	Dec 23, 2019
Solving Data Lineage Tracking And Data Discovery At WeWork Read the full episode description	Dec 16, 2019
SnowflakeDB: The Data Warehouse Built For The Cloud Read the full episode description	Dec 09, 2019
Organizing And Empowering Data Engineers At Citadel Read the full episode description	Dec 03, 2019
Building A Real Time Event Data Warehouse For Sentry Read the full episode description	Nov 26, 2019
Escaping Analysis Paralysis For Your Data Platform With Data Virtualization Read the full episode description	Nov 18, 2019
Designing For Data Protection Read the full episode description	Nov 11, 2019
Automating Your Production Dataflows On Spark Read the full episode description	Nov 04, 2019
Build Maintainable And Testable Data Applications With Dagster Read the full episode description	Oct 28, 2019
Data Orchestration For Hybrid Cloud Analytics Read the full episode description	Oct 22, 2019
Keeping Your Data Warehouse In Order With DataForm Read the full episode description	Oct 15, 2019
Fast Analytics On Semi-Structured And Structured Data In The Cloud Read the full episode description	Oct 08, 2019
Ship Faster With An Opinionated Data Pipeline Framework Read the full episode description	Oct 01, 2019
Open Source Object Storage For All Of Your Data Read the full episode description	Sep 23, 2019
Navigating Boundless Data Streams With The Swim Kernel Read the full episode description	Sep 18, 2019
Building A Reliable And Performant Router For Observability Data Read the full episode description	Sep 10, 2019
Building A Community For Data Professionals at Data Council Read the full episode description	Sep 02, 2019
Building Tools And Platforms For Data Analytics Read the full episode description	Aug 26, 2019
A High Performance Platform For The Full Big Data Lifecycle Read the full episode description	Aug 19, 2019
Digging Into Data Replication At Fivetran Read the full episode description	Aug 12, 2019
Solving Data Discovery At Lyft Read the full episode description	Aug 05, 2019
Simplifying Data Integration Through Eventual Connectivity Read the full episode description	Jul 29, 2019
Straining Your Data Lake Through A Data Mesh Read the full episode description	Jul 22, 2019
Data Labeling That You Can Feel Good About With CloudFactory Read the full episode description	Jul 15, 2019
Scale Your Analytics On The Clickhouse Data Warehouse Read the full episode description	Jul 08, 2019
Stress Testing Kafka And Cassandra For Real-Time Anomaly Detection Read the full episode description	Jul 02, 2019
The Workflow Engine For Data Engineers And Data Scientists Read the full episode description	Jun 25, 2019
Maintaining Your Data Lake At Scale With Spark Read the full episode description	Jun 17, 2019
Managing The Machine Learning Lifecycle Read the full episode description	Jun 10, 2019
Evolving An ETL Pipeline For Better Productivity Read the full episode description	Jun 04, 2019
Data Lineage For Your Pipelines Read the full episode description	May 27, 2019
Build Your Data Analytics Like An Engineer With DBT Read the full episode description	May 20, 2019
Using FoundationDB As The Bedrock For Your Distributed Systems Read the full episode description	May 07, 2019
Running Your Database On Kubernetes With KubeDB Read the full episode description	Apr 29, 2019
Unpacking Fauna: A Global Scale Cloud Native Database Read the full episode description	Apr 22, 2019
Index Your Big Data With Pilosa For Faster Analytics Read the full episode description	Apr 15, 2019
Serverless Data Pipelines On DataCoral Read the full episode description	Apr 08, 2019
Why Analytics Projects Fail And What To Do About It Read the full episode description	Apr 01, 2019
Building An Enterprise Data Fabric At CluedIn Read the full episode description	Mar 25, 2019
A DataOps vs DevOps Cookoff In The Data Kitchen Read the full episode description	Mar 18, 2019
Customer Analytics At Scale With Segment Read the full episode description	Mar 04, 2019
Deep Learning For Data Engineers Read the full episode description	Feb 25, 2019
Speed Up Your Analytics With The Alluxio Distributed Storage System Read the full episode description	Feb 19, 2019
Machine Learning In The Enterprise Read the full episode description	Feb 11, 2019
Cleaning And Curating Open Data For Archaeology Read the full episode description	Feb 04, 2019
Managing Database Access Control For Teams With strongDM Read the full episode description	Jan 29, 2019
Building Enterprise Big Data Systems At LEGO Read the full episode description	Jan 21, 2019
TimescaleDB: The Timeseries Database Built For SQL And Scale - Episode 65 Read the full episode description	Jan 14, 2019
Performing Fast Data Analytics Using Apache Kudu - Episode 64 Read the full episode description	Jan 07, 2019
Simplifying Continuous Data Processing Using Stream Native Storage In Pravega with Tom Kaitchuck - Episode 63 Read the full episode description	Dec 31, 2018
Continuously Query Your Time-Series Data Using PipelineDB with Derek Nelson and Usman Masood - Episode 62 Read the full episode description	Dec 24, 2018
Advice On Scaling Your Data Pipeline Alongside Your Business with Christian Heinzmann - Episode 61 Read the full episode description	Dec 17, 2018
Putting Apache Spark Into Action with Jean Georges Perrin - Episode 60 Read the full episode description	Dec 10, 2018
Apache Zookeeper As A Building Block For Distributed Systems with Patrick Hunt - Episode 59 Read the full episode description	Dec 03, 2018
Set Up Your Own Data-as-a-Service Platform On Dremio with Tomer Shiran - Episode 58 Read the full episode description	Nov 26, 2018
Stateful, Distributed Stream Processing on Flink with Fabian Hueske - Episode 57 Read the full episode description	Nov 19, 2018
How Upsolver Is Building A Data Lake Platform In The Cloud with Yoni Iny - Episode 56 Read the full episode description	Nov 11, 2018
Self Service Business Intelligence And Data Sharing Using Looker with Daniel Mintz - Episode 55 Read the full episode description	Nov 05, 2018
Using Notebooks As The Unifying Layer For Data Roles At Netflix with Matthew Seal - Episode 54 Read the full episode description	Oct 29, 2018
Of Checklists, Ethics, and Data with Emily Miller and Peter Bull (Cross Post from Podcast.__init__) - Episode 53 Read the full episode description	Oct 22, 2018
Improving The Performance Of Cloud-Native Big Data At Netflix Using The Iceberg Table Format with Ryan Blue - Episode 52 Read the full episode description	Oct 15, 2018
Combining Transactional And Analytical Workloads On MemSQL with Nikita Shamgunov Read the full episode description	Oct 09, 2018
Building A Knowledge Graph From Public Data At Enigma With Chris Groskopf - Episode 50 Read the full episode description	Oct 01, 2018
A Primer On Enterprise Data Curation with Todd Walter - Episode 49 Read the full episode description	Sep 24, 2018
Take Control Of Your Web Analytics Using Snowplow With Alexander Dean - Episode 48 Read the full episode description	Sep 17, 2018
Keep Your Data And Query It Too Using Chaos Search with Thomas Hazel and Pete Cheslock - Episode 47 Read the full episode description	Sep 10, 2018
An Agile Approach To Master Data Management with Mark Marinelli - Episode 46 Read the full episode description	Sep 03, 2018
Protecting Your Data In Use At Enveil with Ellison Anne Williams - Episode 45 Read the full episode description	Aug 27, 2018
Graph Databases In Production At Scale Using DGraph with Manish Jain - Episode 44 Read the full episode description	Aug 20, 2018
Putting Airflow Into Production With James Meickle - Episode 43 Read the full episode description	Aug 13, 2018
Taking A Tour Of PostgreSQL with Jonathan Katz - Episode 42 Read the full episode description	Aug 06, 2018
Mobile Data Collection And Analysis Using Ona And Canopy With Peter Lubell-Doughtie - Episode 41 Read the full episode description	Jul 30, 2018
Ceph: A Reliable And Scalable Distributed Filesystem with Sage Weil - Episode 40 Read the full episode description	Jul 16, 2018
Building Data Flows In Apache NiFi With Kevin Doran and Andy LoPresto - Episode 39 Read the full episode description	Jul 08, 2018
Leveraging Human Intelligence For Better AI At Alegion With Cheryl Martin - Episode 38 Read the full episode description	Jul 02, 2018
Package Management And Distribution For Your Data Using Quilt with Kevin Moore - Episode 37 Read the full episode description	Jun 25, 2018
User Analytics In Depth At Heap with Dan Robinson - Episode 36 Read the full episode description	Jun 17, 2018
CockroachDB In Depth with Peter Mattis - Episode 35 Read the full episode description	Jun 11, 2018
ArangoDB: Fast, Scalable, and Multi-Model Data Storage with Jan Steeman and Jan Stücke - Episode 34 Read the full episode description	Jun 04, 2018
The Alooma Data Pipeline With CTO Yair Weinberger - Episode 33 Read the full episode description	May 28, 2018
PrestoDB and Starburst Data with Kamil Bajda-Pawlikowski - Episode 32 Read the full episode description	May 21, 2018
Brief Conversations From The Open Data Science Conference: Part 2 - Episode 31 Read the full episode description	May 14, 2018
Brief Conversations From The Open Data Science Conference: Part 1 - Episode 30 Read the full episode description	May 07, 2018
Metabase Self Service Business Intelligence with Sameer Al-Sakran - Episode 29 Read the full episode description	Apr 30, 2018
Octopai: Metadata Management for Better Business Intelligence with Amnon Drori - Episode 28 Read the full episode description	Apr 23, 2018
Data Engineering Weekly with Joe Crobak - Episode 27 Read the full episode description	Apr 15, 2018
Defining DataOps with Chris Bergh - Episode 26 Read the full episode description	Apr 08, 2018
ThreatStack: Data Driven Cloud Security with Pete Cheslock and Patrick Cable - Episode 25 Read the full episode description	Apr 01, 2018
MarketStore: Managing Timeseries Financial Data with Hitoshi Harada and Christopher Ryan - Episode 24 Read the full episode description	Mar 25, 2018
Stretching The Elastic Stack with Philipp Krenn - Episode 23 Read the full episode description	Mar 19, 2018
Database Refactoring Patterns with Pramod Sadalage - Episode 22 Read the full episode description	Mar 12, 2018
The Future Data Economy with Roger Chen - Episode 21 Read the full episode description	Mar 05, 2018
Honeycomb Data Infrastructure with Sam Stokes - Episode 20 Read the full episode description	Feb 26, 2018
Data Teams with Will McGinnis - Episode 19 Read the full episode description	Feb 19, 2018
TimescaleDB: Fast And Scalable Timeseries with Ajay Kulkarni and Mike Freedman - Episode 18 Read the full episode description	Feb 11, 2018
Pulsar: Fast And Scalable Messaging with Rajan Dhabalia and Matteo Merli - Episode 17 Read the full episode description	Feb 04, 2018
Dat: Distributed Versioned Data Sharing with Danielle Robinson and Joe Hand - Episode 16 Read the full episode description	Jan 29, 2018
Snorkel: Extracting Value From Dark Data with Alex Ratner - Episode 15 Read the full episode description	Jan 22, 2018
CRDTs and Distributed Consensus with Christopher Meiklejohn - Episode 14 Read the full episode description	Jan 15, 2018
Citus Data: Distributed PostGreSQL for Big Data with Ozgun Erdogan and Craig Kerstiens - Episode 13 Read the full episode description	Jan 08, 2018
Wallaroo with Sean T. Allen - Episode 12 Read the full episode description	Dec 25, 2017
SiriDB: Scalable Open Source Timeseries Database with Jeroen van der Heijden - Episode 11 Read the full episode description	Dec 18, 2017
Confluent Schema Registry with Ewen Cheslack-Postava - Episode 10 Read the full episode description	Dec 10, 2017
data.world with Bryon Jacob - Episode 9 Read the full episode description	Dec 03, 2017
Data Serialization Formats with Doug Cutting and Julien Le Dem - Episode 8 Read the full episode description	Nov 22, 2017
Buzzfeed Data Infrastructure with Walter Menendez - Episode 7 Read the full episode description	Nov 14, 2017
Astronomer with Ry Walker - Episode 6 Read the full episode description	Aug 06, 2017
Rebuilding Yelp's Data Pipeline with Justin Cunningham - Episode 5 Read the full episode description	Jun 18, 2017
ScyllaDB with Eyal Gutkind - Episode 4 Read the full episode description	Mar 18, 2017
Defining Data Engineering with Maxime Beauchemin - Episode 3 Read the full episode description	Mar 05, 2017
Dask with Matthew Rocklin - Episode 2 Read the full episode description	Jan 22, 2017
Pachyderm with Daniel Whitenack - Episode 1 Read the full episode description	Jan 14, 2017
Introducing The Show Read the full episode description	Jan 08, 2017

Data Engineering Podcast

By Tobias Macey

Category: Technology

Open in Apple Podcasts

Open RSS feed

Open Website

Description