Data Engineering Podcast

Find Out About The Technology Behind The Latest PFAD In Analytical Database Development



February 25th, 2024  •  56 mins  •  Download (36.2 MB)  •  Link with Timestamp

RSS Feed

Building a database engine requires a substantial amount of engineering effort and time investment. Over the decades of research and development into building these software systems there are a number of common components that are shared across implementations. When Paul Dix decided to re-write the InfluxDB engine he found the Apache Arrow ecosystem ready and waiting with useful building blocks to accelerate the process. In this episode he explains how he used the combination of Apache Arrow, Flight, Datafusion, and Parquet to lay the foundation of the newest version of his time-series database.