Software engineer with experience in environments ranging from 2 developers in a VC’s office, to large multinational corporations and distributed open source projects (loves small companies). Focuses on systems (e.g. databases), and platform engineering, and has been both an architect and manager/VP. He writes papers, gives talks, and blogs about software engineering and databases.
I am a Staff Engineer at InfluxData, where I work on the InfluxDB 3.0 time series database in Rust. My current focus is storage and query processing which is built using the Apache DataFusion query engine and the Apache Arrow ecosystem. With Evan Kaplan, I coined the term FDAP Stack to describe modern systems composed from these reusable components.
I have a strong interest in open source software and the Apache Software
Foundation, where I am a Member and a committer and on the PMC of Apache
DataFusion and Apache Arrow. I spend a significant amount of my time
contributing to and maintaining the code and community of DataFusion,
and arrow-rs
, the Rust implementations of Arrow and Parquet.
2024-09-23
Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query Engine (talk)
Carnegie Mellon Univeristy: Database Building Blocks Seminar Series - Fall 2024
(slides
recording
)
2024-06-19
Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query Engine. Andrew Lamb, Yijie Shen, Daniƫl Heres, Jayjeet Chakraborty, Mehmet Ozan Kabak, Chao Sun, and Liang-Chi Hsieh
2024 International Conference on Management of Data (SIGMOD 2024), June 9-15, 2024, Santiago, Chile
(ACM DOI
PDF )
2012-08-27
The Vertica Analytic Database: C-Store 7 Years Later.
Andrew Lamb, Matt Fuller, Ramakrishna Varadarajan, Nga Tran, Ben Vandiver, Lyric Doshi, Chuck Bear.
38th International Conference on Very Large Data Bases, Proceedings of the VLDB Endowment, Vol. 5, No. 12 (VLDB 2012)
(PDF
PDF alternate
)