An open API service aggregating public data about GitHub Sponsors.

julienledem

View JSON Representation

Apache Parquet co-creator. OpenLineage and Marquez (LFAI&Data) Apache Arrow, Iceberg, 🐖 PMC.

Funding Links: https://github.com/sponsors/julienledem

GitHub Sponsors Profile

I am a software architect, open source leader and entrepreneur who loves collaborating with others in Open Source projects. I started the Parquet project in collaboration with the Impala team at Cloudera back when I was at Twitter. I chaired the project for many years at the Apache foundation and Parquet is now the de-facto standard for data lakes. I later contributed to the creation of the Arrow project as a founding engineer at Dremio. Before that I received my initiation contributing to OpenSource in the Apache Pig project at Yahoo where I evolved from contributor to committer to PMC member and eventually chaired the project in 2013. More recently I started the OpenLineage project while being the CTO and co-founder of Datakin which was later acquired by Astronomer. OpenLineage came out of Marquez, the project we co-created at Wework on the data platform team.
I currently lead the OpenLineage project, facilitating discussions, ensuring all voices are heard and empowered and growing its ecosystem.

Featured Works

OpenLineage/OpenLineage

An Open Standard for lineage metadata collection

Language: Java - Stars: 1873
MarquezProject/marquez

Collect, aggregate, and visualize a data ecosystem's metadata

Language: Java - Stars: 1872
julienledem/brennus

Builder pattern to generate java classes

Language: Java - Stars: 17
julienledem/Jaqen

A type-safe heterogenous Map or a Named field Tuple depending how you looks at it.

Language: Scala - Stars: 1