holdenk
Holden Karau is trans Canadian, and open source contributor. She is a Spark committer co-author of Learning Spark, High Performance Spark and Kubeflow for ML.
Funding Links: https://github.com/sponsors/holdenk
- Name: Holden Karau
- Location: San Francisco, CA, USA
- Company: Open Source Big Data Dev
- Kind: user
- Followers: 2489
- Following: 19
- Total stars: 2848
- Repositories count: 281
- Created at: 2022-11-02T17:57:59.866Z
- Updated at: 2025-03-28T08:38:31.898Z
- Last synced at: 2025-03-28T08:38:31.898Z
GitHub Sponsors Profile
I work on open source big data and ML tooling. Most of my work is on Apache Spark but I also work on related projects like spark-testing-base. I also contribute to more ML focused projects like Kubeflow.
- Current Sponsors: 5
- Past Sponsors: 0
- Total Sponsors: 5
- Minimum Sponsorship: $1.00
Featured Works
holdenk/spark-testing-base
Base classes to use when writing tests with Spark
Language: Scala - Stars: 1526holdenk/learning-spark-examples
Examples for learning spark
Language: Java - Stars: 332holdenk/elasticsearchspark
Elastic Search on Spark
Language: Scala - Stars: 112holdenk/spark-validator
A library you can include in your Spark job to validate the counters and perform operations on success. Goal is scala/java/python support.
Language: Scala - Stars: 109holdenk/spark-structured-streaming-ml
Structured Streaming Machine Learning example with Spark 2.0
Language: Scala - Stars: 92holdenk/sparkProjectTemplate.g8
Template for Spark Projects
Language: Scala - Stars: 101Active Sponsors
Past Sponsors
Sponsor Breakdown
- User: 4