Ecosyste.ms sponsors
An open API service aggregating public data about GitHub Sponsors.
An open API service aggregating public data about GitHub Sponsors.
Holden Karau is trans Canadian, and open source contributor. She is a Spark committer co-author of Learning Spark, High Performance Spark and Kubeflow for ML.
Funding Links: https://github.com/sponsors/holdenk
I work on open source big data and ML tooling. Most of my work is on Apache Spark but I also work on related projects like spark-testing-base. I also contribute to more ML focused projects like Kubeflow.
Base classes to use when writing tests with Spark
Language: Scala - Stars: 1524Examples for learning spark
Language: Java - Stars: 333Elastic Search on Spark
Language: Scala - Stars: 112A library you can include in your Spark job to validate the counters and perform operations on success. Goal is scala/java/python support.
Language: Scala - Stars: 108Structured Streaming Machine Learning example with Spark 2.0
Language: Scala - Stars: 92Template for Spark Projects
Language: Scala - Stars: 101