An open API service aggregating public data about GitHub Sponsors.

automunge

View JSON Representation

A tabular data preprocessing platform.

Funding Links: https://github.com/sponsors/Automunge

GitHub Sponsors Profile

Automunge is a python library platform for preparing tabular data for machine learning. The tool includes options for automated normalization, numerical encoding, and infill of tabular data, and also may serve as a platform for feature engineering - where a user can assign sets of transformations to columns either from our internal library or even define their own. The library has several push-button methods for data pipelines, including "ML infill" for machine learning derived infill, feature importance evaluation, dimensionality reduction, distribution property evaluation, oh and simple simple data pipelines for streams of data. It's very useful!

Featured Works

Automunge/AutoMunge

Tabular feature encoding pipelines for machine learning with options for string parsing, missing data infill, and stochastic perturbations.

Language: Jupyter Notebook - Stars: 164