reparo


Namereparo JSON
Version 0.0.6 PyPI version JSON
download
home_pagehttps://github.com/SigmoidAI/reparo
SummaryReparo is a python sci-kit learn inspired package for Missing Value Imputation.
upload_time2023-02-05 14:46:32
maintainer
docs_urlNone
authorSigmoidAI - Stojoc Vladimir
requires_python
licenseMIT
keywords ml machine learning feature engineering python data science
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            
# reparo

Reparo is a python sci-kit learn inspired package for Missing Value Imputation. It contains a some feature transformers to eliminate Missing Values (NaNs) from your data for Machine Learning Algorithms.

This version of reparo has the next methods of missing value imputation:
1) Cold-Deck Imputation (CDI).
2) Hot-Deck Imputation (HotDeckImputation).
3) Fuzzy-Rough Nearest Neighbor for Imputation (FRNNI).
4) K-Nearest Neighbors Imputation (KNNI).
5) Single Center Imputation from Multiple Chained Equation (SICE).
6) Predictive Mean Matching (PMM).
7) Multivariate Imputation by Chained Equation (MICE).

All these methods work like normal sklearn transformers. They have fit, transform and fit_transform functions implemented.

Additionally every reparo transformer has an apply function which allows to apply an transformation on a pandas Data Frame.

# How to use reparo
To use a transformer from reparo you should just import the transformer from reparo in the following framework:

```python
from reparo import MICE
```

class names are written above in parantheses.

Next create a object of this algorithm (I will use k-Nearest Neighbors Imputation as an example).

```python
method = KNNI()
```

Firstly you should fit the transformer, passing to it a feature matrix (X) and the target array (y). y argument is not really used (as it causes data leackage)

```python
method.fit(X, y)
```

After you fit the model, you can use it for transforming new data, using the transform function. To transform function you should pass only the feature matrix (X).

```python
X_transformed = method.transform(X)
```

Also you can fit and transform the data at the same time using the fit_transform function.

```python
X_transformed = method.fit_transform(X)
```

Also you can apply a transformation directly on a pandas DataFrame, choosing the columns that you want to change.

```python
new_df = method.apply(df, 'target', ['col1', 'col2'])
```

With <3 from Sigmoid.
We are open for feedback. Please send your impression to vladimir.stojoc@gmail.com



            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/SigmoidAI/reparo",
    "name": "reparo",
    "maintainer": "",
    "docs_url": null,
    "requires_python": "",
    "maintainer_email": "",
    "keywords": "ml,machine learning,feature engineering,python,data science",
    "author": "SigmoidAI - Stojoc Vladimir",
    "author_email": "vladimir.stojoc@gmail.com",
    "download_url": "https://files.pythonhosted.org/packages/47/82/81c864af65b63ff1923c3f951d8b76c246c007ca0f474fe4f86b7c5bb925/reparo-0.0.6.tar.gz",
    "platform": null,
    "description": "\n# reparo\n\nReparo is a python sci-kit learn inspired package for Missing Value Imputation. It contains a some feature transformers to eliminate Missing Values (NaNs) from your data for Machine Learning Algorithms.\n\nThis version of reparo has the next methods of missing value imputation:\n1) Cold-Deck Imputation (CDI).\n2) Hot-Deck Imputation (HotDeckImputation).\n3) Fuzzy-Rough Nearest Neighbor for Imputation (FRNNI).\n4) K-Nearest Neighbors Imputation (KNNI).\n5) Single Center Imputation from Multiple Chained Equation (SICE).\n6) Predictive Mean Matching (PMM).\n7) Multivariate Imputation by Chained Equation (MICE).\n\nAll these methods work like normal sklearn transformers. They have fit, transform and fit_transform functions implemented.\n\nAdditionally every reparo transformer has an apply function which allows to apply an transformation on a pandas Data Frame.\n\n# How to use reparo\nTo use a transformer from reparo you should just import the transformer from reparo in the following framework:\n\n```python\nfrom reparo import MICE\n```\n\nclass names are written above in parantheses.\n\nNext create a object of this algorithm (I will use k-Nearest Neighbors Imputation as an example).\n\n```python\nmethod = KNNI()\n```\n\nFirstly you should fit the transformer, passing to it a feature matrix (X) and the target array (y). y argument is not really used (as it causes data leackage)\n\n```python\nmethod.fit(X, y)\n```\n\nAfter you fit the model, you can use it for transforming new data, using the transform function. To transform function you should pass only the feature matrix (X).\n\n```python\nX_transformed = method.transform(X)\n```\n\nAlso you can fit and transform the data at the same time using the fit_transform function.\n\n```python\nX_transformed = method.fit_transform(X)\n```\n\nAlso you can apply a transformation directly on a pandas DataFrame, choosing the columns that you want to change.\n\n```python\nnew_df = method.apply(df, 'target', ['col1', 'col2'])\n```\n\nWith <3 from Sigmoid.\nWe are open for feedback. Please send your impression to vladimir.stojoc@gmail.com\n\n\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "Reparo is a python sci-kit learn inspired package for Missing Value Imputation.",
    "version": "0.0.6",
    "split_keywords": [
        "ml",
        "machine learning",
        "feature engineering",
        "python",
        "data science"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "69dde4331e397ca53f9350f18c58e4719cfb85bb59daf60a688dc426670774f4",
                "md5": "5ff6318accc9695f8ec119b9944fbc3e",
                "sha256": "01f68b9b9c7815bc66906a5416d5932ea145440efad37b94290fdb1cabf3922e"
            },
            "downloads": -1,
            "filename": "reparo-0.0.6-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "5ff6318accc9695f8ec119b9944fbc3e",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": null,
            "size": 16431,
            "upload_time": "2023-02-05T14:46:30",
            "upload_time_iso_8601": "2023-02-05T14:46:30.591407Z",
            "url": "https://files.pythonhosted.org/packages/69/dd/e4331e397ca53f9350f18c58e4719cfb85bb59daf60a688dc426670774f4/reparo-0.0.6-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "478281c864af65b63ff1923c3f951d8b76c246c007ca0f474fe4f86b7c5bb925",
                "md5": "412af3337238cbee4eec505f650816a1",
                "sha256": "d99a7b489e17a2735dbef6b71b8616b63d0350e9d8ab5ef2725d0f3a4d4e5223"
            },
            "downloads": -1,
            "filename": "reparo-0.0.6.tar.gz",
            "has_sig": false,
            "md5_digest": "412af3337238cbee4eec505f650816a1",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": null,
            "size": 11985,
            "upload_time": "2023-02-05T14:46:32",
            "upload_time_iso_8601": "2023-02-05T14:46:32.079514Z",
            "url": "https://files.pythonhosted.org/packages/47/82/81c864af65b63ff1923c3f951d8b76c246c007ca0f474fe4f86b7c5bb925/reparo-0.0.6.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2023-02-05 14:46:32",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "github_user": "SigmoidAI",
    "github_project": "reparo",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "requirements": [],
    "lcname": "reparo"
}
        
Elapsed time: 0.04262s