recipies


Namerecipies JSON
Version 1.0 PyPI version JSON
download
home_pageNone
SummaryA modular preprocessing package for Pandas Dataframe
upload_time2024-10-16 16:02:23
maintainerNone
docs_urlNone
authorNone
requires_pythonNone
licenseMIT license
keywords recipies pandas dataframe polars preprocessing recipys
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            ![logo](https://github.com/rvandewater/ReciPys/blob/development/docs/figures/recipys_logo.png?raw=true)
# 🥧ReciPys🐍
[![CI](https://github.com/rvandewater/recipys/actions/workflows/ci.yml/badge.svg)](https://github.com/rvandewater/recipys/actions/workflows/ci.yml)
[![Black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)
![Platform](https://img.shields.io/badge/platform-linux--64%20|%20win--64%20|%20osx--64-lightgrey)
[![License](https://img.shields.io/badge/license-MIT-green)](LICENSE)
[![PyPI version shields.io](https://img.shields.io/pypi/v/recipies.svg)](https://pypi.python.org/pypi/recipies/)
[![arXiv](https://img.shields.io/badge/arXiv-2306.05109-b31b1b.svg)](http://arxiv.org/abs/2306.05109)

The ReciPys package is a preprocessing framework operating on Pandas dataframes. 
The operation of this package is inspired by the R-package [recipes](https://recipes.tidymodels.org/).
This package allows the user to apply a number of extensible operations for imputation, feature generation/extraction, 
scaling, and encoding. 
It operates on modified Dataframe objects from the established data science package Pandas.
## Installation 
You can install ReciPys from pip using:
```
pip install recipies
```
> Note that the package is called `recipies` and not `recipys` on pip due to a name clash with an existing package.
> 
You can install ReciPys from source to ensure you have the latest version:
```
conda env update -f environment.yml
conda activate recipys
pip install -e .
```
> Note that the last command installs the package called `recipies`.

## Usage
To define preprocessing operations, one has to supply _roles_ to the different columns of the Dataframe. 
This allows the user to create groups of columns which have a particular function.
Then, we provide several "steps" that can be applied to the datasets, among which: Historical accumulation, 
Resampling the time resolution, A number of imputation methods, and a wrapper for any 
[Scikit-learn](https://github.com/scikit-learn/scikit-learn) preprocessing step.
We believe to have covered any basic preprocessing needs for prepared datasets.
Any missing step can be added by following the step interface.

# 📄Paper

If you use this code in your research, please cite the following publication:

```
@article{vandewaterYetAnotherICUBenchmark2023,
	title = {Yet Another ICU Benchmark: A Flexible Multi-Center Framework for Clinical ML},
	shorttitle = {Yet Another ICU Benchmark},
	url = {http://arxiv.org/abs/2306.05109},
	language = {en},
	urldate = {2023-06-09},
	publisher = {arXiv},
	author = {van de Water, Robin and Schmidt, Hendrik and Elbers, Paul and Thoral, Patrick and Arnrich, Bert and Rockenschaub, Patrick},
	month = jun,
	year = {2023},
	note = {arXiv:2306.05109 [cs]},
	keywords = {Computer Science - Machine Learning},
}
```
This paper can also be found on arxiv: https://arxiv.org/pdf/2306.05109.pdf





            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "recipies",
    "maintainer": null,
    "docs_url": null,
    "requires_python": null,
    "maintainer_email": null,
    "keywords": "recipies, pandas, dataframe, polars, preprocessing, recipys",
    "author": null,
    "author_email": "Robin van de Water <robin.vandewater@hpi.de>",
    "download_url": null,
    "platform": null,
    "description": "![logo](https://github.com/rvandewater/ReciPys/blob/development/docs/figures/recipys_logo.png?raw=true)\r\n# \ud83e\udd67ReciPys\ud83d\udc0d\r\n[![CI](https://github.com/rvandewater/recipys/actions/workflows/ci.yml/badge.svg)](https://github.com/rvandewater/recipys/actions/workflows/ci.yml)\r\n[![Black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)\r\n![Platform](https://img.shields.io/badge/platform-linux--64%20|%20win--64%20|%20osx--64-lightgrey)\r\n[![License](https://img.shields.io/badge/license-MIT-green)](LICENSE)\r\n[![PyPI version shields.io](https://img.shields.io/pypi/v/recipies.svg)](https://pypi.python.org/pypi/recipies/)\r\n[![arXiv](https://img.shields.io/badge/arXiv-2306.05109-b31b1b.svg)](http://arxiv.org/abs/2306.05109)\r\n\r\nThe ReciPys package is a preprocessing framework operating on Pandas dataframes. \r\nThe operation of this package is inspired by the R-package [recipes](https://recipes.tidymodels.org/).\r\nThis package allows the user to apply a number of extensible operations for imputation, feature generation/extraction, \r\nscaling, and encoding. \r\nIt operates on modified Dataframe objects from the established data science package Pandas.\r\n## Installation \r\nYou can install ReciPys from pip using:\r\n```\r\npip install recipies\r\n```\r\n> Note that the package is called `recipies` and not `recipys` on pip due to a name clash with an existing package.\r\n> \r\nYou can install ReciPys from source to ensure you have the latest version:\r\n```\r\nconda env update -f environment.yml\r\nconda activate recipys\r\npip install -e .\r\n```\r\n> Note that the last command installs the package called `recipies`.\r\n\r\n## Usage\r\nTo define preprocessing operations, one has to supply _roles_ to the different columns of the Dataframe. \r\nThis allows the user to create groups of columns which have a particular function.\r\nThen, we provide several \"steps\" that can be applied to the datasets, among which: Historical accumulation, \r\nResampling the time resolution, A number of imputation methods, and a wrapper for any \r\n[Scikit-learn](https://github.com/scikit-learn/scikit-learn) preprocessing step.\r\nWe believe to have covered any basic preprocessing needs for prepared datasets.\r\nAny missing step can be added by following the step interface.\r\n\r\n# \ud83d\udcc4Paper\r\n\r\nIf you use this code in your research, please cite the following publication:\r\n\r\n```\r\n@article{vandewaterYetAnotherICUBenchmark2023,\r\n\ttitle = {Yet Another ICU Benchmark: A Flexible Multi-Center Framework for Clinical ML},\r\n\tshorttitle = {Yet Another ICU Benchmark},\r\n\turl = {http://arxiv.org/abs/2306.05109},\r\n\tlanguage = {en},\r\n\turldate = {2023-06-09},\r\n\tpublisher = {arXiv},\r\n\tauthor = {van de Water, Robin and Schmidt, Hendrik and Elbers, Paul and Thoral, Patrick and Arnrich, Bert and Rockenschaub, Patrick},\r\n\tmonth = jun,\r\n\tyear = {2023},\r\n\tnote = {arXiv:2306.05109 [cs]},\r\n\tkeywords = {Computer Science - Machine Learning},\r\n}\r\n```\r\nThis paper can also be found on arxiv: https://arxiv.org/pdf/2306.05109.pdf\r\n\r\n\r\n\r\n\r\n",
    "bugtrack_url": null,
    "license": "MIT license",
    "summary": "A modular preprocessing package for Pandas Dataframe",
    "version": "1.0",
    "project_urls": null,
    "split_keywords": [
        "recipies",
        " pandas",
        " dataframe",
        " polars",
        " preprocessing",
        " recipys"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "22c2ae501e6f52c04b5a6451f8b1a7d51ffbc85eaa4342fab84d25d8e86eb701",
                "md5": "3a5dda493cfe9ebb1e2d85f5c2e6e847",
                "sha256": "749def6ad30cf529a02fff0ff1474bf1eb6daf604d82980a913c9fb5b63e8d93"
            },
            "downloads": -1,
            "filename": "recipies-1.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "3a5dda493cfe9ebb1e2d85f5c2e6e847",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": null,
            "size": 16318,
            "upload_time": "2024-10-16T16:02:23",
            "upload_time_iso_8601": "2024-10-16T16:02:23.119028Z",
            "url": "https://files.pythonhosted.org/packages/22/c2/ae501e6f52c04b5a6451f8b1a7d51ffbc85eaa4342fab84d25d8e86eb701/recipies-1.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-10-16 16:02:23",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "recipies"
}
        
Elapsed time: 0.41539s