#### Introduction
To mimic the py\_mob package (https://pypi.org/project/py-mob) for binary outcomes, the freq\_mob is a collection of python functions that would generate the monotonic binning and perform the variable transformation for frequency outcomes such that the Pearson correlation between the transformed $X$ and $Log(Y)$ is equal to 1. In case of frequency count models with $Log()$ link function, the transformation is derived as $F(x)_i = Log \frac{\sum_i Y / \sum_i Exposure}{\sum Y / \sum Exposure}$ in the training sample, where $Exposure$ is the number of cases and $i$ refers to the $ith$ bin groupped by $x$ values.
Should you have any question or suggestion about the freq\_mob package, please feel free to drop me a line.
#### Core Functions
```
freq_mob
|-- qtl_bin() : An iterative discretization based on quantiles of X.
|-- cnt_bin() : A revised iterative discretization for records with Y > 0.
|-- iso_bin() : A discretization algorthm driven by the isotonic regression between X and Y.
|-- rng_bin() : A revised iterative discretization based on the value range of X.
|-- kmn_bin() : A discretization algorthm based on the kmeans clustering of X.
|-- gbm_bin() : A discretization algorthm based on the gradient boosting machine.
|-- view_bin() : Displays the binning outcome in a tabular form.
|-- cal_newx() : Applies the variable transformation to a numeric vector based on the binning outcome.
`-- mi_score() : Calculates the mutual information score between X and Y.
```
### Authors
[WenSui Liu](mailto:liuwensui@gmail.com) is a seasoned data scientist with 15-year experience in the financial service industry.
[Joyce Liu](mailto:joyce.jl.liu@gmail.com) is a college student majoring in Mathematics with a strong passion for data science.
Raw data
{
"_id": null,
"home_page": "https://github.com/statcompute/freq_mob",
"name": "freq-mob",
"maintainer": "",
"docs_url": null,
"requires_python": "",
"maintainer_email": "",
"keywords": "",
"author": "WenSui Liu",
"author_email": "liuwensui@gmail.com",
"download_url": "https://files.pythonhosted.org/packages/fa/6d/c175682e7966f26fe74a75bda7b6553429d6a52579c4eb3ab6983bba275a/freq_mob-0.2.2.tar.gz",
"platform": null,
"description": "#### Introduction\n\nTo mimic the py\\_mob package (https://pypi.org/project/py-mob) for binary outcomes, the freq\\_mob is a collection of python functions that would generate the monotonic binning and perform the variable transformation for frequency outcomes such that the Pearson correlation between the transformed $X$ and $Log(Y)$ is equal to 1. In case of frequency count models with $Log()$ link function, the transformation is derived as $F(x)_i = Log \\frac{\\sum_i Y / \\sum_i Exposure}{\\sum Y / \\sum Exposure}$ in the training sample, where $Exposure$ is the number of cases and $i$ refers to the $ith$ bin groupped by $x$ values. \n\n\nShould you have any question or suggestion about the freq\\_mob package, please feel free to drop me a line. \n\n#### Core Functions\n\n```\nfreq_mob\n |-- qtl_bin() : An iterative discretization based on quantiles of X. \n |-- cnt_bin() : A revised iterative discretization for records with Y > 0.\n |-- iso_bin() : A discretization algorthm driven by the isotonic regression between X and Y. \n |-- rng_bin() : A revised iterative discretization based on the value range of X. \n |-- kmn_bin() : A discretization algorthm based on the kmeans clustering of X. \n |-- gbm_bin() : A discretization algorthm based on the gradient boosting machine. \n |-- view_bin() : Displays the binning outcome in a tabular form. \n |-- cal_newx() : Applies the variable transformation to a numeric vector based on the binning outcome.\n `-- mi_score() : Calculates the mutual information score between X and Y.\n```\n\n### Authors\n\n[WenSui Liu](mailto:liuwensui@gmail.com) is a seasoned data scientist with 15-year experience in the financial service industry. \n\n[Joyce Liu](mailto:joyce.jl.liu@gmail.com) is a college student majoring in Mathematics with a strong passion for data science.\n",
"bugtrack_url": null,
"license": "",
"summary": "Monotonic Optimal Binning for Frequency Models",
"version": "0.2.2",
"project_urls": {
"Homepage": "https://github.com/statcompute/freq_mob"
},
"split_keywords": [],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "3895ae08f3cf09e4a1d6974b74935c121185d744b10beecd84ce14ddc8aa70a3",
"md5": "5934007aa1b6ecacc1c1539b696ef77f",
"sha256": "6a1e398d30b4381aff8bdcca13c2b44f88bce2f29c9c257126558b1f22683e11"
},
"downloads": -1,
"filename": "freq_mob-0.2.2-py3-none-any.whl",
"has_sig": false,
"md5_digest": "5934007aa1b6ecacc1c1539b696ef77f",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": null,
"size": 6863,
"upload_time": "2023-08-12T19:25:06",
"upload_time_iso_8601": "2023-08-12T19:25:06.443711Z",
"url": "https://files.pythonhosted.org/packages/38/95/ae08f3cf09e4a1d6974b74935c121185d744b10beecd84ce14ddc8aa70a3/freq_mob-0.2.2-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "fa6dc175682e7966f26fe74a75bda7b6553429d6a52579c4eb3ab6983bba275a",
"md5": "0b93ba1d535cc389126dcf4d42e4b56a",
"sha256": "d5005e2deb7731397fad75d4d78d0dc8eb690c7601ffc11dad3ce228765715da"
},
"downloads": -1,
"filename": "freq_mob-0.2.2.tar.gz",
"has_sig": false,
"md5_digest": "0b93ba1d535cc389126dcf4d42e4b56a",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 6437,
"upload_time": "2023-08-12T19:25:09",
"upload_time_iso_8601": "2023-08-12T19:25:09.218510Z",
"url": "https://files.pythonhosted.org/packages/fa/6d/c175682e7966f26fe74a75bda7b6553429d6a52579c4eb3ab6983bba275a/freq_mob-0.2.2.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2023-08-12 19:25:09",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "statcompute",
"github_project": "freq_mob",
"travis_ci": false,
"coveralls": false,
"github_actions": false,
"lcname": "freq-mob"
}