# survival-datasets
A simple data loader for the most common datasets in Survival Analysis. Currently the following are included:
* Veterans Lung Cancer (https://scikit-survival.readthedocs.io/en/stable/api/generated/sksurv.datasets.load_veterans_lung_cancer.html)
* German Breast Cancer Study Group (GBSG2) (https://scikit-survival.readthedocs.io/en/stable/api/generated/sksurv.datasets.load_gbsg2.html)
* AIDS dataset (https://scikit-survival.readthedocs.io/en/stable/api/generated/sksurv.datasets.load_aids.html)
* NHANES (https://shap.readthedocs.io/en/latest/generated/shap.datasets.nhanesi.html)
* SUPPORT Study to Understand Prognoses Preferences Outcomes and Risks of Treatment (from DeepSurv paper, https://arxiv.org/abs/1606.00931)
* METABRIC The Molecular Taxonomy of Breast Cancer International Consortium (from DeepSurv paper, https://arxiv.org/abs/1606.00931)
* WHAS500 Worcester Heart Attack Study (https://scikit-survival.readthedocs.io/en/stable/api/datasets.html)
* FLCHAIN (https://scikit-survival.readthedocs.io/en/stable/api/datasets.html)
* SEER (from Kaggle, https://www.kaggle.com/code/jnegrini/breast-cancer-dataset)
## Requirements
* Python 3.8 or later
* scikit-survival 0.17.2 or later
* pandas 1.4.3 or later
* numpy 1.22.4 or later
* shap 0.41 or later
* pyarrow 11.0 or later
## Installation
Simply install via pip:
```
pip install survival-datasets
```
## Examples
Import the datasets module from the package and load your dataset of choice:
```
from survdata import datasets
if __name__ == "__main__":
X, y = datasets.load_seer_dataset()
```
Raw data
{
"_id": null,
"home_page": "https://github.com/thecml/survival-datasets",
"name": "survival-datasets",
"maintainer": "",
"docs_url": null,
"requires_python": "",
"maintainer_email": "",
"keywords": "Survival Analysis,Datasets",
"author": "Christian Marius Lillelund",
"author_email": "chr1000@gmail.com",
"download_url": "https://files.pythonhosted.org/packages/eb/ba/72b67768f01b31cd4098c03d1c9545572ddb33adaa8202b41bdf2c97e8b8/survival-datasets-0.1.5.tar.gz",
"platform": null,
"description": "# survival-datasets\r\n\r\nA simple data loader for the most common datasets in Survival Analysis. Currently the following are included: \r\n* Veterans Lung Cancer (https://scikit-survival.readthedocs.io/en/stable/api/generated/sksurv.datasets.load_veterans_lung_cancer.html)\r\n* German Breast Cancer Study Group (GBSG2) (https://scikit-survival.readthedocs.io/en/stable/api/generated/sksurv.datasets.load_gbsg2.html)\r\n* AIDS dataset (https://scikit-survival.readthedocs.io/en/stable/api/generated/sksurv.datasets.load_aids.html)\r\n* NHANES (https://shap.readthedocs.io/en/latest/generated/shap.datasets.nhanesi.html)\r\n* SUPPORT Study to Understand Prognoses Preferences Outcomes and Risks of Treatment (from DeepSurv paper, https://arxiv.org/abs/1606.00931)\r\n* METABRIC The Molecular Taxonomy of Breast Cancer International Consortium (from DeepSurv paper, https://arxiv.org/abs/1606.00931)\r\n* WHAS500 Worcester Heart Attack Study (https://scikit-survival.readthedocs.io/en/stable/api/datasets.html)\r\n* FLCHAIN (https://scikit-survival.readthedocs.io/en/stable/api/datasets.html)\r\n* SEER (from Kaggle, https://www.kaggle.com/code/jnegrini/breast-cancer-dataset)\r\n\r\n## Requirements\r\n\r\n* Python 3.8 or later\r\n* scikit-survival 0.17.2 or later\r\n* pandas 1.4.3 or later\r\n* numpy 1.22.4 or later\r\n* shap 0.41 or later\r\n* pyarrow 11.0 or later\r\n\r\n## Installation\r\n\r\nSimply install via pip:\r\n```\r\npip install survival-datasets\r\n```\r\n\r\n## Examples\r\n\r\nImport the datasets module from the package and load your dataset of choice:\r\n```\r\nfrom survdata import datasets\r\n\r\nif __name__ == \"__main__\":\r\n X, y = datasets.load_seer_dataset()\r\n```\r\n",
"bugtrack_url": null,
"license": "",
"summary": "Data loader for common datasets in Survival Analysis.",
"version": "0.1.5",
"project_urls": {
"Homepage": "https://github.com/thecml/survival-datasets"
},
"split_keywords": [
"survival analysis",
"datasets"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "eb3b7f7fbfbedca2c991fe122eac753dee27450a2c54a8c9bcf06f94a9992bf2",
"md5": "93b75113836e3096fa90e72655b7f862",
"sha256": "87f5720816eb2e385b06f6815ff1d1bd8f27efd63b059d3ad51c222467b4ff68"
},
"downloads": -1,
"filename": "survival_datasets-0.1.5-py3-none-any.whl",
"has_sig": false,
"md5_digest": "93b75113836e3096fa90e72655b7f862",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": null,
"size": 263421,
"upload_time": "2023-07-06T13:12:15",
"upload_time_iso_8601": "2023-07-06T13:12:15.566714Z",
"url": "https://files.pythonhosted.org/packages/eb/3b/7f7fbfbedca2c991fe122eac753dee27450a2c54a8c9bcf06f94a9992bf2/survival_datasets-0.1.5-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "ebba72b67768f01b31cd4098c03d1c9545572ddb33adaa8202b41bdf2c97e8b8",
"md5": "9610756215ef40a5b4d4981e9bcd2455",
"sha256": "41ce2ce91ef84f111d2dd34d8d579cbc72489600d7a3b15ca22ed32437a35017"
},
"downloads": -1,
"filename": "survival-datasets-0.1.5.tar.gz",
"has_sig": false,
"md5_digest": "9610756215ef40a5b4d4981e9bcd2455",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 263138,
"upload_time": "2023-07-06T13:12:17",
"upload_time_iso_8601": "2023-07-06T13:12:17.568542Z",
"url": "https://files.pythonhosted.org/packages/eb/ba/72b67768f01b31cd4098c03d1c9545572ddb33adaa8202b41bdf2c97e8b8/survival-datasets-0.1.5.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2023-07-06 13:12:17",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "thecml",
"github_project": "survival-datasets",
"travis_ci": false,
"coveralls": false,
"github_actions": false,
"lcname": "survival-datasets"
}