# <img src="https://raw.githubusercontent.com/pepkit/geofetch/master/docs/img/geofetch_logo.svg?sanitize=true" alt="geofetch logo" height="70">
[](https://pepkit.github.io)

[](https://geofetch.databio.org/en/latest/)
[](https://pypi.org/project/geofetch)
[](https://github.com/psf/black)
[](http://bioconda.github.io/recipes/geofetch/README.html)
**geofetch** is a command-line tool that downloads sequencing data and metadata from GEO and SRA and create metadata tables in [standard PEP format](https://pep.databio.org/). `geofetch` is hosted at [pypi](https://pypi.org/project/geofetch/). You can convert the result of geofetch into unmapped `bam` or `fastq` files with the included `sraconvert` command.
## Key geofetch features:
- Works with GEO and SRA metadata
- Combines samples from different projects
- Standardizes output metadata
- Filters type and size of processed files (from GEO) before downloading them
- Easy to use
- Fast execution time
- Can search GEO to find relevant data
- Can be used either as a command-line tool or from within Python using an API
## Docs
---
**Documentation**: <a href="https://pep.databio.org/geofetch/" target="_blank">https://pep.databio.org/geofetch/</a>
**Source Code**: <a href="https://github.com/pepkit/geofetch/" target="_blank">https://github.com/pepkit/geofetch/</a>
---
## Installation
To install `geofetch` use this command:
```
pip install geofetch
```
or install the latest version from the GitHub repository:
```
pip install git+https://github.com/pepkit/geofetch.git
```
## All GEO projects (GSE + GSM) in PEP format.
All GEO projects are available in PEPhub under geo namespace: https://pephub.databio.org/geo/ .
User can search for GEO projects using the search bar, or download archive with all GEO PEPs from archive section of the namespace:
[https://pephub.databio.org/geo?view=archive](https://pephub.databio.org/geo?view=archive)
## How to cite:
https://doi.org/10.1093/bioinformatics/btad069
```bibtex
@article{10.1093/bioinformatics/btad069,
author = {Khoroshevskyi, Oleksandr and LeRoy, Nathan and Reuter, Vincent P and Sheffield, Nathan C},
title = "{GEOfetch: a command-line tool for downloading data and standardized metadata from GEO and SRA}",
journal = {Bioinformatics},
volume = {39},
number = {3},
pages = {btad069},
year = {2023},
month = {03},
abstract = "{The Gene Expression Omnibus has become an important source of biological data for secondary analysis. However, there is no simple, programmatic way to download data and metadata from Gene Expression Omnibus (GEO) in a standardized annotation format.To address this, we present GEOfetch—a command-line tool that downloads and organizes data and metadata from GEO and SRA. GEOfetch formats the downloaded metadata as a Portable Encapsulated Project, providing universal format for the reanalysis of public data.GEOfetch is available on Bioconda and the Python Package Index (PyPI).}",
issn = {1367-4811},
doi = {10.1093/bioinformatics/btad069},
url = {https://doi.org/10.1093/bioinformatics/btad069},
eprint = {https://academic.oup.com/bioinformatics/article-pdf/39/3/btad069/49407404/btad069.pdf},
}
```
Raw data
{
"_id": null,
"home_page": "https://github.com/pepkit/geofetch/",
"name": "geofetch",
"maintainer": null,
"docs_url": null,
"requires_python": null,
"maintainer_email": null,
"keywords": "project, bioinformatics, sequencing, ngs, workflow, GUI",
"author": "Oleksandr Khoroshevskyi, Nathan Sheffield, Vince Reuter, Nathan LeRoy",
"author_email": null,
"download_url": "https://files.pythonhosted.org/packages/6e/57/8b1193e57f176ae024abbdd670627c06b558111d225010357c544d74af8d/geofetch-0.12.8.tar.gz",
"platform": null,
"description": "# <img src=\"https://raw.githubusercontent.com/pepkit/geofetch/master/docs/img/geofetch_logo.svg?sanitize=true\" alt=\"geofetch logo\" height=\"70\">\n\n[](https://pepkit.github.io)\n\n[](https://geofetch.databio.org/en/latest/)\n[](https://pypi.org/project/geofetch)\n[](https://github.com/psf/black)\n[](http://bioconda.github.io/recipes/geofetch/README.html)\n\n\n**geofetch** is a command-line tool that downloads sequencing data and metadata from GEO and SRA and create metadata tables in [standard PEP format](https://pep.databio.org/). `geofetch` is hosted at [pypi](https://pypi.org/project/geofetch/). You can convert the result of geofetch into unmapped `bam` or `fastq` files with the included `sraconvert` command.\n\n\n## Key geofetch features:\n\n- Works with GEO and SRA metadata\n- Combines samples from different projects\n- Standardizes output metadata\n- Filters type and size of processed files (from GEO) before downloading them\n- Easy to use\n- Fast execution time\n- Can search GEO to find relevant data\n- Can be used either as a command-line tool or from within Python using an API\n\n\n## Docs\n\n---\n\n**Documentation**: <a href=\"https://pep.databio.org/geofetch/\" target=\"_blank\">https://pep.databio.org/geofetch/</a>\n\n**Source Code**: <a href=\"https://github.com/pepkit/geofetch/\" target=\"_blank\">https://github.com/pepkit/geofetch/</a>\n\n---\n\n\n## Installation\nTo install `geofetch` use this command: \n```\npip install geofetch\n```\nor install the latest version from the GitHub repository:\n```\npip install git+https://github.com/pepkit/geofetch.git\n```\n\n## All GEO projects (GSE + GSM) in PEP format.\n\nAll GEO projects are available in PEPhub under geo namespace: https://pephub.databio.org/geo/ .\nUser can search for GEO projects using the search bar, or download archive with all GEO PEPs from archive section of the namespace:\n[https://pephub.databio.org/geo?view=archive](https://pephub.databio.org/geo?view=archive)\n\n\n## How to cite:\nhttps://doi.org/10.1093/bioinformatics/btad069\n```bibtex\n@article{10.1093/bioinformatics/btad069,\n author = {Khoroshevskyi, Oleksandr and LeRoy, Nathan and Reuter, Vincent P and Sheffield, Nathan C},\n title = \"{GEOfetch: a command-line tool for downloading data and standardized metadata from GEO and SRA}\",\n journal = {Bioinformatics},\n volume = {39},\n number = {3},\n pages = {btad069},\n year = {2023},\n month = {03},\n abstract = \"{The Gene Expression Omnibus has become an important source of biological data for secondary analysis. However, there is no simple, programmatic way to download data and metadata from Gene Expression Omnibus (GEO) in a standardized annotation format.To address this, we present GEOfetch\u2014a command-line tool that downloads and organizes data and metadata from GEO and SRA. GEOfetch formats the downloaded metadata as a Portable Encapsulated Project, providing universal format for the reanalysis of public data.GEOfetch is available on Bioconda and the Python Package Index (PyPI).}\",\n issn = {1367-4811},\n doi = {10.1093/bioinformatics/btad069},\n url = {https://doi.org/10.1093/bioinformatics/btad069},\n eprint = {https://academic.oup.com/bioinformatics/article-pdf/39/3/btad069/49407404/btad069.pdf},\n}\n```\n",
"bugtrack_url": null,
"license": "BSD2",
"summary": "Downloads data and metadata from GEO and SRA and creates standard PEPs.",
"version": "0.12.8",
"project_urls": {
"Homepage": "https://github.com/pepkit/geofetch/"
},
"split_keywords": [
"project",
" bioinformatics",
" sequencing",
" ngs",
" workflow",
" gui"
],
"urls": [
{
"comment_text": null,
"digests": {
"blake2b_256": "b472d6a0b14d6013b83f3aeba449a3816b9bcf1b15bd7b9b07497b5841cd3253",
"md5": "861cda80dbdaab5c12345e3269a53eea",
"sha256": "198d329e80ab886b966a200b9fa8b9188837ae10de083763239bab2421a3c3fd"
},
"downloads": -1,
"filename": "geofetch-0.12.8-py3-none-any.whl",
"has_sig": false,
"md5_digest": "861cda80dbdaab5c12345e3269a53eea",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": null,
"size": 43718,
"upload_time": "2025-07-11T17:25:35",
"upload_time_iso_8601": "2025-07-11T17:25:35.703542Z",
"url": "https://files.pythonhosted.org/packages/b4/72/d6a0b14d6013b83f3aeba449a3816b9bcf1b15bd7b9b07497b5841cd3253/geofetch-0.12.8-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": null,
"digests": {
"blake2b_256": "6e578b1193e57f176ae024abbdd670627c06b558111d225010357c544d74af8d",
"md5": "ec7061a11a06338afc82f3627476417e",
"sha256": "645278be8811b8ae4817da63ac213f481e16ed4ab733bb38515c11d5c9beb6fd"
},
"downloads": -1,
"filename": "geofetch-0.12.8.tar.gz",
"has_sig": false,
"md5_digest": "ec7061a11a06338afc82f3627476417e",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 49612,
"upload_time": "2025-07-11T17:25:37",
"upload_time_iso_8601": "2025-07-11T17:25:37.117746Z",
"url": "https://files.pythonhosted.org/packages/6e/57/8b1193e57f176ae024abbdd670627c06b558111d225010357c544d74af8d/geofetch-0.12.8.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2025-07-11 17:25:37",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "pepkit",
"github_project": "geofetch",
"travis_ci": false,
"coveralls": false,
"github_actions": true,
"lcname": "geofetch"
}