# ggr-cwl-ipynb-gen
Jupyter notebook generator to download and execute the processing files for GGR related datasets.
At this point, is not intented to cover all use cases, but to serve as a quick generator of all
related files and scripts to pre-process genomic data generated at the [Duke-GCB Sequencing Core](https://genome.duke.edu/cores-and-services/sequencing-and-genomic-technologies) in [HARDAC](https://genome.duke.edu/cores-and-services/computational-solutions/compute-environments-genomics).
Example of usage:
```
$ ggr_cwl_ipynb_gen \
--root-dir /path/to/rootdir \
--metadata examples/Hong_3979_170316B1.xlsx \
--out /path/to/output_dir \
--force
```
The information in the example metadata and configuration file should reveal what is needed to download and pre-process the samples.
For a full list of options:
```
$ ggr_cwl_ipynb_gen -h
usage: Generator of Jupyter notebooks to execute CWL pre-processing pipelines
[-h] -o OUT -m METADATA [-f] [-n] [--metadata-sep SEP]
[--project-name PROJECT_NAME]
[--data-from {dukeds,sftp,miseq,other,local}] [-c CONF_FILE] [-u USER]
[-e USER_DUKE_EMAIL] [-r ROOT_DIR] [-v VERSION]
optional arguments:
-h, --help show this help message and exit
-o OUT, --out OUT Jupyter notebook output file name
-m METADATA, --metadata METADATA
Metadata file with samples information
-f, --force Force to overwrite output file
-n, --no-upload Avoids uploading generated data to database when
specified
--metadata-sep SEP Separator for metadata file (when different than Excel
spread sheet)
--project-name PROJECT_NAME
Project name (by default, basename of metadata file
name)
--data-from {dukeds,sftp,miseq,other,local}
Choices: dukeds, sftp, miseq, other, local
-c CONF_FILE, --conf-file CONF_FILE
YAML configuration file (see examples)
-u USER, --user USER HARDAC User used in SLURM (default: ${USER})
-e USER_DUKE_EMAIL, --user-duke-email USER_DUKE_EMAIL
Email(s) notified when execution is finished (default:
${USER}@duke.edu)
-r ROOT_DIR, --root-dir ROOT_DIR
Root directory where all subfolders and files will be
created (semi-required: either defined here or in
conf-file)
-v VERSION, --version VERSION
Print version of the program and exit
```
Raw data
{
"_id": null,
"home_page": "https://github.com/ReddyLab/ggr-cwl-ipynb-gen",
"name": "ggr-cwl-ipynb-gen",
"maintainer": null,
"docs_url": null,
"requires_python": ">=3.1",
"maintainer_email": null,
"keywords": "cwl, bioinformatics, development",
"author": "Alejandro Barrera",
"author_email": "alejandro.barrera@duke.edu",
"download_url": "https://files.pythonhosted.org/packages/b7/81/a487340c8812aefa01c5db85ee763a6e8378acc0c5edd7ab9709849cd7b6/ggr_cwl_ipynb_gen-0.5.1.tar.gz",
"platform": null,
"description": "# ggr-cwl-ipynb-gen\nJupyter notebook generator to download and execute the processing files for GGR related datasets.\nAt this point, is not intented to cover all use cases, but to serve as a quick generator of all\nrelated files and scripts to pre-process genomic data generated at the [Duke-GCB Sequencing Core](https://genome.duke.edu/cores-and-services/sequencing-and-genomic-technologies) in [HARDAC](https://genome.duke.edu/cores-and-services/computational-solutions/compute-environments-genomics).\n\nExample of usage:\n```\n$ ggr_cwl_ipynb_gen \\\n --root-dir /path/to/rootdir \\\n --metadata examples/Hong_3979_170316B1.xlsx \\\n --out /path/to/output_dir \\\n --force\n```\nThe information in the example metadata and configuration file should reveal what is needed to download and pre-process the samples.\n\nFor a full list of options:\n```\n$ ggr_cwl_ipynb_gen -h\nusage: Generator of Jupyter notebooks to execute CWL pre-processing pipelines\n [-h] -o OUT -m METADATA [-f] [-n] [--metadata-sep SEP]\n [--project-name PROJECT_NAME]\n [--data-from {dukeds,sftp,miseq,other,local}] [-c CONF_FILE] [-u USER]\n [-e USER_DUKE_EMAIL] [-r ROOT_DIR] [-v VERSION]\n\noptional arguments:\n -h, --help show this help message and exit\n -o OUT, --out OUT Jupyter notebook output file name\n -m METADATA, --metadata METADATA\n Metadata file with samples information\n -f, --force Force to overwrite output file\n -n, --no-upload Avoids uploading generated data to database when\n specified\n --metadata-sep SEP Separator for metadata file (when different than Excel\n spread sheet)\n --project-name PROJECT_NAME\n Project name (by default, basename of metadata file\n name)\n --data-from {dukeds,sftp,miseq,other,local}\n Choices: dukeds, sftp, miseq, other, local\n -c CONF_FILE, --conf-file CONF_FILE\n YAML configuration file (see examples)\n -u USER, --user USER HARDAC User used in SLURM (default: ${USER})\n -e USER_DUKE_EMAIL, --user-duke-email USER_DUKE_EMAIL\n Email(s) notified when execution is finished (default:\n ${USER}@duke.edu)\n -r ROOT_DIR, --root-dir ROOT_DIR\n Root directory where all subfolders and files will be\n created (semi-required: either defined here or in\n conf-file)\n -v VERSION, --version VERSION\n Print version of the program and exit\n```\n",
"bugtrack_url": null,
"license": null,
"summary": "IPython notebook generator for GGR CWL processing pipelines of genomic data",
"version": "0.5.1",
"project_urls": {
"Bug Tracker": "https://github.com/ReddyLab/ggr-cwl-ipynb-gen/issues",
"Homepage": "https://github.com/ReddyLab/ggr-cwl-ipynb-gen",
"Source": "https://github.com/ReddyLab/ggr-cwl-ipynb-gen/"
},
"split_keywords": [
"cwl",
" bioinformatics",
" development"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "9956df3a83c7953f86bb26bc66bd615acf37c4c65545c6b41a7b340074b0f979",
"md5": "ab541e3624cfdb39d0f8dd08b7c41540",
"sha256": "9881f9b3a496dec396948fe48cf80b5bd1ccc7c8000db4f5f2bbc2d6033d292b"
},
"downloads": -1,
"filename": "ggr_cwl_ipynb_gen-0.5.1-py3-none-any.whl",
"has_sig": false,
"md5_digest": "ab541e3624cfdb39d0f8dd08b7c41540",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">=3.1",
"size": 28317,
"upload_time": "2024-11-05T17:17:46",
"upload_time_iso_8601": "2024-11-05T17:17:46.876718Z",
"url": "https://files.pythonhosted.org/packages/99/56/df3a83c7953f86bb26bc66bd615acf37c4c65545c6b41a7b340074b0f979/ggr_cwl_ipynb_gen-0.5.1-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "b781a487340c8812aefa01c5db85ee763a6e8378acc0c5edd7ab9709849cd7b6",
"md5": "de9704ec2f51c6f95979e37fb6682d04",
"sha256": "1de68dcd860ae85bd09e92acf2eb8732413a3e7b25b1f2172cf745113740c1b6"
},
"downloads": -1,
"filename": "ggr_cwl_ipynb_gen-0.5.1.tar.gz",
"has_sig": false,
"md5_digest": "de9704ec2f51c6f95979e37fb6682d04",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">=3.1",
"size": 18079,
"upload_time": "2024-11-05T17:17:48",
"upload_time_iso_8601": "2024-11-05T17:17:48.410144Z",
"url": "https://files.pythonhosted.org/packages/b7/81/a487340c8812aefa01c5db85ee763a6e8378acc0c5edd7ab9709849cd7b6/ggr_cwl_ipynb_gen-0.5.1.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-11-05 17:17:48",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "ReddyLab",
"github_project": "ggr-cwl-ipynb-gen",
"travis_ci": false,
"coveralls": false,
"github_actions": false,
"requirements": [],
"lcname": "ggr-cwl-ipynb-gen"
}