hecatomb

Name	hecatomb JSON
Version	1.3.3 JSON
	download
home_page	https://github.com/shandley/hecatomb
Summary	Viral metagenomics framework for short and longreads
upload_time	2024-12-16 01:50:17
maintainer	None
docs_url	None
author	Michael Roach
requires_python	>=3.9
license	None
keywords
VCS
bugtrack_url
requirements	No requirements were recorded.
Travis-CI	No Travis.
coveralls test coverage	No coveralls.

            ![](hecatombLogo.png)

[![](https://img.shields.io/static/v1?label=CLI&message=Snaketool&color=blueviolet)](https://github.com/beardymcjohnface/Snaketool)
![Anaconda-Server Badge](https://anaconda.org/bioconda/hecatomb/badges/license.svg)
![Anaconda-Server Badge](https://anaconda.org/bioconda/hecatomb/badges/latest_release_date.svg)
[![Documentation Status](https://readthedocs.org/projects/hecatomb/badge/?version=latest&style=flat-square)](https://hecatomb.readthedocs.io/en/latest/?badge=latest)
[![install with bioconda](https://img.shields.io/badge/Install%20with-conda-brightgreen.svg?style=flat-square)](http://bioconda.github.io/recipes/hecatomb/README.html)
![](https://img.shields.io/conda/dn/bioconda/hecatomb?label=Conda%20downloads&style=flat-square)
[![install with PyPI](https://img.shields.io/badge/Install%20with-PyPI-brightgreen.svg?style=flat-square)](https://pypi.org/project/hecatomb/)
[![Unit tests](https://github.com/shandley/hecatomb/actions/workflows/unit-tests.yaml/badge.svg)](https://github.com/shandley/hecatomb/actions/workflows/unit-tests.yaml)
[![Env builds](https://github.com/shandley/hecatomb/actions/workflows/build-hecatomb-envs.yaml/badge.svg)](https://github.com/shandley/hecatomb/actions/workflows/build-hecatomb-envs.yaml)

---

A [hecatomb](https://en.wiktionary.org/wiki/hecatomb) is a great sacrifice or an extensive loss. 
Heactomb the software empowers an analyst to make data driven decisions to *'sacrifice'* false-positive viral reads from 
metagenomes to enrich for true-positive viral reads. 
This process frequently results in a great loss of suspected viral sequences / contigs.

## Contents

- [Documentation](#documentation)
- [Citation](#citation)
- [Quick Start Guide](#quick-start-guide)
- [Inputs](#inputs)
- [Dependencies](#dependencies)
- [Links](#links)

## Documentation

[Complete documentation is hosted at Read the Docs](https://hecatomb.readthedocs.io)

## Citation

[Hecatomb is currently on BioRxiv!](https://www.biorxiv.org/content/10.1101/2022.05.15.492003v1)

## Quick start guide

### Install Hecatomb

__option 1: PIP__

```bash
# Optional: create a virtual with conda or venv
conda create -n hecatomb python=3.10

# activate
conda activte hecatomb

# Install
pip install hecatomb
```

__option 2: Conda__

```bash
# Create the conda env and install hecatomb in one step
conda create -n hecatomb -c conda-forge -c bioconda hecatomb

# activate
conda activate hecatomb
```

__Check installation__

```bash
hecatomb --help
```

### Install databases and envs

__Download the databases__

```bash
# 8 threads = 8 downloads at a time
hecatomb install --threads 8
```

__Optional: prebuild envs__

These are automatically built when running hecatomb, but manually pre-building is useful if your cluster nodes are isolated from the internet.

```shell
hecatomb test build_envs
```

### Run test dataset

```bash
# locally: using 32 threads and 64 GB RAM by default
hecatomb test --threads 32

# HPC: using a profile named 'slurm'
hecatomb test --profile slurm
```

### Snakemake profiles (for running on HPCs)

Hecatomb is powered by [Snakemake](https://snakemake.readthedocs.io/en/stable/#) and greatly benefits from the use of 
Snakemake profiles for HPC Clusters.
[More information and example for setting up Snakemake profiles for Hecatomb in the documentation](https://hecatomb.readthedocs.io/en/latest/profiles/).

__NOTE: Hecatomb currently uses Snakemake version 7. 
The recent version 8 for Snakemake has some breaking changes, including some changes to the command line interface for cluster execution.
Any new Snakemake v8 profiles might not work with Hecatomb.
Please open an issue if you need help setting up a profile.__

## Inputs

### Parsing samples with `--reads`

You can pass either a directory of reads or a TSV file to `--reads`. 
Note that Hecatomb expects paired read file names to include common R1/R2 tags. 
 - __Directory:__ Hecatomb will infer sample names and various R1/2 tag combinations from the filenames.
 - __TSV file:__ Hecatomb expects 2 or 3 columns, with column 1 being the sample name and columns 2 and 3 the reads files.

[More information and examples are available here](https://gist.github.com/beardymcjohnface/bb161ba04ae1042299f48a4849e917c8#file-readme-md)

### Lonread support `--longreads`

Pass the `--longreads` argument to tell Hecatomb that you are using longreads.

### Library preprocessing with `--trim`

Hecatomb uses [Trimnami](https://github.com/beardymcjohnface/Trimnami) for read trimming which supports many different
trimming methods. Current options are `fastp` (default), `prinseq`, `roundAB`, `filtlong` (for longreads), 
`cutadapt` (FASTA input), and `notrim` (skip trimming). See Trimnami's documentation for more information.

### Configuration

You can configure advanced parameters for Hecatomb.
Copy the default config: `hecatomb config`.
Edit the config file in your favourite text editor: `nano hecatomb.out/hecatomb.config.yaml`.

## Dependencies

The only dependency you need to get up and running with Hecatomb is [conda](https://docs.conda.io/en/latest/) or 
the python package manager [pip](https://pypi.org/project/pip/).
Hecatomb relies on [conda](https://docs.conda.io/en/latest/) to ensure portability and ease of installation of its dependencies.
All of Hecatomb's dependencies are installed during installation or runtime, so you don't have to worry about a thing!

## Links

[Hecatomb @ PyPI](https://pypi.org/project/hecatomb/)

[Hecatomb @ bioconda](https://bioconda.github.io/recipes/hecatomb/README.html)

[Hecatomb @ bio.tools](https://bio.tools/hecatomb)

[Hecatomb @ WorkflowHub](https://workflowhub.eu/workflows/235)

[Hecatomb RRID:SCR_025002](https://scicrunch.org/resources/data/record/nlx_144509-1/SCR_025002/resolver)

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/shandley/hecatomb",
    "name": "hecatomb",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.9",
    "maintainer_email": null,
    "keywords": null,
    "author": "Michael Roach",
    "author_email": "beardymcjohnface@gmail.com",
    "download_url": "https://files.pythonhosted.org/packages/b8/5a/33086cb3436426fc88a4f511059d4e7b1e78c676c183ada88bfadc8a0126/hecatomb-1.3.3.tar.gz",
    "platform": null,
    "description": "![](hecatombLogo.png)\n\n[![](https://img.shields.io/static/v1?label=CLI&message=Snaketool&color=blueviolet)](https://github.com/beardymcjohnface/Snaketool)\n![Anaconda-Server Badge](https://anaconda.org/bioconda/hecatomb/badges/license.svg)\n![Anaconda-Server Badge](https://anaconda.org/bioconda/hecatomb/badges/latest_release_date.svg)\n[![Documentation Status](https://readthedocs.org/projects/hecatomb/badge/?version=latest&style=flat-square)](https://hecatomb.readthedocs.io/en/latest/?badge=latest)\n[![install with bioconda](https://img.shields.io/badge/Install%20with-conda-brightgreen.svg?style=flat-square)](http://bioconda.github.io/recipes/hecatomb/README.html)\n![](https://img.shields.io/conda/dn/bioconda/hecatomb?label=Conda%20downloads&style=flat-square)\n[![install with PyPI](https://img.shields.io/badge/Install%20with-PyPI-brightgreen.svg?style=flat-square)](https://pypi.org/project/hecatomb/)\n[![Unit tests](https://github.com/shandley/hecatomb/actions/workflows/unit-tests.yaml/badge.svg)](https://github.com/shandley/hecatomb/actions/workflows/unit-tests.yaml)\n[![Env builds](https://github.com/shandley/hecatomb/actions/workflows/build-hecatomb-envs.yaml/badge.svg)](https://github.com/shandley/hecatomb/actions/workflows/build-hecatomb-envs.yaml)\n\n---\n\nA [hecatomb](https://en.wiktionary.org/wiki/hecatomb) is a great sacrifice or an extensive loss. \nHeactomb the software empowers an analyst to make data driven decisions to *'sacrifice'* false-positive viral reads from \nmetagenomes to enrich for true-positive viral reads. \nThis process frequently results in a great loss of suspected viral sequences / contigs.\n\n## Contents\n\n- [Documentation](#documentation)\n- [Citation](#citation)\n- [Quick Start Guide](#quick-start-guide)\n- [Inputs](#inputs)\n- [Dependencies](#dependencies)\n- [Links](#links)\n\n## Documentation\n\n[Complete documentation is hosted at Read the Docs](https://hecatomb.readthedocs.io)\n\n## Citation\n\n[Hecatomb is currently on BioRxiv!](https://www.biorxiv.org/content/10.1101/2022.05.15.492003v1)\n\n## Quick start guide\n\n### Install Hecatomb\n\n__option 1: PIP__\n\n```bash\n# Optional: create a virtual with conda or venv\nconda create -n hecatomb python=3.10\n\n# activate\nconda activte hecatomb\n\n# Install\npip install hecatomb\n```\n\n__option 2: Conda__\n\n```bash\n# Create the conda env and install hecatomb in one step\nconda create -n hecatomb -c conda-forge -c bioconda hecatomb\n\n# activate\nconda activate hecatomb\n```\n\n__Check installation__\n\n```bash\nhecatomb --help\n```\n\n### Install databases and envs\n\n__Download the databases__\n\n```bash\n# 8 threads = 8 downloads at a time\nhecatomb install --threads 8\n```\n\n__Optional: prebuild envs__\n\nThese are automatically built when running hecatomb, but manually pre-building is useful if your cluster nodes are isolated from the internet.\n\n```shell\nhecatomb test build_envs\n```\n\n### Run test dataset\n\n```bash\n# locally: using 32 threads and 64 GB RAM by default\nhecatomb test --threads 32\n\n# HPC: using a profile named 'slurm'\nhecatomb test --profile slurm\n```\n\n### Snakemake profiles (for running on HPCs)\n\nHecatomb is powered by [Snakemake](https://snakemake.readthedocs.io/en/stable/#) and greatly benefits from the use of \nSnakemake profiles for HPC Clusters.\n[More information and example for setting up Snakemake profiles for Hecatomb in the documentation](https://hecatomb.readthedocs.io/en/latest/profiles/).\n\n__NOTE: Hecatomb currently uses Snakemake version 7. \nThe recent version 8 for Snakemake has some breaking changes, including some changes to the command line interface for cluster execution.\nAny new Snakemake v8 profiles might not work with Hecatomb.\nPlease open an issue if you need help setting up a profile.__\n\n## Inputs\n\n### Parsing samples with `--reads`\n\nYou can pass either a directory of reads or a TSV file to `--reads`. \nNote that Hecatomb expects paired read file names to include common R1/R2 tags. \n - __Directory:__ Hecatomb will infer sample names and various R1/2 tag combinations from the filenames.\n - __TSV file:__ Hecatomb expects 2 or 3 columns, with column 1 being the sample name and columns 2 and 3 the reads files.\n\n[More information and examples are available here](https://gist.github.com/beardymcjohnface/bb161ba04ae1042299f48a4849e917c8#file-readme-md)\n\n### Lonread support `--longreads`\n\nPass the `--longreads` argument to tell Hecatomb that you are using longreads.\n\n### Library preprocessing with `--trim`\n\nHecatomb uses [Trimnami](https://github.com/beardymcjohnface/Trimnami) for read trimming which supports many different\ntrimming methods. Current options are `fastp` (default), `prinseq`, `roundAB`, `filtlong` (for longreads), \n`cutadapt` (FASTA input), and `notrim` (skip trimming). See Trimnami's documentation for more information.\n\n### Configuration\n\nYou can configure advanced parameters for Hecatomb.\nCopy the default config: `hecatomb config`.\nEdit the config file in your favourite text editor: `nano hecatomb.out/hecatomb.config.yaml`.\n\n## Dependencies\n\nThe only dependency you need to get up and running with Hecatomb is [conda](https://docs.conda.io/en/latest/) or \nthe python package manager [pip](https://pypi.org/project/pip/).\nHecatomb relies on [conda](https://docs.conda.io/en/latest/) to ensure portability and ease of installation of its dependencies.\nAll of Hecatomb's dependencies are installed during installation or runtime, so you don't have to worry about a thing!\n\n## Links\n\n[Hecatomb @ PyPI](https://pypi.org/project/hecatomb/)\n\n[Hecatomb @ bioconda](https://bioconda.github.io/recipes/hecatomb/README.html)\n\n[Hecatomb @ bio.tools](https://bio.tools/hecatomb)\n\n[Hecatomb @ WorkflowHub](https://workflowhub.eu/workflows/235)\n\n[Hecatomb RRID:SCR_025002](https://scicrunch.org/resources/data/record/nlx_144509-1/SCR_025002/resolver)\n\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "Viral metagenomics framework for short and longreads",
    "version": "1.3.3",
    "project_urls": {
        "Homepage": "https://github.com/shandley/hecatomb"
    },
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "9a132677e5d4c737ce094d4af2cfc10a4f954db42d3d6d1673c0db4b16122339",
                "md5": "23fcdfd0056d3a578cc8627a5a075113",
                "sha256": "cdd9a593c887708636ca78120c97378cabd0601957685d47dc91e486a5b825f6"
            },
            "downloads": -1,
            "filename": "hecatomb-1.3.3-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "23fcdfd0056d3a578cc8627a5a075113",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.9",
            "size": 98565897,
            "upload_time": "2024-12-16T01:50:03",
            "upload_time_iso_8601": "2024-12-16T01:50:03.146126Z",
            "url": "https://files.pythonhosted.org/packages/9a/13/2677e5d4c737ce094d4af2cfc10a4f954db42d3d6d1673c0db4b16122339/hecatomb-1.3.3-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "b85a33086cb3436426fc88a4f511059d4e7b1e78c676c183ada88bfadc8a0126",
                "md5": "844327fd25991789fbcd2bc6c1c58336",
                "sha256": "64a6d492522a310e7a8c1543ab8c026efbc7d41b9a80fdaa9ff1ded73a19d6e4"
            },
            "downloads": -1,
            "filename": "hecatomb-1.3.3.tar.gz",
            "has_sig": false,
            "md5_digest": "844327fd25991789fbcd2bc6c1c58336",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.9",
            "size": 98546805,
            "upload_time": "2024-12-16T01:50:17",
            "upload_time_iso_8601": "2024-12-16T01:50:17.544802Z",
            "url": "https://files.pythonhosted.org/packages/b8/5a/33086cb3436426fc88a4f511059d4e7b1e78c676c183ada88bfadc8a0126/hecatomb-1.3.3.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-12-16 01:50:17",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "shandley",
    "github_project": "hecatomb",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "hecatomb"
}

Michael Roach