pathogena

Name	pathogena JSON
Version	2.0.1 JSON
	download
home_page	None
Summary	The command line and Python client for EIT Pathogena.
upload_time	2024-11-07 16:43:33
maintainer	None
docs_url	None
author	None
requires_python	>=3.10
license	None
keywords	pathogen pathogena eit gpas
VCS
bugtrack_url
requirements	No requirements were recorded.
Travis-CI	No Travis.
coveralls test coverage	No coveralls.

            # EIT Pathogena Client

The command line interface for the EIT Pathogena platform.

The client enables privacy-preserving sequence data submission and retrieval of analytical output files. Prior to
upload, sample identifiers are anonymised and human host sequences are removed. A computer with Linux or MacOS is
required to use the client. When running human read removal prior to upload a computer with a modern multi-core
processor and at least 16GB of RAM is recommended.

## Install

There are two recommended methods for installing the Pathogena Client, either by using the popular package and
environment manager Conda or by using our publicly available Docker container which we build at release time.

### Installing Miniconda

If a Conda package manager is already installed, skip to [Installing the client](#installing-or-updating-the-client-with-miniconda),
otherwise the following instructions have been taken from the [Miniconda install process documentation](https://docs.anaconda.com/miniconda/miniconda-install/)

#### Installing Miniconda on Linux

In a terminal console, install Miniconda with the following instructions and accepting default options:

    ```bash
    mkdir -p ~/miniconda3
    wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh -O ~/miniconda3/miniconda.sh
    bash ~/miniconda3/miniconda.sh -b -u -p ~/miniconda3
    rm -rf ~/miniconda3/miniconda.sh
    ```

#### Installing Miniconda on MacOS

The client requires the Conda platform to be using `x86_64` when creating the environment.

- If your Mac has an Apple processor, using Terminal, firstly run:
    ```bash
    mkdir -p ~/miniconda3
    curl https://repo.anaconda.com/miniconda/Miniconda3-latest-MacOSX-arm64.sh -o ~/miniconda3/miniconda.sh
    bash ~/miniconda3/miniconda.sh -b -u -p ~/miniconda3
    rm -rf ~/miniconda3/miniconda.sh
    ```

- Initialise Miniconda using either of the following commands depending on your Shell (Bash|ZSH)
    ```bash
    ~/miniconda3/bin/conda init bash
    ~/miniconda3/bin/conda init zsh
    ```

### Installing or updating the client with Miniconda

The client has at least one dependency that requires `bioconda`, which itself
depends on `conda-forge`. Note that for the `conda create` step (see below), installation can be very slow,
so please leave it running. For more verbose output, you can add the `-v` or `-vv` flags, though
it is not recommended to show the full debug output with `-vvv` as this has been seen to lead to OOM errors.

#### Linux

```bash
conda create -y -n pathogena -c conda-forge -c bioconda hostile==1.1.0
conda activate pathogena
pip install --upgrade pathogena
```

#### MacOS

Please note the additional argument `--platform osx-64` in this command, compared to the above.

```bash
conda create --platform osx-64 -y -n pathogena -c conda-forge -c bioconda hostile==1.1.0
conda activate pathogena
pip install --upgrade pathogena
```

A simple test to verify installation would be to run a version check:

```bash
pathogena --version
```
## `pathogena auth`

```text
Usage: pathogena auth [OPTIONS]

  Authenticate with EIT Pathogena.

Options:
  --host                          API hostname (for development)
  --check-expiry                   Check for a current token and print the
                                  expiry if exists
  -h, --help                      Show this message and exit.
```

Most actions with the EIT Pathogena CLI require that the user have first authenticated with the EIT Pathogena server
with their login credentials. Upon successfully authentication, a bearer token is stored in the user's home directory
and will be used on subsequent CLI usage.

The token is valid for 7 days and a new token can be retrieved at anytime.

### Usage

Running `pathogena auth` will ask for your username and password for EIT Pathogena, your password will not be shown
in the terminal session.

```bash
$ pathogena auth

14:04:31 INFO: EIT Pathogena client version 2.0.0rc1
14:04:31 INFO: Authenticating with portal.eit-pathogena.com
Enter your username: pathogena-user@eit.org
Enter your password:
14:04:50 INFO: Authenticated (/Users/jdhillon/.config/pathogena/tokens/portal.eit-pathogena.com.json)
```

#### Troubleshooting Authentication

##### How do I get an account for EIT Pathogena?

Creating a Personal Account:

Navigate to EIT Pathogena and click on “Sign Up”. Follow the instructions to create a user account.

Shortly after filling out the form you'll receive a verification email. Click the link in the email to verify your
account and email address. If you don’t receive the email, please contact pathogena.support@eit.org.

You are now ready to start using EIT Pathogena.

##### What happens when my token expires?

If you haven't already retrieved a token, you will receive the following error message.

```bash No token file
$ pathogena upload tests/data/illumina-2.csv

12:46:42 INFO: EIT Pathogena client version 2.0.0rc1
12:46:43 INFO: Getting credit balance for portal.eit-pathogena.com
12:46:43 ERROR: FileNotFoundError: Token not found at /Users/jdhillon/.config/pathogena/tokens/portal.eit-pathogena.com.json, have you authenticated?
```

If your token is invalid or expired, you will receive the following message

```text Invalid token
14:03:26 INFO: EIT Pathogena client version 2.0.0rc1
14:03:26 ERROR: AuthorizationError: Authorization checks failed! Please re-authenticate with `pathogena auth` and
try again.
```

##### How can I check my token expiry before long running processes?

You can check the expiry of your token with the following command:

```bash
$ pathogena auth --check-expiry
14:05:52 INFO: EIT Pathogena client version 2.0.0rc1
14:05:52 INFO: Current token for portal.eit-pathogena.com expires at 2024-08-13 14:04:50.672085
```
## `pathogena balance`

```bash balance help
pathogena balance -h
15:55:36 INFO: EIT Pathogena client version 2.0.0
Usage: pathogena balance [OPTIONS]

  Check your EIT Pathogena account balance.

Options:
  --host TEXT  API hostname (for development)
  -h, --help   Show this message and exit.
```

Credits are required to upload samples and initiate the analysis process. Users can check their credit balance in the
header of the Pathogena Portal or by using the `pathogena balance` command when logged in.

### Usage

```bash balance usage
pathogena balance
15:56:56 INFO: EIT Pathogena client version 2.0.0
15:56:56 INFO: Getting credit balance for portal.eit-pathogena.com
15:56:57 INFO: Your remaining account balance is 1000 credits
```
## `pathogena upload`

```text
Usage: pathogena upload [OPTIONS] UPLOAD_CSV

  Validate, decontaminate and upload reads to EIT Pathogena. Creates a mapping
  CSV file which can be used to download output files with original sample
  names.

Options:
  --threads INTEGER               Number of alignment threads used during decontamination
  --save                          Retain decontaminated reads after upload completion
  --host                           API hostname (for development)
  --skip-fastq-check              Skip checking FASTQ files for validity
  --skip-decontamination          Run decontamination prior to upload
  --output-dir DIRECTORY          Output directory for the cleaned FastQ files,
                                  defaults to the current working directory.
  -h, --help                      Show this message and exit.
```

> Where samples may contain human reads we strongly recommend using the provided decontamination functionality. This is
best practice to minimise the risk of personally identifiable information being uploaded to the cloud.

The upload command performs metadata validation and client-side removal of human reads for each of your samples,
before uploading sequences to EIT Pathogena for analysis.

To generate a CSV file to use with this command see the [build-csv](./build-csv.md) documentation. 

### Credits

Credits are required to upload samples and initiate the analysis process. Users can check their credit balance in the
header of the Pathogena Portal or by using the `pathogena balance` command. More information can be found in the
`pathogena balance` section.

Each sample for Mycobacterium genomic sequencing will require 10 credits. During the upload command process,
a balance check is performed to ensure the user has enough credits for the number of samples in the batch. Credits are
then deducted when sample files are successfully uploaded and ready for processing.

### Human Read Removal

A 4GB human genome index is downloaded the first time you run `pathogena upload`. If for any reason this is interrupted,
run the upload command again. Upload will not proceed until the index has been downloaded and passed an integrity
check. You may optionally download the index ahead of time using the command `pathogena download-index`.

By default, the upload command will first run `pathogena decontaminate` to attempt to remove human reads prior to
uploading the input samples to EIT Pathogena, this option can be overridden but only do so if you're aware of the risks
stated above.

To retain the decontaminated FASTQ files uploaded to EIT Pathogena, include the optional `--save` flag. To perform
decontamination without uploading anything, use the `pathogena decontaminate` command.

During upload, a mapping CSV is created (e.g. `a5w2e8.mapping.csv`) linking your local sample names with their randomly
generated remote names. Keep this file safe, as it is useful for downloading and relinking results later, it cannot be
recreated after this step without re-uploading the same samples again.

### Usage

```bash Upload with running human read removal
pathogena upload my-first-batch.csv
15:41:57 INFO: EIT Pathogena client version 2.0.0
15:41:57 INFO: Getting credit balance for portal.eit-pathogena.com
15:41:59 INFO: Your remaining account balance is 1000 credits
15:41:59 INFO: Performing FastQ checks and gathering total reads
15:41:59 INFO: Calculating read count in: /Users/jdhillon/samples/ERR4809187_1.fastq.gz
15:42:00 INFO: Calculating read count in: /Users/jdhillon/samples/ERR4809187_2.fastq.gz
15:42:02 INFO: 3958206.0 reads in FASTQ file
15:42:02 INFO: Removing human reads from ILLUMINA FastQ files and storing in /Users/jdhillon/code/pathogena/client
15:42:02 INFO: Hostile version 1.1.0. Mode: paired short read (Bowtie2)
15:42:02 INFO: Found cached standard index human-t2t-hla-argos985-mycob140
15:42:02 INFO: Cleaning...
15:43:39 INFO: Cleaning complete
15:43:39 INFO: The mapping file gx5y5p.mapping.csv has been created.
15:43:39 INFO: You can monitor the progress of your batch in EIT Pathogena here: "..."
15:43:39 INFO: Uploading my-first-sample
15:45:27 INFO:   Uploaded 66433ffc-3c10-4576-8502-56b4805c7ecc_1.fastq.gz
15:45:27 INFO: Uploading my-first-sample
15:49:20 INFO:   Uploaded 66433ffc-3c10-4576-8502-56b4805c7ecc_2.fastq.gz
15:49:21 INFO: Upload complete. Created gx5y5p.mapping.csv (keep this safe)
15:49:21 INFO: Getting credit balance for portal.eit-pathogena.com
15:49:23 INFO: Your remaining account balance is 990 credits
```

```bash Upload without human read removal
pathogena upload --skip-decontamination my-first-batch.csv
15:41:57 INFO: EIT Pathogena client version 2.0.0
15:41:57 INFO: Getting credit balance for portal.eit-pathogena.com
15:41:59 INFO: Your remaining account balance is 1000 credits
15:41:59 INFO: Performing FastQ checks and gathering total reads
15:41:59 INFO: Calculating read count in: /Users/jdhillon/samples/ERR4809187_1.fastq.gz
15:42:00 INFO: Calculating read count in: /Users/jdhillon/samples/ERR4809187_2.fastq.gz
15:42:02 INFO: 3958206.0 reads in FASTQ file
15:42:02 INFO: Removing human reads from ILLUMINA FastQ files and storing in /Users/jdhillon/code/pathogena/client
15:43:39 INFO: The mapping file gx5y5p.mapping.csv has been created.
15:43:39 INFO: You can monitor the progress of your batch in EIT Pathogena here: "..."
15:43:39 INFO: Uploading my-first-sample
15:45:27 INFO:   Uploaded 66433ffc-3c10-4576-8502-56b4805c7ecc_1.fastq.gz
15:45:27 INFO: Uploading my-first-sample
15:49:20 INFO:   Uploaded 66433ffc-3c10-4576-8502-56b4805c7ecc_2.fastq.gz
15:49:21 INFO: Upload complete. Created gx5y5p.mapping.csv (keep this safe)
15:49:21 INFO: Getting credit balance for portal.eit-pathogena.com
15:49:23 INFO: Your remaining account balance is 990 credits
```
## `pathogena decontaminate`

```text
Usage: pathogena decontaminate [OPTIONS] INPUT_CSV

  Decontaminate reads from a CSV file.

Options:
  --output-dir DIRECTORY  Output directory for the cleaned FastQ files,
                          defaults to the current working directory.
  --threads INTEGER       Number of alignment threads used during
                          decontamination
  --skip-fastq-check      Skip checking FASTQ files for validity
  -h, --help              Show this message and exit.
 ```

This command will attempt to remove human reads from a given input CSV file, in the same structure as the input CSV that
would be used for uploading to EIT Pathogena, an [example can be found here](assets/example-input.csv).

By default, the processed files will be output in the same directory that the command is run in, but you can choose a
different directory with the `--output-dir` argument.

### Usage

```bash
$ pathogena decontaminate tests/data/illumina.csv
15:24:39 INFO: EIT Pathogena client version 2.0.0rc1
15:24:39 INFO: Performing FastQ checks and gathering total reads
15:24:39 INFO: Calculating read count in: /Users/jdhillon/code/pathogena/client/tests/data/reads/tuberculosis_1_1.fastq
15:24:39 INFO: Calculating read count in: /Users/jdhillon/code/pathogena/client/tests/data/reads/tuberculosis_1_2.fastq
15:24:39 INFO: 2.0 reads in FASTQ file
15:24:39 INFO: Removing human reads from ILLUMINA FastQ files and storing in /Users/jdhillon/code/pathogena/client
15:24:39 INFO: Hostile version 1.1.0. Mode: paired short read (Bowtie2)
15:24:39 INFO: Found cached standard index human-t2t-hla-argos985-mycob140
15:24:39 INFO: Cleaning...
15:24:39 INFO: Cleaning complete
15:24:39 INFO: Human reads removed from input samples and can be found here: /Users/jdhillon/code/pathogena/client
```
## `pathogena download`

```text
$ pathogena download -h
16:07:34 INFO: EIT Pathogena client version 2.0.0rc1
Usage: pathogena download [OPTIONS] SAMPLES

  Download input and output files associated with sample IDs or a mapping CSV
  file created during upload.

Options:
  --filenames TEXT        Comma-separated list of output filenames to download
  --inputs                Also download decontaminated input FASTQ file(s)
  --output-dir DIRECTORY  Output directory for the downloaded files.
  --rename / --no-rename  Rename downloaded files using sample names when
                          given a mapping CSV
  --host TEXT             API hostname (for development)
  -h, --help              Show this message and exit.
```

The download command retrieves the output (and/or input) files associated with a batch of samples given a mapping CSV
generated during upload, or one or more sample GUIDs. When a mapping CSV is used, by default downloaded file names are
prefixed with the sample names provided at upload. Otherwise, downloaded files are prefixed with the sample GUID.

### Usage

```bash
# Download the main reports for all samples in a5w2e8.mapping.csv
pathogena download a5w2e8.mapping.csv

# Download the main and speciation reports for all samples in a5w2e8.mapping.csv
pathogena download a5w2e8.mapping.csv --filenames main_report.json,speciation_report.json

# Download the main report for one sample
pathogena download 3bf7d6f9-c883-4273-adc0-93bb96a499f6

# Download the final assembly for one M. tuberculosis sample
pathogena download 3bf7d6f9-c883-4273-adc0-93bb96a499f6 --filenames final.fasta

# Download the main report for two samples
pathogena download 3bf7d6f9-c883-4273-adc0-93bb96a499f6,6f004868-096b-4587-9d50-b13e09d01882

# Save downloaded files to a specific directory
pathogena download a5w2e8.mapping.csv --output-dir results

# Download only input fastqs
pathogena download a5w2e8.mapping.csv --inputs --filenames ""
```

The complete list of `--filenames` available for download varies by sample, and can be found in the Downloads section of
sample view pages in EIT Pathogena.
## `pathogena validate`

```text
$ pathogena validate -h
16:00:13 INFO: EIT Pathogena client version 2.0.0rc1
Usage: pathogena validate [OPTIONS] UPLOAD_CSV

  Validate a given upload CSV.

Options:
  --host TEXT  API hostname (for development)
  -h, --help   Show this message and exit.
```

The `validate` command will check that a Batch can be created from a given CSV and if your user account has permission
to upload the samples, the individual FastQ files are then checked for validity. These checks are already performed
by default with the `upload` command but using this can ensure validity without commiting to the subsequent upload
if you're looking to check a CSV during writing it.
## `pathogena query-raw`

```text
pathogena query-raw -h
15:36:39 INFO: EIT Pathogena client version 2.0.0rc1
Usage: pathogena query-raw [OPTIONS] SAMPLES

  Fetch metadata for one or more SAMPLES in JSON format.
  SAMPLES should be command separated list of GUIDs or path to mapping CSV.

Options:
  --host TEXT  API hostname (for development)
  -h, --help   Show this message and exit.
```

The `query-raw` command fetches either the raw metadata of one more samples given a mapping CSV
generated during upload, or one or more sample GUIDs.

### Usage

```bash
# Query all available metadata in JSON format
pathogena query-raw a5w2e8.mapping.csv
```
## `pathogena query-status`

```text
pathogena query-status -h
15:36:39 INFO: EIT Pathogena client version 2.0.0rc1
Usage: pathogena query-status [OPTIONS] SAMPLES

  Fetch processing status for one or more SAMPLES in JSON format.
  SAMPLES should be command separated list of GUIDs or path to mapping CSV.

Options:
  --host TEXT  API hostname (for development)
  -h, --help   Show this message and exit.
```

The `query-status` command fetches the current processing status of one or more samples in a mapping CSV
generated during upload, or one or more sample GUIDs.

### Usage

```bash
# Query the processing status of all samples in a5w2e8.mapping.csv
pathogena query-status a5w2e8.mapping.csv

# Query the processing status of a single sample
pathogena query-status 3bf7d6f9-c883-4273-adc0-93bb96a499f6
```
## `pathogena autocomplete`

This command will output the steps required to enable auto-completion in either a Bash or ZSH shell, follow the output
to enable autocompletion, this will need to be executed on every new shell session, instructions are provided on how to
make this permanent depending on your environment. More information and instructions for other shells can be found in
the [Click documentation](https://click.palletsprojects.com/en/8.1.x/shell-completion/).

### Usage

```bash
$ pathogena autocomplete
Run this command to enable autocompletion:
    eval "$(_PATHOGENA_COMPLETE=bash_source pathogena)"
Add this to your ~/.bashrc file to enable this permanently:
    command -v pathogena > /dev/null 2>&1 && eval "$(_PATHOGENA_COMPLETE=bash_source pathogena)"
```

Tab completion can optionally be enabled by adding the lines output by the command to your shell source files.
This will enable the ability to press tab after writing `pathogena ` to list possible sub-commands. It can also be used
for sub-command options, if `--` is entered prior to pressing tab.

## Support

For technical support, please open an issue or contact pathogena.support@eit.org

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "pathogena",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.10",
    "maintainer_email": "EIT Pathogena Devs <pathogena.support@eit.org>, Jay Dhillon <jdhillon@eit.org>",
    "keywords": "pathogen, pathogena, eit, gpas",
    "author": null,
    "author_email": "Jay Dhillon <jdhillon@eit.org>",
    "download_url": "https://files.pythonhosted.org/packages/24/b6/5345e95cc94058930a06467e87359619d2dc30c2e9274f75df1f50c18ac2/pathogena-2.0.1.tar.gz",
    "platform": null,
    "description": "# EIT Pathogena Client\n\nThe command line interface for the EIT Pathogena platform.\n\nThe client enables privacy-preserving sequence data submission and retrieval of analytical output files. Prior to\nupload, sample identifiers are anonymised and human host sequences are removed. A computer with Linux or MacOS is\nrequired to use the client. When running human read removal prior to upload a computer with a modern multi-core\nprocessor and at least 16GB of RAM is recommended.\n\n## Install\n\nThere are two recommended methods for installing the Pathogena Client, either by using the popular package and\nenvironment manager Conda or by using our publicly available Docker container which we build at release time.\n\n### Installing Miniconda\n\nIf a Conda package manager is already installed, skip to [Installing the client](#installing-or-updating-the-client-with-miniconda),\notherwise the following instructions have been taken from the [Miniconda install process documentation](https://docs.anaconda.com/miniconda/miniconda-install/)\n\n#### Installing Miniconda on Linux\n\nIn a terminal console, install Miniconda with the following instructions and accepting default options:\n\n    ```bash\n    mkdir -p ~/miniconda3\n    wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh -O ~/miniconda3/miniconda.sh\n    bash ~/miniconda3/miniconda.sh -b -u -p ~/miniconda3\n    rm -rf ~/miniconda3/miniconda.sh\n    ```\n\n#### Installing Miniconda on MacOS\n\nThe client requires the Conda platform to be using `x86_64` when creating the environment.\n\n- If your Mac has an Apple processor, using Terminal, firstly run:\n    ```bash\n    mkdir -p ~/miniconda3\n    curl https://repo.anaconda.com/miniconda/Miniconda3-latest-MacOSX-arm64.sh -o ~/miniconda3/miniconda.sh\n    bash ~/miniconda3/miniconda.sh -b -u -p ~/miniconda3\n    rm -rf ~/miniconda3/miniconda.sh\n    ```\n\n- Initialise Miniconda using either of the following commands depending on your Shell (Bash|ZSH)\n    ```bash\n    ~/miniconda3/bin/conda init bash\n    ~/miniconda3/bin/conda init zsh\n    ```\n\n### Installing or updating the client with Miniconda\n\nThe client has at least one dependency that requires `bioconda`, which itself\ndepends on `conda-forge`. Note that for the `conda create` step (see below), installation can be very slow,\nso please leave it running. For more verbose output, you can add the `-v` or `-vv` flags, though\nit is not recommended to show the full debug output with `-vvv` as this has been seen to lead to OOM errors.\n\n#### Linux\n\n```bash\nconda create -y -n pathogena -c conda-forge -c bioconda hostile==1.1.0\nconda activate pathogena\npip install --upgrade pathogena\n```\n\n#### MacOS\n\nPlease note the additional argument `--platform osx-64` in this command, compared to the above.\n\n```bash\nconda create --platform osx-64 -y -n pathogena -c conda-forge -c bioconda hostile==1.1.0\nconda activate pathogena\npip install --upgrade pathogena\n```\n\nA simple test to verify installation would be to run a version check:\n\n```bash\npathogena --version\n```\n## `pathogena auth`\n\n```text\nUsage: pathogena auth [OPTIONS]\n\n  Authenticate with EIT Pathogena.\n\nOptions:\n  --host                          API hostname (for development)\n  --check-expiry                   Check for a current token and print the\n                                  expiry if exists\n  -h, --help                      Show this message and exit.\n```\n\nMost actions with the EIT Pathogena CLI require that the user have first authenticated with the EIT Pathogena server\nwith their login credentials. Upon successfully authentication, a bearer token is stored in the user's home directory\nand will be used on subsequent CLI usage.\n\nThe token is valid for 7 days and a new token can be retrieved at anytime.\n\n### Usage\n\nRunning `pathogena auth` will ask for your username and password for EIT Pathogena, your password will not be shown\nin the terminal session.\n\n```bash\n$ pathogena auth\n\n14:04:31 INFO: EIT Pathogena client version 2.0.0rc1\n14:04:31 INFO: Authenticating with portal.eit-pathogena.com\nEnter your username: pathogena-user@eit.org\nEnter your password:\n14:04:50 INFO: Authenticated (/Users/jdhillon/.config/pathogena/tokens/portal.eit-pathogena.com.json)\n```\n\n#### Troubleshooting Authentication\n\n##### How do I get an account for EIT Pathogena?\n\nCreating a Personal Account:\n\nNavigate to EIT Pathogena and click on \u201cSign Up\u201d. Follow the instructions to create a user account.\n\nShortly after filling out the form you'll receive a verification email. Click the link in the email to verify your\naccount and email address. If you don\u2019t receive the email, please contact pathogena.support@eit.org.\n\nYou are now ready to start using EIT Pathogena.\n\n##### What happens when my token expires?\n\nIf you haven't already retrieved a token, you will receive the following error message.\n\n```bash No token file\n$ pathogena upload tests/data/illumina-2.csv\n\n12:46:42 INFO: EIT Pathogena client version 2.0.0rc1\n12:46:43 INFO: Getting credit balance for portal.eit-pathogena.com\n12:46:43 ERROR: FileNotFoundError: Token not found at /Users/jdhillon/.config/pathogena/tokens/portal.eit-pathogena.com.json,\u00a0have you authenticated?\n```\n\nIf your token is invalid or expired, you will receive the following message\n\n```text Invalid token\n14:03:26 INFO: EIT Pathogena client version 2.0.0rc1\n14:03:26 ERROR: AuthorizationError: Authorization checks failed! Please re-authenticate with `pathogena auth` and\ntry again.\n```\n\n##### How can I check my token expiry before long running processes?\n\nYou can check the expiry of your token with the following command:\n\n```bash\n$ pathogena auth --check-expiry\n14:05:52 INFO: EIT Pathogena client version 2.0.0rc1\n14:05:52 INFO: Current token for portal.eit-pathogena.com expires at 2024-08-13 14:04:50.672085\n```\n## `pathogena balance`\n\n```bash balance help\npathogena balance -h\n15:55:36 INFO: EIT Pathogena client version 2.0.0\nUsage: pathogena balance [OPTIONS]\n\n  Check your EIT Pathogena account balance.\n\nOptions:\n  --host TEXT  API hostname (for development)\n  -h, --help   Show this message and exit.\n```\n\nCredits are required to upload samples and initiate the analysis process. Users can check their credit balance in the\nheader of the Pathogena Portal or by using the `pathogena balance` command when logged in.\n\n### Usage\n\n```bash balance usage\npathogena balance\n15:56:56 INFO: EIT Pathogena client version 2.0.0\n15:56:56 INFO: Getting credit balance for portal.eit-pathogena.com\n15:56:57 INFO: Your remaining account balance is 1000 credits\n```\n## `pathogena upload`\n\n```text\nUsage: pathogena upload [OPTIONS] UPLOAD_CSV\n\n  Validate, decontaminate and upload reads to EIT Pathogena. Creates a mapping\n  CSV file which can be used to download output files with original sample\n  names.\n\nOptions:\n  --threads INTEGER               Number of alignment threads used during decontamination\n  --save                          Retain decontaminated reads after upload completion\n  --host                           API hostname (for development)\n  --skip-fastq-check              Skip checking FASTQ files for validity\n  --skip-decontamination          Run decontamination prior to upload\n  --output-dir DIRECTORY          Output directory for the cleaned FastQ files,\n                                  defaults to the current working directory.\n  -h, --help                      Show this message and exit.\n```\n\n> Where samples may contain human reads we strongly recommend using the provided decontamination functionality. This is\nbest practice to minimise the risk of personally identifiable information being uploaded to the cloud.\n\nThe upload command performs metadata validation and client-side removal of human reads for each of your samples,\nbefore uploading sequences to EIT Pathogena for analysis.\n\nTo generate a CSV file to use with this command see the [build-csv](./build-csv.md) documentation. \n\n### Credits\n\nCredits are required to upload samples and initiate the analysis process. Users can check their credit balance in the\nheader of the Pathogena Portal or by using the `pathogena balance` command. More information can be found in the\n`pathogena balance` section.\n\nEach sample for Mycobacterium genomic sequencing will require 10 credits. During the upload command process,\na balance check is performed to ensure the user has enough credits for the number of samples in the batch. Credits are\nthen deducted when sample files are successfully uploaded and ready for processing.\n\n### Human Read Removal\n\nA 4GB human genome index is downloaded the first time you run `pathogena upload`. If for any reason this is interrupted,\nrun the upload command again. Upload will not proceed until the index has been downloaded and passed an integrity\ncheck. You may optionally download the index ahead of time using the command `pathogena download-index`.\n\nBy default, the upload command will first run `pathogena decontaminate` to attempt to remove human reads prior to\nuploading the input samples to EIT Pathogena, this option can be overridden but only do so if you're aware of the risks\nstated above.\n\nTo retain the decontaminated FASTQ files uploaded to EIT Pathogena, include the optional `--save` flag. To perform\ndecontamination without uploading anything, use the `pathogena decontaminate` command.\n\nDuring upload, a mapping CSV is created (e.g. `a5w2e8.mapping.csv`) linking your local sample names with their randomly\ngenerated remote names. Keep this file safe, as it is useful for downloading and relinking results later, it cannot be\nrecreated after this step without re-uploading the same samples again.\n\n### Usage\n\n```bash Upload with running human read removal\npathogena upload my-first-batch.csv\n15:41:57 INFO: EIT Pathogena client version 2.0.0\n15:41:57 INFO: Getting credit balance for portal.eit-pathogena.com\n15:41:59 INFO: Your remaining account balance is 1000 credits\n15:41:59 INFO: Performing FastQ checks and gathering total reads\n15:41:59 INFO: Calculating read count in: /Users/jdhillon/samples/ERR4809187_1.fastq.gz\n15:42:00 INFO: Calculating read count in: /Users/jdhillon/samples/ERR4809187_2.fastq.gz\n15:42:02 INFO: 3958206.0 reads in FASTQ file\n15:42:02 INFO: Removing human reads from ILLUMINA FastQ files and storing in /Users/jdhillon/code/pathogena/client\n15:42:02 INFO: Hostile version 1.1.0. Mode: paired short read (Bowtie2)\n15:42:02 INFO: Found cached standard index human-t2t-hla-argos985-mycob140\n15:42:02 INFO: Cleaning...\n15:43:39 INFO: Cleaning complete\n15:43:39 INFO: The mapping file gx5y5p.mapping.csv has been created.\n15:43:39 INFO: You can monitor the progress of your batch in EIT Pathogena here: \"...\"\n15:43:39 INFO: Uploading my-first-sample\n15:45:27 INFO:   Uploaded 66433ffc-3c10-4576-8502-56b4805c7ecc_1.fastq.gz\n15:45:27 INFO: Uploading my-first-sample\n15:49:20 INFO:   Uploaded 66433ffc-3c10-4576-8502-56b4805c7ecc_2.fastq.gz\n15:49:21 INFO: Upload complete. Created gx5y5p.mapping.csv (keep this safe)\n15:49:21 INFO: Getting credit balance for portal.eit-pathogena.com\n15:49:23 INFO: Your remaining account balance is 990 credits\n```\n\n```bash Upload without human read removal\npathogena upload --skip-decontamination my-first-batch.csv\n15:41:57 INFO: EIT Pathogena client version 2.0.0\n15:41:57 INFO: Getting credit balance for portal.eit-pathogena.com\n15:41:59 INFO: Your remaining account balance is 1000 credits\n15:41:59 INFO: Performing FastQ checks and gathering total reads\n15:41:59 INFO: Calculating read count in: /Users/jdhillon/samples/ERR4809187_1.fastq.gz\n15:42:00 INFO: Calculating read count in: /Users/jdhillon/samples/ERR4809187_2.fastq.gz\n15:42:02 INFO: 3958206.0 reads in FASTQ file\n15:42:02 INFO: Removing human reads from ILLUMINA FastQ files and storing in /Users/jdhillon/code/pathogena/client\n15:43:39 INFO: The mapping file gx5y5p.mapping.csv has been created.\n15:43:39 INFO: You can monitor the progress of your batch in EIT Pathogena here: \"...\"\n15:43:39 INFO: Uploading my-first-sample\n15:45:27 INFO:   Uploaded 66433ffc-3c10-4576-8502-56b4805c7ecc_1.fastq.gz\n15:45:27 INFO: Uploading my-first-sample\n15:49:20 INFO:   Uploaded 66433ffc-3c10-4576-8502-56b4805c7ecc_2.fastq.gz\n15:49:21 INFO: Upload complete. Created gx5y5p.mapping.csv (keep this safe)\n15:49:21 INFO: Getting credit balance for portal.eit-pathogena.com\n15:49:23 INFO: Your remaining account balance is 990 credits\n```\n## `pathogena decontaminate`\n\n```text\nUsage: pathogena decontaminate [OPTIONS] INPUT_CSV\n\n  Decontaminate reads from a CSV file.\n\nOptions:\n  --output-dir DIRECTORY  Output directory for the cleaned FastQ files,\n                          defaults to the current working directory.\n  --threads INTEGER       Number of alignment threads used during\n                          decontamination\n  --skip-fastq-check      Skip checking FASTQ files for validity\n  -h, --help              Show this message and exit.\n ```\n\nThis command will attempt to remove human reads from a given input CSV file, in the same structure as the input CSV that\nwould be used for uploading to EIT Pathogena, an [example can be found here](assets/example-input.csv).\n\nBy default, the processed files will be output in the same directory that the command is run in, but you can choose a\ndifferent directory with the `--output-dir` argument.\n\n### Usage\n\n```bash\n$ pathogena decontaminate tests/data/illumina.csv\n15:24:39 INFO: EIT Pathogena client version 2.0.0rc1\n15:24:39 INFO: Performing FastQ checks and gathering total reads\n15:24:39 INFO: Calculating read count in: /Users/jdhillon/code/pathogena/client/tests/data/reads/tuberculosis_1_1.fastq\n15:24:39 INFO: Calculating read count in: /Users/jdhillon/code/pathogena/client/tests/data/reads/tuberculosis_1_2.fastq\n15:24:39 INFO: 2.0 reads in FASTQ file\n15:24:39 INFO: Removing human reads from ILLUMINA FastQ files and storing in /Users/jdhillon/code/pathogena/client\n15:24:39 INFO: Hostile version 1.1.0. Mode: paired short read (Bowtie2)\n15:24:39 INFO: Found cached standard index human-t2t-hla-argos985-mycob140\n15:24:39 INFO: Cleaning...\n15:24:39 INFO: Cleaning complete\n15:24:39 INFO: Human reads removed from input samples and can be found here: /Users/jdhillon/code/pathogena/client\n```\n## `pathogena download`\n\n```text\n$ pathogena download -h\n16:07:34 INFO: EIT Pathogena client version 2.0.0rc1\nUsage: pathogena download [OPTIONS] SAMPLES\n\n  Download input and output files associated with sample IDs or a mapping CSV\n  file created during upload.\n\nOptions:\n  --filenames TEXT        Comma-separated list of output filenames to download\n  --inputs                Also download decontaminated input FASTQ file(s)\n  --output-dir DIRECTORY  Output directory for the downloaded files.\n  --rename / --no-rename  Rename downloaded files using sample names when\n                          given a mapping CSV\n  --host TEXT             API hostname (for development)\n  -h, --help              Show this message and exit.\n```\n\nThe download command retrieves the output (and/or input) files associated with a batch of samples given a mapping CSV\ngenerated during upload, or one or more sample GUIDs. When a mapping CSV is used, by default downloaded file names are\nprefixed with the sample names provided at upload. Otherwise, downloaded files are prefixed with the sample GUID.\n\n### Usage\n\n```bash\n# Download the main reports for all samples in a5w2e8.mapping.csv\npathogena download a5w2e8.mapping.csv\n\n# Download the main and speciation reports for all samples in a5w2e8.mapping.csv\npathogena download a5w2e8.mapping.csv --filenames main_report.json,speciation_report.json\n\n# Download the main report for one sample\npathogena download 3bf7d6f9-c883-4273-adc0-93bb96a499f6\n\n# Download the final assembly for one M. tuberculosis sample\npathogena download 3bf7d6f9-c883-4273-adc0-93bb96a499f6 --filenames final.fasta\n\n# Download the main report for two samples\npathogena download 3bf7d6f9-c883-4273-adc0-93bb96a499f6,6f004868-096b-4587-9d50-b13e09d01882\n\n# Save downloaded files to a specific directory\npathogena download a5w2e8.mapping.csv --output-dir results\n\n# Download only input fastqs\npathogena download a5w2e8.mapping.csv --inputs --filenames \"\"\n```\n\nThe complete list of `--filenames` available for download varies by sample, and can be found in the Downloads section of\nsample view pages in EIT Pathogena.\n## `pathogena validate`\n\n```text\n$ pathogena validate -h\n16:00:13 INFO: EIT Pathogena client version 2.0.0rc1\nUsage: pathogena validate [OPTIONS] UPLOAD_CSV\n\n  Validate a given upload CSV.\n\nOptions:\n  --host TEXT  API hostname (for development)\n  -h, --help   Show this message and exit.\n```\n\nThe `validate` command will check that a Batch can be created from a given CSV and if your user account has permission\nto upload the samples, the individual FastQ files are then checked for validity. These checks are already performed\nby default with the `upload` command but using this can ensure validity without commiting to the subsequent upload\nif you're looking to check a CSV during writing it.\n## `pathogena query-raw`\n\n```text\npathogena query-raw -h\n15:36:39 INFO: EIT Pathogena client version 2.0.0rc1\nUsage: pathogena query-raw [OPTIONS] SAMPLES\n\n  Fetch metadata for one or more SAMPLES in JSON format.\n  SAMPLES should be command separated list of GUIDs or path to mapping CSV.\n\nOptions:\n  --host TEXT  API hostname (for development)\n  -h, --help   Show this message and exit.\n```\n\nThe `query-raw` command fetches either the raw metadata of one more samples given a mapping CSV\ngenerated during upload, or one or more sample GUIDs.\n\n### Usage\n\n```bash\n# Query all available metadata in JSON format\npathogena query-raw a5w2e8.mapping.csv\n```\n## `pathogena query-status`\n\n```text\npathogena query-status -h\n15:36:39 INFO: EIT Pathogena client version 2.0.0rc1\nUsage: pathogena query-status [OPTIONS] SAMPLES\n\n  Fetch processing status for one or more SAMPLES in JSON format.\n  SAMPLES should be command separated list of GUIDs or path to mapping CSV.\n\nOptions:\n  --host TEXT  API hostname (for development)\n  -h, --help   Show this message and exit.\n```\n\nThe `query-status` command fetches the current processing status of one or more samples in a mapping CSV\ngenerated during upload, or one or more sample GUIDs.\n\n### Usage\n\n```bash\n# Query the processing status of all samples in a5w2e8.mapping.csv\npathogena query-status a5w2e8.mapping.csv\n\n# Query the processing status of a single sample\npathogena query-status 3bf7d6f9-c883-4273-adc0-93bb96a499f6\n```\n## `pathogena autocomplete`\n\nThis command will output the steps required to enable auto-completion in either a Bash or ZSH shell, follow the output\nto enable autocompletion, this will need to be executed on every new shell session, instructions are provided on how to\nmake this permanent depending on your environment. More information and instructions for other shells can be found in\nthe [Click documentation](https://click.palletsprojects.com/en/8.1.x/shell-completion/).\n\n### Usage\n\n```bash\n$ pathogena autocomplete\nRun this command to enable autocompletion:\n    eval \"$(_PATHOGENA_COMPLETE=bash_source pathogena)\"\nAdd this to your ~/.bashrc file to enable this permanently:\n    command -v pathogena > /dev/null 2>&1 && eval \"$(_PATHOGENA_COMPLETE=bash_source pathogena)\"\n```\n\nTab completion can optionally be enabled by adding the lines output by the command to your shell source files.\nThis will enable the ability to press tab after writing `pathogena ` to list possible sub-commands. It can also be used\nfor sub-command options, if `--` is entered prior to pressing tab.\n\n## Support\n\nFor technical support, please open an issue or contact pathogena.support@eit.org\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "The command line and Python client for EIT Pathogena.",
    "version": "2.0.1",
    "project_urls": null,
    "split_keywords": [
        "pathogen",
        " pathogena",
        " eit",
        " gpas"
    ],
    "urls": [
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "6785090afa327326edda5e38f6eb39df30dd06b053f49cb5669d6a448ef6b5c5",
                "md5": "eefded6cf34ba963259d6c233d12ed90",
                "sha256": "10e616cabb156dd6bb56e521c895203e3a92ae1a306229754a16cf79bd9724fe"
            },
            "downloads": -1,
            "filename": "pathogena-2.0.1-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "eefded6cf34ba963259d6c233d12ed90",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.10",
            "size": 28804,
            "upload_time": "2024-11-07T16:43:30",
            "upload_time_iso_8601": "2024-11-07T16:43:30.871554Z",
            "url": "https://files.pythonhosted.org/packages/67/85/090afa327326edda5e38f6eb39df30dd06b053f49cb5669d6a448ef6b5c5/pathogena-2.0.1-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "24b65345e95cc94058930a06467e87359619d2dc30c2e9274f75df1f50c18ac2",
                "md5": "51910bea773b36ecee2845d4ff64fb0b",
                "sha256": "262ac953e835851735a7400122bb524b2da4b4eedb91fc81bcc92735823214e9"
            },
            "downloads": -1,
            "filename": "pathogena-2.0.1.tar.gz",
            "has_sig": false,
            "md5_digest": "51910bea773b36ecee2845d4ff64fb0b",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.10",
            "size": 85982,
            "upload_time": "2024-11-07T16:43:33",
            "upload_time_iso_8601": "2024-11-07T16:43:33.207258Z",
            "url": "https://files.pythonhosted.org/packages/24/b6/5345e95cc94058930a06467e87359619d2dc30c2e9274f75df1f50c18ac2/pathogena-2.0.1.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-11-07 16:43:33",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "pathogena"
}

None