mgnify-pipelines-toolkit

Name	mgnify-pipelines-toolkit JSON
Version	1.3.0 JSON
	download
home_page	None
Summary	Collection of scripts and tools for MGnify pipelines
upload_time	2025-10-09 13:30:21
maintainer	None
docs_url	None
author	None
requires_python	>=3.11
license	Apache Software License 2.0
keywords	bioinformatics pipelines metagenomics
VCS
bugtrack_url
requirements	No requirements were recorded.
Travis-CI	No Travis.
coveralls test coverage	No coveralls.

            # mgnify-pipelines-toolkit

This Python package contains a collection of scripts and tools for including in MGnify pipelines. Scripts stored here are mainly for:

- One-off production scripts that perform specific tasks in pipelines
- Scripts that have few dependencies
- Scripts that don't have existing containers built to run them
- Scripts for which building an entire container would be too bulky of a solution to deploy in pipelines

This package is built and uploaded to PyPi and bioconda. The package bundles scripts and makes them executable from the command-line when this package is installed.

## How to install

This package is available both on [PyPi](https://pypi.org/project/mgnify-pipelines-toolkit/) and bioconda.

To install from PyPi with pip:

`pip install mgnify-pipelines-toolkit`

To install from bioconda with conda/mamba:

`conda install -c bioconda mgnify-pipelines-toolkit`

You should then be able to run the packages from the command-line. For example to run the `get_subunits.py` script:

`get_subunits -i ${easel_coords} -n ${meta.id}`

## Development

### Quick Start with uv and Taskfile

This project uses [uv](https://docs.astral.sh/uv/) for fast Python environment management and [Task](https://taskfile.dev/) for task automation.

Prerequisites:
- Install [uv](https://docs.astral.sh/uv/getting-started/installation/)
- Install [Task](https://taskfile.dev/installation/)

Common tasks:

```bash
task: Available tasks for this project:
* clean:            Clean up generated files and caches
* lint:             Run linters (ruff check only)
* lint-fix:         Run linters and fix issues automatically
* pre-commit:       Install pre-commit hooks
* run:              Run toolkit scripts with uv (usage: task run -- <script_name> [args])
* test:             Run tests with uv
* testk:            Run specific tests from a file (usage: task testk -- test_path)
* venv:             Create a virtual environment with uv
```

When doing these steps above, you ensure that the code you add will be linted and formatted properly.

### New script requirements

There are a few requirements for your script:

- It needs to have a named main function of some kind. See `mgnify_pipelines_toolkit/analysis/shared/get_subunits.py` and the `main()` function for an example
- Because this package is meant to be run from the command-line, make sure your script can easily pass arguments using tools like `argparse` or `click`
- A small amount of dependencies. This requirement is subjective, but for example if your script only requires a handful of basic packages like `Biopython`, `numpy`, `pandas`, etc., then it's fine. However if the script has a more extensive list of dependencies, a container is probably a better fit.

### How to add a new script

To add a new Python script, first copy it over to the `mgnify_pipelines_toolkit` directory in this repository, specifically to the subdirectory that makes the most sense. If none of the subdirectories make sense for your script, create a new one. If your script doesn't have a `main()` type function yet, write one.

Then, open `pyproject.toml` as you will need to add some bits. First, add any missing dependencies (include the version) to the `dependencies` field.

Then, if you created a new subdirectory to add your script in, go to the `packages` line under `[tool.setuptools]` and add the new subdirectory following the same syntax.

Then, scroll down to the `[project.scripts]` line. Here, you will create an alias command for running your script from the command-line. In the example line:

`get_subunits = "mgnify_pipelines_toolkit.analysis.shared.get_subunits:main"`

- `get_subunits` is the alias
- `mgnify_pipelines_toolkit.analysis.shared.get_subunits` will link the alias to the script with the path `mgnify_pipelines_toolkit/analysis/shared/get_subunits.py`
- `:main` will specifically call the function named `main()` when the alias is run.

When you have setup this command, executing `get_subunits` on the command-line will be the equivalent of doing:

`from mgnify_pipelines_toolkit.analysis.shared.get_subunits import main; main()`

You should then write at least one unit test for your addition. This package uses `pytest` at the moment for this purpose. A GitHub Action workflow will run all of the unit tests whenever a commit is pushed to any branch.

Finally, you will need to bump up the version in the `version` line.

At the moment, these should be the only steps required to setup your script in this package (which is subject to change).

### Building and uploading to PyPi

The building and pushing of the package is automated by GitHub Actions, which will activate only on a new release. Bioconda should then automatically pick up the new PyPi release and push it to their recipes, though it's worth keeping an eye on their automated pull requests just in case [here](https://github.com/bioconda/bioconda-recipes/pulls).

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "mgnify-pipelines-toolkit",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.11",
    "maintainer_email": null,
    "keywords": "bioinformatics, pipelines, metagenomics",
    "author": null,
    "author_email": "MGnify team <metagenomics-help@ebi.ac.uk>",
    "download_url": "https://files.pythonhosted.org/packages/38/c0/e02872f44864b1c0eaab2ed5b134781fadadb0add61d443136397a1f33d9/mgnify_pipelines_toolkit-1.3.0.tar.gz",
    "platform": null,
    "description": "# mgnify-pipelines-toolkit\n\nThis Python package contains a collection of scripts and tools for including in MGnify pipelines. Scripts stored here are mainly for:\n\n- One-off production scripts that perform specific tasks in pipelines\n- Scripts that have few dependencies\n- Scripts that don't have existing containers built to run them\n- Scripts for which building an entire container would be too bulky of a solution to deploy in pipelines\n\nThis package is built and uploaded to PyPi and bioconda. The package bundles scripts and makes them executable from the command-line when this package is installed.\n\n## How to install\n\nThis package is available both on [PyPi](https://pypi.org/project/mgnify-pipelines-toolkit/) and bioconda.\n\nTo install from PyPi with pip:\n\n`pip install mgnify-pipelines-toolkit`\n\nTo install from bioconda with conda/mamba:\n\n`conda install -c bioconda mgnify-pipelines-toolkit`\n\nYou should then be able to run the packages from the command-line. For example to run the `get_subunits.py` script:\n\n`get_subunits -i ${easel_coords} -n ${meta.id}`\n\n## Development\n\n### Quick Start with uv and Taskfile\n\nThis project uses [uv](https://docs.astral.sh/uv/) for fast Python environment management and [Task](https://taskfile.dev/) for task automation.\n\nPrerequisites:\n- Install [uv](https://docs.astral.sh/uv/getting-started/installation/)\n- Install [Task](https://taskfile.dev/installation/)\n\nCommon tasks:\n\n```bash\ntask: Available tasks for this project:\n* clean:            Clean up generated files and caches\n* lint:             Run linters (ruff check only)\n* lint-fix:         Run linters and fix issues automatically\n* pre-commit:       Install pre-commit hooks\n* run:              Run toolkit scripts with uv (usage: task run -- <script_name> [args])\n* test:             Run tests with uv\n* testk:            Run specific tests from a file (usage: task testk -- test_path)\n* venv:             Create a virtual environment with uv\n```\n\nWhen doing these steps above, you ensure that the code you add will be linted and formatted properly.\n\n### New script requirements\n\nThere are a few requirements for your script:\n\n- It needs to have a named main function of some kind. See `mgnify_pipelines_toolkit/analysis/shared/get_subunits.py` and the `main()` function for an example\n- Because this package is meant to be run from the command-line, make sure your script can easily pass arguments using tools like `argparse` or `click`\n- A small amount of dependencies. This requirement is subjective, but for example if your script only requires a handful of basic packages like `Biopython`, `numpy`, `pandas`, etc., then it's fine. However if the script has a more extensive list of dependencies, a container is probably a better fit.\n\n### How to add a new script\n\nTo add a new Python script, first copy it over to the `mgnify_pipelines_toolkit` directory in this repository, specifically to the subdirectory that makes the most sense. If none of the subdirectories make sense for your script, create a new one. If your script doesn't have a `main()` type function yet, write one.\n\nThen, open `pyproject.toml` as you will need to add some bits. First, add any missing dependencies (include the version) to the `dependencies` field.\n\nThen, if you created a new subdirectory to add your script in, go to the `packages` line under `[tool.setuptools]` and add the new subdirectory following the same syntax.\n\nThen, scroll down to the `[project.scripts]` line. Here, you will create an alias command for running your script from the command-line. In the example line:\n\n`get_subunits = \"mgnify_pipelines_toolkit.analysis.shared.get_subunits:main\"`\n\n- `get_subunits` is the alias\n- `mgnify_pipelines_toolkit.analysis.shared.get_subunits` will link the alias to the script with the path `mgnify_pipelines_toolkit/analysis/shared/get_subunits.py`\n- `:main` will specifically call the function named `main()` when the alias is run.\n\nWhen you have setup this command, executing `get_subunits` on the command-line will be the equivalent of doing:\n\n`from mgnify_pipelines_toolkit.analysis.shared.get_subunits import main; main()`\n\nYou should then write at least one unit test for your addition. This package uses `pytest` at the moment for this purpose. A GitHub Action workflow will run all of the unit tests whenever a commit is pushed to any branch.\n\nFinally, you will need to bump up the version in the `version` line.\n\nAt the moment, these should be the only steps required to setup your script in this package (which is subject to change).\n\n### Building and uploading to PyPi\n\nThe building and pushing of the package is automated by GitHub Actions, which will activate only on a new release. Bioconda should then automatically pick up the new PyPi release and push it to their recipes, though it's worth keeping an eye on their automated pull requests just in case [here](https://github.com/bioconda/bioconda-recipes/pulls).\n",
    "bugtrack_url": null,
    "license": "Apache Software License 2.0",
    "summary": "Collection of scripts and tools for MGnify pipelines",
    "version": "1.3.0",
    "project_urls": null,
    "split_keywords": [
        "bioinformatics",
        " pipelines",
        " metagenomics"
    ],
    "urls": [
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "2acdc95d156f955ed5ad3caa23dfffba11c73d100f14a83754cf888d8b8cedb2",
                "md5": "94b26cfb2522b8d814b0f625dbf0f8e1",
                "sha256": "f1cb999ce10a9f86c896476495bbbf7bad426481a7f74214e1b23e3e6571dfda"
            },
            "downloads": -1,
            "filename": "mgnify_pipelines_toolkit-1.3.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "94b26cfb2522b8d814b0f625dbf0f8e1",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.11",
            "size": 109739,
            "upload_time": "2025-10-09T13:30:20",
            "upload_time_iso_8601": "2025-10-09T13:30:20.181110Z",
            "url": "https://files.pythonhosted.org/packages/2a/cd/c95d156f955ed5ad3caa23dfffba11c73d100f14a83754cf888d8b8cedb2/mgnify_pipelines_toolkit-1.3.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "38c0e02872f44864b1c0eaab2ed5b134781fadadb0add61d443136397a1f33d9",
                "md5": "366f8ab745d36ddb2489a38899021fb6",
                "sha256": "3f288ca4ea7adf8acfc5dc66c6665fbd8472c1240bcacc00eaa60c837f73e007"
            },
            "downloads": -1,
            "filename": "mgnify_pipelines_toolkit-1.3.0.tar.gz",
            "has_sig": false,
            "md5_digest": "366f8ab745d36ddb2489a38899021fb6",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.11",
            "size": 77560,
            "upload_time": "2025-10-09T13:30:21",
            "upload_time_iso_8601": "2025-10-09T13:30:21.699195Z",
            "url": "https://files.pythonhosted.org/packages/38/c0/e02872f44864b1c0eaab2ed5b134781fadadb0add61d443136397a1f33d9/mgnify_pipelines_toolkit-1.3.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-10-09 13:30:21",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "mgnify-pipelines-toolkit"
}

None