ictasks


Nameictasks JSON
Version 0.2.4 PyPI version JSON
download
home_pageNone
SummaryA utility to run multiple small tasks in a single HPC job submission.
upload_time2025-09-13 12:18:08
maintainerNone
docs_urlNone
authorNone
requires_python>=3.10
licenseNone
keywords slurm hpc tasks mpi
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # ictasks

`ictasks` is a library to support running collections of small tasks in parallel, from a laptop through to a HPC cluster.

It is being developed to support projects and workflow standarisation at the Irish Centre for High End Computing (ICHEC).

There are many libraries like this, this one focuses on:

* being small and simple - easy for ICHEC researchers to understand all elements of it
* having low dependencies - can be installed with `pip`
* having an easy to use configuration format with `yaml`
* having verbose and structured reporting of task related metadata for analysis

# Features #

The library allows a few different ways to specify tasks.

1) As a Python package

You can create a collection of `Task` instances and launch them with the `session.run()` function. 

2) Using a `tasklist.dat` file and the command line interface (CLI):

``` shell
ictasks taskfarm --tasklist $PATH_TO_TASKLIST
```

See the `test/data/tasklist.dat` file for an example input with two small tasks.

3) Using a `yaml` config file, giving task and environment info.

``` yaml
job_id: my_job
task_distribution:
    cores_per_node: 1

tasks:
    items:
    - id: 0
      launch_cmd: "echo 'hello from task 0'"
    - id: 1
      launch_cmd: "echo 'hello from task 1'"
```

we can run this with:

``` shell
ictasks taskfarm --config my_config.yaml
```

Launching the run will launch the task in that directory and output a status file `task.json`. By default all processors on the machine (or compute node) will be assigned tasks.


## task_distribution ##

There are 5 variables available in `task_distribution`

 - `cores_per_node=0`
 - `threads_per_core=0`
 - `cores_per_task=1`
 - `gpus_per_node=0`
 - `gpus_per_task=0`

If left unspecified, both `cores_per_task` and `gpus_per_task` will be automatically detected and set.

The number of active `worker`s will be set depending on these resource limitations, with gpu limits being hit first if needed. These workers can then execute tasks in parallel.

### gpus in your tasks ##

When taskfarming with gpus, there will be a `gpu_ids.txt` file in each task directory which contains a gpu_id per line, the user's program must then use this file to specify which gpu(s) to use. The function `ictasks.task.get_gpu_ids` can be used to retrieve the ids in this file as a Python list. See [an example here](./test/mocks/gpu_script.py).


# Installing #

The package can be installed from PyPI:

``` shell
pip install ictasks
```

# Contact #

Our Gitlab hosting arrangement doesn't allow us to easily accept external contributions or feedback, however they are still welcome.

In future if there is interest 


If you have any feedback on this library 

# License #

This package is Coypright of the Irish Centre for High End Computing. It can be used under the terms of the GNU Public License (GPL v3+). See the included `LICENSE.txt` file for details.

If you are an ICHEC collaborator or user of the National Service different licensing terms can be offered - please get in touch to discuss.






            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "ictasks",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.10",
    "maintainer_email": null,
    "keywords": "Slurm, HPC, tasks, MPI",
    "author": null,
    "author_email": "\"James Grogan, Irish Centre for High End Computing\" <james.grogan@ichec.ie>",
    "download_url": "https://files.pythonhosted.org/packages/0a/b3/6dcbca9751b75f665df784c18368c3e055caff9ec568489a38c444b3bde1/ictasks-0.2.4.tar.gz",
    "platform": null,
    "description": "# ictasks\n\n`ictasks` is a library to support running collections of small tasks in parallel, from a laptop through to a HPC cluster.\n\nIt is being developed to support projects and workflow standarisation at the Irish Centre for High End Computing (ICHEC).\n\nThere are many libraries like this, this one focuses on:\n\n* being small and simple - easy for ICHEC researchers to understand all elements of it\n* having low dependencies - can be installed with `pip`\n* having an easy to use configuration format with `yaml`\n* having verbose and structured reporting of task related metadata for analysis\n\n# Features #\n\nThe library allows a few different ways to specify tasks.\n\n1) As a Python package\n\nYou can create a collection of `Task` instances and launch them with the `session.run()` function. \n\n2) Using a `tasklist.dat` file and the command line interface (CLI):\n\n``` shell\nictasks taskfarm --tasklist $PATH_TO_TASKLIST\n```\n\nSee the `test/data/tasklist.dat` file for an example input with two small tasks.\n\n3) Using a `yaml` config file, giving task and environment info.\n\n``` yaml\njob_id: my_job\ntask_distribution:\n    cores_per_node: 1\n\ntasks:\n    items:\n    - id: 0\n      launch_cmd: \"echo 'hello from task 0'\"\n    - id: 1\n      launch_cmd: \"echo 'hello from task 1'\"\n```\n\nwe can run this with:\n\n``` shell\nictasks taskfarm --config my_config.yaml\n```\n\nLaunching the run will launch the task in that directory and output a status file `task.json`. By default all processors on the machine (or compute node) will be assigned tasks.\n\n\n## task_distribution ##\n\nThere are 5 variables available in `task_distribution`\n\n - `cores_per_node=0`\n - `threads_per_core=0`\n - `cores_per_task=1`\n - `gpus_per_node=0`\n - `gpus_per_task=0`\n\nIf left unspecified, both `cores_per_task` and `gpus_per_task` will be automatically detected and set.\n\nThe number of active `worker`s will be set depending on these resource limitations, with gpu limits being hit first if needed. These workers can then execute tasks in parallel.\n\n### gpus in your tasks ##\n\nWhen taskfarming with gpus, there will be a `gpu_ids.txt` file in each task directory which contains a gpu_id per line, the user's program must then use this file to specify which gpu(s) to use. The function `ictasks.task.get_gpu_ids` can be used to retrieve the ids in this file as a Python list. See [an example here](./test/mocks/gpu_script.py).\n\n\n# Installing #\n\nThe package can be installed from PyPI:\n\n``` shell\npip install ictasks\n```\n\n# Contact #\n\nOur Gitlab hosting arrangement doesn't allow us to easily accept external contributions or feedback, however they are still welcome.\n\nIn future if there is interest \n\n\nIf you have any feedback on this library \n\n# License #\n\nThis package is Coypright of the Irish Centre for High End Computing. It can be used under the terms of the GNU Public License (GPL v3+). See the included `LICENSE.txt` file for details.\n\nIf you are an ICHEC collaborator or user of the National Service different licensing terms can be offered - please get in touch to discuss.\n\n\n\n\n\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "A utility to run multiple small tasks in a single HPC job submission.",
    "version": "0.2.4",
    "project_urls": {
        "Homepage": "https://git.ichec.ie/performance/toolshed/ictasks",
        "Repository": "https://git.ichec.ie/performance/toolshed/ictasks"
    },
    "split_keywords": [
        "slurm",
        " hpc",
        " tasks",
        " mpi"
    ],
    "urls": [
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "83ec67c278152a40fdf48c42db655e8cb2982d93c7554c50d3d09033d27c76fe",
                "md5": "5088b5ab5d34824cbe8b1db647824ddd",
                "sha256": "e8eff08754c5a2ec2668ae959e6c4277cf528478871b692a5e1567bd6526ed7c"
            },
            "downloads": -1,
            "filename": "ictasks-0.2.4-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "5088b5ab5d34824cbe8b1db647824ddd",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.10",
            "size": 30350,
            "upload_time": "2025-09-13T12:18:06",
            "upload_time_iso_8601": "2025-09-13T12:18:06.403359Z",
            "url": "https://files.pythonhosted.org/packages/83/ec/67c278152a40fdf48c42db655e8cb2982d93c7554c50d3d09033d27c76fe/ictasks-0.2.4-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "0ab36dcbca9751b75f665df784c18368c3e055caff9ec568489a38c444b3bde1",
                "md5": "a223ab713712495903397b4b67833821",
                "sha256": "2f771fc55fc5a6a56835e1181ede6e7d9967f7f58d5ffbdc6debc7f0e11c7aec"
            },
            "downloads": -1,
            "filename": "ictasks-0.2.4.tar.gz",
            "has_sig": false,
            "md5_digest": "a223ab713712495903397b4b67833821",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.10",
            "size": 28722,
            "upload_time": "2025-09-13T12:18:08",
            "upload_time_iso_8601": "2025-09-13T12:18:08.050290Z",
            "url": "https://files.pythonhosted.org/packages/0a/b3/6dcbca9751b75f665df784c18368c3e055caff9ec568489a38c444b3bde1/ictasks-0.2.4.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-09-13 12:18:08",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "ictasks"
}
        
Elapsed time: 2.19888s