pshmem

Name	pshmem JSON
Version	1.2.0 JSON
	download
home_page	None
Summary	Parallel shared memory and locking with MPI
upload_time	2025-01-06 06:32:06
maintainer	None
docs_url	None
author	None
requires_python	>=3.9
license	None
keywords
VCS
bugtrack_url
requirements	No requirements were recorded.
Travis-CI	No Travis.
coveralls test coverage	No coveralls.

            # MPI design patterns with shared memory

This is a small package that implements parallel design patterns using MPI one-sided and
shared memory constructs.

## Installation and Requirements

This package needs a recent version of the `mpi4py` package in order to be useful.
However, the classes also accept a value of `None` for the communicator, in which case a
trivial local implementation is used.  The code uses other widely available packages
(like numpy) and requires a recent Python3 installation.

### Binary Packages

Wheels are available on PyPI:

    pip install pshmem

Or you can install packages from conda-forge:

    conda install -c conda-forge pshmem

### Installing from Source

You can install the code from a git checkout with:

    pip install .


## MPIShared Class

This class implements a pattern where a shared array is allocated on each node.
Processes can update pieces of the shared array with the synchronous "set()" method.
During this call, the data from the desired process is first replicated to all nodes,
and then one process on each node copies that piece into the shared array.

All processes on all nodes can freely read data from the node-local copy of the shared
array.

### Example

You can use `MPIShared` as a context manager or by explicitly creating and freeing
memory.  Here is an example of creating a shared memory object that is replicated across
nodes:

```python
import numpy as np
from mpi4py import MPI

from pshmem import MPIShared

comm = MPI.COMM_WORLD

with MPIShared((3, 5), np.float64, comm) as shm:
    # A copy of the data exists on every node and is initialized to zero.
    # There is a numpy array "view" of that memory available with slice notation
    # or by accessing the "data" member:
    if comm.rank == 0:
        # You can get a summary of the data by printing it:
        print("String representation:\n")
        print(shm)
        print("\n===== Initialized Data =====")
    for p in range(comm.size):
        if p == comm.rank:
            print("rank {}:\n".format(p), shm.data, flush=True)
        comm.barrier()

    set_data = None
    set_offset = None
    if comm.rank == 0:
        set_data = np.arange(6, dtype=np.float64).reshape((2, 3))
        set_offset = (1, 1)

    # The set() method is collective, but the inputs only matter on one rank
    shm.set(set_data, offset=set_offset, fromrank=0)

    # You can also use the usual '[]' notation.  However, this call must do an
    # additional pre-communication to detect which process the data is coming from.
    # And this line is still collective and must be called on all processes:
    shm[set_offset] = set_data

    # This updated data has now been replicated to the shared memory on all nodes.
    if comm.rank == 0:
        print("======= Updated Data =======")
    for p in range(comm.size):
        if p == comm.rank:
            print("rank {}:\n".format(p), shm.data, flush=True)
        comm.barrier()

    # You can read from the node-local copy of the data from all processes,
    # using either the "data" member or slice access:
    if comm.rank == comm.size - 1:
        print("==== Read-only access ======")
        print("rank {}: shm[2, 3] = {}".format(comm.rank, shm[2, 3]), flush=True)
        print("rank {}: shm.data = \n{}".format(comm.rank, shm.data), flush=True)

```

Putting the above code into a file `test.py` and running this on 4 processes gives:

```
mpirun -np 4 python3 test.py

String representation:

<MPIShared
  replicated on 1 nodes, each with 4 processes (4 total)
  shape = (3, 5), dtype = float64
  [ [0. 0. 0. 0. 0.] [0. 0. 0. 0. 0.] [0. 0. 0. 0. 0.] ]
>

===== Initialized Data =====
rank 0:
 [[0. 0. 0. 0. 0.]
 [0. 0. 0. 0. 0.]
 [0. 0. 0. 0. 0.]]
rank 1:
 [[0. 0. 0. 0. 0.]
 [0. 0. 0. 0. 0.]
 [0. 0. 0. 0. 0.]]
rank 2:
 [[0. 0. 0. 0. 0.]
 [0. 0. 0. 0. 0.]
 [0. 0. 0. 0. 0.]]
rank 3:
 [[0. 0. 0. 0. 0.]
 [0. 0. 0. 0. 0.]
 [0. 0. 0. 0. 0.]]
======= Updated Data =======
rank 0:
 [[0. 0. 0. 0. 0.]
 [0. 0. 1. 2. 0.]
 [0. 3. 4. 5. 0.]]
rank 1:
 [[0. 0. 0. 0. 0.]
 [0. 0. 1. 2. 0.]
 [0. 3. 4. 5. 0.]]
rank 2:
 [[0. 0. 0. 0. 0.]
 [0. 0. 1. 2. 0.]
 [0. 3. 4. 5. 0.]]
rank 3:
 [[0. 0. 0. 0. 0.]
 [0. 0. 1. 2. 0.]
 [0. 3. 4. 5. 0.]]
==== Read-only access ======
rank 3: shm[2, 3] = 5.0
rank 3: shm.data =
[[0. 0. 0. 0. 0.]
 [0. 0. 1. 2. 0.]
 [0. 3. 4. 5. 0.]]
 ```

Note that if you are not using a context manager, then you should be careful to close
and delete the object like this:

```python
shm = MPIShared((3, 5), np.float64, comm=comm)
# Do stuff
shm.close()
del shm
```

## MPILock Class

This implements a MUTEX lock across an arbitrary communicator.  A memory buffer on a
single process acts as a waiting list where processes can add themselves (using
one-sided calls).  The processes pass a token to transfer ownership of the lock.  The
token is passed in order of request.

### Example

A typical use case is where we want to serialize some operation across a large number of
processes that reside on different nodes.  For example, perhaps we are making requests
to the external network from a computing center and we do not want to saturate that with
all processes simultaneously.  Or perhaps we are writing to a shared data file which
does not support parallel writes and we have a sub-communicator of writing processes
which take turns updating the filesystem.  We can instantiate a lock on any
communicator, so it is possible to split the world communicator into groups and have
some operation serialized just within that group:

```python
with MPILock(MPI.COMM_WORLD) as mpilock:
    mpilock.lock()
    # Do something here.  Only one process at a time will do this.
    mpilock.unlock()
```

## Tests

After installation, you can run some tests with:

    mpirun -np 4 python3 -c 'import pshmem.test; pshmem.test.run()'

If you have mpi4py available but would like to explicitly disable the use of MPI in the
tests, you can set an environment variable:

    MPI_DISABLE=1 python3 -c 'import pshmem.test; pshmem.test.run()'

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "pshmem",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.9",
    "maintainer_email": "Theodore Kisner <tskisner.public@gmail.com>",
    "keywords": null,
    "author": null,
    "author_email": null,
    "download_url": "https://files.pythonhosted.org/packages/34/c6/bbe690ad4c48c80956fa7f8e5d27a9b1534cc0711695d95dcbb813ea7b77/pshmem-1.2.0.tar.gz",
    "platform": null,
    "description": "# MPI design patterns with shared memory\n\nThis is a small package that implements parallel design patterns using MPI one-sided and\nshared memory constructs.\n\n## Installation and Requirements\n\nThis package needs a recent version of the `mpi4py` package in order to be useful.\nHowever, the classes also accept a value of `None` for the communicator, in which case a\ntrivial local implementation is used.  The code uses other widely available packages\n(like numpy) and requires a recent Python3 installation.\n\n### Binary Packages\n\nWheels are available on PyPI:\n\n    pip install pshmem\n\nOr you can install packages from conda-forge:\n\n    conda install -c conda-forge pshmem\n\n### Installing from Source\n\nYou can install the code from a git checkout with:\n\n    pip install .\n\n\n## MPIShared Class\n\nThis class implements a pattern where a shared array is allocated on each node.\nProcesses can update pieces of the shared array with the synchronous \"set()\" method.\nDuring this call, the data from the desired process is first replicated to all nodes,\nand then one process on each node copies that piece into the shared array.\n\nAll processes on all nodes can freely read data from the node-local copy of the shared\narray.\n\n### Example\n\nYou can use `MPIShared` as a context manager or by explicitly creating and freeing\nmemory.  Here is an example of creating a shared memory object that is replicated across\nnodes:\n\n```python\nimport numpy as np\nfrom mpi4py import MPI\n\nfrom pshmem import MPIShared\n\ncomm = MPI.COMM_WORLD\n\nwith MPIShared((3, 5), np.float64, comm) as shm:\n    # A copy of the data exists on every node and is initialized to zero.\n    # There is a numpy array \"view\" of that memory available with slice notation\n    # or by accessing the \"data\" member:\n    if comm.rank == 0:\n        # You can get a summary of the data by printing it:\n        print(\"String representation:\\n\")\n        print(shm)\n        print(\"\\n===== Initialized Data =====\")\n    for p in range(comm.size):\n        if p == comm.rank:\n            print(\"rank {}:\\n\".format(p), shm.data, flush=True)\n        comm.barrier()\n\n    set_data = None\n    set_offset = None\n    if comm.rank == 0:\n        set_data = np.arange(6, dtype=np.float64).reshape((2, 3))\n        set_offset = (1, 1)\n\n    # The set() method is collective, but the inputs only matter on one rank\n    shm.set(set_data, offset=set_offset, fromrank=0)\n\n    # You can also use the usual '[]' notation.  However, this call must do an\n    # additional pre-communication to detect which process the data is coming from.\n    # And this line is still collective and must be called on all processes:\n    shm[set_offset] = set_data\n\n    # This updated data has now been replicated to the shared memory on all nodes.\n    if comm.rank == 0:\n        print(\"======= Updated Data =======\")\n    for p in range(comm.size):\n        if p == comm.rank:\n            print(\"rank {}:\\n\".format(p), shm.data, flush=True)\n        comm.barrier()\n\n    # You can read from the node-local copy of the data from all processes,\n    # using either the \"data\" member or slice access:\n    if comm.rank == comm.size - 1:\n        print(\"==== Read-only access ======\")\n        print(\"rank {}: shm[2, 3] = {}\".format(comm.rank, shm[2, 3]), flush=True)\n        print(\"rank {}: shm.data = \\n{}\".format(comm.rank, shm.data), flush=True)\n\n```\n\nPutting the above code into a file `test.py` and running this on 4 processes gives:\n\n```\nmpirun -np 4 python3 test.py\n\nString representation:\n\n<MPIShared\n  replicated on 1 nodes, each with 4 processes (4 total)\n  shape = (3, 5), dtype = float64\n  [ [0. 0. 0. 0. 0.] [0. 0. 0. 0. 0.] [0. 0. 0. 0. 0.] ]\n>\n\n===== Initialized Data =====\nrank 0:\n [[0. 0. 0. 0. 0.]\n [0. 0. 0. 0. 0.]\n [0. 0. 0. 0. 0.]]\nrank 1:\n [[0. 0. 0. 0. 0.]\n [0. 0. 0. 0. 0.]\n [0. 0. 0. 0. 0.]]\nrank 2:\n [[0. 0. 0. 0. 0.]\n [0. 0. 0. 0. 0.]\n [0. 0. 0. 0. 0.]]\nrank 3:\n [[0. 0. 0. 0. 0.]\n [0. 0. 0. 0. 0.]\n [0. 0. 0. 0. 0.]]\n======= Updated Data =======\nrank 0:\n [[0. 0. 0. 0. 0.]\n [0. 0. 1. 2. 0.]\n [0. 3. 4. 5. 0.]]\nrank 1:\n [[0. 0. 0. 0. 0.]\n [0. 0. 1. 2. 0.]\n [0. 3. 4. 5. 0.]]\nrank 2:\n [[0. 0. 0. 0. 0.]\n [0. 0. 1. 2. 0.]\n [0. 3. 4. 5. 0.]]\nrank 3:\n [[0. 0. 0. 0. 0.]\n [0. 0. 1. 2. 0.]\n [0. 3. 4. 5. 0.]]\n==== Read-only access ======\nrank 3: shm[2, 3] = 5.0\nrank 3: shm.data =\n[[0. 0. 0. 0. 0.]\n [0. 0. 1. 2. 0.]\n [0. 3. 4. 5. 0.]]\n ```\n\nNote that if you are not using a context manager, then you should be careful to close\nand delete the object like this:\n\n```python\nshm = MPIShared((3, 5), np.float64, comm=comm)\n# Do stuff\nshm.close()\ndel shm\n```\n\n## MPILock Class\n\nThis implements a MUTEX lock across an arbitrary communicator.  A memory buffer on a\nsingle process acts as a waiting list where processes can add themselves (using\none-sided calls).  The processes pass a token to transfer ownership of the lock.  The\ntoken is passed in order of request.\n\n### Example\n\nA typical use case is where we want to serialize some operation across a large number of\nprocesses that reside on different nodes.  For example, perhaps we are making requests\nto the external network from a computing center and we do not want to saturate that with\nall processes simultaneously.  Or perhaps we are writing to a shared data file which\ndoes not support parallel writes and we have a sub-communicator of writing processes\nwhich take turns updating the filesystem.  We can instantiate a lock on any\ncommunicator, so it is possible to split the world communicator into groups and have\nsome operation serialized just within that group:\n\n```python\nwith MPILock(MPI.COMM_WORLD) as mpilock:\n    mpilock.lock()\n    # Do something here.  Only one process at a time will do this.\n    mpilock.unlock()\n```\n\n## Tests\n\nAfter installation, you can run some tests with:\n\n    mpirun -np 4 python3 -c 'import pshmem.test; pshmem.test.run()'\n\nIf you have mpi4py available but would like to explicitly disable the use of MPI in the\ntests, you can set an environment variable:\n\n    MPI_DISABLE=1 python3 -c 'import pshmem.test; pshmem.test.run()'\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "Parallel shared memory and locking with MPI",
    "version": "1.2.0",
    "project_urls": {
        "Issue Tracker": "https://github.com/tskisner/pshmem/issues",
        "Source": "https://github.com/tskisner/pshmem"
    },
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "b64ec2ae71536e91ae833eaf879ca35981f80699c812cf3239ab99af5ab65327",
                "md5": "31bdc3b20cf884f7424a84cad7da1722",
                "sha256": "dc052db0c702fc50f6ecb3afc9a56c5939fe67a401a4a7478e663da32384b97f"
            },
            "downloads": -1,
            "filename": "pshmem-1.2.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "31bdc3b20cf884f7424a84cad7da1722",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.9",
            "size": 27322,
            "upload_time": "2025-01-06T06:32:07",
            "upload_time_iso_8601": "2025-01-06T06:32:07.821315Z",
            "url": "https://files.pythonhosted.org/packages/b6/4e/c2ae71536e91ae833eaf879ca35981f80699c812cf3239ab99af5ab65327/pshmem-1.2.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "34c6bbe690ad4c48c80956fa7f8e5d27a9b1534cc0711695d95dcbb813ea7b77",
                "md5": "77eba62d24b82279bdec940d15ebbd10",
                "sha256": "fe0b6250835c65be1459c69471296bf1a4a8ab510627b6a7e491e6122677ae05"
            },
            "downloads": -1,
            "filename": "pshmem-1.2.0.tar.gz",
            "has_sig": false,
            "md5_digest": "77eba62d24b82279bdec940d15ebbd10",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.9",
            "size": 26571,
            "upload_time": "2025-01-06T06:32:06",
            "upload_time_iso_8601": "2025-01-06T06:32:06.588096Z",
            "url": "https://files.pythonhosted.org/packages/34/c6/bbe690ad4c48c80956fa7f8e5d27a9b1534cc0711695d95dcbb813ea7b77/pshmem-1.2.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-01-06 06:32:06",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "tskisner",
    "github_project": "pshmem",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "pshmem"
}

None