bamboost


Namebamboost JSON
Version 0.10.3 PyPI version JSON
download
home_pageNone
SummaryA light weight scientific data management system using HDF5
upload_time2025-10-08 10:45:11
maintainerNone
docs_urlNone
authorNone
requires_python>=3.10
licenseNone
keywords database database-manager hdf5
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            <a id="readme-top"></a>


<br />
<div align="center">
  <a href="https://gitlab.com/cmbm-ethz/bamboost">
	<img src="https://gitlab.com/cmbm-ethz/bamboost/-/raw/main/assets/bamboost_icon.png?ref_type=heads" width="150" alt="Logo"/><br/>
  </a>

<h3 align="center">BAMBOOST</h3>

  <p align="center">
    Bamboost is a Python library built for datamanagement using
    the HDF5 file format.
    bamboost stands for a <span style="font-weight: bold;">lightweight</span> shelf which will <span style="font-weight: bold">boost</span> your efficiency and which
    will totally break if you load it heavily. Just kidding, bamboo can fully carry pandas. <br/>
    🐼🐼🐼🐼
    <br />

[![Docs][docs-shield]][docs-url]
[![Pipeline][pipeline-shield]][pipeline-url]
[![Coverage][coverage-shield]][coverage-url]
[![Pypi][pypi-shield]][pypi-url]
[![PyPI_downloads][pypi-downloads-shield]][pypi-downloads-url]
[![License][license-shield]][license-url]


<a href="https://bamboost.ch/docs"><strong>Explore the docs Β»</strong></a>
<br />
<br />
<a href="https://github.com/zrlf/bamboost-tui">Terminal User Interface (TUI)</a>
&middot;
<a href="https://github.com/zrlf/bamboost-docs">Doc site repo</a>
<br />
<br />
<a href="https://gitlab.com/cmbm-ethz/bamboost/issues/new?labels=bug&template=bug-report---.md">Report Bug</a>
&middot;
<a href="https://gitlab.com/cmbm-ethz/bamboost/issues/new?labels=enhancement&template=feature-request---.md">Request Feature</a>

  </p>
</div>

> [!important]
> Starting from version 0.10.0, bamboost breaks compatibility with previous versions.
> For previous versions, checkout the [legacy branch](https://gitlab.com/cmbm-ethz/bamboost/-/tree/legacy).

<details>
  <summary>Table of Contents</summary>

[[_TOC_]]

</details>



## About The Project

**bamboost** is a python data framework designed for managing scientific simulation data.
It provides an organized model for storing, indexing, and retrieving
simulation results, making it easier to work with large-scale computational
studies. 
In its core, it is a filesystem storage model, providing directories for
**simulations**, bundled in **collections**. 

### Principles
- **Independence:** Any dataset must be complete and understandable on it's own. You can copy or extract any of your data and distribute it without external dependencies.
- **Path redundancy:** Data must be referencable without knowledge of it's path. This serves several purposes: You can share your data easily ($e.g.$ supplementary material for papers), and renaming directories, moving files, switching computer, etc. will not break your data referencing.

This leads to the following requirements:
- Simulation parameters must be stored locally, inside the simulation directory. Crucially, not _exclusively_ in a global database of any kind.
- Collections must have unique identifiers that are independent of its path.
- Simulations must have unique identifiers that are independent of its path.

### Concept

We organize **simulations** in **collections** within structured
directories. 
Let's consider the following directory:

```
test_data/
β”œβ”€β”€ simulation_1/
β”‚   β”œβ”€β”€ data.h5
β”‚   β”œβ”€β”€ data.xdmf
β”‚   β”œβ”€β”€ additional_file_1.txt
β”‚   β”œβ”€β”€ additional_file_2.csv
β”œβ”€β”€ simulation_2/
β”‚   β”œβ”€β”€ data.h5
β”‚   β”œβ”€β”€ additional_file_3.txt
└── .bamboost-collection-ABCD1234
```

This is a valid `bamboost` collection at the path `./test_data`. It contains an
identifier file giving this collection a unique identifier. In this case, it is
`ABCD1234`.
This file defines the unique ID of the collection.

It contains two entries; `simulation_1` and `simulation_2`.
As you can see, each simulation owns a directory inside a collection. 
The directory names are simultaneously used as their _name_ as well as their _ID_.
The unique identifier for a single simulation becomes the combination of the collection _ID_ that it belongs to and the simulation _ID_.
That means, the full identifier of `simulation_1` is `ABCD1234:simulation_1`.

Each simulation contains a central _HDF5 file_ named `data.h5`. This file is used to store the _parameters_, as well as generated data.
The simulation API of `bamboost` provides extensive functionality to store and retrieve data from this file. However, users are not limited to this file, or using `python` in general.
The reason why simulations are directories instead of just a single HDF file is that you can dump any file that belongs to this simulation into its path. This can be output from 3rd party software (think LAMMPS), additional input files such as images, and also scripts to reproduce the generated data.


<p align="right">(<a href="#readme-top">back to top</a>)</p>


## Getting Started

**bamboost** is available from the Python Package Index (PyPI) and can be installed using `pip` (or `uv` of course):

```sh
pip install bamboost
```

### Prerequisites

To use **bamboost** with MPI, you need a working MPI installation. Additionally, you need
- `mpi4py`
- `h5py` with MPI support. For most distros you can get a version from it's package manager, or see [building against Parallel HDF5](https://docs.h5py.org/en/stable/mpi.html#building-against-parallel-hdf5)

### Installation

**bamboost** is available from the Python Package Index (PyPI) and can be installed using `pip` (or `uv` of course):

```sh
pip install bamboost
```

To install the latest version from this repository, you can use:

```sh
pip install git+https://gitlab.com/cmbm-ethz/bamboost.git
```

<p align="right">(<a href="#readme-top">back to top</a>)</p>



## Usage

For a getting started guide, please see here: [Getting started](https://bamboost.ch/docs/getting_started)

<p align="right">(<a href="#readme-top">back to top</a>)</p>



<!-- ROADMAP -->
## Roadmap

- [ ] Clear MPI handling

See the [open issues](https://gitlab.com/cmbm-ethz/bamboost/issues) for a full list of proposed features (and known issues).

<p align="right">(<a href="#readme-top">back to top</a>)</p>



<!-- CONTRIBUTING -->
## Contributing

If you have a suggestion that would make this better, please fork the repo and create a pull request. You can also simply open an issue with the tag "enhancement".
Don't forget to give the project a star! Thanks again!

1. Fork the Project
2. Create your Feature Branch (`git checkout -b feature/AmazingFeature`)
3. Commit your Changes (`git commit -m 'Add some AmazingFeature'`)
4. Push to the Branch (`git push origin feature/AmazingFeature`)
5. Open a Pull Request

<p align="right">(<a href="#readme-top">back to top</a>)</p>



<!-- LICENSE -->
## License

Distributed under the **MIT license**. See [LICENSE](https://gitlab.com/cmbm-ethz/bamboost/-/blob/main/LICENSE) for more information.

<p align="right">(<a href="#readme-top">back to top</a>)</p>



<!-- CONTACT -->
## Contact

zrlf - forez@ethz.ch

Project Link: [https://gitlab.com/cmbm-ethz/bamboost](https://gitlab.com/cmbm-ethz/bamboost)

<p align="right">(<a href="#readme-top">back to top</a>)</p>



<!-- ACKNOWLEDGMENTS -->
## Acknowledgments

* [Best-README-Template](https://github.com/othneildrew/Best-README-Template) for inspiration on how to structure this README.

<p align="right">(<a href="#readme-top">back to top</a>)</p>



<!-- MARKDOWN LINKS & IMAGES -->
<!-- https://www.markdownguide.org/basic-syntax/#reference-style-links -->
[docs-shield]: https://img.shields.io/badge/Docs-bamboost.ch-blue?style=flat-square
[docs-url]: https://bamboost.ch
[pipeline-shield]: https://gitlab.com/cmbm-ethz/bamboost/badges/main/pipeline.svg?style=flat-square
[pipeline-url]: https://gitlab.com/cmbm-ethz/bamboost/-/commits/main
[coverage-shield]: https://gitlab.com/cmbm-ethz/bamboost/badges/main/coverage.svg?style=flat-square
[coverage-url]: https://cmbm-ethz.gitlab.io/bamboost/
[pypi-shield]: https://img.shields.io/pypi/v/bamboost?style=flat-square
[pypi-url]: https://pypi.org/project/bamboost/
[pypi-downloads-shield]: https://img.shields.io/pypi/dm/bamboost?style=flat-square
[pypi-downloads-url]: https://pypi.org/project/bamboost/
[license-shield]: https://img.shields.io/pypi/l/bamboost?style=flat-square
[license-url]: https://gitlab.com/cmbm-ethz/bamboost/-/blob/main/LICENSE



            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "bamboost",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.10",
    "maintainer_email": null,
    "keywords": "database, database-manager, hdf5",
    "author": null,
    "author_email": "Flavio Lorez <florez@ethz.ch>",
    "download_url": "https://files.pythonhosted.org/packages/10/66/f9123861886b953250c4abe59ecca94111dbed3ed627eabea65b420be521/bamboost-0.10.3.tar.gz",
    "platform": null,
    "description": "<a id=\"readme-top\"></a>\n\n\n<br />\n<div align=\"center\">\n  <a href=\"https://gitlab.com/cmbm-ethz/bamboost\">\n\t<img src=\"https://gitlab.com/cmbm-ethz/bamboost/-/raw/main/assets/bamboost_icon.png?ref_type=heads\" width=\"150\" alt=\"Logo\"/><br/>\n  </a>\n\n<h3 align=\"center\">BAMBOOST</h3>\n\n  <p align=\"center\">\n    Bamboost is a Python library built for datamanagement using\n    the HDF5 file format.\n    bamboost stands for a <span style=\"font-weight: bold;\">lightweight</span> shelf which will <span style=\"font-weight: bold\">boost</span> your efficiency and which\n    will totally break if you load it heavily. Just kidding, bamboo can fully carry pandas. <br/>\n    \ud83d\udc3c\ud83d\udc3c\ud83d\udc3c\ud83d\udc3c\n    <br />\n\n[![Docs][docs-shield]][docs-url]\n[![Pipeline][pipeline-shield]][pipeline-url]\n[![Coverage][coverage-shield]][coverage-url]\n[![Pypi][pypi-shield]][pypi-url]\n[![PyPI_downloads][pypi-downloads-shield]][pypi-downloads-url]\n[![License][license-shield]][license-url]\n\n\n<a href=\"https://bamboost.ch/docs\"><strong>Explore the docs \u00bb</strong></a>\n<br />\n<br />\n<a href=\"https://github.com/zrlf/bamboost-tui\">Terminal User Interface (TUI)</a>\n&middot;\n<a href=\"https://github.com/zrlf/bamboost-docs\">Doc site repo</a>\n<br />\n<br />\n<a href=\"https://gitlab.com/cmbm-ethz/bamboost/issues/new?labels=bug&template=bug-report---.md\">Report Bug</a>\n&middot;\n<a href=\"https://gitlab.com/cmbm-ethz/bamboost/issues/new?labels=enhancement&template=feature-request---.md\">Request Feature</a>\n\n  </p>\n</div>\n\n> [!important]\n> Starting from version 0.10.0, bamboost breaks compatibility with previous versions.\n> For previous versions, checkout the [legacy branch](https://gitlab.com/cmbm-ethz/bamboost/-/tree/legacy).\n\n<details>\n  <summary>Table of Contents</summary>\n\n[[_TOC_]]\n\n</details>\n\n\n\n## About The Project\n\n**bamboost** is a python data framework designed for managing scientific simulation data.\nIt provides an organized model for storing, indexing, and retrieving\nsimulation results, making it easier to work with large-scale computational\nstudies. \nIn its core, it is a filesystem storage model, providing directories for\n**simulations**, bundled in **collections**. \n\n### Principles\n- **Independence:** Any dataset must be complete and understandable on it's own. You can copy or extract any of your data and distribute it without external dependencies.\n- **Path redundancy:** Data must be referencable without knowledge of it's path. This serves several purposes: You can share your data easily ($e.g.$ supplementary material for papers), and renaming directories, moving files, switching computer, etc. will not break your data referencing.\n\nThis leads to the following requirements:\n- Simulation parameters must be stored locally, inside the simulation directory. Crucially, not _exclusively_ in a global database of any kind.\n- Collections must have unique identifiers that are independent of its path.\n- Simulations must have unique identifiers that are independent of its path.\n\n### Concept\n\nWe organize **simulations** in **collections** within structured\ndirectories. \nLet's consider the following directory:\n\n```\ntest_data/\n\u251c\u2500\u2500 simulation_1/\n\u2502   \u251c\u2500\u2500 data.h5\n\u2502   \u251c\u2500\u2500 data.xdmf\n\u2502   \u251c\u2500\u2500 additional_file_1.txt\n\u2502   \u251c\u2500\u2500 additional_file_2.csv\n\u251c\u2500\u2500 simulation_2/\n\u2502   \u251c\u2500\u2500 data.h5\n\u2502   \u251c\u2500\u2500 additional_file_3.txt\n\u2514\u2500\u2500 .bamboost-collection-ABCD1234\n```\n\nThis is a valid `bamboost` collection at the path `./test_data`. It contains an\nidentifier file giving this collection a unique identifier. In this case, it is\n`ABCD1234`.\nThis file defines the unique ID of the collection.\n\nIt contains two entries; `simulation_1` and `simulation_2`.\nAs you can see, each simulation owns a directory inside a collection. \nThe directory names are simultaneously used as their _name_ as well as their _ID_.\nThe unique identifier for a single simulation becomes the combination of the collection _ID_ that it belongs to and the simulation _ID_.\nThat means, the full identifier of `simulation_1` is `ABCD1234:simulation_1`.\n\nEach simulation contains a central _HDF5 file_ named `data.h5`. This file is used to store the _parameters_, as well as generated data.\nThe simulation API of `bamboost` provides extensive functionality to store and retrieve data from this file. However, users are not limited to this file, or using `python` in general.\nThe reason why simulations are directories instead of just a single HDF file is that you can dump any file that belongs to this simulation into its path. This can be output from 3rd party software (think LAMMPS), additional input files such as images, and also scripts to reproduce the generated data.\n\n\n<p align=\"right\">(<a href=\"#readme-top\">back to top</a>)</p>\n\n\n## Getting Started\n\n**bamboost** is available from the Python Package Index (PyPI) and can be installed using `pip` (or `uv` of course):\n\n```sh\npip install bamboost\n```\n\n### Prerequisites\n\nTo use **bamboost** with MPI, you need a working MPI installation. Additionally, you need\n- `mpi4py`\n- `h5py` with MPI support. For most distros you can get a version from it's package manager, or see [building against Parallel HDF5](https://docs.h5py.org/en/stable/mpi.html#building-against-parallel-hdf5)\n\n### Installation\n\n**bamboost** is available from the Python Package Index (PyPI) and can be installed using `pip` (or `uv` of course):\n\n```sh\npip install bamboost\n```\n\nTo install the latest version from this repository, you can use:\n\n```sh\npip install git+https://gitlab.com/cmbm-ethz/bamboost.git\n```\n\n<p align=\"right\">(<a href=\"#readme-top\">back to top</a>)</p>\n\n\n\n## Usage\n\nFor a getting started guide, please see here: [Getting started](https://bamboost.ch/docs/getting_started)\n\n<p align=\"right\">(<a href=\"#readme-top\">back to top</a>)</p>\n\n\n\n<!-- ROADMAP -->\n## Roadmap\n\n- [ ] Clear MPI handling\n\nSee the [open issues](https://gitlab.com/cmbm-ethz/bamboost/issues) for a full list of proposed features (and known issues).\n\n<p align=\"right\">(<a href=\"#readme-top\">back to top</a>)</p>\n\n\n\n<!-- CONTRIBUTING -->\n## Contributing\n\nIf you have a suggestion that would make this better, please fork the repo and create a pull request. You can also simply open an issue with the tag \"enhancement\".\nDon't forget to give the project a star! Thanks again!\n\n1. Fork the Project\n2. Create your Feature Branch (`git checkout -b feature/AmazingFeature`)\n3. Commit your Changes (`git commit -m 'Add some AmazingFeature'`)\n4. Push to the Branch (`git push origin feature/AmazingFeature`)\n5. Open a Pull Request\n\n<p align=\"right\">(<a href=\"#readme-top\">back to top</a>)</p>\n\n\n\n<!-- LICENSE -->\n## License\n\nDistributed under the **MIT license**. See [LICENSE](https://gitlab.com/cmbm-ethz/bamboost/-/blob/main/LICENSE) for more information.\n\n<p align=\"right\">(<a href=\"#readme-top\">back to top</a>)</p>\n\n\n\n<!-- CONTACT -->\n## Contact\n\nzrlf - forez@ethz.ch\n\nProject Link: [https://gitlab.com/cmbm-ethz/bamboost](https://gitlab.com/cmbm-ethz/bamboost)\n\n<p align=\"right\">(<a href=\"#readme-top\">back to top</a>)</p>\n\n\n\n<!-- ACKNOWLEDGMENTS -->\n## Acknowledgments\n\n* [Best-README-Template](https://github.com/othneildrew/Best-README-Template) for inspiration on how to structure this README.\n\n<p align=\"right\">(<a href=\"#readme-top\">back to top</a>)</p>\n\n\n\n<!-- MARKDOWN LINKS & IMAGES -->\n<!-- https://www.markdownguide.org/basic-syntax/#reference-style-links -->\n[docs-shield]: https://img.shields.io/badge/Docs-bamboost.ch-blue?style=flat-square\n[docs-url]: https://bamboost.ch\n[pipeline-shield]: https://gitlab.com/cmbm-ethz/bamboost/badges/main/pipeline.svg?style=flat-square\n[pipeline-url]: https://gitlab.com/cmbm-ethz/bamboost/-/commits/main\n[coverage-shield]: https://gitlab.com/cmbm-ethz/bamboost/badges/main/coverage.svg?style=flat-square\n[coverage-url]: https://cmbm-ethz.gitlab.io/bamboost/\n[pypi-shield]: https://img.shields.io/pypi/v/bamboost?style=flat-square\n[pypi-url]: https://pypi.org/project/bamboost/\n[pypi-downloads-shield]: https://img.shields.io/pypi/dm/bamboost?style=flat-square\n[pypi-downloads-url]: https://pypi.org/project/bamboost/\n[license-shield]: https://img.shields.io/pypi/l/bamboost?style=flat-square\n[license-url]: https://gitlab.com/cmbm-ethz/bamboost/-/blob/main/LICENSE\n\n\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "A light weight scientific data management system using HDF5",
    "version": "0.10.3",
    "project_urls": {
        "Homepage": "https://bamboost.ch/",
        "changelog": "https://gitlab.com/cmbm-ethz/bamboost/-/tree/main/CHANGELOG.md",
        "repository": "https://gitlab.com/cmbm-ethz/bamboost"
    },
    "split_keywords": [
        "database",
        " database-manager",
        " hdf5"
    ],
    "urls": [
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "69f4c66bf66a1d45e2ae16ee529ddf306271d363c739d35f5b4d7b135fd51ffe",
                "md5": "a1a67a7d621a5d8732fe05e054a7683e",
                "sha256": "04fa260bf01ed64cebe97d222c21f68faa6341275cb20c84adb065a6d1772b90"
            },
            "downloads": -1,
            "filename": "bamboost-0.10.3-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "a1a67a7d621a5d8732fe05e054a7683e",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.10",
            "size": 212417,
            "upload_time": "2025-10-08T10:45:09",
            "upload_time_iso_8601": "2025-10-08T10:45:09.641532Z",
            "url": "https://files.pythonhosted.org/packages/69/f4/c66bf66a1d45e2ae16ee529ddf306271d363c739d35f5b4d7b135fd51ffe/bamboost-0.10.3-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "1066f9123861886b953250c4abe59ecca94111dbed3ed627eabea65b420be521",
                "md5": "1e988eca0cb2bec0d2b295079409d3cb",
                "sha256": "a727979e7503a4c650965db3e18e2b29fbe3a85af6a21b6a688d89e3f7e516f7"
            },
            "downloads": -1,
            "filename": "bamboost-0.10.3.tar.gz",
            "has_sig": false,
            "md5_digest": "1e988eca0cb2bec0d2b295079409d3cb",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.10",
            "size": 393123,
            "upload_time": "2025-10-08T10:45:11",
            "upload_time_iso_8601": "2025-10-08T10:45:11.012271Z",
            "url": "https://files.pythonhosted.org/packages/10/66/f9123861886b953250c4abe59ecca94111dbed3ed627eabea65b420be521/bamboost-0.10.3.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-10-08 10:45:11",
    "github": false,
    "gitlab": true,
    "bitbucket": false,
    "codeberg": false,
    "gitlab_user": "cmbm-ethz",
    "gitlab_project": "bamboost",
    "lcname": "bamboost"
}
        
Elapsed time: 4.39817s