<p align="center">
<a href="https://github.com/davemlz/cubo"><img src="https://github.com/davemlz/cubo/raw/main/docs/_static/logo.png" alt="cubo"></a>
</p>
<p align="center">
<em>On-Demand Earth System Data Cubes (ESDCs) in Python</em>
</p>
<p align="center">
<a href='https://pypi.python.org/pypi/cubo'>
<img src='https://img.shields.io/pypi/v/cubo.svg' alt='PyPI' />
</a>
<a href='https://anaconda.org/conda-forge/cubo'>
<img src='https://img.shields.io/conda/vn/conda-forge/cubo.svg' alt='conda-forge' />
</a>
<a href='https://cubo.readthedocs.io/en/latest/?badge=latest'>
<img src='https://readthedocs.org/projects/cubo/badge/?version=latest' alt='Documentation Status' />
</a>
<a href='https://arxiv.org/abs/2404.13105'>
<img src='https://img.shields.io/badge/arXiv-2404.13105-b31b1b.svg' alt='Documentation Status' />
</a>
<a href="https://github.com/davemlz/cubo/actions/workflows/tests.yml" target="_blank">
<img src="https://github.com/davemlz/cubo/actions/workflows/tests.yml/badge.svg" alt="Tests">
</a>
<a href="https://opensource.org/licenses/MIT" target="_blank">
<img src="https://img.shields.io/badge/License-MIT-blue.svg" alt="License">
</a>
<a href="https://github.com/sponsors/davemlz" target="_blank">
<img src="https://img.shields.io/badge/GitHub%20Sponsors-Donate-ff69b4.svg" alt="GitHub Sponsors">
</a>
<a href="https://www.buymeacoffee.com/davemlz" target="_blank">
<img src="https://img.shields.io/badge/Buy%20me%20a%20coffee-Donate-ff69b4.svg" alt="Buy me a coffee">
</a>
<a href="https://ko-fi.com/davemlz" target="_blank">
<img src="https://img.shields.io/badge/kofi-Donate-ff69b4.svg" alt="Ko-fi">
</a>
<a href="https://twitter.com/dmlmont" target="_blank">
<img src="https://img.shields.io/twitter/follow/dmlmont?style=social" alt="Twitter">
</a>
<a href="https://github.com/psf/black" target="_blank">
<img src="https://img.shields.io/badge/code%20style-black-000000.svg" alt="Black">
</a>
<a href="https://pycqa.github.io/isort/" target="_blank">
<img src="https://img.shields.io/badge/%20imports-isort-%231674b1?style=flat&labelColor=ef8336" alt="isort">
</a>
</p>
---
**GitHub**: [https://github.com/davemlz/cubo](https://github.com/davemlz/cubo)
**Documentation**: [https://cubo.readthedocs.io/](https://cubo.readthedocs.io/)
**PyPI**: [https://pypi.org/project/cubo/](https://pypi.org/project/cubo/)
**Conda-forge**: [https://anaconda.org/conda-forge/cubo](https://anaconda.org/conda-forge/cubo)
**Tutorials**: [https://cubo.readthedocs.io/en/latest/tutorials.html](https://cubo.readthedocs.io/en/latest/tutorials.html)
**Paper**: [https://arxiv.org/abs/2404.13105](https://arxiv.org/abs/2404.13105)
---
## News
> [!IMPORTANT]
> :star: **Pinned (2024-04-19)**: Our `cubo` paper (preprint) is out in arXiv! Check it here: **[Montero, D., Aybar, C., Ji, C., Kraemer, G., Sochting, M., Teber, K., & Mahecha, M.D. (2024). On-Demand Earth System Data Cubes.](https://arxiv.org/abs/2404.13105)**
> [!NOTE]
> **Upcoming Event (2024-07-07)**: `cubo` will be presented at IGARSS24! More info here: **[2024 IEEE International Geoscience and Remote Sensing Symposium "Acting for Sustainability and Resilience", 7 - 12 July, 2024 • Athens, Greece.](https://www.2024.ieeeigarss.org/)**
## Overview
[SpatioTemporal Asset Catalogs (STAC)](https://stacspec.org/) provide a standardized format that describes
geospatial information. Multiple platforms are using this standard to provide clients several datasets.
Nice platforms such as [Planetary Computer](https://planetarycomputer.microsoft.com/) use this standard.
Additionally, [Google Earth Engine (GEE)](https://developers.google.com/earth-engine/datasets/)
also provides a gigantic catalogue that users can harness for different tasks in Python.
`cubo` is a Python package that provides users of STAC and GEE an easy way to create On-Demand Earth System Data Cubes (ESDCs). This is perfectly suitable for Deep Learning (DL) tasks. You can easily create a lot of ESDCs by just knowing a pair of coordinates and the edge size of the cube in pixels!
Check the simple usage of `cubo` with STAC here:
```python
import cubo
import xarray as xr
da = cubo.create(
lat=4.31, # Central latitude of the cube
lon=-76.2, # Central longitude of the cube
collection="sentinel-2-l2a", # Name of the STAC collection
bands=["B02","B03","B04"], # Bands to retrieve
start_date="2021-06-01", # Start date of the cube
end_date="2021-06-10", # End date of the cube
edge_size=64, # Edge size of the cube (px)
resolution=10, # Pixel size of the cube (m)
)
```
![Cubo Description](https://github.com/davemlz/cubo/raw/main/docs/_static/cubo_desc.png)
This chunk of code just created an `xr.DataArray` object given a pair of coordinates, the edge size of the cube (in pixels), and additional information to get the data from STAC (Planetary Computer by default, but you can use another provider!). Note that you can also use the resolution you want (in meters) and the bands that you require.
Now check the simple usage of `cubo` with GEE here:
```python
import cubo
import xarray as xr
da = cubo.create(
lat=51.079225, # Central latitude of the cube
lon=10.452173, # Central longitude of the cube
collection="COPERNICUS/S2_SR_HARMONIZED", # Id of the GEE collection
bands=["B2","B3","B4"], # Bands to retrieve
start_date="2016-06-01", # Start date of the cube
end_date="2017-07-01", # End date of the cube
edge_size=128, # Edge size of the cube (px)
resolution=10, # Pixel size of the cube (m)
gee=True # Use GEE instead of STAC
)
```
This chunk of code is very similar to the STAC-based cubo code. Note that the `collection`
is now the ID of the GEE collection to use, and note that the `gee` argument must be set to
`True`.
## How does it work?
The thing is super easy and simple.
1. You have the coordinates of a point of interest. The cube will be created around these coordinates (i.e., these coordinates will be approximately the spatial center of the cube).
2. Internally, the coordinates are transformed to the projected UTM coordinates [x,y] in meters (i.e., local UTM CRS). They are rounded to the closest pair of coordinates that are divisible by the resolution you requested.
3. The edge size you provide is used to create a Bounding Box (BBox) for the cube in the local UTM CRS given the exact amount of pixels (Note that the edge size should be a multiple of 2, otherwise it will be rounded, usual edge sizes for ML are 64, 128, 256, 512, etc.).
4. Additional information is used to retrieve the data from the STAC catalogue or from GEE: starts and end dates, name of the collection, endpoint of the catalogue (ignored for GEE), etc.
5. Then, by using `stackstac` and `pystac_client` the cube is retrieved as a `xr. DataArray`. In the case of GEE, the cube is retrieved
via `xee`.
6. Success! That's what `cubo` is doing for you, and you just need to provide the coordinates, the edge size, and the additional info to get the cube.
## Installation
Install the latest version from PyPI:
```
pip install cubo
```
Install `cubo` with the required GEE dependencies from PyPI:
```
pip install cubo[ee]
```
Upgrade `cubo` by running:
```
pip install -U cubo
```
Install the latest version from conda-forge:
```
conda install -c conda-forge cubo
```
Install the latest dev version from GitHub by running:
```
pip install git+https://github.com/davemlz/cubo
```
## Features
### Main function: `create()`
`cubo` is pretty straightforward, everything you need is in the `create()` function:
```python
da = cubo.create(
lat=4.31,
lon=-76.2,
collection="sentinel-2-l2a",
bands=["B02","B03","B04"],
start_date="2021-06-01",
end_date="2021-06-10",
edge_size=64,
resolution=10,
)
```
### Using different units for `edge_size`
By default, the units of `edge_size` are pixels. But you can modify this using the `units` argument:
```python
da = cubo.create(
lat=4.31,
lon=-76.2,
collection="sentinel-2-l2a",
bands=["B02","B03","B04"],
start_date="2021-06-01",
end_date="2021-06-10",
edge_size=1500,
units="m",
resolution=10,
)
```
> [!TIP]
> You can use "px" (pixels), "m" (meters), or any unit available in [`scipy.constants`](https://docs.scipy.org/doc/scipy/reference/constants.html#units).
```python
da = cubo.create(
lat=4.31,
lon=-76.2,
collection="sentinel-2-l2a",
bands=["B02","B03","B04"],
start_date="2021-06-01",
end_date="2021-06-10",
edge_size=1.5,
units="kilo",
resolution=10,
)
```
### Using another endpoint
By default, `cubo` uses Planetary Computer. But you can use another STAC provider endpoint if you want:
```python
da = cubo.create(
lat=4.31,
lon=-76.2,
collection="sentinel-s2-l2a-cogs",
bands=["B05","B06","B07"],
start_date="2020-01-01",
end_date="2020-06-01",
edge_size=128,
resolution=20,
stac="https://earth-search.aws.element84.com/v0"
)
```
### Keywords for searching data
You can pass `kwargs` to `pystac_client.Client.search()` if required:
```python
da = cubo.create(
lat=4.31,
lon=-76.2,
collection="sentinel-2-l2a",
bands=["B02","B03","B04"],
start_date="2021-01-01",
end_date="2021-06-10",
edge_size=64,
resolution=10,
query={"eo:cloud_cover": {"lt": 10}} # kwarg to pass
)
```
## License
The project is licensed under the MIT license.
## Citation
If you use this work, please consider citing the following paper:
```bibtex
@misc{montero2024ondemand,
title={On-Demand Earth System Data Cubes},
author={David Montero and César Aybar and Chaonan Ji and Guido Kraemer and Maximilian Söchting and Khalil Teber and Miguel D. Mahecha},
year={2024},
eprint={2404.13105},
archivePrefix={arXiv},
primaryClass={cs.DB}
}
```
## Logo Attribution
The logo and images were created using <a href="https://www.flaticon.com/free-icons/dice" title="dice icons">dice icons created by Freepik - Flaticon</a>.
[![RSC4Earth](https://github.com/davemlz/cubo/raw/main/docs/_static/esds.png)](https://rsc4earth.de/authors/esds/)
Raw data
{
"_id": null,
"home_page": "https://github.com/davemlz/cubo",
"name": "cubo",
"maintainer": null,
"docs_url": null,
"requires_python": null,
"maintainer_email": null,
"keywords": null,
"author": "David Montero Loaiza",
"author_email": "dml.mont@gmail.com",
"download_url": "https://files.pythonhosted.org/packages/5a/0b/094b1e65e38b5285387fc0d844f5423f051791f031cfe4ecf90043ae876d/cubo-2024.6.0.tar.gz",
"platform": null,
"description": "<p align=\"center\">\n <a href=\"https://github.com/davemlz/cubo\"><img src=\"https://github.com/davemlz/cubo/raw/main/docs/_static/logo.png\" alt=\"cubo\"></a>\n</p>\n<p align=\"center\">\n <em>On-Demand Earth System Data Cubes (ESDCs) in Python</em>\n</p>\n<p align=\"center\">\n<a href='https://pypi.python.org/pypi/cubo'>\n <img src='https://img.shields.io/pypi/v/cubo.svg' alt='PyPI' />\n</a>\n<a href='https://anaconda.org/conda-forge/cubo'>\n <img src='https://img.shields.io/conda/vn/conda-forge/cubo.svg' alt='conda-forge' />\n</a>\n<a href='https://cubo.readthedocs.io/en/latest/?badge=latest'>\n <img src='https://readthedocs.org/projects/cubo/badge/?version=latest' alt='Documentation Status' />\n</a>\n<a href='https://arxiv.org/abs/2404.13105'>\n <img src='https://img.shields.io/badge/arXiv-2404.13105-b31b1b.svg' alt='Documentation Status' />\n</a>\n<a href=\"https://github.com/davemlz/cubo/actions/workflows/tests.yml\" target=\"_blank\">\n <img src=\"https://github.com/davemlz/cubo/actions/workflows/tests.yml/badge.svg\" alt=\"Tests\">\n</a>\n<a href=\"https://opensource.org/licenses/MIT\" target=\"_blank\">\n <img src=\"https://img.shields.io/badge/License-MIT-blue.svg\" alt=\"License\">\n</a>\n<a href=\"https://github.com/sponsors/davemlz\" target=\"_blank\">\n <img src=\"https://img.shields.io/badge/GitHub%20Sponsors-Donate-ff69b4.svg\" alt=\"GitHub Sponsors\">\n</a>\n<a href=\"https://www.buymeacoffee.com/davemlz\" target=\"_blank\">\n <img src=\"https://img.shields.io/badge/Buy%20me%20a%20coffee-Donate-ff69b4.svg\" alt=\"Buy me a coffee\">\n</a>\n<a href=\"https://ko-fi.com/davemlz\" target=\"_blank\">\n <img src=\"https://img.shields.io/badge/kofi-Donate-ff69b4.svg\" alt=\"Ko-fi\">\n</a>\n<a href=\"https://twitter.com/dmlmont\" target=\"_blank\">\n <img src=\"https://img.shields.io/twitter/follow/dmlmont?style=social\" alt=\"Twitter\">\n</a>\n<a href=\"https://github.com/psf/black\" target=\"_blank\">\n <img src=\"https://img.shields.io/badge/code%20style-black-000000.svg\" alt=\"Black\">\n</a>\n<a href=\"https://pycqa.github.io/isort/\" target=\"_blank\">\n <img src=\"https://img.shields.io/badge/%20imports-isort-%231674b1?style=flat&labelColor=ef8336\" alt=\"isort\">\n</a>\n</p>\n\n---\n\n**GitHub**: [https://github.com/davemlz/cubo](https://github.com/davemlz/cubo)\n\n**Documentation**: [https://cubo.readthedocs.io/](https://cubo.readthedocs.io/)\n\n**PyPI**: [https://pypi.org/project/cubo/](https://pypi.org/project/cubo/)\n\n**Conda-forge**: [https://anaconda.org/conda-forge/cubo](https://anaconda.org/conda-forge/cubo)\n\n**Tutorials**: [https://cubo.readthedocs.io/en/latest/tutorials.html](https://cubo.readthedocs.io/en/latest/tutorials.html)\n\n**Paper**: [https://arxiv.org/abs/2404.13105](https://arxiv.org/abs/2404.13105)\n\n---\n\n## News\n\n> [!IMPORTANT] \n> :star: **Pinned (2024-04-19)**: Our `cubo` paper (preprint) is out in arXiv! Check it here: **[Montero, D., Aybar, C., Ji, C., Kraemer, G., Sochting, M., Teber, K., & Mahecha, M.D. (2024). On-Demand Earth System Data Cubes.](https://arxiv.org/abs/2404.13105)**\n\n> [!NOTE] \n> **Upcoming Event (2024-07-07)**: `cubo` will be presented at IGARSS24! More info here: **[2024 IEEE International Geoscience and Remote Sensing Symposium \"Acting for Sustainability and Resilience\", 7 - 12 July, 2024 \u2022 Athens, Greece.](https://www.2024.ieeeigarss.org/)**\n\n## Overview\n\n[SpatioTemporal Asset Catalogs (STAC)](https://stacspec.org/) provide a standardized format that describes\ngeospatial information. Multiple platforms are using this standard to provide clients several datasets.\nNice platforms such as [Planetary Computer](https://planetarycomputer.microsoft.com/) use this standard.\nAdditionally, [Google Earth Engine (GEE)](https://developers.google.com/earth-engine/datasets/) \nalso provides a gigantic catalogue that users can harness for different tasks in Python.\n\n`cubo` is a Python package that provides users of STAC and GEE an easy way to create On-Demand Earth System Data Cubes (ESDCs). This is perfectly suitable for Deep Learning (DL) tasks. You can easily create a lot of ESDCs by just knowing a pair of coordinates and the edge size of the cube in pixels!\n\nCheck the simple usage of `cubo` with STAC here:\n\n```python\nimport cubo\nimport xarray as xr\n\nda = cubo.create(\n lat=4.31, # Central latitude of the cube\n lon=-76.2, # Central longitude of the cube\n collection=\"sentinel-2-l2a\", # Name of the STAC collection\n bands=[\"B02\",\"B03\",\"B04\"], # Bands to retrieve\n start_date=\"2021-06-01\", # Start date of the cube\n end_date=\"2021-06-10\", # End date of the cube\n edge_size=64, # Edge size of the cube (px)\n resolution=10, # Pixel size of the cube (m)\n)\n```\n\n![Cubo Description](https://github.com/davemlz/cubo/raw/main/docs/_static/cubo_desc.png)\n\nThis chunk of code just created an `xr.DataArray` object given a pair of coordinates, the edge size of the cube (in pixels), and additional information to get the data from STAC (Planetary Computer by default, but you can use another provider!). Note that you can also use the resolution you want (in meters) and the bands that you require.\n\nNow check the simple usage of `cubo` with GEE here:\n\n```python\nimport cubo\nimport xarray as xr\n\nda = cubo.create(\n lat=51.079225, # Central latitude of the cube\n lon=10.452173, # Central longitude of the cube\n collection=\"COPERNICUS/S2_SR_HARMONIZED\", # Id of the GEE collection\n bands=[\"B2\",\"B3\",\"B4\"], # Bands to retrieve\n start_date=\"2016-06-01\", # Start date of the cube\n end_date=\"2017-07-01\", # End date of the cube\n edge_size=128, # Edge size of the cube (px)\n resolution=10, # Pixel size of the cube (m)\n gee=True # Use GEE instead of STAC\n)\n```\n\nThis chunk of code is very similar to the STAC-based cubo code. Note that the `collection`\nis now the ID of the GEE collection to use, and note that the `gee` argument must be set to\n`True`.\n\n## How does it work?\n\nThe thing is super easy and simple.\n\n1. You have the coordinates of a point of interest. The cube will be created around these coordinates (i.e., these coordinates will be approximately the spatial center of the cube).\n2. Internally, the coordinates are transformed to the projected UTM coordinates [x,y] in meters (i.e., local UTM CRS). They are rounded to the closest pair of coordinates that are divisible by the resolution you requested.\n3. The edge size you provide is used to create a Bounding Box (BBox) for the cube in the local UTM CRS given the exact amount of pixels (Note that the edge size should be a multiple of 2, otherwise it will be rounded, usual edge sizes for ML are 64, 128, 256, 512, etc.).\n4. Additional information is used to retrieve the data from the STAC catalogue or from GEE: starts and end dates, name of the collection, endpoint of the catalogue (ignored for GEE), etc.\n5. Then, by using `stackstac` and `pystac_client` the cube is retrieved as a `xr. DataArray`. In the case of GEE, the cube is retrieved\nvia `xee`.\n6. Success! That's what `cubo` is doing for you, and you just need to provide the coordinates, the edge size, and the additional info to get the cube.\n\n## Installation\n\nInstall the latest version from PyPI:\n\n```\npip install cubo\n```\n\nInstall `cubo` with the required GEE dependencies from PyPI:\n\n```\npip install cubo[ee]\n```\n\nUpgrade `cubo` by running:\n\n```\npip install -U cubo\n```\n\nInstall the latest version from conda-forge:\n\n```\nconda install -c conda-forge cubo\n```\n\nInstall the latest dev version from GitHub by running:\n\n```\npip install git+https://github.com/davemlz/cubo\n```\n\n## Features\n\n### Main function: `create()`\n\n`cubo` is pretty straightforward, everything you need is in the `create()` function:\n\n```python\nda = cubo.create(\n lat=4.31,\n lon=-76.2,\n collection=\"sentinel-2-l2a\",\n bands=[\"B02\",\"B03\",\"B04\"],\n start_date=\"2021-06-01\",\n end_date=\"2021-06-10\",\n edge_size=64,\n resolution=10,\n)\n```\n\n### Using different units for `edge_size`\n\nBy default, the units of `edge_size` are pixels. But you can modify this using the `units` argument:\n\n```python\nda = cubo.create(\n lat=4.31,\n lon=-76.2,\n collection=\"sentinel-2-l2a\",\n bands=[\"B02\",\"B03\",\"B04\"],\n start_date=\"2021-06-01\",\n end_date=\"2021-06-10\",\n edge_size=1500,\n units=\"m\",\n resolution=10,\n)\n```\n\n> [!TIP]\n> You can use \"px\" (pixels), \"m\" (meters), or any unit available in [`scipy.constants`](https://docs.scipy.org/doc/scipy/reference/constants.html#units).\n\n```python\nda = cubo.create(\n lat=4.31,\n lon=-76.2,\n collection=\"sentinel-2-l2a\",\n bands=[\"B02\",\"B03\",\"B04\"],\n start_date=\"2021-06-01\",\n end_date=\"2021-06-10\",\n edge_size=1.5,\n units=\"kilo\",\n resolution=10,\n)\n```\n\n### Using another endpoint\n\nBy default, `cubo` uses Planetary Computer. But you can use another STAC provider endpoint if you want:\n\n```python\nda = cubo.create(\n lat=4.31,\n lon=-76.2,\n collection=\"sentinel-s2-l2a-cogs\",\n bands=[\"B05\",\"B06\",\"B07\"],\n start_date=\"2020-01-01\",\n end_date=\"2020-06-01\",\n edge_size=128,\n resolution=20,\n stac=\"https://earth-search.aws.element84.com/v0\"\n)\n```\n\n### Keywords for searching data\n\nYou can pass `kwargs` to `pystac_client.Client.search()` if required:\n\n```python\nda = cubo.create(\n lat=4.31,\n lon=-76.2,\n collection=\"sentinel-2-l2a\",\n bands=[\"B02\",\"B03\",\"B04\"],\n start_date=\"2021-01-01\",\n end_date=\"2021-06-10\",\n edge_size=64,\n resolution=10,\n query={\"eo:cloud_cover\": {\"lt\": 10}} # kwarg to pass\n)\n```\n\n## License\n\nThe project is licensed under the MIT license.\n\n## Citation\n\nIf you use this work, please consider citing the following paper:\n\n```bibtex\n@misc{montero2024ondemand,\n title={On-Demand Earth System Data Cubes}, \n author={David Montero and C\u00e9sar Aybar and Chaonan Ji and Guido Kraemer and Maximilian S\u00f6chting and Khalil Teber and Miguel D. Mahecha},\n year={2024},\n eprint={2404.13105},\n archivePrefix={arXiv},\n primaryClass={cs.DB}\n}\n```\n\n## Logo Attribution\n\nThe logo and images were created using <a href=\"https://www.flaticon.com/free-icons/dice\" title=\"dice icons\">dice icons created by Freepik - Flaticon</a>.\n\n[![RSC4Earth](https://github.com/davemlz/cubo/raw/main/docs/_static/esds.png)](https://rsc4earth.de/authors/esds/)\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "On-Demand Earth System Data Cubes in Python",
"version": "2024.6.0",
"project_urls": {
"Homepage": "https://github.com/davemlz/cubo"
},
"split_keywords": [],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "5a0b094b1e65e38b5285387fc0d844f5423f051791f031cfe4ecf90043ae876d",
"md5": "3952b1ea747c1e5198638de0258b9707",
"sha256": "83e29e5f174370fe8b749df54261166c5e856fd423af06362eb79f4466126ffd"
},
"downloads": -1,
"filename": "cubo-2024.6.0.tar.gz",
"has_sig": false,
"md5_digest": "3952b1ea747c1e5198638de0258b9707",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 12953,
"upload_time": "2024-06-23T14:28:34",
"upload_time_iso_8601": "2024-06-23T14:28:34.865017Z",
"url": "https://files.pythonhosted.org/packages/5a/0b/094b1e65e38b5285387fc0d844f5423f051791f031cfe4ecf90043ae876d/cubo-2024.6.0.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-06-23 14:28:34",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "davemlz",
"github_project": "cubo",
"travis_ci": false,
"coveralls": false,
"github_actions": true,
"requirements": [],
"tox": true,
"lcname": "cubo"
}