geosink


Namegeosink JSON
Version 0.2.0 PyPI version JSON
download
home_pageNone
SummaryGeodesic Sinkhorn with Chebyshev approximation
upload_time2024-06-28 13:13:42
maintainerNone
docs_urlNone
authorGuillaume Huguet
requires_python>=3.8
licenseMIT License Copyright (c) 2024 Guillaume Huguet Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions: The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software. THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
keywords diffusion optimal transport heat kernel sinkhorn
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # GeoSinkhorn
Code for the paper Geodesic Sinkhorn for Fast and Accurate Optimal Transport on Manifolds. 

> [!NOTE]  
>This repository is still in development.

### Installation
You can install the library from [PyPI](https://pypi.org/project/geosink/) by running:
```bash
pip install geosink
``` 
Or using Git, by first cloning the repository and running:
```bash
pip install -e .
```
If you want to use the pre existing graph tools, run:
```bash
pip install -e .['graph']
```
To run the tests, you will need additional packages. Install them by running:
```bash
pip install -e .['dev']
```


### Minimal Example
You can reproduce this example in the following notebook [![notebook](https://img.shields.io/static/v1?label=Run%20in&message=Google%20Colab&color=orange&logo=Google%20Cloud)](https://colab.research.google.com/drive/1Y_CHGb49aVXgTPtnD-Yf8GX_PYYwXwYx?usp=sharing).


We build a graph between two Gaussian distributions and compute the distance between two signals on that graph.
```python
import numpy as np
from geosink.sinkhorn import GeoSinkhorn 
from geosink.heat_kernel import laplacian_from_data

# Generate data and build graph.
data0 = np.random.normal(0, 1, (100, 5))
data1 = np.random.normal(5, 1, (100, 5))
data = np.concatenate([data0, data1], axis=0)
lap = laplacian_from_data(data, sigma=1.0)

# instantiate the GeoSinkhorn class
geo_sinkhorn = GeoSinkhorn(tau=5.0, order=10, method="cheb", lap=lap)

# create two signals
m_0 = np.zeros(200,)
m_0[:100] = 1
m_0 = m_0 / np.sum(m_0)
m_1 = np.zeros(200,)
m_1[100:] = 1
m_1 = m_1 / np.sum(m_1)

# compute the distance between the two signals
dist_w = geo_sinkhorn(m_0, m_1, max_iter=500)
print(dist_w)
```
Note that it is also possible to provide a graph instance directly to the `GeoSinkhorn` class with `GeoSinkhorn(tau=1.0, order=10, method="cheb", graph=graph)`. The `graph` must have a Laplacian attribute `graph.L`. We suggest using a sparse Laplacian (e.g. in COO format) for better performance.

### How to Cite

If you find this code useful in your research, please cite the following paper (expand for BibTeX):
<details>
<summary>
Huguet, G., Tong, A., Zapatero, M. R., Tape, C. J., Wolf, G., & Krishnaswamy, S. (2023). Geodesic Sinkhorn for fast and accurate optimal transport on manifolds. In 2023 IEEE 33rd International Workshop on Machine Learning for Signal Processing (MLSP).
</summary>

```bibtex
@inproceedings{huguet2023geodesic,
  title={Geodesic Sinkhorn for fast and accurate optimal transport on manifolds},
  author={Huguet, Guillaume and Tong, Alexander and Zapatero, Mar{\'\i}a Ramos and Tape, Christopher J and Wolf, Guy and Krishnaswamy, Smita},
  booktitle={2023 IEEE 33rd International Workshop on Machine Learning for Signal Processing (MLSP)},
  pages={1--6},
  year={2023},
  organization={IEEE}
}
```

            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "geosink",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.8",
    "maintainer_email": null,
    "keywords": "diffusion, optimal transport, heat, kernel, sinkhorn",
    "author": "Guillaume Huguet",
    "author_email": null,
    "download_url": "https://files.pythonhosted.org/packages/33/78/2966824fe5ba03a75e7f1d54cd6883ec6b4b1957ac1a76c076aae2a5a31f/geosink-0.2.0.tar.gz",
    "platform": null,
    "description": "# GeoSinkhorn\nCode for the paper Geodesic Sinkhorn for Fast and Accurate Optimal Transport on Manifolds. \n\n> [!NOTE]  \n>This repository is still in development.\n\n### Installation\nYou can install the library from [PyPI](https://pypi.org/project/geosink/) by running:\n```bash\npip install geosink\n``` \nOr using Git, by first cloning the repository and running:\n```bash\npip install -e .\n```\nIf you want to use the pre existing graph tools, run:\n```bash\npip install -e .['graph']\n```\nTo run the tests, you will need additional packages. Install them by running:\n```bash\npip install -e .['dev']\n```\n\n\n### Minimal Example\nYou can reproduce this example in the following notebook [![notebook](https://img.shields.io/static/v1?label=Run%20in&message=Google%20Colab&color=orange&logo=Google%20Cloud)](https://colab.research.google.com/drive/1Y_CHGb49aVXgTPtnD-Yf8GX_PYYwXwYx?usp=sharing).\n\n\nWe build a graph between two Gaussian distributions and compute the distance between two signals on that graph.\n```python\nimport numpy as np\nfrom geosink.sinkhorn import GeoSinkhorn \nfrom geosink.heat_kernel import laplacian_from_data\n\n# Generate data and build graph.\ndata0 = np.random.normal(0, 1, (100, 5))\ndata1 = np.random.normal(5, 1, (100, 5))\ndata = np.concatenate([data0, data1], axis=0)\nlap = laplacian_from_data(data, sigma=1.0)\n\n# instantiate the GeoSinkhorn class\ngeo_sinkhorn = GeoSinkhorn(tau=5.0, order=10, method=\"cheb\", lap=lap)\n\n# create two signals\nm_0 = np.zeros(200,)\nm_0[:100] = 1\nm_0 = m_0 / np.sum(m_0)\nm_1 = np.zeros(200,)\nm_1[100:] = 1\nm_1 = m_1 / np.sum(m_1)\n\n# compute the distance between the two signals\ndist_w = geo_sinkhorn(m_0, m_1, max_iter=500)\nprint(dist_w)\n```\nNote that it is also possible to provide a graph instance directly to the `GeoSinkhorn` class with `GeoSinkhorn(tau=1.0, order=10, method=\"cheb\", graph=graph)`. The `graph` must have a Laplacian attribute `graph.L`. We suggest using a sparse Laplacian (e.g. in COO format) for better performance.\n\n### How to Cite\n\nIf you find this code useful in your research, please cite the following paper (expand for BibTeX):\n<details>\n<summary>\nHuguet, G., Tong, A., Zapatero, M. R., Tape, C. J., Wolf, G., & Krishnaswamy, S. (2023). Geodesic Sinkhorn for fast and accurate optimal transport on manifolds. In 2023 IEEE 33rd International Workshop on Machine Learning for Signal Processing (MLSP).\n</summary>\n\n```bibtex\n@inproceedings{huguet2023geodesic,\n  title={Geodesic Sinkhorn for fast and accurate optimal transport on manifolds},\n  author={Huguet, Guillaume and Tong, Alexander and Zapatero, Mar{\\'\\i}a Ramos and Tape, Christopher J and Wolf, Guy and Krishnaswamy, Smita},\n  booktitle={2023 IEEE 33rd International Workshop on Machine Learning for Signal Processing (MLSP)},\n  pages={1--6},\n  year={2023},\n  organization={IEEE}\n}\n```\n",
    "bugtrack_url": null,
    "license": "MIT License  Copyright (c) 2024 Guillaume Huguet  Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the \"Software\"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:  The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.  THE SOFTWARE IS PROVIDED \"AS IS\", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE. ",
    "summary": "Geodesic Sinkhorn with Chebyshev approximation",
    "version": "0.2.0",
    "project_urls": null,
    "split_keywords": [
        "diffusion",
        " optimal transport",
        " heat",
        " kernel",
        " sinkhorn"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "084f84bd70757ff46bcf8aafe7d24e46e98069d230434c3d268090d8cd7093e8",
                "md5": "f291fe3899429cb79e4fdbbc9dac8f47",
                "sha256": "6bf065dfd9d10d6651d190dafe4a70f93b61dca98b371af1e42ee21f511a379b"
            },
            "downloads": -1,
            "filename": "geosink-0.2.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "f291fe3899429cb79e4fdbbc9dac8f47",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.8",
            "size": 6956,
            "upload_time": "2024-06-28T13:13:41",
            "upload_time_iso_8601": "2024-06-28T13:13:41.312409Z",
            "url": "https://files.pythonhosted.org/packages/08/4f/84bd70757ff46bcf8aafe7d24e46e98069d230434c3d268090d8cd7093e8/geosink-0.2.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "33782966824fe5ba03a75e7f1d54cd6883ec6b4b1957ac1a76c076aae2a5a31f",
                "md5": "7a763b0a8b9dd1fe637aae917ca5af78",
                "sha256": "2ee04d6f17905ec1d6289d225ba51408da78e99bce7dd82b711c30993258706d"
            },
            "downloads": -1,
            "filename": "geosink-0.2.0.tar.gz",
            "has_sig": false,
            "md5_digest": "7a763b0a8b9dd1fe637aae917ca5af78",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.8",
            "size": 9358,
            "upload_time": "2024-06-28T13:13:42",
            "upload_time_iso_8601": "2024-06-28T13:13:42.609703Z",
            "url": "https://files.pythonhosted.org/packages/33/78/2966824fe5ba03a75e7f1d54cd6883ec6b4b1957ac1a76c076aae2a5a31f/geosink-0.2.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-06-28 13:13:42",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "geosink"
}
        
Elapsed time: 2.42031s