# autoESDA
[![image](https://img.shields.io/pypi/v/autoesda.svg)](https://pypi.python.org/pypi/autoesda)
[![image](https://img.shields.io/conda/vn/conda-forge/autoesda.svg)](https://anaconda.org/conda-forge/autoesda)
![Conda](https://img.shields.io/conda/dn/conda-forge/autoesda)
[![image](https://github.com/NicholasDeKock/autoesda/workflows/docs/badge.svg)](https://nicholasdekock.github.io/autoesda/)
[![image](https://github.com/NicholasDeKock/autoesda/workflows/build/badge.svg)](https://github.com/giswqs/autoesda/actions?query=workflow%3Abuild)
[![image](https://img.shields.io/pypi/l/autoesda?color=yellow&label=Licence&logo=BSD-3-Clause)](https://opensource.org/licenses/BSD-3-Clause)
![Visitor Badge](https://visitor-badge.laobi.icu/badge?page_id=NicholasDeKock.autoesda)
**A Python package that automates the exploratory spatial data analysis (ESDA) process by summarising the results into an HTML report.**
## Table of Contents
1. **Introduction**
2. **Key features**
3. **Installation**
4. **Dependancies**
5. **Usage**
6. **Examples**
7. **Contributing**
8. **License**
9. **References**
10. **Credits**
### 1. Introduction
Exploratory spatial data analysis (ESDA) is a term used to describe a various functions used to gain a surface-level understanding of a spatial dataset. Currently the ESDA process is repetitive as each of these functions need to be calculated individually. This makes it quite a time consuming process and also includes a large margin for human-induced errors. Additionally, results are not often easily viewed side-by-side for easy comparison and sharing with people who may not have the technical skills to do so.
**autoesda** is the solution to this by allowing the user to execute one line of code to generate an information-rich HTML report that can easily be shared with others.
### 2. Key features
- **HTML output report**
- **Extent map**
- **Dataset overview** (coordinate system, number of rows/columns, which rows/columns have been included/excluded in the report)
- **Descriptive statistics** (count, mean, standard deviation, minimum/maximum, 25<sup>th</sup>/50<sup>th</sup>/75<sup>th</sup> percentiles)
- **Sample of dataset**
- **Boxplot**
- **Histogram**
- **Moran's I simulation** (moran's I, number of features, p-value, z-score, number of permutations)
- **Local Indicator of Spatial Autocorrelation** (local scatterplot, LISA cluster map)
- **Choropleth maps** (quantiles, equal intervals, natural breaks, and percentiles classification schemes)
- **Correlation** (correlation matrix/heatmap, pairwise plot)
### 3. Installation
**autoesda** is available on [PyPI](https://pypi.org/project/autoesda/), to install **autoesda**, run this command in your terminal:
```
pip install autoesda
```
[geopandas](https://github.com/geopandas/geopandas) is a primary dependancy of **autoesda** and there are known challenges assosciated with using pip to install geopandas. The recommended strategy is thus, to use **autoesda** in a `conda environment`.
For advanced users, you can follow [this](https://geopandas.org/en/stable/getting_started/install.html) documentation which will guide you through the geopandas installation by downloading the [unofficial binary files](https://www.lfd.uci.edu/~gohlke/pythonlibs/) of some of the [geopandas](https://github.com/geopandas/geopandas) dependancies.
**autoesda** is also available on [conda-forge](https://anaconda.org/conda-forge/autoesda). If you have [Anaconda](https://www.anaconda.com/products/distribution#download-section) or [Miniconda](https://docs.conda.io/en/latest/miniconda.html) installed on your computer you can use this command in your Anaconda/Miniconda prompt:
```
conda install autoesda
```
### 4. Dependancies
- [geopandas](https://github.com/geopandas/geopandas)
- [pysal](https://github.com/pysal/pysal)
- [matplotlib](https://github.com/matplotlib/matplotlib)
- [libpysal](https://github.com/pysal/libpysal)
- [esda](https://github.com/pysal/esda)
- [splot](https://github.com/pysal/splot)
- [seaborn](https://github.com/mwaskom/seaborn)
### 5. Usage
To start off with, you need to ensure that you have imported both *geopandas* and *autoesda*.
```
import geopandas as gpd
import autoesda
```
Once both libraries have been sucessfully imported, you can import your dataset as a GeoDataFrame. This is done using *geopandas*. To read more about compatible file types, read the geopandas [documentation](https://geopandas.org/en/stable/docs/user_guide/io.html). In this example, a shapefile is imported.
```
gdf = gpd.read_file(r'example-file-path\example-shapefile.shp')
```
Once your data is stored in a GeoDataFrame, you can generate the report.
```
autoesda.generate_report(gdf)
```
The report will be saved to your working file directory.
### 6. Example Reports
| Vector Reports | Raster Reports |
| ---------- | ---------- |
| [Old COJ Demographic Data](https://autoesda.github.io/autoESDA-static/)| Global Terrestrial Precipitation <br> [Band 1](https://nicholasdekock.github.io/autoesda/example_reports/rain_1_original.html) \| [Band 2](https://nicholasdekock.github.io/autoesda/example_reports/rain_1_low_range.html) \| [Band 3](https://nicholasdekock.github.io/autoesda/example_reports/rain_1_med_range.html) \| [Band 4](https://nicholasdekock.github.io/autoesda/example_reports/rain_1_high_range.html) \| [Stacked](https://nicholasdekock.github.io/autoesda/example_reports/rain_1_stacked.html) |
| [AirbBnB Chicago 2015](https://nicholasdekock.github.io/autoesda/example_reports/v1-airbnb.html) | EU NOx Concentration <br> [Band 1](https://nicholasdekock.github.io/autoesda/example_reports/eu_air_1_original.html) \| [Band 2](https://nicholasdekock.github.io/autoesda/example_reports/eu_air_1_low_range.html) \| [Band 3](https://nicholasdekock.github.io/autoesda/example_reports/eu_air_1_med_range.html) \| [Band 4](https://nicholasdekock.github.io/autoesda/example_reports/eu_air_1_high_range.html) \| [Stacked](https://nicholasdekock.github.io/autoesda/example_reports/eu_air_1_stacked.html) |
| [Grid 100](https://nicholasdekock.github.io/autoesda/example_reports/v2-grid100.html) | South African Population <br> [Band 1](https://nicholasdekock.github.io/autoesda/example_reports/za_pop_1_original.html) \| [Band 2](https://nicholasdekock.github.io/autoesda/example_reports/za_pop_1_low_range.html) \| [Band 3](https://nicholasdekock.github.io/autoesda/example_reports/za_pop_1_med_range.html) \| [Band 4](https://nicholasdekock.github.io/autoesda/example_reports/za_pop_1_high_range.html) |
| [South African 2011 Census](https://nicholasdekock.github.io/autoesda/example_reports/v3-southafrica.html) | |
| [Natural Earth Country Boundaries](https://nicholasdekock.github.io/autoesda/example_reports/v4-naturalearth.html) | |
| [Malaria in Colombia](https://nicholasdekock.github.io/autoesda/example_reports/v5-colombia.html) | |
| [USA Election Results](https://nicholasdekock.github.io/autoesda/example_reports/v6-usa.html) | |
### 7. Contributing
Click [here](https://github.com/NicholasDeKock/autoesda/issues/new/choose) to report bugs
Click [here](https://github.com/NicholasDeKock/autoesda/issues/new/choose) to request a new feature
If you would like to assist with fixing bugs, further development or writing documentation you are most welcome to do so. Use the [issues](https://github.com/NicholasDeKock/autoesda/issues) page to guide what you can assist with.
In order to make a contribution you will need to:
1. Fork the autoesda repository on GitHub.
2. Clone your fork locally.
3. Commit your changes to your branch on GitHub
4. Once you are satsfied that your work is suitable, submit a pull request through the GitHub website.
### 8. License
This software is available under the BSD-3-Clause license.
For more information, see the [LICENSE](https://github.com/NicholasDeKock/autoesda/blob/main/LICENSE) file which contains details on the history of this software, terms & conditions for usage, and a disclaimer of all warranties.
### 9. References
When citing this library, please reference the following:
de Kock, N., Rautenbach, V., and Fabris-Rotelli, I.: TOWARDS AN OPEN SOURCE PYTHON LIBRARY FOR AUTOMATED EXPLORATORY SPATIAL DATA ANALYSIS, Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci., XLIII-B4-2022, 91–98, https://doi.org/10.5194/isprs-archives-XLIII-B4-2022-91-2022, 2022.
### 10. Credits
This package was created with [Cookiecutter](https://github.com/cookiecutter/cookiecutter) and the [giswqs/pypackage](https://github.com/giswqs/pypackage) project template.
Raw data
{
"_id": null,
"home_page": "https://github.com/NicholasDeKock/autoesda",
"name": "autoesda",
"maintainer": "",
"docs_url": null,
"requires_python": ">=3.5",
"maintainer_email": "",
"keywords": "autoesda",
"author": "Nicholas De Kock",
"author_email": "nicholas.dekock@tuks.co.za",
"download_url": "https://files.pythonhosted.org/packages/87/42/77b1880b0aa8dfebcfa3d826b860f49296fff1fcdcc0b05576b1aaf50f30/autoesda-1.0.0.tar.gz",
"platform": null,
"description": "# autoESDA\n\n\n[![image](https://img.shields.io/pypi/v/autoesda.svg)](https://pypi.python.org/pypi/autoesda)\n[![image](https://img.shields.io/conda/vn/conda-forge/autoesda.svg)](https://anaconda.org/conda-forge/autoesda)\n![Conda](https://img.shields.io/conda/dn/conda-forge/autoesda)\n[![image](https://github.com/NicholasDeKock/autoesda/workflows/docs/badge.svg)](https://nicholasdekock.github.io/autoesda/)\n[![image](https://github.com/NicholasDeKock/autoesda/workflows/build/badge.svg)](https://github.com/giswqs/autoesda/actions?query=workflow%3Abuild)\n[![image](https://img.shields.io/pypi/l/autoesda?color=yellow&label=Licence&logo=BSD-3-Clause)](https://opensource.org/licenses/BSD-3-Clause)\n![Visitor Badge](https://visitor-badge.laobi.icu/badge?page_id=NicholasDeKock.autoesda)\n\n**A Python package that automates the exploratory spatial data analysis (ESDA) process by summarising the results into an HTML report.**\n\n## Table of Contents\n1. **Introduction**\n2. **Key features**\n3. **Installation**\n4. **Dependancies**\n5. **Usage**\n6. **Examples**\n7. **Contributing**\n8. **License**\n9. **References**\n10. **Credits**\n\n### 1. Introduction\nExploratory spatial data analysis (ESDA) is a term used to describe a various functions used to gain a surface-level understanding of a spatial dataset. Currently the ESDA process is repetitive as each of these functions need to be calculated individually. This makes it quite a time consuming process and also includes a large margin for human-induced errors. Additionally, results are not often easily viewed side-by-side for easy comparison and sharing with people who may not have the technical skills to do so.\n\n**autoesda** is the solution to this by allowing the user to execute one line of code to generate an information-rich HTML report that can easily be shared with others.\n\n### 2. Key features\n- **HTML output report**\n- **Extent map**\n- **Dataset overview** (coordinate system, number of rows/columns, which rows/columns have been included/excluded in the report)\n- **Descriptive statistics** (count, mean, standard deviation, minimum/maximum, 25<sup>th</sup>/50<sup>th</sup>/75<sup>th</sup> percentiles)\n- **Sample of dataset**\n- **Boxplot**\n- **Histogram**\n- **Moran's I simulation** (moran's I, number of features, p-value, z-score, number of permutations)\n- **Local Indicator of Spatial Autocorrelation** (local scatterplot, LISA cluster map)\n- **Choropleth maps** (quantiles, equal intervals, natural breaks, and percentiles classification schemes)\n- **Correlation** (correlation matrix/heatmap, pairwise plot)\n\n### 3. Installation\n**autoesda** is available on [PyPI](https://pypi.org/project/autoesda/), to install **autoesda**, run this command in your terminal:\n```\npip install autoesda\n```\n[geopandas](https://github.com/geopandas/geopandas) is a primary dependancy of **autoesda** and there are known challenges assosciated with using pip to install geopandas. The recommended strategy is thus, to use **autoesda** in a `conda environment`.\n\nFor advanced users, you can follow [this](https://geopandas.org/en/stable/getting_started/install.html) documentation which will guide you through the geopandas installation by downloading the [unofficial binary files](https://www.lfd.uci.edu/~gohlke/pythonlibs/) of some of the [geopandas](https://github.com/geopandas/geopandas) dependancies.\n\n**autoesda** is also available on [conda-forge](https://anaconda.org/conda-forge/autoesda). If you have [Anaconda](https://www.anaconda.com/products/distribution#download-section) or [Miniconda](https://docs.conda.io/en/latest/miniconda.html) installed on your computer you can use this command in your Anaconda/Miniconda prompt:\n```\nconda install autoesda\n```\n\n### 4. Dependancies\n- [geopandas](https://github.com/geopandas/geopandas)\n- [pysal](https://github.com/pysal/pysal)\n- [matplotlib](https://github.com/matplotlib/matplotlib)\n- [libpysal](https://github.com/pysal/libpysal)\n- [esda](https://github.com/pysal/esda)\n- [splot](https://github.com/pysal/splot)\n- [seaborn](https://github.com/mwaskom/seaborn)\n\n### 5. Usage\nTo start off with, you need to ensure that you have imported both *geopandas* and *autoesda*.\n```\nimport geopandas as gpd\nimport autoesda\n```\n\nOnce both libraries have been sucessfully imported, you can import your dataset as a GeoDataFrame. This is done using *geopandas*. To read more about compatible file types, read the geopandas [documentation](https://geopandas.org/en/stable/docs/user_guide/io.html). In this example, a shapefile is imported.\n```\ngdf = gpd.read_file(r'example-file-path\\example-shapefile.shp')\n```\n\nOnce your data is stored in a GeoDataFrame, you can generate the report. \n```\nautoesda.generate_report(gdf)\n```\n\nThe report will be saved to your working file directory.\n\n### 6. Example Reports\n| Vector Reports | Raster Reports |\n| ---------- | ---------- |\n| [Old COJ Demographic Data](https://autoesda.github.io/autoESDA-static/)| Global Terrestrial Precipitation <br> [Band 1](https://nicholasdekock.github.io/autoesda/example_reports/rain_1_original.html) \\| [Band 2](https://nicholasdekock.github.io/autoesda/example_reports/rain_1_low_range.html) \\| [Band 3](https://nicholasdekock.github.io/autoesda/example_reports/rain_1_med_range.html) \\| [Band 4](https://nicholasdekock.github.io/autoesda/example_reports/rain_1_high_range.html) \\| [Stacked](https://nicholasdekock.github.io/autoesda/example_reports/rain_1_stacked.html) |\n| [AirbBnB Chicago 2015](https://nicholasdekock.github.io/autoesda/example_reports/v1-airbnb.html) | EU NOx Concentration <br> [Band 1](https://nicholasdekock.github.io/autoesda/example_reports/eu_air_1_original.html) \\| [Band 2](https://nicholasdekock.github.io/autoesda/example_reports/eu_air_1_low_range.html) \\| [Band 3](https://nicholasdekock.github.io/autoesda/example_reports/eu_air_1_med_range.html) \\| [Band 4](https://nicholasdekock.github.io/autoesda/example_reports/eu_air_1_high_range.html) \\| [Stacked](https://nicholasdekock.github.io/autoesda/example_reports/eu_air_1_stacked.html) |\n| [Grid 100](https://nicholasdekock.github.io/autoesda/example_reports/v2-grid100.html) | South African Population <br> [Band 1](https://nicholasdekock.github.io/autoesda/example_reports/za_pop_1_original.html) \\| [Band 2](https://nicholasdekock.github.io/autoesda/example_reports/za_pop_1_low_range.html) \\| [Band 3](https://nicholasdekock.github.io/autoesda/example_reports/za_pop_1_med_range.html) \\| [Band 4](https://nicholasdekock.github.io/autoesda/example_reports/za_pop_1_high_range.html) | \n| [South African 2011 Census](https://nicholasdekock.github.io/autoesda/example_reports/v3-southafrica.html) | |\n| [Natural Earth Country Boundaries](https://nicholasdekock.github.io/autoesda/example_reports/v4-naturalearth.html) | |\n| [Malaria in Colombia](https://nicholasdekock.github.io/autoesda/example_reports/v5-colombia.html) | |\n| [USA Election Results](https://nicholasdekock.github.io/autoesda/example_reports/v6-usa.html) | |\n\n### 7. Contributing\nClick [here](https://github.com/NicholasDeKock/autoesda/issues/new/choose) to report bugs\n\nClick [here](https://github.com/NicholasDeKock/autoesda/issues/new/choose) to request a new feature\n\nIf you would like to assist with fixing bugs, further development or writing documentation you are most welcome to do so. Use the [issues](https://github.com/NicholasDeKock/autoesda/issues) page to guide what you can assist with. \n\nIn order to make a contribution you will need to:\n\n1. Fork the autoesda repository on GitHub.\n2. Clone your fork locally.\n3. Commit your changes to your branch on GitHub\n4. Once you are satsfied that your work is suitable, submit a pull request through the GitHub website.\n\n### 8. License\nThis software is available under the BSD-3-Clause license.\n\nFor more information, see the [LICENSE](https://github.com/NicholasDeKock/autoesda/blob/main/LICENSE) file which contains details on the history of this software, terms & conditions for usage, and a disclaimer of all warranties.\n\n### 9. References\nWhen citing this library, please reference the following:\n\nde Kock, N., Rautenbach, V., and Fabris-Rotelli, I.: TOWARDS AN OPEN SOURCE PYTHON LIBRARY FOR AUTOMATED EXPLORATORY SPATIAL DATA ANALYSIS, Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci., XLIII-B4-2022, 91\u201398, https://doi.org/10.5194/isprs-archives-XLIII-B4-2022-91-2022, 2022.\n\n### 10. Credits\nThis package was created with [Cookiecutter](https://github.com/cookiecutter/cookiecutter) and the [giswqs/pypackage](https://github.com/giswqs/pypackage) project template.\n",
"bugtrack_url": null,
"license": "BSD license",
"summary": "A Python package that automates the exploratory spatial data analysis (ESDA) process by summarizing the results in an HTML report",
"version": "1.0.0",
"project_urls": {
"Homepage": "https://github.com/NicholasDeKock/autoesda"
},
"split_keywords": [
"autoesda"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "706118305cdc3a67a7cf8bfa2d2e216e5099248d270495f9b3fb703d90679ef1",
"md5": "e5dbc2a2c1262253c2eae1518bc14990",
"sha256": "c7bf1a7c7b9e531a2b91fbff9c7a4879f72dfe47bb7c21db34d0979e24be3e4d"
},
"downloads": -1,
"filename": "autoesda-1.0.0-py2.py3-none-any.whl",
"has_sig": false,
"md5_digest": "e5dbc2a2c1262253c2eae1518bc14990",
"packagetype": "bdist_wheel",
"python_version": "py2.py3",
"requires_python": ">=3.5",
"size": 60148,
"upload_time": "2024-03-13T12:20:45",
"upload_time_iso_8601": "2024-03-13T12:20:45.955991Z",
"url": "https://files.pythonhosted.org/packages/70/61/18305cdc3a67a7cf8bfa2d2e216e5099248d270495f9b3fb703d90679ef1/autoesda-1.0.0-py2.py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "874277b1880b0aa8dfebcfa3d826b860f49296fff1fcdcc0b05576b1aaf50f30",
"md5": "06d25733cd6c6300e47857ebcf316ffe",
"sha256": "25595faad07c6ef8770c704c8e05acd096719d9d7e0d255334d6e01150aafb2a"
},
"downloads": -1,
"filename": "autoesda-1.0.0.tar.gz",
"has_sig": false,
"md5_digest": "06d25733cd6c6300e47857ebcf316ffe",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">=3.5",
"size": 64835,
"upload_time": "2024-03-13T12:20:47",
"upload_time_iso_8601": "2024-03-13T12:20:47.617455Z",
"url": "https://files.pythonhosted.org/packages/87/42/77b1880b0aa8dfebcfa3d826b860f49296fff1fcdcc0b05576b1aaf50f30/autoesda-1.0.0.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-03-13 12:20:47",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "NicholasDeKock",
"github_project": "autoesda",
"travis_ci": false,
"coveralls": false,
"github_actions": true,
"requirements": [
{
"name": "esda",
"specs": []
},
{
"name": "geopandas",
"specs": [
[
"==",
"0.8.1"
]
]
},
{
"name": "libpysal",
"specs": [
[
"==",
"4.4.0"
]
]
},
{
"name": "matplotlib",
"specs": [
[
"==",
"3.4.2"
]
]
},
{
"name": "pysal",
"specs": [
[
"==",
"2.3.0"
]
]
},
{
"name": "seaborn",
"specs": [
[
"==",
"0.11.2"
]
]
},
{
"name": "splot",
"specs": [
[
">=",
"1.1.4"
]
]
}
],
"lcname": "autoesda"
}