Name | geochemistrypi JSON |
Version |
0.6.1
JSON |
| download |
home_page | None |
Summary | A highly automated machine learning Python framework dedicating to build up MLOps level 1 software product for data-driven geochemistry discovery |
upload_time | 2024-07-05 02:14:29 |
maintainer | None |
docs_url | None |
author | None |
requires_python | ~=3.9 |
license | MIT License Copyright (c) 2021 ZJUEarthData Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions: The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software. THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE. |
keywords |
automated
continuous training
data mining
geochemistry discovery
geochemistry π
mlops
machine learning
machine learning lifecycle management
model inference
|
VCS |
|
bugtrack_url |
|
requirements |
No requirements were recorded.
|
Travis-CI |
No Travis.
|
coveralls test coverage |
No coveralls.
|
<p>
<img src="https://github.com/ZJUEarthData/geochemistrypi/assets/47497750/6f970708-7811-40bb-8172-9b98563c7659" class="center"/>
</p>
<p align="center">
<img src="https://img.shields.io/github/actions/workflow/status/ZJUEarthData/geochemistrypi/geochemistrypi.yml?logo=github">
<img src="https://img.shields.io/github/license/ZJUEarthData/geochemistrypi">
<img src="https://img.shields.io/github/v/release/ZJUEarthData/geochemistrypi?include_prereleases">
<img src="https://static.pepy.tech/personalized-badge/geochemistrypi?period=total&units=international_system&left_color=grey&right_color=green&left_text=Downloads">
<img src="https://img.shields.io/pypi/pyversions/geochemistrypi">
</p>
---
**Documentation**: <a href="https://geochemistrypi.readthedocs.io" target="_blank">https://geochemistrypi.readthedocs.io</a>
**Source Code**: <a href="https://github.com/ZJUEarthData/geochemistrypi" target="_blank">https://github.com/ZJUEarthData/geochemistrypi</a>
---
Geochemistry π is an **open-sourced highly automated machine learning Python framework** dedicating to build up MLOps level 1 software product for data-driven geochemistry discovery on tabular data.
Core capabilities are:
+ **Continous Training**
+ **Machine Learning Lifecycle Management**
+ **Model Inference**
Key features are:
+ **Easy to use:** The automation of data mining process provides the users with simple number options to choose.
+ **Extensible:** It allows appending new algorithms through Scikit-learn with automatic hyper parameter searching by FLAML and Ray.
+ **Traceable**: It integrates MLflow to build special storage mechanism to streamline the end-to-end machine learning lifecycle.
Latest Update: follow up by clicking `Starred` and `Watch` on our [GitHub repository](https://github.com/ZJUEarthData/geochemistrypi), then get email notifications of the newest features automatically.
The following figure is the simplified overview of Geochemistry π: <br>
<p align="center">
<img src="https://github.com/ZJUEarthData/geochemistrypi/assets/47497750/28e174f0-1f2f-4367-96bd-9526352101bd" alt="Overview of workflow" width="600" />
</p>
The following figure is the frontend-backend separation architecture of Geochemistry: <br>
<p align="center">
<img src="https://github.com/ZJUEarthData/geochemistrypi/assets/47497750/3b27cbdb-ff50-4fa6-b1d1-4c75b253fdff" alt="Frontend-backend separation architecture of Geochemistry" width="450" />
</p>
**If the software contributes to your research, cite the work as :**
ZhangZhou J\*, He Can\*, Sun Jianhao, Zhao Jianming, Lyu Yang, Wang Shengxin, Zhao Wenyu, Li Anzhou, Ji Xiaohui. Geochemistry π: Automated machine learning python framework for tabular data (2024). Geochemistry, Geophysics,
Geosystems, 25, e2023GC011324
Download link: https://agupubs.onlinelibrary.wiley.com/doi/10.1029/2023GC011324
**Related report:**
Geochemistry π was selected for featuring as an Editor’s Highlight in EOS magazine by American Geophysical Union (fewer than 2 percent of paper are selected) and quoted in Geochemical NEWS by Geochemical Society.
Eos Website: https://eos.org/editor-highlights/machine-learning-for-geochemists-who-dont-want-to-code.
## Quick Installation
Our software is well tested on **macOS** and **Windows** system with **Python 3.9**. Other systems and Python version are not guranteed.
One instruction to download on **command line**, such as Terminal on macOS, Power Shell on Windows.
```
pip install geochemistrypi
```
Download the latest version to avoid some old version issues, such as dependency downloading.
```
pip install "geochemistrypi==0.6.1"
```
One instruction to download on **Jupyter Notebook** or **Google Colab**.
```
!pip install geochemistrypi
```
Download the latest version to avoid some old version issues, such as dependency downloading.
```
!pip install "geochemistrypi==0.6.1"
```
Check the downloaded version of our software:
```
geochemistrypi --version
```
**Note**: For more detail on installation, please refer to our online documentation in **Installation Manual** under the section of **FOR USER**. Over there, we highly recommend to use virtual environment (Conda) to avoid dependency version problems.
## Quick Update
One instruction to update the software to the latest version on **command line**, such as Terminal on macOS, Power Shell on Windows.
```
pip install --upgrade geochemistrypi
```
One instruction to download on **Jupyter Notebook** or **Google Colab**.
```
!pip install --upgrade geochemistrypi
```
Check the updated version of our software:
```
geochemistrypi --version
```
## Data Preparation
In order to utilize the functions provided by our software, your own data set should satisfy:
- be with the suffix **.xlsx** or **.csv**, which is supported by Microsoft Excel.
- be comprise of location information **LATITUDE** and **LONGITUDE**, two columns respectively. It is optional.
If you want to run **classification** algorithm, you data set should satisfy:
- a label column. You can name it as you wish, such as **Label**.
Column name specification:
- No restriction on the column names. You can name them as you want except for two special and optional column **LATITUDE** and **LONGITUDE**.
- every column can only one column name. Multi level column names are not allowed.
- Between two columns with values, a completed void column can exists.
The following are seven built-in data sets in our software stored on Google Drive and Tecent Docs, have a look on them. For the algorithm you intend to run, you can refer to the data format of the corresponding dataset.
+ Data_Regression.xlsx [[Google Drive]](https://docs.google.com/spreadsheets/d/13MB4t_2PiZ90tTMJKw7HcBUi2sb3tXej/edit?usp=sharing&ouid=110717816678586054594&rtpof=true&sd=true) | [[Tencent Docs]](https://docs.qq.com/document/DQ3VmdWZCTGV3bmpM?&u=6868f96d4a384b309036e04e637e367a)
+ ApplicationData_Regression.xlsx [[Google Drive]](https://docs.google.com/spreadsheets/d/1FCek2OOYQD887jfQz21g0ovqVuUJIjVoNI77D-Ufr9Y/edit?usp=sharing) | [[Tencent Docs]](
https://docs.qq.com/document/DQ3BDeHhxRGNzSXZN)
+ Data_Classification.xlsx [[Google Drive]](https://docs.google.com/spreadsheets/d/1xFBCYVmtZfuEAbeBljUlzqBjxVuLAt8x/edit?usp=sharing&ouid=110717816678586054594&rtpof=true&sd=true) | [[Tencent Docs]](https://docs.qq.com/document/DQ0JUaUFsZnRaZkNG?&u=6868f96d4a384b309036e04e637e367a)
+ ApplicationData_Classification.xlsx [[Google Drive]](https://docs.google.com/spreadsheets/d/1J7QvdvbbHJMlKtiumBgKDW7ALghfQQZyKGEoOqhKQjw/edit?usp=sharing) | [[Tencent Docs]](https://docs.qq.com/document/DQ2dnQWtubHRBTGtB)
+ Data_Clustering.xlsx [[Google Drive]](https://docs.google.com/spreadsheets/d/1sbuJdOzGNQ2Pk-bVURfPYg1rltyBbn5J/edit?usp=sharing&ouid=110717816678586054594&rtpof=true&sd=true) | [[Tencent Docs]](https://docs.qq.com/document/DQ3dKdGtlWkhZS2xR?&u=6868f96d4a384b309036e04e637e367a)
+ Data_Decomposition.xlsx [[Google Drive]](https://docs.google.com/spreadsheets/d/1kix82qj5--vhnm8-KhuUBH9dqYH6zcY8/edit?usp=sharing&ouid=110717816678586054594&rtpof=true&sd=true) | [[Tencent Docs]](https://docs.qq.com/document/DQ29oZ0lhUGtZUmdN?&u=6868f96d4a384b309036e04e637e367a)
+ Data_AbnormalDetectioon.xlsx [[Google Drive]](https://docs.google.com/spreadsheets/d/1NqTQZCkv74Sn_iOJOKRc-QnJzpaWmnzC_lET_0ZreiQ/edit?usp=sharing) | [[Tencent Docs]](
https://docs.qq.com/document/DQ2hqQ2N2ZGlOUWlT)
**Note**: For more detail on data preparation, please refer to our online documentation in **Model Example** under the section of **FOR USER**.
## Running Example
**How to run:** After successfully downloading, run this instruction on **command line / Jupyter Notebook / Google Colab** whatever directory it is.
### Case 1: Run with built-in data set for testing
On command line:
```
geochemistrypi data-mining
```
On Jupyter Notebook / Google Colab:
```
!geochemistrypi data-mining
```
**Note**: There are four built-in data sets corresponding to four kinds of model pattern.
### Case 2: Run with your own data set without model inference
On command line:
```
geochemistrypi data-mining --data your_own_data_set.xlsx
```
On Jupyter Notebook / Google Colab:
```
!geochemistrypi data-mining --data your_own_data_set.xlsx
```
**Note**: Currently, `.xlsx` and `.csv` files are supported. Please specify the path your data file exists. For Google Colab, don't forget to upload your dataset first.
### Case 3: Implement model inference on application data
On command line:
```
geochemistrypi data-mining --training your_own_training_data.xlsx --application your_own_application_data.xlsx
```
On Jupyter Notebook / Google Colab:
```
!geochemistrypi data-mining --training your_own_training_data.xlsx --application your_own_application_data.xlsx
```
**Note**: Please make sure the column names (data schema) in both training data file and application data file are the same. Because the operations you perform via our software on the training data will be record automatically and subsequently applied to the application data in the same order.
The training data in our pipeline will be divided into the train set and test set used for training the ML model and evaluating the model's performance. The score includes two types. The first type is the scores from the prediction on the test set while the second type is cv scores from the cross validation on the train set.
### Case 4: Activate MLflow web interface
On command line:
```
geochemistrypi data-mining --mlflow
```
On Jupyter Notebook / Google Colab:
```
!geochemistrypi data-mining --mlflow
```
**Note**: Once you run our software, there are two folders (`geopi_output` and `geopi_tracking`) generated automatically. Make sure the directory where you execute using the above command should have the genereted file `geopi_tracking`.
Copy the URL shown on the console into any browser to open the MLflow web interface. The URL is normally like this http://127.0.0.1:5000. Search MLflow online to see more operations and usages.
For more details: Please refer to:
- Geochemistry π - Download and Run the Beta Version [[Bilibili]](https://www.bilibili.com/video/BV1UM4y1Q7Ju/?spm_id_from=333.999.0.0&vd_source=27944ab3b73a78970c1a52a5dcbb9140) | [[YouTube]](https://www.youtube.com/watch?v=EeVaJ3H7_AU&list=PLy8hNsI55lvh1UHjhVhqNUj3xPdV9sEiM&index=9)
- MLflow UI user guide - Geochemistry π v0.5.0 [[Bilibili]](https://b23.tv/CW5Rjmo) | [[YouTube]](https://www.youtube.com/watch?v=Yu1nzNeLfRY)
The following screenshot shows the downloads and launching of our software on macOS:
<p align="center">
<img src="https://github.com/ZJUEarthData/geochemistrypi/assets/47497750/70728795-59b7-4741-ab5b-9e63d284ad37" alt="Downloads and Launching on macOS" width="450" />
</p>
## Roadmap
### First Phase
It works as a **software application** with a command-line interface (CLI) to automate **data mining** process with frequently-used **machine learning algorithms** and **statistical analysis methods**, which would further lower the threshold for the geochemists.
The highlight is that through choosing **simple number options**, the users are able to implement a full cycle of data mining **without knowledge of** SciPy, NumPy, Pandas, Scikit-learn, FLAML, Ray packages.
The following figure is the activity diagram of automated ML pipeline in Geochemistry π:
<img src="https://github.com/ZJUEarthData/geochemistrypi/assets/47497750/d7b45a7d-4c6d-472b-9498-c9ccb992212e" />
Its data section provides feature engineering based on **arithmatic operation**. It allows the users to have a statistic analysis on the data set as well as on the imputation result, which is supported by the combination of **Monte Carlo simulation** and **hypothesis testing**.
Its models section provides both **supervised learning** and **unsupervised learning** methods from **Scikit-learn** framework, including four types of algorithms, regression, classification, clustering, and dimensional reduction. Integrated with **FLAML** and **Ray** framework, it allows the users to run AutoML easily, fastly and cost-effectively on the built-in supervised learning algorithms in our framework.
The following figure is the hierarchical architecture of Geochemistry π:
<p align="center">
<img src="https://github.com/ZJUEarthData/geochemistrypi/assets/47497750/9c3ddc2b-700c-4685-b52f-f5f9a8931849" alt="Hierarchical Architecture" width="450" />
</p>
### Second Phase
Currently, we are building three access ways to provide more user-friendly service, including **web portal**, **CLI package** and **API**. It allows the user to perform **continuous training** and **model inference** by automating the ML pipeline and **machine learning lifecycle management** by unique storage mechanism in different access layers.
The following figure is the system architecture diagram: <br>
![System Architecture Diagram](https://github.com/ZJUEarthData/geochemistrypi/assets/47497750/20b5a2a4-f2de-492d-a2df-9282196d8c4f)
The following figure is the customized automated ML pipeline: <br>
<p align="center">
<img src="https://github.com/ZJUEarthData/geochemistrypi/assets/47497750/6275472d-9628-4df5-b4a3-c58a65cfc346" alt="Customized automated ML pipeline" width="400" />
</p>
The following figure is the design pattern hierarchical architecture: <br>
![Design Pattern](https://github.com/ZJUEarthData/geochemistrypi/assets/47497750/aa84ab12-c95e-4282-a60e-64ba2858c437)
![Workflow Object](https://github.com/ZJUEarthData/geochemistrypi/assets/47497750/f08885bf-1bec-4045-bf6b-82c5c18d3f8f)
The following figure is the storage mechanism: <br>
<p align="center">
<img src="https://github.com/ZJUEarthData/geochemistrypi/assets/47497750/401f3429-c44f-4b76-b085-7a9dcc987cde" alt="Storage Mechanism" width="500" />
</p>
The whole package is under construction and the documentation is progressively evolving.
## Geochemistry π Mind Map
[→ Click here for more details](https://docs.qq.com/mind/DZnhoa2NPamFYZHR6?u=40ac0718eb494b008b2f072197ea95db)
![Geochemistry π.png](https://github.com/ZJUEarthData/geochemistrypi/assets/97781484/e77b1f11-41ab-4354-9064-6d62cc1bf1e4)
## Team Info
**Leader:**
+ Can He (Sany, National University of Singapore, Singapore)
Email: sanyhew1097618435@163.com
**Technical Group:**
+ Jianming Zhao (Jamie, Zhejiang University, China)
+ Jianhao Sun (Jin, China University of Geosciences, Wuhan, China)
+ Yongkang Chan (Kill-virus, Lanzhou University, China)
+ Mengying Ye (Mary, Jilin University, China)
+ Mengqi Gao (China University of Geosciences, Beijing, China)
+ Chengtu Li(Trenki, Henan Polytechnic University, Beijing, China)
+ Yucheng Yan (Andy, University of Sydney, Australia)
**Product Group**:
+ Yang Lyu (Daisy, Zhejiang University, China)
+ Bailun Jiang (EPSI / Lille University, France)
+ Ruitao Chang (China University of Geosciences Beijing, China)
+ Panyan Weng (The University of Sydney, Australia)
+ Siqi Yao (Clara, Dongguan University of Technology, China)
+ Zhelan Lin(Lan, Fuzhou University, China)
+ ShuYi Li (Communication University Of China, Beijing, China)
+ Junbo Wang (China University Of Geosciences, Beijing, China)
+ Haibin Wang(Watson, University of Sydney, Australia)
## Join Us :)
**The recruitment of research interns is ongoing !!!**
**Key Point: All things are done online, remote work (\*^▽^\*)**
**What can you learn?**
+ Learning the full cycle of data mining (Scikit-learn, Ray, Mlflow) on tabular data, including the algorithms in regression,classification, clustering, and decomposition.
+ Learning to be a qualified Python developer, including any Python programing contents towards data mining, basic software engineering techniques like frontend (React, Typescript, Ant Design scaffold) and backend (SQL & NoSQL database, RESFful API, FastAPI) development, and cooperation tools like Git.
**What can you get?**
+ Research internship proof and reference letter after working for >> 100 hours.
+ Chance to pay a visit to Hangzhou, China, sponsored by ZJU Earth Data.
+ Chance to be guided by the experts from IT companies in Silicon Valley and Hangzhou.
+ Bonus depending on your performance.
**Current Working Pattern:**
+ Online working and cooperation
+ Three weeks per working cycle -> One online meeting per working cycle
+ One cycle report (see below) per cycle - 5 mins to finish
Even if you are not familiar with topics above, but if you are interested in and have plenty of time to do it. That's enough. We have a full-developed training system to help you, as a newbie of data mining or Python developer, learn steps by steps with seniors until you can make a significant contribution to our project.
**More details about the project?**
Please refer to:
English Page: https://person.zju.edu.cn/en/zhangzhou
Chinese Page: https://person.zju.edu.cn/zhangzhou#0
**Do you want to contribute to this open-source program?**
Contact with your CV: sanyhew1097618435@163.com
## In-house Materials
Materials are in both Chinese and English. Others unshown below are internal materials.
1. [Guideline Manual – Geochemistry π (International - Google drive)](https://docs.google.com/document/d/1LjwB5Lazk33E5vbtnFPJio_MyjYQxjEu/edit?usp=sharing&ouid=110717816678586054594&rtpof=true&sd=true)
2. [Guideline Manual – Geochemistry π (China - Tencent Docs)](https://docs.qq.com/doc/DQ21IZUdVQktqRWpm?&u=6868f96d4a384b309036e04e637e367a)
3. [Learning Steps for Newbies – Geochemistry π (International - Google drive)](https://docs.google.com/document/d/1GQO-SXwEx_8midr362pqfxNZtfUf-nA6/edit?usp=sharing&ouid=110717816678586054594&rtpof=true&sd=true)
4. [Learning Steps for Newbies - Geochemistry π (China - Tencent Docs)](https://docs.qq.com/doc/DTlVEakt2WnJrdkN1?&u=6868f96d4a384b309036e04e637e367a)
5. [Code Specification v2.1.2 - Geochemistry π (International - Google drive)](https://drive.google.com/file/d/12UPrGqrj9hl0_vK8r-m6xykh_6052OtI/view?usp=sharing)
6. [Code Specification v2.1.2 - Geochemistry π (China - Tencent Docs)](https://docs.qq.com/pdf/DQ2pmc1l1Z2t3QVFa?&u=6868f96d4a384b309036e04e637e367a)
7. [Cycle Report - Geochemistry π (International - Google drive)](https://drive.google.com/file/d/1JPZoSLcPRqzu6LDvw8wLQkV2GfJoER51/view?usp=sharing)
8. [Cycle Report - Geochemistry π (China - Tencent Docs)](https://docs.qq.com/pdf/DQ25VSGNlbGx4UkFZ?&u=6868f96d4a384b309036e04e637e367a)
## In-house Videos
Technical record videos are on Bilibili and Youtube synchronously while other meeting videos are internal materials.
More Videos will be recorded soon.
1. [ZJU_Earth_Data Introduction (Geochemical Data, Python, Geochemistry π) - Prof. Zhang](https://www.bilibili.com/video/BV1Lf4y1w7EK?spm_id_from=333.999.0.0)
2. [How to Collaborate and Provide Bug Report on Geochemistry π Through GitHub - Can He (Sany)](https://www.youtube.com/watch?v=1DWoEsqsfvQ&list=PLy8hNsI55lvh1UHjhVhqNUj3xPdV9sEiM&index=3)
3. [Geochemistry π - Download and Run the Beta Version](https://www.youtube.com/watch?v=EeVaJ3H7_AU&list=PLy8hNsI55lvh1UHjhVhqNUj3xPdV9sEiM&index=9)
4. [How to Create and Use Virtual Environment on Geochemistry π - Can He (Sany)](https://www.youtube.com/watch?v=4KFi7OXxD-c&list=PLy8hNsI55lvh1UHjhVhqNUj3xPdV9sEiM&index=4)
5. [How to use Github-Desktop in conflict resolution - Qiuhao Zhao (Brad)](https://www.youtube.com/watch?v=KT1g5JpuUVI&list=PLy8hNsI55lvh1UHjhVhqNUj3xPdV9sEiM)
6. [Virtual Environment & Packages On Windows - Jianming Zhao (Jamie)](https://www.youtube.com/watch?v=e4VqSBuNp_o&list=PLy8hNsI55lvh1UHjhVhqNUj3xPdV9sEiM&index=2)
7. [Git Workflow & Coordinating Synchronization - Jianming Zhao (Jamie)](https://www.bilibili.com/video/BV1Sa4y1f74k?spm_id_from=333.999.0.0&vd_source=9adcf2c5fdeffe1d11c89d441ef598ba)
## Contributors
+ Shengxin Wang (Samson, Lanzhou University, China)
+ Wenyu Zhao (Molly, Zhejiang University, China)
+ Qiuhao Zhao (Brad, Zhejiang University, China)
+ Kaixin Zheng (Hayne, Sun Yat-sen University, China)
+ Anzhou Li (Andrian, Zhejiang University, China)
+ Keran Li (Kirk, Chengdu University of Technology, China)
+ Dan Hu (Notre Dame University, United States)
+ Xunxin Liu (Tante, China University of Geosciences, Wuhan, China)
+ Fang Li (liv, Shenzhen University, China)
+ Xin Li (The University of Manchester, United Kingdom)
+ Ting Liu (Kira, Sun Yat-sen University, China)
+ Xirui Zhu (Rae, University of York, United Kingdom)
+ Aixiwake·Janganuer (Ayshuak, Sun Yat-sen University, China)
+ Zhenglin Xu (Garry, Jilin University, China)
+ Jianing Wang (National University of Singapore, Singapore)
+ Junchi Liao(Roceda, University of Electronic Science and Technology of China, China)
Raw data
{
"_id": null,
"home_page": null,
"name": "geochemistrypi",
"maintainer": null,
"docs_url": null,
"requires_python": "~=3.9",
"maintainer_email": "Can He <sanyhew1097618435@163.com>, Jianhao Sun <sjh20171502@gmail.com>, Jianming Zhao <zhaojianming@zju.edu.cn>, Yongkang Shan <kk1361207571@163.com>, Mengqi Gao <2534671415@qq.com>",
"keywords": "Automated, Continuous Training, Data Mining, Geochemistry Discovery, Geochemistry \u03c0, MLOps, Machine Learning, Machine Learning Lifecycle Management, Model Inference",
"author": null,
"author_email": "Can He <sanyhew1097618435@163.com>, Jianhao Sun <sjh20171502@gmail.com>, Jianming Zhao <zhaojianming@zju.edu.cn>, Yang Lyu <lyuyang1007@zju.edu.cn>",
"download_url": "https://files.pythonhosted.org/packages/dd/4d/6eeafa969578e7af394801871fa5ebc415e49d2192cc3dda85fd9d0722d5/geochemistrypi-0.6.1.tar.gz",
"platform": null,
"description": "<p>\n<img src=\"https://github.com/ZJUEarthData/geochemistrypi/assets/47497750/6f970708-7811-40bb-8172-9b98563c7659\" class=\"center\"/>\n</p>\n<p align=\"center\">\n<img src=\"https://img.shields.io/github/actions/workflow/status/ZJUEarthData/geochemistrypi/geochemistrypi.yml?logo=github\">\n<img src=\"https://img.shields.io/github/license/ZJUEarthData/geochemistrypi\">\n<img src=\"https://img.shields.io/github/v/release/ZJUEarthData/geochemistrypi?include_prereleases\">\n<img src=\"https://static.pepy.tech/personalized-badge/geochemistrypi?period=total&units=international_system&left_color=grey&right_color=green&left_text=Downloads\">\n<img src=\"https://img.shields.io/pypi/pyversions/geochemistrypi\">\n</p>\n\n---\n\n**Documentation**: <a href=\"https://geochemistrypi.readthedocs.io\" target=\"_blank\">https://geochemistrypi.readthedocs.io</a>\n\n**Source Code**: <a href=\"https://github.com/ZJUEarthData/geochemistrypi\" target=\"_blank\">https://github.com/ZJUEarthData/geochemistrypi</a>\n\n---\n\nGeochemistry \u03c0 is an **open-sourced highly automated machine learning Python framework** dedicating to build up MLOps level 1 software product for data-driven geochemistry discovery on tabular data.\n\nCore capabilities are:\n\n+ **Continous Training**\n+ **Machine Learning Lifecycle Management**\n+ **Model Inference**\n\nKey features are:\n\n+ **Easy to use:** The automation of data mining process provides the users with simple number options to choose.\n+ **Extensible:** It allows appending new algorithms through Scikit-learn with automatic hyper parameter searching by FLAML and Ray.\n+ **Traceable**: It integrates MLflow to build special storage mechanism to streamline the end-to-end machine learning lifecycle.\n\nLatest Update: follow up by clicking `Starred` and `Watch` on our [GitHub repository](https://github.com/ZJUEarthData/geochemistrypi), then get email notifications of the newest features automatically.\n\nThe following figure is the simplified overview of Geochemistry \u03c0: <br>\n\n<p align=\"center\">\n <img src=\"https://github.com/ZJUEarthData/geochemistrypi/assets/47497750/28e174f0-1f2f-4367-96bd-9526352101bd\" alt=\"Overview of workflow\" width=\"600\" />\n</p>\n\nThe following figure is the frontend-backend separation architecture of Geochemistry: <br>\n\n<p align=\"center\">\n <img src=\"https://github.com/ZJUEarthData/geochemistrypi/assets/47497750/3b27cbdb-ff50-4fa6-b1d1-4c75b253fdff\" alt=\"Frontend-backend separation architecture of Geochemistry\" width=\"450\" />\n</p>\n\n\n**If the software contributes to your research, cite the work as :**\n\nZhangZhou J\\*, He Can\\*, Sun Jianhao, Zhao Jianming, Lyu Yang, Wang Shengxin, Zhao Wenyu, Li Anzhou, Ji Xiaohui. Geochemistry \u03c0: Automated machine learning python framework for tabular data (2024). Geochemistry, Geophysics,\nGeosystems, 25, e2023GC011324\n\nDownload link: https://agupubs.onlinelibrary.wiley.com/doi/10.1029/2023GC011324\n\n**Related report:**\n\nGeochemistry \u03c0 was selected for featuring\u00a0as\u00a0an Editor\u2019s\u00a0Highlight in EOS magazine by American Geophysical Union (fewer than 2 percent of paper are selected) and quoted in Geochemical NEWS by Geochemical Society.\n\nEos Website:\u00a0https://eos.org/editor-highlights/machine-learning-for-geochemists-who-dont-want-to-code.\n\n## Quick Installation\n\nOur software is well tested on **macOS** and **Windows** system with **Python 3.9**. Other systems and Python version are not guranteed.\n\nOne instruction to download on **command line**, such as Terminal on macOS, Power Shell on Windows.\n\n```\npip install geochemistrypi\n```\n\nDownload the latest version to avoid some old version issues, such as dependency downloading.\n```\npip install \"geochemistrypi==0.6.1\"\n```\n\nOne instruction to download on **Jupyter Notebook** or **Google Colab**.\n\n```\n!pip install geochemistrypi\n```\nDownload the latest version to avoid some old version issues, such as dependency downloading.\n```\n!pip install \"geochemistrypi==0.6.1\"\n```\nCheck the downloaded version of our software:\n\n```\ngeochemistrypi --version\n```\n\n**Note**: For more detail on installation, please refer to our online documentation in **Installation Manual** under the section of **FOR USER**. Over there, we highly recommend to use virtual environment (Conda) to avoid dependency version problems.\n\n## Quick Update\n\nOne instruction to update the software to the latest version on **command line**, such as Terminal on macOS, Power Shell on Windows.\n\n```\npip install --upgrade geochemistrypi\n```\n\nOne instruction to download on **Jupyter Notebook** or **Google Colab**.\n\n```\n!pip install --upgrade geochemistrypi\n```\n\nCheck the updated version of our software:\n\n```\ngeochemistrypi --version\n```\n\n## Data Preparation\n\nIn order to utilize the functions provided by our software, your own data set should satisfy:\n\n- be with the suffix **.xlsx** or **.csv**, which is supported by Microsoft Excel.\n- be comprise of location information **LATITUDE** and **LONGITUDE**, two columns respectively. It is optional.\n\nIf you want to run **classification** algorithm, you data set should satisfy:\n\n- a label column. You can name it as you wish, such as **Label**.\n\nColumn name specification:\n\n- No restriction on the column names. You can name them as you want except for two special and optional column **LATITUDE** and **LONGITUDE**.\n\n- every column can only one column name. Multi level column names are not allowed.\n\n- Between two columns with values, a completed void column can exists.\n\nThe following are seven built-in data sets in our software stored on Google Drive and Tecent Docs, have a look on them. For the algorithm you intend to run, you can refer to the data format of the corresponding dataset.\n\n+ Data_Regression.xlsx [[Google Drive]](https://docs.google.com/spreadsheets/d/13MB4t_2PiZ90tTMJKw7HcBUi2sb3tXej/edit?usp=sharing&ouid=110717816678586054594&rtpof=true&sd=true) | [[Tencent Docs]](https://docs.qq.com/document/DQ3VmdWZCTGV3bmpM?&u=6868f96d4a384b309036e04e637e367a)\n\n+ ApplicationData_Regression.xlsx [[Google Drive]](https://docs.google.com/spreadsheets/d/1FCek2OOYQD887jfQz21g0ovqVuUJIjVoNI77D-Ufr9Y/edit?usp=sharing) | [[Tencent Docs]](\nhttps://docs.qq.com/document/DQ3BDeHhxRGNzSXZN)\n\n+ Data_Classification.xlsx [[Google Drive]](https://docs.google.com/spreadsheets/d/1xFBCYVmtZfuEAbeBljUlzqBjxVuLAt8x/edit?usp=sharing&ouid=110717816678586054594&rtpof=true&sd=true) | [[Tencent Docs]](https://docs.qq.com/document/DQ0JUaUFsZnRaZkNG?&u=6868f96d4a384b309036e04e637e367a)\n\n+ ApplicationData_Classification.xlsx [[Google Drive]](https://docs.google.com/spreadsheets/d/1J7QvdvbbHJMlKtiumBgKDW7ALghfQQZyKGEoOqhKQjw/edit?usp=sharing) | [[Tencent Docs]](https://docs.qq.com/document/DQ2dnQWtubHRBTGtB)\n\n+ Data_Clustering.xlsx [[Google Drive]](https://docs.google.com/spreadsheets/d/1sbuJdOzGNQ2Pk-bVURfPYg1rltyBbn5J/edit?usp=sharing&ouid=110717816678586054594&rtpof=true&sd=true) | [[Tencent Docs]](https://docs.qq.com/document/DQ3dKdGtlWkhZS2xR?&u=6868f96d4a384b309036e04e637e367a)\n\n+ Data_Decomposition.xlsx [[Google Drive]](https://docs.google.com/spreadsheets/d/1kix82qj5--vhnm8-KhuUBH9dqYH6zcY8/edit?usp=sharing&ouid=110717816678586054594&rtpof=true&sd=true) | [[Tencent Docs]](https://docs.qq.com/document/DQ29oZ0lhUGtZUmdN?&u=6868f96d4a384b309036e04e637e367a)\n\n+ Data_AbnormalDetectioon.xlsx [[Google Drive]](https://docs.google.com/spreadsheets/d/1NqTQZCkv74Sn_iOJOKRc-QnJzpaWmnzC_lET_0ZreiQ/edit?usp=sharing) | [[Tencent Docs]](\nhttps://docs.qq.com/document/DQ2hqQ2N2ZGlOUWlT)\n\n**Note**: For more detail on data preparation, please refer to our online documentation in **Model Example** under the section of **FOR USER**.\n\n## Running Example\n\n**How to run:** After successfully downloading, run this instruction on **command line / Jupyter Notebook / Google Colab** whatever directory it is.\n\n### Case 1: Run with built-in data set for testing\n\nOn command line:\n\n```\ngeochemistrypi data-mining\n```\n\nOn Jupyter Notebook / Google Colab:\n\n```\n!geochemistrypi data-mining\n```\n\n**Note**: There are four built-in data sets corresponding to four kinds of model pattern.\n\n### Case 2: Run with your own data set without model inference\n\nOn command line:\n\n```\ngeochemistrypi data-mining --data your_own_data_set.xlsx\n```\n\nOn Jupyter Notebook / Google Colab:\n\n```\n!geochemistrypi data-mining --data your_own_data_set.xlsx\n```\n\n**Note**: Currently, `.xlsx` and `.csv` files are supported. Please specify the path your data file exists. For Google Colab, don't forget to upload your dataset first.\n\n### Case 3: Implement model inference on application data\n\nOn command line:\n\n```\ngeochemistrypi data-mining --training your_own_training_data.xlsx --application your_own_application_data.xlsx\n```\n\nOn Jupyter Notebook / Google Colab:\n\n```\n!geochemistrypi data-mining --training your_own_training_data.xlsx --application your_own_application_data.xlsx\n```\n\n**Note**: Please make sure the column names (data schema) in both training data file and application data file are the same. Because the operations you perform via our software on the training data will be record automatically and subsequently applied to the application data in the same order.\n\nThe training data in our pipeline will be divided into the train set and test set used for training the ML model and evaluating the model's performance. The score includes two types. The first type is the scores from the prediction on the test set while the second type is cv scores from the cross validation on the train set.\n\n### Case 4: Activate MLflow web interface\n\nOn command line:\n\n```\ngeochemistrypi data-mining --mlflow\n```\n\nOn Jupyter Notebook / Google Colab:\n\n```\n!geochemistrypi data-mining --mlflow\n```\n\n**Note**: Once you run our software, there are two folders (`geopi_output` and `geopi_tracking`) generated automatically. Make sure the directory where you execute using the above command should have the genereted file `geopi_tracking`.\n\nCopy the URL shown on the console into any browser to open the MLflow web interface. The URL is normally like this http://127.0.0.1:5000. Search MLflow online to see more operations and usages.\n\nFor more details: Please refer to:\n\n- Geochemistry \u03c0 - Download and Run the Beta Version [[Bilibili]](https://www.bilibili.com/video/BV1UM4y1Q7Ju/?spm_id_from=333.999.0.0&vd_source=27944ab3b73a78970c1a52a5dcbb9140) | [[YouTube]](https://www.youtube.com/watch?v=EeVaJ3H7_AU&list=PLy8hNsI55lvh1UHjhVhqNUj3xPdV9sEiM&index=9)\n\n- MLflow UI user guide - Geochemistry \u03c0 v0.5.0 [[Bilibili]](https://b23.tv/CW5Rjmo) | [[YouTube]](https://www.youtube.com/watch?v=Yu1nzNeLfRY)\n\nThe following screenshot shows the downloads and launching of our software on macOS:\n\n<p align=\"center\">\n <img src=\"https://github.com/ZJUEarthData/geochemistrypi/assets/47497750/70728795-59b7-4741-ab5b-9e63d284ad37\" alt=\"Downloads and Launching on macOS\" width=\"450\" />\n</p>\n\n## Roadmap\n\n### First Phase\n\nIt works as a **software application** with a command-line interface (CLI) to automate **data mining** process with frequently-used **machine learning algorithms** and **statistical analysis methods**, which would further lower the threshold for the geochemists.\n\nThe highlight is that through choosing **simple number options**, the users are able to implement a full cycle of data mining **without knowledge of** SciPy, NumPy, Pandas, Scikit-learn, FLAML, Ray packages.\n\nThe following figure is the activity diagram of automated ML pipeline in Geochemistry \u03c0:\n\n<img src=\"https://github.com/ZJUEarthData/geochemistrypi/assets/47497750/d7b45a7d-4c6d-472b-9498-c9ccb992212e\" />\n\nIts data section provides feature engineering based on **arithmatic operation**. It allows the users to have a statistic analysis on the data set as well as on the imputation result, which is supported by the combination of **Monte Carlo simulation** and **hypothesis testing**.\n\nIts models section provides both **supervised learning** and **unsupervised learning** methods from **Scikit-learn** framework, including four types of algorithms, regression, classification, clustering, and dimensional reduction. Integrated with **FLAML** and **Ray** framework, it allows the users to run AutoML easily, fastly and cost-effectively on the built-in supervised learning algorithms in our framework.\n\nThe following figure is the hierarchical architecture of Geochemistry \u03c0:\n\n<p align=\"center\">\n <img src=\"https://github.com/ZJUEarthData/geochemistrypi/assets/47497750/9c3ddc2b-700c-4685-b52f-f5f9a8931849\" alt=\"Hierarchical Architecture\" width=\"450\" />\n</p>\n\n### Second Phase\n\nCurrently, we are building three access ways to provide more user-friendly service, including **web portal**, **CLI package** and **API**. It allows the user to perform **continuous training** and **model inference** by automating the ML pipeline and **machine learning lifecycle management** by unique storage mechanism in different access layers.\n\nThe following figure is the system architecture diagram: <br>\n\n![System Architecture Diagram](https://github.com/ZJUEarthData/geochemistrypi/assets/47497750/20b5a2a4-f2de-492d-a2df-9282196d8c4f)\n\nThe following figure is the customized automated ML pipeline: <br>\n\n<p align=\"center\">\n <img src=\"https://github.com/ZJUEarthData/geochemistrypi/assets/47497750/6275472d-9628-4df5-b4a3-c58a65cfc346\" alt=\"Customized automated ML pipeline\" width=\"400\" />\n</p>\n\nThe following figure is the design pattern hierarchical architecture: <br>\n\n![Design Pattern](https://github.com/ZJUEarthData/geochemistrypi/assets/47497750/aa84ab12-c95e-4282-a60e-64ba2858c437)\n![Workflow Object](https://github.com/ZJUEarthData/geochemistrypi/assets/47497750/f08885bf-1bec-4045-bf6b-82c5c18d3f8f)\n\nThe following figure is the storage mechanism: <br>\n\n<p align=\"center\">\n <img src=\"https://github.com/ZJUEarthData/geochemistrypi/assets/47497750/401f3429-c44f-4b76-b085-7a9dcc987cde\" alt=\"Storage Mechanism\" width=\"500\" />\n</p>\n\nThe whole package is under construction and the documentation is progressively evolving.\n\n## Geochemistry \u03c0 Mind Map\n\n[\u2192 Click here for more details](https://docs.qq.com/mind/DZnhoa2NPamFYZHR6?u=40ac0718eb494b008b2f072197ea95db)\n\n![Geochemistry \u03c0.png](https://github.com/ZJUEarthData/geochemistrypi/assets/97781484/e77b1f11-41ab-4354-9064-6d62cc1bf1e4)\n\n## Team Info\n\n**Leader:**\n\n+ Can He (Sany, National University of Singapore, Singapore)\n Email: sanyhew1097618435@163.com\n\n**Technical Group:**\n\n+ Jianming Zhao (Jamie, Zhejiang University, China)\n+ Jianhao Sun (Jin, China University of Geosciences, Wuhan, China)\n+ Yongkang Chan (Kill-virus, Lanzhou University, China)\n+ Mengying Ye (Mary, Jilin University, China)\n+ Mengqi Gao (China University of Geosciences, Beijing, China)\n+ Chengtu Li\uff08Trenki, Henan Polytechnic University, Beijing, China\uff09\n+ Yucheng Yan (Andy, University of Sydney, Australia)\n\n**Product Group**:\n\n+ Yang Lyu (Daisy, Zhejiang University, China)\n+ Bailun Jiang (EPSI / Lille University, France)\n+ Ruitao Chang (China University of Geosciences Beijing, China)\n+ Panyan Weng (The University of Sydney, Australia)\n+ Siqi Yao (Clara, Dongguan University of Technology, China)\n+ Zhelan Lin\uff08Lan, Fuzhou University, China\uff09\n+ ShuYi Li (Communication University Of China, Beijing, China)\n+ Junbo Wang (China University Of Geosciences, Beijing, China)\n+ Haibin Wang\uff08Watson, University of Sydney, Australia\uff09\n\n## Join Us :)\n\n**The recruitment of research interns is ongoing !!!**\n\n**Key Point: All things are done online, remote work (\\*^\u25bd^\\*)**\n\n**What can you learn?**\n\n+ Learning the full cycle of data mining (Scikit-learn, Ray, Mlflow) on tabular data, including the algorithms in regression,classification, clustering, and decomposition.\n+ Learning to be a qualified Python developer, including any Python programing contents towards data mining, basic software engineering techniques like frontend (React, Typescript, Ant Design scaffold) and backend (SQL & NoSQL database, RESFful API, FastAPI) development, and cooperation tools like Git.\n\n**What can you get?**\n\n+ Research internship proof and reference letter after working for >> 100 hours.\n+ Chance to pay a visit to Hangzhou, China, sponsored by ZJU Earth Data.\n+ Chance to be guided by the experts from IT companies in Silicon Valley and Hangzhou.\n+ Bonus depending on your performance.\n\n**Current Working Pattern:**\n\n+ Online working and cooperation\n+ Three weeks per working cycle -> One online meeting per working cycle\n+ One cycle report (see below) per cycle - 5 mins to finish\n\nEven if you are not familiar with topics above, but if you are interested in and have plenty of time to do it. That's enough. We have a full-developed training system to help you, as a newbie of data mining or Python developer, learn steps by steps with seniors until you can make a significant contribution to our project.\n\n**More details about the project?**\nPlease refer to:\nEnglish Page: https://person.zju.edu.cn/en/zhangzhou\nChinese Page: https://person.zju.edu.cn/zhangzhou#0\n\n**Do you want to contribute to this open-source program?**\nContact with your CV: sanyhew1097618435@163.com\n\n## In-house Materials\n\nMaterials are in both Chinese and English. Others unshown below are internal materials.\n\n1. [Guideline Manual \u2013 Geochemistry \u03c0 (International - Google drive)](https://docs.google.com/document/d/1LjwB5Lazk33E5vbtnFPJio_MyjYQxjEu/edit?usp=sharing&ouid=110717816678586054594&rtpof=true&sd=true)\n2. [Guideline Manual \u2013 Geochemistry \u03c0 (China - Tencent Docs)](https://docs.qq.com/doc/DQ21IZUdVQktqRWpm?&u=6868f96d4a384b309036e04e637e367a)\n3. [Learning Steps for Newbies \u2013 Geochemistry \u03c0 (International - Google drive)](https://docs.google.com/document/d/1GQO-SXwEx_8midr362pqfxNZtfUf-nA6/edit?usp=sharing&ouid=110717816678586054594&rtpof=true&sd=true)\n4. [Learning Steps for Newbies - Geochemistry \u03c0 (China - Tencent Docs)](https://docs.qq.com/doc/DTlVEakt2WnJrdkN1?&u=6868f96d4a384b309036e04e637e367a)\n5. [Code Specification v2.1.2 - Geochemistry \u03c0 (International - Google drive)](https://drive.google.com/file/d/12UPrGqrj9hl0_vK8r-m6xykh_6052OtI/view?usp=sharing)\n6. [Code Specification v2.1.2 - Geochemistry \u03c0 (China - Tencent Docs)](https://docs.qq.com/pdf/DQ2pmc1l1Z2t3QVFa?&u=6868f96d4a384b309036e04e637e367a)\n7. [Cycle Report - Geochemistry \u03c0 (International - Google drive)](https://drive.google.com/file/d/1JPZoSLcPRqzu6LDvw8wLQkV2GfJoER51/view?usp=sharing)\n8. [Cycle Report - Geochemistry \u03c0 (China - Tencent Docs)](https://docs.qq.com/pdf/DQ25VSGNlbGx4UkFZ?&u=6868f96d4a384b309036e04e637e367a)\n\n## In-house Videos\n\nTechnical record videos are on Bilibili and Youtube synchronously while other meeting videos are internal materials.\nMore Videos will be recorded soon.\n\n1. [ZJU_Earth_Data Introduction (Geochemical Data, Python, Geochemistry \u03c0) - Prof. Zhang](https://www.bilibili.com/video/BV1Lf4y1w7EK?spm_id_from=333.999.0.0)\n2. [How to Collaborate and Provide Bug Report on Geochemistry \u03c0 Through GitHub - Can He (Sany)](https://www.youtube.com/watch?v=1DWoEsqsfvQ&list=PLy8hNsI55lvh1UHjhVhqNUj3xPdV9sEiM&index=3)\n3. [Geochemistry \u03c0 - Download and Run the Beta Version](https://www.youtube.com/watch?v=EeVaJ3H7_AU&list=PLy8hNsI55lvh1UHjhVhqNUj3xPdV9sEiM&index=9)\n4. [How to Create and Use Virtual Environment on Geochemistry \u03c0 - Can He (Sany)](https://www.youtube.com/watch?v=4KFi7OXxD-c&list=PLy8hNsI55lvh1UHjhVhqNUj3xPdV9sEiM&index=4)\n5. [How to use Github-Desktop in conflict resolution - Qiuhao Zhao (Brad)](https://www.youtube.com/watch?v=KT1g5JpuUVI&list=PLy8hNsI55lvh1UHjhVhqNUj3xPdV9sEiM)\n6. [Virtual Environment & Packages On Windows - Jianming Zhao (Jamie)](https://www.youtube.com/watch?v=e4VqSBuNp_o&list=PLy8hNsI55lvh1UHjhVhqNUj3xPdV9sEiM&index=2)\n7. [Git Workflow & Coordinating Synchronization - Jianming Zhao (Jamie)](https://www.bilibili.com/video/BV1Sa4y1f74k?spm_id_from=333.999.0.0&vd_source=9adcf2c5fdeffe1d11c89d441ef598ba)\n\n## Contributors\n\n+ Shengxin Wang (Samson, Lanzhou University, China)\n+ Wenyu Zhao (Molly, Zhejiang University, China)\n+ Qiuhao Zhao (Brad, Zhejiang University, China)\n+ Kaixin Zheng (Hayne, Sun Yat-sen University, China)\n+ Anzhou Li (Andrian, Zhejiang University, China)\n+ Keran Li (Kirk, Chengdu University of Technology, China)\n+ Dan Hu (Notre Dame University, United States)\n+ Xunxin Liu (Tante, China University of Geosciences, Wuhan, China)\n+ Fang Li (liv, Shenzhen University, China)\n+ Xin Li (The University of Manchester, United Kingdom)\n+ Ting Liu (Kira, Sun Yat-sen University, China)\n+ Xirui Zhu (Rae, University of York, United Kingdom)\n+ Aixiwake\u00b7Janganuer (Ayshuak, Sun Yat-sen University, China)\n+ Zhenglin Xu (Garry, Jilin University, China)\n+ Jianing Wang (National University of Singapore, Singapore)\n+ Junchi Liao(Roceda, University of Electronic Science and Technology of China, China)\n",
"bugtrack_url": null,
"license": "MIT License Copyright (c) 2021 ZJUEarthData Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the \"Software\"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions: The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software. THE SOFTWARE IS PROVIDED \"AS IS\", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.",
"summary": "A highly automated machine learning Python framework dedicating to build up MLOps level 1 software product for data-driven geochemistry discovery",
"version": "0.6.1",
"project_urls": {
"Bug Tracker": "https://github.com/ZJUEarthData/geochemistrypi/issues",
"GitHub": "https://github.com/ZJUEarthData/geochemistrypi",
"Homepage": "https://geochemistrypi.deep-time.org",
"Online Documentation": "https://geochemistrypi.readthedocs.io/en/latest/"
},
"split_keywords": [
"automated",
" continuous training",
" data mining",
" geochemistry discovery",
" geochemistry \u03c0",
" mlops",
" machine learning",
" machine learning lifecycle management",
" model inference"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "da370d2413f53d31a60a439f3ebc13fc505b7cf6ddb15d370b4fe57354d1abd9",
"md5": "2b2bdc8d0d6bf82de81eaeb31990eb92",
"sha256": "ef53132137facdd5e1dc0b6e8d5a8ddb6c7d13d2cb2092a2deb6778afa2947bb"
},
"downloads": -1,
"filename": "geochemistrypi-0.6.1-py3-none-any.whl",
"has_sig": false,
"md5_digest": "2b2bdc8d0d6bf82de81eaeb31990eb92",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": "~=3.9",
"size": 934495,
"upload_time": "2024-07-05T02:14:24",
"upload_time_iso_8601": "2024-07-05T02:14:24.863061Z",
"url": "https://files.pythonhosted.org/packages/da/37/0d2413f53d31a60a439f3ebc13fc505b7cf6ddb15d370b4fe57354d1abd9/geochemistrypi-0.6.1-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "dd4d6eeafa969578e7af394801871fa5ebc415e49d2192cc3dda85fd9d0722d5",
"md5": "4fa7fafdd4edaa2bfec31113fae4c33a",
"sha256": "f4694ab5b27f630baeb3956e646549c31b782f3078c5179686aafdd50626c4de"
},
"downloads": -1,
"filename": "geochemistrypi-0.6.1.tar.gz",
"has_sig": false,
"md5_digest": "4fa7fafdd4edaa2bfec31113fae4c33a",
"packagetype": "sdist",
"python_version": "source",
"requires_python": "~=3.9",
"size": 1468438,
"upload_time": "2024-07-05T02:14:29",
"upload_time_iso_8601": "2024-07-05T02:14:29.679380Z",
"url": "https://files.pythonhosted.org/packages/dd/4d/6eeafa969578e7af394801871fa5ebc415e49d2192cc3dda85fd9d0722d5/geochemistrypi-0.6.1.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-07-05 02:14:29",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "ZJUEarthData",
"github_project": "geochemistrypi",
"travis_ci": false,
"coveralls": false,
"github_actions": true,
"lcname": "geochemistrypi"
}