[![PyPI - Python](https://img.shields.io/badge/python-v3.7+-blue.svg)](https://pypi.org/project/keypartx/)
[![docs](https://img.shields.io/badge/docs-Passing-green.svg)](https://github.com/pengKiina/KeypartX/)
[![PyPI - PyPi](https://img.shields.io/pypi/v/keypartx)](https://pypi.org/project/keypartx/)
[![PyPI - License](https://img.shields.io/badge/license-MIT-green.svg)](https://github.com/pengKiina/KeypartX/blob/main/LICENSE)
[![arXiv](https://img.shields.io/badge/arXiv-2203.05794-<COLOR>.svg)](https://arxiv.org/abs/2209.11844)
# KeypartX
<img src="https://github.com/pengKiina/KeypartX/raw/main/images/keypartx_logo.png" width="40%" height="20%" align="right" />
* No more Topic Modeling
* No need Training
* No more Machine Learning but Human-like Reading
* Get the Insights of Text Big and Small
KeypartX: a graph-based approach to represent perception (text in general) by key parts of speech. KeypartX solved the coherence crux that current topic modeling algorithms are trying to deal with but failed. KeypartX extracts the topics from text corpus syntactically, semantically and pragmatically instead of a meaningless combination of words from topic modeling.
## Key Parts: Noun, Adjective, Verb and Emoji
KeypartX Vs Topic Modeling results from the following text:
``` “Thai food was great we loved it. Thiland also has beautiful beach resorts, we will come to Thailand again👍” ```
* KeypartX Result
<img style="border:1px solid black"
src="https://github.com/pengKiina/KeypartX/raw/main/images/keyparts_sent_network.png" width="40%" height="40%" align="center" />
* Topic Modeling Result
```['food','thailand','resort','great','love', 'beautiful']```
## Installation
```bash
if need coreferee:
pip install keypartx[coreferee_spacy]
python3 -m coreferee install en
python -m spacy download en_core_web_lg
else:
pip install spacy
pip install keypartx
python -m spacy download en_core_web_lg
```
## Getting Started
For an in-depth overview of the features of KeypartX
you can check the [**Documents**](https://medium.com/@egalitrans/topic-modeling-is-dead-long-live-keypartx-a1998a94a0b0) or you can follow along
with one of the examples as follows:
| Name | Link |
|---|---|
| KeypartX Quick Start | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1hjAU-_RP7GGzMm6rnpdJZR7LSY0KS81E?usp=sharing) |
| KeypartX with Real Example | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1VdKIJtMMcYRnXsne87azY7B1FXp9FpD1?usp=sharing) |
| KeypartX VS Topic Modelling | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/14XvylCMBZ2oUnjpZhnf_658paVdvAhp0?usp=sharing) |
| KeypartX Network Comparison | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1vTx9LwIGXt5so5IdOr4zsqrgSm4sHnYe?usp=sharing) |
## Visualization Examples
* 1 NLP Target
Original sentence: """Thai food was great,delicousr and not expensive, we loved it. We visited 3 beach resorts, they are higly recommened... We had "Fire-Vodka" !!!"""
<img src="https://github.com/pengKiina/KeypartX/raw/main/images/nlp_target.png" width="70%" height="35%" align="center" />
* 2 Keyparts Wordclouds
The following wordclouds are generated from a real example of corpus comprised of reviews by those who visit Thailand.
<img src="https://github.com/pengKiina/KeypartX/raw/main/images/keypart_wordcloud.png" width="35%" height="35%" align="center" />
* 3 Community and Gray Perceptual Unit Networks
<img src="https://github.com/pengKiina/KeypartX/raw/main/images/community_gray.gif" width="70%" height="50%" align="center" />
## Citation
To cite the [KeypartX paper](https://arxiv.org/abs/2209.11844), please use the following bibtex reference:
```bibtext
@article{pengyang2022keypartx,
title={KeypartX: Graph-based Perception (Text) Representation},
author={Peng, Yang},
journal={arXiv preprint arXiv:2209.11844},
year={2022}
}
```
Raw data
{
"_id": null,
"home_page": "https://github.com/pengKiina/KeypartX",
"name": "keypartx",
"maintainer": "",
"docs_url": null,
"requires_python": "",
"maintainer_email": "",
"keywords": "text representation,text mining,nlp,topic modeling,emoji,image,perception",
"author": "Peng Yang",
"author_email": "pyseptimo@outlook.com",
"download_url": "https://files.pythonhosted.org/packages/6b/ae/8d3188e64dba84aaaf47f54759ac141d7aec5774c7a8165cde32f00ce33b/keypartx-0.1.20.tar.gz",
"platform": null,
"description": "[![PyPI - Python](https://img.shields.io/badge/python-v3.7+-blue.svg)](https://pypi.org/project/keypartx/)\r\n[![docs](https://img.shields.io/badge/docs-Passing-green.svg)](https://github.com/pengKiina/KeypartX/)\r\n[![PyPI - PyPi](https://img.shields.io/pypi/v/keypartx)](https://pypi.org/project/keypartx/)\r\n[![PyPI - License](https://img.shields.io/badge/license-MIT-green.svg)](https://github.com/pengKiina/KeypartX/blob/main/LICENSE)\r\n[![arXiv](https://img.shields.io/badge/arXiv-2203.05794-<COLOR>.svg)](https://arxiv.org/abs/2209.11844)\r\n\r\n# KeypartX\r\n<img src=\"https://github.com/pengKiina/KeypartX/raw/main/images/keypartx_logo.png\" width=\"40%\" height=\"20%\" align=\"right\" />\r\n\r\n* No more Topic Modeling\r\n* No need Training \r\n* No more Machine Learning but Human-like Reading\r\n* Get the Insights of Text Big and Small\r\n\r\nKeypartX: a graph-based approach to represent perception (text in general) by key parts of speech. KeypartX solved the coherence crux that current topic modeling algorithms are trying to deal with but failed. KeypartX extracts the topics from text corpus syntactically, semantically and pragmatically instead of a meaningless combination of words from topic modeling.\r\n\r\n\r\n## Key Parts: Noun, Adjective, Verb and Emoji \r\n\r\nKeypartX Vs Topic Modeling results from the following text:\r\n\r\n``` \u201cThai food was great we loved it. Thiland also has beautiful beach resorts, we will come to Thailand again\ud83d\udc4d\u201d ```\r\n\r\n* KeypartX Result \r\n\r\n<img style=\"border:1px solid black\"\r\nsrc=\"https://github.com/pengKiina/KeypartX/raw/main/images/keyparts_sent_network.png\" width=\"40%\" height=\"40%\" align=\"center\" />\r\n\r\n* Topic Modeling Result\r\n\r\n```['food','thailand','resort','great','love', 'beautiful']```\r\n\r\n\r\n## Installation\r\n\r\n```bash\r\nif need coreferee: \r\n pip install keypartx[coreferee_spacy] \r\n python3 -m coreferee install en \r\n python -m spacy download en_core_web_lg \r\n\r\nelse:\r\n pip install spacy \r\n pip install keypartx \r\n python -m spacy download en_core_web_lg\r\n```\r\n\r\n\r\n## Getting Started\r\nFor an in-depth overview of the features of KeypartX\r\nyou can check the [**Documents**](https://medium.com/@egalitrans/topic-modeling-is-dead-long-live-keypartx-a1998a94a0b0) or you can follow along \r\nwith one of the examples as follows:\r\n\r\n| Name | Link |\r\n|---|---|\r\n| KeypartX Quick Start | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1hjAU-_RP7GGzMm6rnpdJZR7LSY0KS81E?usp=sharing) |\r\n| KeypartX with Real Example | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1VdKIJtMMcYRnXsne87azY7B1FXp9FpD1?usp=sharing) |\r\n| KeypartX VS Topic Modelling | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/14XvylCMBZ2oUnjpZhnf_658paVdvAhp0?usp=sharing) |\r\n| KeypartX Network Comparison | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1vTx9LwIGXt5so5IdOr4zsqrgSm4sHnYe?usp=sharing) |\r\n\r\n## Visualization Examples \r\n* 1 NLP Target \r\n\r\nOriginal sentence: \"\"\"Thai food was great,delicousr and not expensive, we loved it. We visited 3 beach resorts, they are higly recommened... We had \"Fire-Vodka\" !!!\"\"\"\r\n\r\n<img src=\"https://github.com/pengKiina/KeypartX/raw/main/images/nlp_target.png\" width=\"70%\" height=\"35%\" align=\"center\" />\r\n\r\n* 2 Keyparts Wordclouds\r\n\r\nThe following wordclouds are generated from a real example of corpus comprised of reviews by those who visit Thailand.\r\n\r\n<img src=\"https://github.com/pengKiina/KeypartX/raw/main/images/keypart_wordcloud.png\" width=\"35%\" height=\"35%\" align=\"center\" />\r\n\r\n* 3 Community and Gray Perceptual Unit Networks \r\n\r\n<img src=\"https://github.com/pengKiina/KeypartX/raw/main/images/community_gray.gif\" width=\"70%\" height=\"50%\" align=\"center\" />\r\n\r\n\r\n\r\n## Citation\r\nTo cite the [KeypartX paper](https://arxiv.org/abs/2209.11844), please use the following bibtex reference:\r\n\r\n```bibtext\r\n@article{pengyang2022keypartx,\r\n title={KeypartX: Graph-based Perception (Text) Representation},\r\n author={Peng, Yang},\r\n journal={arXiv preprint arXiv:2209.11844},\r\n year={2022}\r\n}\r\n```\r\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "A Graph-based Perception(Text) Representation",
"version": "0.1.20",
"split_keywords": [
"text representation",
"text mining",
"nlp",
"topic modeling",
"emoji",
"image",
"perception"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "16841e41fdf58ff318bb3ac5bcb1fbc7bde1f7fb3e12714b914f38cd38b93659",
"md5": "0493c6d40f821bdb3f6339f7632fde5d",
"sha256": "a1fa59554d7308eb554d9ddb8c49159c5e3be3de11ddc3319df4e1d9a8b187e5"
},
"downloads": -1,
"filename": "keypartx-0.1.20-py3-none-any.whl",
"has_sig": false,
"md5_digest": "0493c6d40f821bdb3f6339f7632fde5d",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": null,
"size": 36609,
"upload_time": "2023-04-23T14:23:41",
"upload_time_iso_8601": "2023-04-23T14:23:41.139471Z",
"url": "https://files.pythonhosted.org/packages/16/84/1e41fdf58ff318bb3ac5bcb1fbc7bde1f7fb3e12714b914f38cd38b93659/keypartx-0.1.20-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "6bae8d3188e64dba84aaaf47f54759ac141d7aec5774c7a8165cde32f00ce33b",
"md5": "90db5812cd33f60f123aa9ab3f6dda2b",
"sha256": "87bb95ce481d39f74a64205e66eb23d3056a321d5aa4ee63bbf2139b758c94e8"
},
"downloads": -1,
"filename": "keypartx-0.1.20.tar.gz",
"has_sig": false,
"md5_digest": "90db5812cd33f60f123aa9ab3f6dda2b",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 32110,
"upload_time": "2023-04-23T14:23:43",
"upload_time_iso_8601": "2023-04-23T14:23:43.305345Z",
"url": "https://files.pythonhosted.org/packages/6b/ae/8d3188e64dba84aaaf47f54759ac141d7aec5774c7a8165cde32f00ce33b/keypartx-0.1.20.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2023-04-23 14:23:43",
"github": true,
"gitlab": false,
"bitbucket": false,
"github_user": "pengKiina",
"github_project": "KeypartX",
"travis_ci": false,
"coveralls": false,
"github_actions": false,
"lcname": "keypartx"
}