![image](https://img.shields.io/pypi/v/stopwords-zh.svg) ![image](https://img.shields.io/travis/yuanjie-ai/stopwords-zh.svg) ![image](https://readthedocs.org/projects/stopwords-zh/badge/?version=latest)
<h1 align = "center">🔥stopwords-zh🔥</h1>
---
### 欢迎提交更新,共建中文停用词库
# Install
```shell
pip install -U stopwords-zh
```
# [Docs](https://yuanjie-ai.github.io/stopwords-zh/)
# Usages
- source: string, 停用词来源,目前支持
- baidu: 百度停用词表
- hit: 哈工大停用词表
- ict: 中科院计算所停用词表
- scu: 四川大学机器智能实验室停用词库
- cn: 广为流传未知来源的中文停用词表
- marimo: Marimo multi-lingual stopwords collection 内的中文停用词
- iso: Stopwords ISO 内的中文停用词
- all: 上述所有停用词并集
```python
import jieba
from stopwords import stopwords, filter_stopwords
print(filter_stopwords(jieba.cut('欢迎提交更新,共建中文停用词库')))
```
---
# TODO
- [x] 停用词
- [ ] 情感字典
Raw data
{
"_id": null,
"home_page": "https://github.com/yuanjie-ai/stopwords-zh",
"name": "stopwords-zh",
"maintainer": "",
"docs_url": null,
"requires_python": ">=3.6",
"maintainer_email": "",
"keywords": "stopwords-zh",
"author": "stopwords-zh",
"author_email": "yuanjie@example.com",
"download_url": "https://files.pythonhosted.org/packages/87/23/c5d6fb809da70a4c93c62352c3d8f6e4dec98f9cd2fdeb47f6a336734cd3/stopwords-zh-2023.6.5.13.18.38.tar.gz",
"platform": null,
"description": "\n\n![image](https://img.shields.io/pypi/v/stopwords-zh.svg) ![image](https://img.shields.io/travis/yuanjie-ai/stopwords-zh.svg) ![image](https://readthedocs.org/projects/stopwords-zh/badge/?version=latest)\n\n\n\n<h1 align = \"center\">\ud83d\udd25stopwords-zh\ud83d\udd25</h1>\n\n---\n### \u6b22\u8fce\u63d0\u4ea4\u66f4\u65b0\uff0c\u5171\u5efa\u4e2d\u6587\u505c\u7528\u8bcd\u5e93\n\n# Install\n```shell\npip install -U stopwords-zh\n```\n\n# [Docs](https://yuanjie-ai.github.io/stopwords-zh/)\n\n# Usages\n- source: string, \u505c\u7528\u8bcd\u6765\u6e90\uff0c\u76ee\u524d\u652f\u6301\n - baidu: \u767e\u5ea6\u505c\u7528\u8bcd\u8868\n - hit: \u54c8\u5de5\u5927\u505c\u7528\u8bcd\u8868\n - ict: \u4e2d\u79d1\u9662\u8ba1\u7b97\u6240\u505c\u7528\u8bcd\u8868\n - scu: \u56db\u5ddd\u5927\u5b66\u673a\u5668\u667a\u80fd\u5b9e\u9a8c\u5ba4\u505c\u7528\u8bcd\u5e93\n - cn: \u5e7f\u4e3a\u6d41\u4f20\u672a\u77e5\u6765\u6e90\u7684\u4e2d\u6587\u505c\u7528\u8bcd\u8868\n - marimo: Marimo multi-lingual stopwords collection \u5185\u7684\u4e2d\u6587\u505c\u7528\u8bcd\n - iso: Stopwords ISO \u5185\u7684\u4e2d\u6587\u505c\u7528\u8bcd\n - all: \u4e0a\u8ff0\u6240\u6709\u505c\u7528\u8bcd\u5e76\u96c6\n```python\nimport jieba\nfrom stopwords import stopwords, filter_stopwords\n\nprint(filter_stopwords(jieba.cut('\u6b22\u8fce\u63d0\u4ea4\u66f4\u65b0\uff0c\u5171\u5efa\u4e2d\u6587\u505c\u7528\u8bcd\u5e93')))\n\n```\n\n---\n# TODO\n\n- [x] \u505c\u7528\u8bcd\n- [ ] \u60c5\u611f\u5b57\u5178\n\n\n\n\n",
"bugtrack_url": null,
"license": "MIT license",
"summary": "stopwords-zh",
"version": "2023.6.5.13.18.38",
"project_urls": {
"Homepage": "https://github.com/yuanjie-ai/stopwords-zh"
},
"split_keywords": [
"stopwords-zh"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "d71af5bc0439542b97f5f7705627142283e810da2deb14d74a463c67ed549fd0",
"md5": "996f8a1c7d5b50639e75a19550f7ed14",
"sha256": "ee009979e2cf52095db449569792dbfce16bff5b64fff9c744da646319c8858d"
},
"downloads": -1,
"filename": "stopwords_zh-2023.6.5.13.18.38-py3-none-any.whl",
"has_sig": false,
"md5_digest": "996f8a1c7d5b50639e75a19550f7ed14",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">=3.6",
"size": 39243,
"upload_time": "2023-06-05T05:18:41",
"upload_time_iso_8601": "2023-06-05T05:18:41.834163Z",
"url": "https://files.pythonhosted.org/packages/d7/1a/f5bc0439542b97f5f7705627142283e810da2deb14d74a463c67ed549fd0/stopwords_zh-2023.6.5.13.18.38-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "8723c5d6fb809da70a4c93c62352c3d8f6e4dec98f9cd2fdeb47f6a336734cd3",
"md5": "e7ee27095a19a199650334494f0c1d12",
"sha256": "70920c80e734f2b4449f9fd21e389365438a9b1328f25fefb0555f5b612b9cbc"
},
"downloads": -1,
"filename": "stopwords-zh-2023.6.5.13.18.38.tar.gz",
"has_sig": false,
"md5_digest": "e7ee27095a19a199650334494f0c1d12",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">=3.6",
"size": 38036,
"upload_time": "2023-06-05T05:18:43",
"upload_time_iso_8601": "2023-06-05T05:18:43.835455Z",
"url": "https://files.pythonhosted.org/packages/87/23/c5d6fb809da70a4c93c62352c3d8f6e4dec98f9cd2fdeb47f6a336734cd3/stopwords-zh-2023.6.5.13.18.38.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2023-06-05 05:18:43",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "yuanjie-ai",
"github_project": "stopwords-zh",
"travis_ci": false,
"coveralls": false,
"github_actions": false,
"requirements": [],
"lcname": "stopwords-zh"
}