stopwords-zh


Namestopwords-zh JSON
Version 2023.6.5.13.18.38 PyPI version JSON
download
home_pagehttps://github.com/yuanjie-ai/stopwords-zh
Summarystopwords-zh
upload_time2023-06-05 05:18:43
maintainer
docs_urlNone
authorstopwords-zh
requires_python>=3.6
licenseMIT license
keywords stopwords-zh
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            

![image](https://img.shields.io/pypi/v/stopwords-zh.svg) ![image](https://img.shields.io/travis/yuanjie-ai/stopwords-zh.svg) ![image](https://readthedocs.org/projects/stopwords-zh/badge/?version=latest)



<h1 align = "center">🔥stopwords-zh🔥</h1>

---
### 欢迎提交更新,共建中文停用词库

# Install
```shell
pip install -U stopwords-zh
```

# [Docs](https://yuanjie-ai.github.io/stopwords-zh/)

# Usages
- source: string, 停用词来源,目前支持
  - baidu: 百度停用词表
  - hit: 哈工大停用词表
  - ict: 中科院计算所停用词表
  - scu: 四川大学机器智能实验室停用词库
  - cn: 广为流传未知来源的中文停用词表
  - marimo: Marimo multi-lingual stopwords collection 内的中文停用词
  - iso: Stopwords ISO 内的中文停用词
  - all: 上述所有停用词并集
```python
import jieba
from stopwords import stopwords, filter_stopwords

print(filter_stopwords(jieba.cut('欢迎提交更新,共建中文停用词库')))

```

---
# TODO

- [x] 停用词
- [ ] 情感字典





            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/yuanjie-ai/stopwords-zh",
    "name": "stopwords-zh",
    "maintainer": "",
    "docs_url": null,
    "requires_python": ">=3.6",
    "maintainer_email": "",
    "keywords": "stopwords-zh",
    "author": "stopwords-zh",
    "author_email": "yuanjie@example.com",
    "download_url": "https://files.pythonhosted.org/packages/87/23/c5d6fb809da70a4c93c62352c3d8f6e4dec98f9cd2fdeb47f6a336734cd3/stopwords-zh-2023.6.5.13.18.38.tar.gz",
    "platform": null,
    "description": "\n\n![image](https://img.shields.io/pypi/v/stopwords-zh.svg) ![image](https://img.shields.io/travis/yuanjie-ai/stopwords-zh.svg) ![image](https://readthedocs.org/projects/stopwords-zh/badge/?version=latest)\n\n\n\n<h1 align = \"center\">\ud83d\udd25stopwords-zh\ud83d\udd25</h1>\n\n---\n### \u6b22\u8fce\u63d0\u4ea4\u66f4\u65b0\uff0c\u5171\u5efa\u4e2d\u6587\u505c\u7528\u8bcd\u5e93\n\n# Install\n```shell\npip install -U stopwords-zh\n```\n\n# [Docs](https://yuanjie-ai.github.io/stopwords-zh/)\n\n# Usages\n- source: string, \u505c\u7528\u8bcd\u6765\u6e90\uff0c\u76ee\u524d\u652f\u6301\n  - baidu: \u767e\u5ea6\u505c\u7528\u8bcd\u8868\n  - hit: \u54c8\u5de5\u5927\u505c\u7528\u8bcd\u8868\n  - ict: \u4e2d\u79d1\u9662\u8ba1\u7b97\u6240\u505c\u7528\u8bcd\u8868\n  - scu: \u56db\u5ddd\u5927\u5b66\u673a\u5668\u667a\u80fd\u5b9e\u9a8c\u5ba4\u505c\u7528\u8bcd\u5e93\n  - cn: \u5e7f\u4e3a\u6d41\u4f20\u672a\u77e5\u6765\u6e90\u7684\u4e2d\u6587\u505c\u7528\u8bcd\u8868\n  - marimo: Marimo multi-lingual stopwords collection \u5185\u7684\u4e2d\u6587\u505c\u7528\u8bcd\n  - iso: Stopwords ISO \u5185\u7684\u4e2d\u6587\u505c\u7528\u8bcd\n  - all: \u4e0a\u8ff0\u6240\u6709\u505c\u7528\u8bcd\u5e76\u96c6\n```python\nimport jieba\nfrom stopwords import stopwords, filter_stopwords\n\nprint(filter_stopwords(jieba.cut('\u6b22\u8fce\u63d0\u4ea4\u66f4\u65b0\uff0c\u5171\u5efa\u4e2d\u6587\u505c\u7528\u8bcd\u5e93')))\n\n```\n\n---\n# TODO\n\n- [x] \u505c\u7528\u8bcd\n- [ ] \u60c5\u611f\u5b57\u5178\n\n\n\n\n",
    "bugtrack_url": null,
    "license": "MIT license",
    "summary": "stopwords-zh",
    "version": "2023.6.5.13.18.38",
    "project_urls": {
        "Homepage": "https://github.com/yuanjie-ai/stopwords-zh"
    },
    "split_keywords": [
        "stopwords-zh"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "d71af5bc0439542b97f5f7705627142283e810da2deb14d74a463c67ed549fd0",
                "md5": "996f8a1c7d5b50639e75a19550f7ed14",
                "sha256": "ee009979e2cf52095db449569792dbfce16bff5b64fff9c744da646319c8858d"
            },
            "downloads": -1,
            "filename": "stopwords_zh-2023.6.5.13.18.38-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "996f8a1c7d5b50639e75a19550f7ed14",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.6",
            "size": 39243,
            "upload_time": "2023-06-05T05:18:41",
            "upload_time_iso_8601": "2023-06-05T05:18:41.834163Z",
            "url": "https://files.pythonhosted.org/packages/d7/1a/f5bc0439542b97f5f7705627142283e810da2deb14d74a463c67ed549fd0/stopwords_zh-2023.6.5.13.18.38-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "8723c5d6fb809da70a4c93c62352c3d8f6e4dec98f9cd2fdeb47f6a336734cd3",
                "md5": "e7ee27095a19a199650334494f0c1d12",
                "sha256": "70920c80e734f2b4449f9fd21e389365438a9b1328f25fefb0555f5b612b9cbc"
            },
            "downloads": -1,
            "filename": "stopwords-zh-2023.6.5.13.18.38.tar.gz",
            "has_sig": false,
            "md5_digest": "e7ee27095a19a199650334494f0c1d12",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.6",
            "size": 38036,
            "upload_time": "2023-06-05T05:18:43",
            "upload_time_iso_8601": "2023-06-05T05:18:43.835455Z",
            "url": "https://files.pythonhosted.org/packages/87/23/c5d6fb809da70a4c93c62352c3d8f6e4dec98f9cd2fdeb47f6a336734cd3/stopwords-zh-2023.6.5.13.18.38.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2023-06-05 05:18:43",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "yuanjie-ai",
    "github_project": "stopwords-zh",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "requirements": [],
    "lcname": "stopwords-zh"
}
        
Elapsed time: 0.27467s