jieba-migang


Namejieba-migang JSON
Version 0.0.1 PyPI version JSON
download
home_pagehttps://github.com/fxsjy/jieba
SummaryChinese Words Segmentation Utilities
upload_time2024-08-02 03:20:43
maintainerNone
docs_urlNone
authorSun, Junyi
requires_pythonNone
licenseMIT
keywords nlp tokenizing chinese word segementation
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            
jieba
=====

“结巴”中文分词:做最好的 Python 中文分词组件

"Jieba" (Chinese for "to stutter") Chinese text segmentation: built to
be the best Python Chinese word segmentation module.

完整文档见 ``README.md``

GitHub: https://github.com/fxsjy/jieba

特点
====

-  支持三种分词模式:

   -  精确模式,试图将句子最精确地切开,适合文本分析;
   -  全模式,把句子中所有的可以成词的词语都扫描出来,
      速度非常快,但是不能解决歧义;
   -  搜索引擎模式,在精确模式的基础上,对长词再次切分,提高召回率,适合用于搜索引擎分词。

-  支持繁体分词
-  支持自定义词典
-  MIT 授权协议

在线演示: http://jiebademo.ap01.aws.af.cm/

安装说明
========

代码对 Python 2/3 均兼容

-  全自动安装: ``easy_install jieba`` 或者 ``pip install jieba`` / ``pip3 install jieba``
-  半自动安装:先下载 https://pypi.python.org/pypi/jieba/ ,解压后运行
   python setup.py install
-  手动安装:将 jieba 目录放置于当前目录或者 site-packages 目录
-  通过 ``import jieba`` 来引用


            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/fxsjy/jieba",
    "name": "jieba-migang",
    "maintainer": null,
    "docs_url": null,
    "requires_python": null,
    "maintainer_email": null,
    "keywords": "NLP, tokenizing, Chinese word segementation",
    "author": "Sun, Junyi",
    "author_email": "ccnusjy@gmail.com",
    "download_url": null,
    "platform": null,
    "description": "\r\njieba\r\n=====\r\n\r\n\u201c\u7ed3\u5df4\u201d\u4e2d\u6587\u5206\u8bcd\uff1a\u505a\u6700\u597d\u7684 Python \u4e2d\u6587\u5206\u8bcd\u7ec4\u4ef6\r\n\r\n\"Jieba\" (Chinese for \"to stutter\") Chinese text segmentation: built to\r\nbe the best Python Chinese word segmentation module.\r\n\r\n\u5b8c\u6574\u6587\u6863\u89c1 ``README.md``\r\n\r\nGitHub: https://github.com/fxsjy/jieba\r\n\r\n\u7279\u70b9\r\n====\r\n\r\n-  \u652f\u6301\u4e09\u79cd\u5206\u8bcd\u6a21\u5f0f\uff1a\r\n\r\n   -  \u7cbe\u786e\u6a21\u5f0f\uff0c\u8bd5\u56fe\u5c06\u53e5\u5b50\u6700\u7cbe\u786e\u5730\u5207\u5f00\uff0c\u9002\u5408\u6587\u672c\u5206\u6790\uff1b\r\n   -  \u5168\u6a21\u5f0f\uff0c\u628a\u53e5\u5b50\u4e2d\u6240\u6709\u7684\u53ef\u4ee5\u6210\u8bcd\u7684\u8bcd\u8bed\u90fd\u626b\u63cf\u51fa\u6765,\r\n      \u901f\u5ea6\u975e\u5e38\u5feb\uff0c\u4f46\u662f\u4e0d\u80fd\u89e3\u51b3\u6b67\u4e49\uff1b\r\n   -  \u641c\u7d22\u5f15\u64ce\u6a21\u5f0f\uff0c\u5728\u7cbe\u786e\u6a21\u5f0f\u7684\u57fa\u7840\u4e0a\uff0c\u5bf9\u957f\u8bcd\u518d\u6b21\u5207\u5206\uff0c\u63d0\u9ad8\u53ec\u56de\u7387\uff0c\u9002\u5408\u7528\u4e8e\u641c\u7d22\u5f15\u64ce\u5206\u8bcd\u3002\r\n\r\n-  \u652f\u6301\u7e41\u4f53\u5206\u8bcd\r\n-  \u652f\u6301\u81ea\u5b9a\u4e49\u8bcd\u5178\r\n-  MIT \u6388\u6743\u534f\u8bae\r\n\r\n\u5728\u7ebf\u6f14\u793a\uff1a http://jiebademo.ap01.aws.af.cm/\r\n\r\n\u5b89\u88c5\u8bf4\u660e\r\n========\r\n\r\n\u4ee3\u7801\u5bf9 Python 2/3 \u5747\u517c\u5bb9\r\n\r\n-  \u5168\u81ea\u52a8\u5b89\u88c5\uff1a ``easy_install jieba`` \u6216\u8005 ``pip install jieba`` / ``pip3 install jieba``\r\n-  \u534a\u81ea\u52a8\u5b89\u88c5\uff1a\u5148\u4e0b\u8f7d https://pypi.python.org/pypi/jieba/ \uff0c\u89e3\u538b\u540e\u8fd0\u884c\r\n   python setup.py install\r\n-  \u624b\u52a8\u5b89\u88c5\uff1a\u5c06 jieba \u76ee\u5f55\u653e\u7f6e\u4e8e\u5f53\u524d\u76ee\u5f55\u6216\u8005 site-packages \u76ee\u5f55\r\n-  \u901a\u8fc7 ``import jieba`` \u6765\u5f15\u7528\r\n\r\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "Chinese Words Segmentation Utilities",
    "version": "0.0.1",
    "project_urls": {
        "Homepage": "https://github.com/fxsjy/jieba"
    },
    "split_keywords": [
        "nlp",
        " tokenizing",
        " chinese word segementation"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "03abcefe7c598f830dcf920eb1caced13c7b029807c519db18bf089000b249b0",
                "md5": "d7a52295744cabda4f3e66aac88214eb",
                "sha256": "5c1792089147cde622a8d0ed2df7876e632a57c4cdaeff1cf17c6a223ec23c8e"
            },
            "downloads": -1,
            "filename": "jieba_migang-0.0.1-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "d7a52295744cabda4f3e66aac88214eb",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": null,
            "size": 19399040,
            "upload_time": "2024-08-02T03:20:43",
            "upload_time_iso_8601": "2024-08-02T03:20:43.648577Z",
            "url": "https://files.pythonhosted.org/packages/03/ab/cefe7c598f830dcf920eb1caced13c7b029807c519db18bf089000b249b0/jieba_migang-0.0.1-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-08-02 03:20:43",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "fxsjy",
    "github_project": "jieba",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "lcname": "jieba-migang"
}
        
Elapsed time: 0.39508s