pypinyin-dict


Namepypinyin-dict JSON
Version 0.9.0 PyPI version JSON
download
home_pagehttps://github.com/mozillazg/pypinyin-dict
Summary使用 pinyin-data 和 phrase-pinyin-data 中的拼音数据文件覆盖 pypinyin 中的自带拼音数据,实现只使用某个或某些拼音数据文件中的拼音数据的需求
upload_time2025-01-12 09:35:15
maintainerNone
docs_urlNone
authormozillazg
requires_python!=3.0.*,!=3.1.*,!=3.2.*,<4,>=2.6
licenseNone
keywords
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # pypinyin-dict

使用 [pinyin-data](https://github.com/mozillazg/pinyin-data) 和 [phrase-pinyin-data](https://github.com/mozillazg/phrase-pinyin-data) 中的拼音数据文件覆盖 [pypinyin](https://github.com/mozillazg/python-pinyin) 中的内置拼音数据,
实现只使用某个或某些拼音数据文件中的拼音数据的需求。

## 安装

```
$ pip install pypinyin-dict
```

## 使用示例

```python
>>> from pypinyin import pinyin
>>> pinyin('枯萎')
[['kū'], ['wēi']]

# 使用 phrase-pinyin-data 项目中 cc_cedict.txt 文件中的拼音数据优化结果
>>> from pypinyin_dict.phrase_pinyin_data import cc_cedict
>>> cc_cedict.load()

>>> pinyin('枯萎')
[['kū'], ['wěi']]


>>> pinyin('扔', heteronym=True)
[['rēng', 'rèng']]

# 使用 pinyin-data 项目中 kXHC1983.txt 文件中的拼音数据优化结果
>>> from pypinyin_dict.pinyin_data import kxhc1983
>>> kxhc1983.load()

>>> pinyin('扔', heteronym=True)
[['rēng']]
```

## 模块介绍

各个模块与数据文件关系如下(所有模块中都有一个 ``load()`` 函数用于导入对应的拼音数据,使用方法详见上方【使用示例】):

```python
# pinyin-data/kTGHZ2013.txt
>> from pypinyin_dict.pinyin_data import ktghz2013

# pinyin-data/kHanyuPinyin.txt
>> from pypinyin_dict.pinyin_data import khanyupinyin

# pinyin-data/kXHC1983.txt
>> from pypinyin_dict.pinyin_data import kxhc1983

# pinyin-data/kHanyuPinlu.txt
>> from pypinyin_dict.pinyin_data import khanyupinlu

# pinyin-data/kMandarin_8105.txt
>> from pypinyin_dict.pinyin_data import kmandarin_8105

# pinyin-data/pinyin.txt
>> from pypinyin_dict.pinyin_data import pinyin

# pinyin-data/zdic.txt
>> from pypinyin_dict.pinyin_data import zdic

# pinyin-data/cc_cedict.txt
>> from pypinyin_dict.pinyin_data import cc_cedict


# phrase-pinyin-data/pinyin.txt
>> from pypinyin_dict.phrase_pinyin_data import pinyin

# phrase-pinyin-data/zdic_cibs.txt
>> from pypinyin_dict.phrase_pinyin_data import zdic_cibs

# phrase-pinyin-data/zdic_cybs.txt
>> from pypinyin_dict.phrase_pinyin_data import zdic_cybs

# phrase-pinyin-data/cc_cedict.txt
>> from pypinyin_dict.phrase_pinyin_data import cc_cedict

# phrase-pinyin-data/di.txt
>> from pypinyin_dict.phrase_pinyin_data import di

# phrase-pinyin-data/large_pinyin.txt
>> from pypinyin_dict.phrase_pinyin_data import large_pinyin

```

## 生成文件

依赖 Python 3 环境。

clone 子模块:

```
git submodule init
git submodule update
```

生成 py 文件:

```
make generate
```

编辑数据文件然后生成 py 文件(可选):

1. 进入 tools 目录
2. 编辑 pinyin-data 或 phrase-pinyin-data 里的 txt 文件(tools 目录中 Makefile 里涉及的 txt 文件)
3. 执行 `make generate` 生成最新的 py 文件
4. 可以通过 `git diff` 查看变更内容


            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/mozillazg/pypinyin-dict",
    "name": "pypinyin-dict",
    "maintainer": null,
    "docs_url": null,
    "requires_python": "!=3.0.*,!=3.1.*,!=3.2.*,<4,>=2.6",
    "maintainer_email": null,
    "keywords": null,
    "author": "mozillazg",
    "author_email": "mozillazg101@gmail.com",
    "download_url": "https://files.pythonhosted.org/packages/64/7a/f56b7096cde930a65f8d5dc8cb726136d53c23175148f6aa1daa75419126/pypinyin_dict-0.9.0.tar.gz",
    "platform": null,
    "description": "# pypinyin-dict\n\n\u4f7f\u7528 [pinyin-data](https://github.com/mozillazg/pinyin-data) \u548c [phrase-pinyin-data](https://github.com/mozillazg/phrase-pinyin-data) \u4e2d\u7684\u62fc\u97f3\u6570\u636e\u6587\u4ef6\u8986\u76d6 [pypinyin](https://github.com/mozillazg/python-pinyin) \u4e2d\u7684\u5185\u7f6e\u62fc\u97f3\u6570\u636e\uff0c\n\u5b9e\u73b0\u53ea\u4f7f\u7528\u67d0\u4e2a\u6216\u67d0\u4e9b\u62fc\u97f3\u6570\u636e\u6587\u4ef6\u4e2d\u7684\u62fc\u97f3\u6570\u636e\u7684\u9700\u6c42\u3002\n\n## \u5b89\u88c5\n\n```\n$ pip install pypinyin-dict\n```\n\n## \u4f7f\u7528\u793a\u4f8b\n\n```python\n>>> from pypinyin import pinyin\n>>> pinyin('\u67af\u840e')\n[['k\u016b'], ['w\u0113i']]\n\n# \u4f7f\u7528 phrase-pinyin-data \u9879\u76ee\u4e2d cc_cedict.txt \u6587\u4ef6\u4e2d\u7684\u62fc\u97f3\u6570\u636e\u4f18\u5316\u7ed3\u679c\n>>> from pypinyin_dict.phrase_pinyin_data import cc_cedict\n>>> cc_cedict.load()\n\n>>> pinyin('\u67af\u840e')\n[['k\u016b'], ['w\u011bi']]\n\n\n>>> pinyin('\u6254', heteronym=True)\n[['r\u0113ng', 'r\u00e8ng']]\n\n# \u4f7f\u7528 pinyin-data \u9879\u76ee\u4e2d kXHC1983.txt \u6587\u4ef6\u4e2d\u7684\u62fc\u97f3\u6570\u636e\u4f18\u5316\u7ed3\u679c\n>>> from pypinyin_dict.pinyin_data import kxhc1983\n>>> kxhc1983.load()\n\n>>> pinyin('\u6254', heteronym=True)\n[['r\u0113ng']]\n```\n\n## \u6a21\u5757\u4ecb\u7ecd\n\n\u5404\u4e2a\u6a21\u5757\u4e0e\u6570\u636e\u6587\u4ef6\u5173\u7cfb\u5982\u4e0b\uff08\u6240\u6709\u6a21\u5757\u4e2d\u90fd\u6709\u4e00\u4e2a ``load()`` \u51fd\u6570\u7528\u4e8e\u5bfc\u5165\u5bf9\u5e94\u7684\u62fc\u97f3\u6570\u636e\uff0c\u4f7f\u7528\u65b9\u6cd5\u8be6\u89c1\u4e0a\u65b9\u3010\u4f7f\u7528\u793a\u4f8b\u3011\uff09\uff1a\n\n```python\n# pinyin-data/kTGHZ2013.txt\n>> from pypinyin_dict.pinyin_data import ktghz2013\n\n# pinyin-data/kHanyuPinyin.txt\n>> from pypinyin_dict.pinyin_data import khanyupinyin\n\n# pinyin-data/kXHC1983.txt\n>> from pypinyin_dict.pinyin_data import kxhc1983\n\n# pinyin-data/kHanyuPinlu.txt\n>> from pypinyin_dict.pinyin_data import khanyupinlu\n\n# pinyin-data/kMandarin_8105.txt\n>> from pypinyin_dict.pinyin_data import kmandarin_8105\n\n# pinyin-data/pinyin.txt\n>> from pypinyin_dict.pinyin_data import pinyin\n\n# pinyin-data/zdic.txt\n>> from pypinyin_dict.pinyin_data import zdic\n\n# pinyin-data/cc_cedict.txt\n>> from pypinyin_dict.pinyin_data import cc_cedict\n\n\n# phrase-pinyin-data/pinyin.txt\n>> from pypinyin_dict.phrase_pinyin_data import pinyin\n\n# phrase-pinyin-data/zdic_cibs.txt\n>> from pypinyin_dict.phrase_pinyin_data import zdic_cibs\n\n# phrase-pinyin-data/zdic_cybs.txt\n>> from pypinyin_dict.phrase_pinyin_data import zdic_cybs\n\n# phrase-pinyin-data/cc_cedict.txt\n>> from pypinyin_dict.phrase_pinyin_data import cc_cedict\n\n# phrase-pinyin-data/di.txt\n>> from pypinyin_dict.phrase_pinyin_data import di\n\n# phrase-pinyin-data/large_pinyin.txt\n>> from pypinyin_dict.phrase_pinyin_data import large_pinyin\n\n```\n\n## \u751f\u6210\u6587\u4ef6\n\n\u4f9d\u8d56 Python 3 \u73af\u5883\u3002\n\nclone \u5b50\u6a21\u5757\uff1a\n\n```\ngit submodule init\ngit submodule update\n```\n\n\u751f\u6210 py \u6587\u4ef6\uff1a\n\n```\nmake generate\n```\n\n\u7f16\u8f91\u6570\u636e\u6587\u4ef6\u7136\u540e\u751f\u6210 py \u6587\u4ef6\uff08\u53ef\u9009\uff09\uff1a\n\n1. \u8fdb\u5165 tools \u76ee\u5f55\n2. \u7f16\u8f91 pinyin-data \u6216 phrase-pinyin-data \u91cc\u7684 txt \u6587\u4ef6\uff08tools \u76ee\u5f55\u4e2d Makefile \u91cc\u6d89\u53ca\u7684 txt \u6587\u4ef6\uff09\n3. \u6267\u884c `make generate` \u751f\u6210\u6700\u65b0\u7684 py \u6587\u4ef6\n4. \u53ef\u4ee5\u901a\u8fc7 `git diff` \u67e5\u770b\u53d8\u66f4\u5185\u5bb9\n\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "\u4f7f\u7528 pinyin-data \u548c phrase-pinyin-data \u4e2d\u7684\u62fc\u97f3\u6570\u636e\u6587\u4ef6\u8986\u76d6 pypinyin \u4e2d\u7684\u81ea\u5e26\u62fc\u97f3\u6570\u636e\uff0c\u5b9e\u73b0\u53ea\u4f7f\u7528\u67d0\u4e2a\u6216\u67d0\u4e9b\u62fc\u97f3\u6570\u636e\u6587\u4ef6\u4e2d\u7684\u62fc\u97f3\u6570\u636e\u7684\u9700\u6c42",
    "version": "0.9.0",
    "project_urls": {
        "Bug Tracker": "https://github.com/mozillazg/pypinyin-dict/issues",
        "Homepage": "https://github.com/mozillazg/pypinyin-dict"
    },
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "418fadd772a61256a9ac91d95bf5ec3dffc1de97c8e5da53d40655044b2e1509",
                "md5": "31df43a092f49e06f49b817a94ff8584",
                "sha256": "10cfbe40af87d704b867533177be8cd72837da9e224755dd275798e88097067a"
            },
            "downloads": -1,
            "filename": "pypinyin_dict-0.9.0-py2.py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "31df43a092f49e06f49b817a94ff8584",
            "packagetype": "bdist_wheel",
            "python_version": "py2.py3",
            "requires_python": "!=3.0.*,!=3.1.*,!=3.2.*,<4,>=2.6",
            "size": 9506709,
            "upload_time": "2025-01-12T09:35:10",
            "upload_time_iso_8601": "2025-01-12T09:35:10.318122Z",
            "url": "https://files.pythonhosted.org/packages/41/8f/add772a61256a9ac91d95bf5ec3dffc1de97c8e5da53d40655044b2e1509/pypinyin_dict-0.9.0-py2.py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "647af56b7096cde930a65f8d5dc8cb726136d53c23175148f6aa1daa75419126",
                "md5": "247c1d7cd0d0b5c4a0ab5ef0a5fcd128",
                "sha256": "8c491396baa1567311f2ec759cbc154638f3bcefdc711d34e53e373e3a429fa5"
            },
            "downloads": -1,
            "filename": "pypinyin_dict-0.9.0.tar.gz",
            "has_sig": false,
            "md5_digest": "247c1d7cd0d0b5c4a0ab5ef0a5fcd128",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": "!=3.0.*,!=3.1.*,!=3.2.*,<4,>=2.6",
            "size": 9264679,
            "upload_time": "2025-01-12T09:35:15",
            "upload_time_iso_8601": "2025-01-12T09:35:15.704693Z",
            "url": "https://files.pythonhosted.org/packages/64/7a/f56b7096cde930a65f8d5dc8cb726136d53c23175148f6aa1daa75419126/pypinyin_dict-0.9.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-01-12 09:35:15",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "mozillazg",
    "github_project": "pypinyin-dict",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "pypinyin-dict"
}
        
Elapsed time: 1.51542s