Name | pypinyin-dict JSON |
Version |
0.9.0
JSON |
| download |
home_page | https://github.com/mozillazg/pypinyin-dict |
Summary | 使用 pinyin-data 和 phrase-pinyin-data 中的拼音数据文件覆盖 pypinyin 中的自带拼音数据,实现只使用某个或某些拼音数据文件中的拼音数据的需求 |
upload_time | 2025-01-12 09:35:15 |
maintainer | None |
docs_url | None |
author | mozillazg |
requires_python | !=3.0.*,!=3.1.*,!=3.2.*,<4,>=2.6 |
license | None |
keywords |
|
VCS |
 |
bugtrack_url |
|
requirements |
No requirements were recorded.
|
Travis-CI |
No Travis.
|
coveralls test coverage |
No coveralls.
|
# pypinyin-dict
使用 [pinyin-data](https://github.com/mozillazg/pinyin-data) 和 [phrase-pinyin-data](https://github.com/mozillazg/phrase-pinyin-data) 中的拼音数据文件覆盖 [pypinyin](https://github.com/mozillazg/python-pinyin) 中的内置拼音数据,
实现只使用某个或某些拼音数据文件中的拼音数据的需求。
## 安装
```
$ pip install pypinyin-dict
```
## 使用示例
```python
>>> from pypinyin import pinyin
>>> pinyin('枯萎')
[['kū'], ['wēi']]
# 使用 phrase-pinyin-data 项目中 cc_cedict.txt 文件中的拼音数据优化结果
>>> from pypinyin_dict.phrase_pinyin_data import cc_cedict
>>> cc_cedict.load()
>>> pinyin('枯萎')
[['kū'], ['wěi']]
>>> pinyin('扔', heteronym=True)
[['rēng', 'rèng']]
# 使用 pinyin-data 项目中 kXHC1983.txt 文件中的拼音数据优化结果
>>> from pypinyin_dict.pinyin_data import kxhc1983
>>> kxhc1983.load()
>>> pinyin('扔', heteronym=True)
[['rēng']]
```
## 模块介绍
各个模块与数据文件关系如下(所有模块中都有一个 ``load()`` 函数用于导入对应的拼音数据,使用方法详见上方【使用示例】):
```python
# pinyin-data/kTGHZ2013.txt
>> from pypinyin_dict.pinyin_data import ktghz2013
# pinyin-data/kHanyuPinyin.txt
>> from pypinyin_dict.pinyin_data import khanyupinyin
# pinyin-data/kXHC1983.txt
>> from pypinyin_dict.pinyin_data import kxhc1983
# pinyin-data/kHanyuPinlu.txt
>> from pypinyin_dict.pinyin_data import khanyupinlu
# pinyin-data/kMandarin_8105.txt
>> from pypinyin_dict.pinyin_data import kmandarin_8105
# pinyin-data/pinyin.txt
>> from pypinyin_dict.pinyin_data import pinyin
# pinyin-data/zdic.txt
>> from pypinyin_dict.pinyin_data import zdic
# pinyin-data/cc_cedict.txt
>> from pypinyin_dict.pinyin_data import cc_cedict
# phrase-pinyin-data/pinyin.txt
>> from pypinyin_dict.phrase_pinyin_data import pinyin
# phrase-pinyin-data/zdic_cibs.txt
>> from pypinyin_dict.phrase_pinyin_data import zdic_cibs
# phrase-pinyin-data/zdic_cybs.txt
>> from pypinyin_dict.phrase_pinyin_data import zdic_cybs
# phrase-pinyin-data/cc_cedict.txt
>> from pypinyin_dict.phrase_pinyin_data import cc_cedict
# phrase-pinyin-data/di.txt
>> from pypinyin_dict.phrase_pinyin_data import di
# phrase-pinyin-data/large_pinyin.txt
>> from pypinyin_dict.phrase_pinyin_data import large_pinyin
```
## 生成文件
依赖 Python 3 环境。
clone 子模块:
```
git submodule init
git submodule update
```
生成 py 文件:
```
make generate
```
编辑数据文件然后生成 py 文件(可选):
1. 进入 tools 目录
2. 编辑 pinyin-data 或 phrase-pinyin-data 里的 txt 文件(tools 目录中 Makefile 里涉及的 txt 文件)
3. 执行 `make generate` 生成最新的 py 文件
4. 可以通过 `git diff` 查看变更内容
Raw data
{
"_id": null,
"home_page": "https://github.com/mozillazg/pypinyin-dict",
"name": "pypinyin-dict",
"maintainer": null,
"docs_url": null,
"requires_python": "!=3.0.*,!=3.1.*,!=3.2.*,<4,>=2.6",
"maintainer_email": null,
"keywords": null,
"author": "mozillazg",
"author_email": "mozillazg101@gmail.com",
"download_url": "https://files.pythonhosted.org/packages/64/7a/f56b7096cde930a65f8d5dc8cb726136d53c23175148f6aa1daa75419126/pypinyin_dict-0.9.0.tar.gz",
"platform": null,
"description": "# pypinyin-dict\n\n\u4f7f\u7528 [pinyin-data](https://github.com/mozillazg/pinyin-data) \u548c [phrase-pinyin-data](https://github.com/mozillazg/phrase-pinyin-data) \u4e2d\u7684\u62fc\u97f3\u6570\u636e\u6587\u4ef6\u8986\u76d6 [pypinyin](https://github.com/mozillazg/python-pinyin) \u4e2d\u7684\u5185\u7f6e\u62fc\u97f3\u6570\u636e\uff0c\n\u5b9e\u73b0\u53ea\u4f7f\u7528\u67d0\u4e2a\u6216\u67d0\u4e9b\u62fc\u97f3\u6570\u636e\u6587\u4ef6\u4e2d\u7684\u62fc\u97f3\u6570\u636e\u7684\u9700\u6c42\u3002\n\n## \u5b89\u88c5\n\n```\n$ pip install pypinyin-dict\n```\n\n## \u4f7f\u7528\u793a\u4f8b\n\n```python\n>>> from pypinyin import pinyin\n>>> pinyin('\u67af\u840e')\n[['k\u016b'], ['w\u0113i']]\n\n# \u4f7f\u7528 phrase-pinyin-data \u9879\u76ee\u4e2d cc_cedict.txt \u6587\u4ef6\u4e2d\u7684\u62fc\u97f3\u6570\u636e\u4f18\u5316\u7ed3\u679c\n>>> from pypinyin_dict.phrase_pinyin_data import cc_cedict\n>>> cc_cedict.load()\n\n>>> pinyin('\u67af\u840e')\n[['k\u016b'], ['w\u011bi']]\n\n\n>>> pinyin('\u6254', heteronym=True)\n[['r\u0113ng', 'r\u00e8ng']]\n\n# \u4f7f\u7528 pinyin-data \u9879\u76ee\u4e2d kXHC1983.txt \u6587\u4ef6\u4e2d\u7684\u62fc\u97f3\u6570\u636e\u4f18\u5316\u7ed3\u679c\n>>> from pypinyin_dict.pinyin_data import kxhc1983\n>>> kxhc1983.load()\n\n>>> pinyin('\u6254', heteronym=True)\n[['r\u0113ng']]\n```\n\n## \u6a21\u5757\u4ecb\u7ecd\n\n\u5404\u4e2a\u6a21\u5757\u4e0e\u6570\u636e\u6587\u4ef6\u5173\u7cfb\u5982\u4e0b\uff08\u6240\u6709\u6a21\u5757\u4e2d\u90fd\u6709\u4e00\u4e2a ``load()`` \u51fd\u6570\u7528\u4e8e\u5bfc\u5165\u5bf9\u5e94\u7684\u62fc\u97f3\u6570\u636e\uff0c\u4f7f\u7528\u65b9\u6cd5\u8be6\u89c1\u4e0a\u65b9\u3010\u4f7f\u7528\u793a\u4f8b\u3011\uff09\uff1a\n\n```python\n# pinyin-data/kTGHZ2013.txt\n>> from pypinyin_dict.pinyin_data import ktghz2013\n\n# pinyin-data/kHanyuPinyin.txt\n>> from pypinyin_dict.pinyin_data import khanyupinyin\n\n# pinyin-data/kXHC1983.txt\n>> from pypinyin_dict.pinyin_data import kxhc1983\n\n# pinyin-data/kHanyuPinlu.txt\n>> from pypinyin_dict.pinyin_data import khanyupinlu\n\n# pinyin-data/kMandarin_8105.txt\n>> from pypinyin_dict.pinyin_data import kmandarin_8105\n\n# pinyin-data/pinyin.txt\n>> from pypinyin_dict.pinyin_data import pinyin\n\n# pinyin-data/zdic.txt\n>> from pypinyin_dict.pinyin_data import zdic\n\n# pinyin-data/cc_cedict.txt\n>> from pypinyin_dict.pinyin_data import cc_cedict\n\n\n# phrase-pinyin-data/pinyin.txt\n>> from pypinyin_dict.phrase_pinyin_data import pinyin\n\n# phrase-pinyin-data/zdic_cibs.txt\n>> from pypinyin_dict.phrase_pinyin_data import zdic_cibs\n\n# phrase-pinyin-data/zdic_cybs.txt\n>> from pypinyin_dict.phrase_pinyin_data import zdic_cybs\n\n# phrase-pinyin-data/cc_cedict.txt\n>> from pypinyin_dict.phrase_pinyin_data import cc_cedict\n\n# phrase-pinyin-data/di.txt\n>> from pypinyin_dict.phrase_pinyin_data import di\n\n# phrase-pinyin-data/large_pinyin.txt\n>> from pypinyin_dict.phrase_pinyin_data import large_pinyin\n\n```\n\n## \u751f\u6210\u6587\u4ef6\n\n\u4f9d\u8d56 Python 3 \u73af\u5883\u3002\n\nclone \u5b50\u6a21\u5757\uff1a\n\n```\ngit submodule init\ngit submodule update\n```\n\n\u751f\u6210 py \u6587\u4ef6\uff1a\n\n```\nmake generate\n```\n\n\u7f16\u8f91\u6570\u636e\u6587\u4ef6\u7136\u540e\u751f\u6210 py \u6587\u4ef6\uff08\u53ef\u9009\uff09\uff1a\n\n1. \u8fdb\u5165 tools \u76ee\u5f55\n2. \u7f16\u8f91 pinyin-data \u6216 phrase-pinyin-data \u91cc\u7684 txt \u6587\u4ef6\uff08tools \u76ee\u5f55\u4e2d Makefile \u91cc\u6d89\u53ca\u7684 txt \u6587\u4ef6\uff09\n3. \u6267\u884c `make generate` \u751f\u6210\u6700\u65b0\u7684 py \u6587\u4ef6\n4. \u53ef\u4ee5\u901a\u8fc7 `git diff` \u67e5\u770b\u53d8\u66f4\u5185\u5bb9\n\n",
"bugtrack_url": null,
"license": null,
"summary": "\u4f7f\u7528 pinyin-data \u548c phrase-pinyin-data \u4e2d\u7684\u62fc\u97f3\u6570\u636e\u6587\u4ef6\u8986\u76d6 pypinyin \u4e2d\u7684\u81ea\u5e26\u62fc\u97f3\u6570\u636e\uff0c\u5b9e\u73b0\u53ea\u4f7f\u7528\u67d0\u4e2a\u6216\u67d0\u4e9b\u62fc\u97f3\u6570\u636e\u6587\u4ef6\u4e2d\u7684\u62fc\u97f3\u6570\u636e\u7684\u9700\u6c42",
"version": "0.9.0",
"project_urls": {
"Bug Tracker": "https://github.com/mozillazg/pypinyin-dict/issues",
"Homepage": "https://github.com/mozillazg/pypinyin-dict"
},
"split_keywords": [],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "418fadd772a61256a9ac91d95bf5ec3dffc1de97c8e5da53d40655044b2e1509",
"md5": "31df43a092f49e06f49b817a94ff8584",
"sha256": "10cfbe40af87d704b867533177be8cd72837da9e224755dd275798e88097067a"
},
"downloads": -1,
"filename": "pypinyin_dict-0.9.0-py2.py3-none-any.whl",
"has_sig": false,
"md5_digest": "31df43a092f49e06f49b817a94ff8584",
"packagetype": "bdist_wheel",
"python_version": "py2.py3",
"requires_python": "!=3.0.*,!=3.1.*,!=3.2.*,<4,>=2.6",
"size": 9506709,
"upload_time": "2025-01-12T09:35:10",
"upload_time_iso_8601": "2025-01-12T09:35:10.318122Z",
"url": "https://files.pythonhosted.org/packages/41/8f/add772a61256a9ac91d95bf5ec3dffc1de97c8e5da53d40655044b2e1509/pypinyin_dict-0.9.0-py2.py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "647af56b7096cde930a65f8d5dc8cb726136d53c23175148f6aa1daa75419126",
"md5": "247c1d7cd0d0b5c4a0ab5ef0a5fcd128",
"sha256": "8c491396baa1567311f2ec759cbc154638f3bcefdc711d34e53e373e3a429fa5"
},
"downloads": -1,
"filename": "pypinyin_dict-0.9.0.tar.gz",
"has_sig": false,
"md5_digest": "247c1d7cd0d0b5c4a0ab5ef0a5fcd128",
"packagetype": "sdist",
"python_version": "source",
"requires_python": "!=3.0.*,!=3.1.*,!=3.2.*,<4,>=2.6",
"size": 9264679,
"upload_time": "2025-01-12T09:35:15",
"upload_time_iso_8601": "2025-01-12T09:35:15.704693Z",
"url": "https://files.pythonhosted.org/packages/64/7a/f56b7096cde930a65f8d5dc8cb726136d53c23175148f6aa1daa75419126/pypinyin_dict-0.9.0.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2025-01-12 09:35:15",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "mozillazg",
"github_project": "pypinyin-dict",
"travis_ci": false,
"coveralls": false,
"github_actions": true,
"lcname": "pypinyin-dict"
}