uyghur


Nameuyghur JSON
Version 0.1.2 PyPI version JSON
download
home_pageNone
SummaryAll things you need for Uyghur Language.
upload_time2025-10-23 06:25:37
maintainerNone
docs_urlNone
authorTang Ziya
requires_python>=3.8
licenseNone
keywords uyghur npl
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # 维吾尔文处理

[English](https://github.com/tcztzy/uyghur/blob/master/README.md)

## 动机

本人因课题需要,需要对维吾尔文进行自然语言处理,但维文的文本是自右向左排版的,在很多时候不利手动处理(如选中复制,虽然应避免手动处理,但仍有场景),而且与其他自左向右的文字混在一起时会出现一些问题(例如显示在在右侧的数字其实一个在句首一个在句末),且很多字母在词首词中词尾和单列时是不一样的,所以有些没有连字字体的系统需要单独定义字符。为了方便我这个小辣鸡我需要一个小工具转换老维文为拉丁维文。

## 术语

UEY
: 老维文,中国新疆官方唯一官方字母表,在公共媒体和日常生活中使用;

UKY
: 西里尔维文,在中亚尤其是哈萨克斯坦使用;

ULY
: 维吾尔语拉丁字母是在2008年推出的,只在计算机相关领域作为辅助书写系统使用,但在所有设备上扩大使用UEY键盘后,现在基本上已经废弃了。

UYY
: 新维字(也叫拼音Yeziⱪi或UPNY),这种字母也是基于拉丁文的,但现在大多数想用拉丁文打字的人都用ULY代替。

## 安装

```sh
$ pip install uyghur
```

## 用法

```python
from uyghur.conversion import uey2uly

print(uey2uly('پلام، جهان'))
```

## 测试

在 CPython 3.8/3.9/3.10 和 Pypy 3.8 测试,如需自行运行测试,执行下列命令

```shell
tox
```

## 待办

* [x] UEY2ULY
* [ ] ULY2UEY
* [ ] UEY2UKY
* [ ] UKY2UEY
* [ ] UEY2UYY
* [ ] UYY2UEY
* [ ] TEXT2SPEECH

## 参考文献

1. DB65/T 3690-2015 现行维吾尔文与拉丁维吾尔文编码字符转换规则

            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "uyghur",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.8",
    "maintainer_email": null,
    "keywords": "Uyghur, NPL",
    "author": "Tang Ziya",
    "author_email": "Tang Ziya <tcztzy@gmail.com>",
    "download_url": "https://files.pythonhosted.org/packages/6f/02/44e22ed92f8436dfd1b01d5879f7784e63071b1e906accd401deb0b5331f/uyghur-0.1.2.tar.gz",
    "platform": null,
    "description": "# \u7ef4\u543e\u5c14\u6587\u5904\u7406\n\n[English](https://github.com/tcztzy/uyghur/blob/master/README.md)\n\n## \u52a8\u673a\n\n\u672c\u4eba\u56e0\u8bfe\u9898\u9700\u8981\uff0c\u9700\u8981\u5bf9\u7ef4\u543e\u5c14\u6587\u8fdb\u884c\u81ea\u7136\u8bed\u8a00\u5904\u7406\uff0c\u4f46\u7ef4\u6587\u7684\u6587\u672c\u662f\u81ea\u53f3\u5411\u5de6\u6392\u7248\u7684\uff0c\u5728\u5f88\u591a\u65f6\u5019\u4e0d\u5229\u624b\u52a8\u5904\u7406\uff08\u5982\u9009\u4e2d\u590d\u5236\uff0c\u867d\u7136\u5e94\u907f\u514d\u624b\u52a8\u5904\u7406\uff0c\u4f46\u4ecd\u6709\u573a\u666f\uff09\uff0c\u800c\u4e14\u4e0e\u5176\u4ed6\u81ea\u5de6\u5411\u53f3\u7684\u6587\u5b57\u6df7\u5728\u4e00\u8d77\u65f6\u4f1a\u51fa\u73b0\u4e00\u4e9b\u95ee\u9898\uff08\u4f8b\u5982\u663e\u793a\u5728\u5728\u53f3\u4fa7\u7684\u6570\u5b57\u5176\u5b9e\u4e00\u4e2a\u5728\u53e5\u9996\u4e00\u4e2a\u5728\u53e5\u672b\uff09\uff0c\u4e14\u5f88\u591a\u5b57\u6bcd\u5728\u8bcd\u9996\u8bcd\u4e2d\u8bcd\u5c3e\u548c\u5355\u5217\u65f6\u662f\u4e0d\u4e00\u6837\u7684\uff0c\u6240\u4ee5\u6709\u4e9b\u6ca1\u6709\u8fde\u5b57\u5b57\u4f53\u7684\u7cfb\u7edf\u9700\u8981\u5355\u72ec\u5b9a\u4e49\u5b57\u7b26\u3002\u4e3a\u4e86\u65b9\u4fbf\u6211\u8fd9\u4e2a\u5c0f\u8fa3\u9e21\u6211\u9700\u8981\u4e00\u4e2a\u5c0f\u5de5\u5177\u8f6c\u6362\u8001\u7ef4\u6587\u4e3a\u62c9\u4e01\u7ef4\u6587\u3002\n\n## \u672f\u8bed\n\nUEY\n: \u8001\u7ef4\u6587\uff0c\u4e2d\u56fd\u65b0\u7586\u5b98\u65b9\u552f\u4e00\u5b98\u65b9\u5b57\u6bcd\u8868\uff0c\u5728\u516c\u5171\u5a92\u4f53\u548c\u65e5\u5e38\u751f\u6d3b\u4e2d\u4f7f\u7528\uff1b\n\nUKY\n: \u897f\u91cc\u5c14\u7ef4\u6587\uff0c\u5728\u4e2d\u4e9a\u5c24\u5176\u662f\u54c8\u8428\u514b\u65af\u5766\u4f7f\u7528\uff1b\n\nULY\n: \u7ef4\u543e\u5c14\u8bed\u62c9\u4e01\u5b57\u6bcd\u662f\u57282008\u5e74\u63a8\u51fa\u7684\uff0c\u53ea\u5728\u8ba1\u7b97\u673a\u76f8\u5173\u9886\u57df\u4f5c\u4e3a\u8f85\u52a9\u4e66\u5199\u7cfb\u7edf\u4f7f\u7528\uff0c\u4f46\u5728\u6240\u6709\u8bbe\u5907\u4e0a\u6269\u5927\u4f7f\u7528UEY\u952e\u76d8\u540e\uff0c\u73b0\u5728\u57fa\u672c\u4e0a\u5df2\u7ecf\u5e9f\u5f03\u4e86\u3002\n\nUYY\n: \u65b0\u7ef4\u5b57\uff08\u4e5f\u53eb\u62fc\u97f3Yezi\u2c6ai\u6216UPNY\uff09\uff0c\u8fd9\u79cd\u5b57\u6bcd\u4e5f\u662f\u57fa\u4e8e\u62c9\u4e01\u6587\u7684\uff0c\u4f46\u73b0\u5728\u5927\u591a\u6570\u60f3\u7528\u62c9\u4e01\u6587\u6253\u5b57\u7684\u4eba\u90fd\u7528ULY\u4ee3\u66ff\u3002\n\n## \u5b89\u88c5\n\n```sh\n$ pip install uyghur\n```\n\n## \u7528\u6cd5\n\n```python\nfrom uyghur.conversion import uey2uly\n\nprint(uey2uly('\u067e\u0644\u0627\u0645\u060c \u062c\u0647\u0627\u0646'))\n```\n\n## \u6d4b\u8bd5\n\n\u5728 CPython 3.8/3.9/3.10 \u548c Pypy 3.8 \u6d4b\u8bd5\uff0c\u5982\u9700\u81ea\u884c\u8fd0\u884c\u6d4b\u8bd5\uff0c\u6267\u884c\u4e0b\u5217\u547d\u4ee4\n\n```shell\ntox\n```\n\n## \u5f85\u529e\n\n* [x] UEY2ULY\n* [ ] ULY2UEY\n* [ ] UEY2UKY\n* [ ] UKY2UEY\n* [ ] UEY2UYY\n* [ ] UYY2UEY\n* [ ] TEXT2SPEECH\n\n## \u53c2\u8003\u6587\u732e\n\n1. DB65/T 3690-2015 \u73b0\u884c\u7ef4\u543e\u5c14\u6587\u4e0e\u62c9\u4e01\u7ef4\u543e\u5c14\u6587\u7f16\u7801\u5b57\u7b26\u8f6c\u6362\u89c4\u5219\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "All things you need for Uyghur Language.",
    "version": "0.1.2",
    "project_urls": {
        "repository": "https://github.com/tcztzy/uyghur"
    },
    "split_keywords": [
        "uyghur",
        " npl"
    ],
    "urls": [
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "fa352785bc8184be8b1ee1b4deac83b1bd9285cb33aa4b28341c594750b2bdb1",
                "md5": "f45eaf45f345eb8233956b1edf3179bd",
                "sha256": "260910a9cfb3340dd083e0509510e78a4226fc8565b6f10b47d25a32c0723b13"
            },
            "downloads": -1,
            "filename": "uyghur-0.1.2-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "f45eaf45f345eb8233956b1edf3179bd",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.8",
            "size": 4949,
            "upload_time": "2025-10-23T06:25:36",
            "upload_time_iso_8601": "2025-10-23T06:25:36.205458Z",
            "url": "https://files.pythonhosted.org/packages/fa/35/2785bc8184be8b1ee1b4deac83b1bd9285cb33aa4b28341c594750b2bdb1/uyghur-0.1.2-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "6f0244e22ed92f8436dfd1b01d5879f7784e63071b1e906accd401deb0b5331f",
                "md5": "04e08e2cc67d8d329d74a15c60a85845",
                "sha256": "f640ff1c405cd065fee08430e826c8fb4e35e6d633f6fd3921716e58a4ee4331"
            },
            "downloads": -1,
            "filename": "uyghur-0.1.2.tar.gz",
            "has_sig": false,
            "md5_digest": "04e08e2cc67d8d329d74a15c60a85845",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.8",
            "size": 4549,
            "upload_time": "2025-10-23T06:25:37",
            "upload_time_iso_8601": "2025-10-23T06:25:37.515271Z",
            "url": "https://files.pythonhosted.org/packages/6f/02/44e22ed92f8436dfd1b01d5879f7784e63071b1e906accd401deb0b5331f/uyghur-0.1.2.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-10-23 06:25:37",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "tcztzy",
    "github_project": "uyghur",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "lcname": "uyghur"
}
        
Elapsed time: 2.62899s