umsc


Nameumsc JSON
Version 0.3.0 PyPI version JSON
download
home_pagehttps://github.com/neouyghur/ScriptConverter4Uyghur
SummaryScript Converter for Uyghur Language
upload_time2024-03-27 05:24:52
maintainerOsman Tursun
docs_urlNone
authorOsman Tursun
requires_python>=3.0
licenseNone
keywords uyghur script converter arabic latin cyrillic ipa ئۇيغۇر
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # Script Converter for Uyghur Language
This converter converts multiple Uyghur scripts such as **ULS** (Uyghur Latin Script), **UAS** (Uyghur Arabic Script), 
**CTS** (Common Turkish Script), **UCS** (Uyghur Cyrillic Script), **UYS** (Uyghur Yengi (new) Script), 
International Phonetic Alphabet (IPA), Uzbek Latin Script (UZLS) and Xinjiang University Script (XJUS).

## Installation
```
pip install umsc
```


## Mapping table
| UAS | CTS | ULS| UCS|UYS| IPA   | UZLS | XJUS |
|-----|----| ---- | --- | -- |-------|---|------|
| ا   | a  | a    | а   |a | /ɑ/   | o | a    |
| ە   | e  | e    | ә   |ə | /æ/   | a | A    |
| ب   | b  | b    | б   |b | /b/   | b | b    |
| پ   | p  | p    | п   |p | /p/   | p | p    |
| ت   | t  | t    | т   |t | /t/   | t | t    |
| ج   | c  | j    | җ   |j | /d͡ʒ/ |  j | j   |
| چ   | ç  | ch   | ч   |q | /t͡ʃ/ | ch | c   |
| خ   | x  | x    | х   |h | /χ/   | x | H    |
| د   | d  | d    | д   |d | /d/   | d | d    |
| ر   | r  | r    | р   |r | /r/   | r | r    |
| ز   | z  | z    | з   |z | /z/   | z | z    |
| ژ   | j  | zh   | ж   |ⱬ | /ʒ/   |  j | J   |
| س   | s  | s    | с   |s | /s/   | s | s    |
| ش   | ş  | sh   | ш   |x | /ʃ/   | sh | x   |
| ف   | f  | f    | ф   |f | /f/   | f | f    |
| ڭ   | ñ  | ng   | ң   |ng | /ŋ/   | ng | N  |
| ل   | l  | l    | л   |l | /l/   | l | l    |
| م   | m  | m    | м   |m | /m/   | m | m    |
| ھ   | h  | h    | һ   |ⱨ | /h/   | h | h    |
| و   | o  | o    | о   |o | /o/   | oʻ | o    |
| ۇ   | u  | u    | у   |u | /u/   | u | u    |
| ۆ   | ö  | ö    | ө   |ɵ | /ø/   | oʻ | O   |
| ۈ   | ü  | ü    | ү   |ü | /y/   | uʻ | U   |
| ۋ   | v  | w    | в   |w | /w/   | v | w    |
| ې   | é  | é    | е   |e | /ɛ/   | e | e    |
| ى   | i  | i    | и   |i | /i/   | i | i    |
| ي   | y  | y    | й   |y | /j/   | y | y    |
| ق   | q  | q    | қ   |ⱪ | /q/   | q | q    |
| ك   | k  | k    | к   |k | /k/   | k | k    |
| گ   | g  | g    | г   |g | /ɡ/   | g | g    |
| ن   | n  | n    | н   |n | /n/   | n | n    |
| غ   | ğ  | gh   | ғ   |ƣ | /ʁ/   | gʻ | G   |
| ئ   |    |      |     | |       |   | v    |
| يا  | ya | ya   | я   |ya |       | ya | ya   |
| يۇ  | yu | yu   | ю   |yu |       | yu | yu   |

## Sample input and output examples

Review the files in the tests directory for examples of converting between different scripts.

## Usage

```
from umsc import UgMultiScriptConverter
# To convert text, you need to define source and target scripts
# The abbreviation of scrips
# ULS | Uyghur Latin Script
# UYS | Uyghur Yengi (New) Script
# CPS | Chinese Pinyin Script
# UAS | Uyghur Arabic Script
# CTS |Common Turkic Script
# UCS | Uyghur Cyrillic Script
# XJU | Xinjinag University English Case Sensitive
# UZLS | Uzbek Latin Script
# Convert Uyghur Arabic Script to Uyghur Latin Script
source_script = 'UAS'
target_script = 'ULS'
converter = UgMultiScriptConverter(source_script, target_script)
text1 = 'ياخشىمۇسىز!'
text1 = converter(text1)
print(text1)
# Convert Uyghur Latin Script to Uyghur Arabic Script
source_script = 'ULS'
target_script = 'UAS'
converter = UgMultiScriptConverter(source_script, target_script)
text2 = 'yaxshimusiz!'
text2 = converter(text2)
print(text2)
```

## Notes
- API and Uzbek are currently developing. The mapping might not be very accurate. Especially for Uzbek, it is not very clear how to map "ئا" and "ئە" to Uzbek.


## Citation

If you wish to cite this project, please use `cite this repository`. 

## Contributing
Feel free to raise issue and pull request.

## License
Distributed under the Apache 2.0 License. See [`LICENSE`](LICENSE) for more information.

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/neouyghur/ScriptConverter4Uyghur",
    "name": "umsc",
    "maintainer": "Osman Tursun",
    "docs_url": null,
    "requires_python": ">=3.0",
    "maintainer_email": "mpcabd@gmail.com",
    "keywords": "uyghur script converter arabic latin cyrillic IPA \u0626\u06c7\u064a\u063a\u06c7\u0631",
    "author": "Osman Tursun",
    "author_email": "osmanjan.t@gmail.com",
    "download_url": "https://files.pythonhosted.org/packages/c1/58/dc5faf952c59041bb3d9966affb091a26bba0084895284ff314c96033efb/umsc-0.3.0.tar.gz",
    "platform": null,
    "description": "# Script Converter for Uyghur Language\nThis converter converts multiple Uyghur scripts such as **ULS** (Uyghur Latin Script), **UAS** (Uyghur Arabic Script), \n**CTS** (Common Turkish Script), **UCS** (Uyghur Cyrillic Script), **UYS** (Uyghur Yengi (new) Script), \nInternational Phonetic Alphabet (IPA), Uzbek Latin Script (UZLS) and Xinjiang University Script (XJUS).\n\n## Installation\n```\npip install umsc\n```\n\n\n## Mapping table\n| UAS | CTS | ULS| UCS|UYS| IPA   | UZLS | XJUS |\n|-----|----| ---- | --- | -- |-------|---|------|\n| \u0627   | a  | a    | \u0430   |a | /\u0251/   | o | a    |\n| \u06d5   | e  | e    | \u04d9   |\u0259 | /\u00e6/   | a | A    |\n| \u0628   | b  | b    | \u0431   |b | /b/   | b | b    |\n| \u067e   | p  | p    | \u043f   |p | /p/   | p | p    |\n| \u062a   | t  | t    | \u0442   |t | /t/   | t | t    |\n| \u062c   | c  | j    | \u0497   |j | /d\u0361\u0292/ |  j | j   |\n| \u0686   | \u00e7  | ch   | \u0447   |q | /t\u0361\u0283/ | ch | c   |\n| \u062e   | x  | x    | \u0445   |h | /\u03c7/   | x | H    |\n| \u062f   | d  | d    | \u0434   |d | /d/   | d | d    |\n| \u0631   | r  | r    | \u0440   |r | /r/   | r | r    |\n| \u0632   | z  | z    | \u0437   |z | /z/   | z | z    |\n| \u0698   | j  | zh   | \u0436   |\u2c6c | /\u0292/   |  j | J   |\n| \u0633   | s  | s    | \u0441   |s | /s/   | s | s    |\n| \u0634   | \u015f  | sh   | \u0448   |x | /\u0283/   | sh | x   |\n| \u0641   | f  | f    | \u0444   |f | /f/   | f | f    |\n| \u06ad   | \u00f1  | ng   | \u04a3   |ng | /\u014b/   | ng | N  |\n| \u0644   | l  | l    | \u043b   |l | /l/   | l | l    |\n| \u0645   | m  | m    | \u043c   |m | /m/   | m | m    |\n| \u06be   | h  | h    | \u04bb   |\u2c68 | /h/   | h | h    |\n| \u0648   | o  | o    | \u043e   |o | /o/   | o\u02bb | o    |\n| \u06c7   | u  | u    | \u0443   |u | /u/   | u | u    |\n| \u06c6   | \u00f6  | \u00f6    | \u04e9   |\u0275 | /\u00f8/   | o\u02bb | O   |\n| \u06c8   | \u00fc  | \u00fc    | \u04af   |\u00fc | /y/   | u\u02bb | U   |\n| \u06cb   | v  | w    | \u0432   |w | /w/   | v | w    |\n| \u06d0   | \u00e9  | \u00e9    | \u0435   |e | /\u025b/   | e | e    |\n| \u0649   | i  | i    | \u0438   |i | /i/   | i | i    |\n| \u064a   | y  | y    | \u0439   |y | /j/   | y | y    |\n| \u0642   | q  | q    | \u049b   |\u2c6a | /q/   | q | q    |\n| \u0643   | k  | k    | \u043a   |k | /k/   | k | k    |\n| \u06af   | g  | g    | \u0433   |g | /\u0261/   | g | g    |\n| \u0646   | n  | n    | \u043d   |n | /n/   | n | n    |\n| \u063a   | \u011f  | gh   | \u0493   |\u01a3 | /\u0281/   | g\u02bb | G   |\n| \u0626   |    |      |     | |       |   | v    |\n| \u064a\u0627  | ya | ya   | \u044f   |ya |       | ya | ya   |\n| \u064a\u06c7  | yu | yu   | \u044e   |yu |       | yu | yu   |\n\n## Sample input and output examples\n\nReview the files in the tests directory for examples of converting between different scripts.\n\n## Usage\n\n```\nfrom umsc import UgMultiScriptConverter\n# To convert text, you need to define source and target scripts\n# The abbreviation of scrips\n# ULS | Uyghur Latin Script\n# UYS | Uyghur Yengi (New) Script\n# CPS | Chinese Pinyin Script\n# UAS | Uyghur Arabic Script\n# CTS |Common Turkic Script\n# UCS | Uyghur Cyrillic Script\n# XJU | Xinjinag University English Case Sensitive\n# UZLS | Uzbek Latin Script\n# Convert Uyghur Arabic Script to Uyghur Latin Script\nsource_script = 'UAS'\ntarget_script = 'ULS'\nconverter = UgMultiScriptConverter(source_script, target_script)\ntext1 = '\u064a\u0627\u062e\u0634\u0649\u0645\u06c7\u0633\u0649\u0632!'\ntext1 = converter(text1)\nprint(text1)\n# Convert Uyghur Latin Script to Uyghur Arabic Script\nsource_script = 'ULS'\ntarget_script = 'UAS'\nconverter = UgMultiScriptConverter(source_script, target_script)\ntext2 = 'yaxshimusiz!'\ntext2 = converter(text2)\nprint(text2)\n```\n\n## Notes\n- API and Uzbek are currently developing. The mapping might not be very accurate. Especially for Uzbek, it is not very clear how to map \"\u0626\u0627\" and \"\u0626\u06d5\" to Uzbek.\n\n\n## Citation\n\nIf you wish to cite this project, please use `cite this repository`. \n\n## Contributing\nFeel free to raise issue and pull request.\n\n## License\nDistributed under the Apache 2.0 License. See [`LICENSE`](LICENSE) for more information.\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "Script Converter for Uyghur Language",
    "version": "0.3.0",
    "project_urls": {
        "Homepage": "https://github.com/neouyghur/ScriptConverter4Uyghur"
    },
    "split_keywords": [
        "uyghur",
        "script",
        "converter",
        "arabic",
        "latin",
        "cyrillic",
        "ipa",
        "\u0626\u06c7\u064a\u063a\u06c7\u0631"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "58c5a4bc5591da37a09f0b6494e2e2005deefab538bb2654eb1ed74fb5483fc7",
                "md5": "871027c63e1963a57908aa2a705ca9eb",
                "sha256": "0aa8eb1698782f9197e08f8d38c8a0d5f8faa9f98bf1cd22193f23c1d2493a53"
            },
            "downloads": -1,
            "filename": "umsc-0.3.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "871027c63e1963a57908aa2a705ca9eb",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.0",
            "size": 22177,
            "upload_time": "2024-03-27T05:24:50",
            "upload_time_iso_8601": "2024-03-27T05:24:50.823377Z",
            "url": "https://files.pythonhosted.org/packages/58/c5/a4bc5591da37a09f0b6494e2e2005deefab538bb2654eb1ed74fb5483fc7/umsc-0.3.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "c158dc5faf952c59041bb3d9966affb091a26bba0084895284ff314c96033efb",
                "md5": "f919868fdf3f195323d41746bac9d0c5",
                "sha256": "829f17ed0f21e49d68707429cc85f20e7406eb14772dfca3131bebcee2464ce4"
            },
            "downloads": -1,
            "filename": "umsc-0.3.0.tar.gz",
            "has_sig": false,
            "md5_digest": "f919868fdf3f195323d41746bac9d0c5",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.0",
            "size": 16686,
            "upload_time": "2024-03-27T05:24:52",
            "upload_time_iso_8601": "2024-03-27T05:24:52.233518Z",
            "url": "https://files.pythonhosted.org/packages/c1/58/dc5faf952c59041bb3d9966affb091a26bba0084895284ff314c96033efb/umsc-0.3.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-03-27 05:24:52",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "neouyghur",
    "github_project": "ScriptConverter4Uyghur",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "lcname": "umsc"
}
        
Elapsed time: 0.21882s