# uztagger
https://pypi.org/project/uztagger <br>
https://github.com/UlugbekSalaev/uztagger
uztagger is focused to make tagging sentence with morphological Part of Speech (POS) tagset of Uzbek word based on morphemes and limited number of lexicon. The tool includes list of POS tagset, tagging method.
It is created as a python library and uploaded to [PyPI](https://pypi.org/). It is simply easy to use in your python project or other programming language projects via the API.
## About project
The tool is focused to make tagging sentence with morphological Part of Speech (POS) tagset of Uzbek word based on morphemes. The tool includes list of POS tagset, tagging method.
## Quick links
- [Github](https://github.com/UlugbekSalaev/uztagger)
- [PyPI](https://pypi.org/project/uztagger/)
- [Web-UI](https://nlp.urdu.uz/?menu=uztagger)
## Demo
You can use [web interface](http://nlp.urdu.uz/?menu=uztagger).
## Features
- Tagging
- POS tag list
- Help function
## Usage
Three options to run uztagger:
- pip
- API
- Web interface
### pip installation
To install uztagger, simply run:
```code
pip install uztagger
```
After installation, use in python like following:
```yml
# import the library
from uztagger import Tagger
# create an object
tagger = Tagger()
# call tagging method
tagger.pos_tag('Bizlar bugun maktabga bormoqchimiz.')
# output
[('Bizlar','NOUN'),('bugun', 'NOUN'), ('maktabga', 'NOUN'), ('bormoqchimiz', 'VERB'), ('.', 'PUNC')]
```
### API
API configurations:
- Method: `GET`
- Response type: `string`
- URL: `https://nlp.urdu.uz:8080/uztagger/pos_tag`
- Parameters: `text:string`
- Sample Request: `https://nlp.urdu.uz:8080/uztagger/pos_tag?text=Ular%20maktabga%20borayaptilar.`
- Sample output: `[("Ular","NOUN"),("maktabga",""),("borayaptilar",""),(".","PUNC")]`
### Web-UI
The web interface created to use easily the library:
You can use web interface [here](http://nlp.urdu.uz/?page=uztagger).
![Demo image](src/uztagger/web-interface-ui.png)
### POS tag list
Tagger using following options as POS tag:<br>
`NOUN` Noun {Ot}<br>
`VERB` Verb {Fe'l}<br>
`ADJ ` Adjective {Sifat}<br>
`NUM ` Numeric {Son}<br>
`ADV ` Adverb {Ravish}<br>
`PRN ` Pronoun {Olmosh}<br>
`CNJ ` Conjunction {Bog'lovchi}<br>
`ADP ` Adposition {Ko'makchi}<br>
`PRT ` Particle {Yuklama}<br>
`INTJ` Interjection {Undov}<br>
`MOD ` Modal {Modal}<br>
`IMIT` Imitation {Taqlid}<br>
`AUX ` Auxiliary verb {Yordamchi fe'l}<br>
`PPN ` Proper noun {Atoqli ot}<br>
`PUNC` Punctuation {Tinish belgi}<br>
`SYM ` Symbol {Belgi}<br>
### Result Explaining
The method ```pos_tag``` returns list, that an item of the list contain tuples for each token of the text with following format: ```(token, pos)```, for POS tag list, see <i>POS Tag List</i> section on above.
#### Result from `tagger` method
`[('Bizlar','NOUN'),('bugun', 'NOUN'), ('maktabga', 'NOUN'), ('bormoqchimiz', 'VERB'), ('.', 'PUNC')]`
## Documentation
See [here](https://github.com/UlugbekSalaev/uztagger).
## Citation
```tex
@misc{uztagger,
title={{uztagger}: Morphological Part of Speech Tagger Tool for Uzbek},
url={https://github.com/UlugbekSalaev/uztagger},
note={Software available from https://github.com/UlugbekSalaev/uztagger},
author={
Ulugbek Salaev},
year={2022},
}
```
## Contact
For help and feedback, please feel free to contact [the author](https://github.com/UlugbekSalaev).
Raw data
{
"_id": null,
"home_page": "https://github.com/UlugbekSalaev/uztagger",
"name": "uztagger",
"maintainer": null,
"docs_url": null,
"requires_python": ">=3.6",
"maintainer_email": null,
"keywords": "mophology, uzbek-language, pos tagging, morphological tagging",
"author": "Ulugbek Salaev",
"author_email": "ulugbek0302@gmail.com",
"download_url": "https://files.pythonhosted.org/packages/0c/75/a7a2ea66afddd2965d94ac993a40628c48f04c047635d2d987d07bd83464/uztagger-0.0.11.tar.gz",
"platform": null,
"description": "# uztagger\r\n\r\nhttps://pypi.org/project/uztagger <br>\r\nhttps://github.com/UlugbekSalaev/uztagger\r\n\r\nuztagger is focused to make tagging sentence with morphological Part of Speech (POS) tagset of Uzbek word based on morphemes and limited number of lexicon. The tool includes list of POS tagset, tagging method. \r\nIt is created as a python library and uploaded to [PyPI](https://pypi.org/). It is simply easy to use in your python project or other programming language projects via the API. \r\n\r\n## About project\r\nThe tool is focused to make tagging sentence with morphological Part of Speech (POS) tagset of Uzbek word based on morphemes. The tool includes list of POS tagset, tagging method.\r\n\r\n## Quick links\r\n\r\n- [Github](https://github.com/UlugbekSalaev/uztagger)\r\n- [PyPI](https://pypi.org/project/uztagger/)\r\n- [Web-UI](https://nlp.urdu.uz/?menu=uztagger)\r\n\r\n## Demo\r\n\r\nYou can use [web interface](http://nlp.urdu.uz/?menu=uztagger).\r\n\r\n## Features\r\n\r\n- Tagging \r\n- POS tag list\r\n- Help function\r\n\r\n## Usage\r\n\r\nThree options to run uztagger:\r\n\r\n- pip\r\n- API \r\n- Web interface\r\n\r\n### pip installation\r\n\r\nTo install uztagger, simply run:\r\n\r\n```code\r\npip install uztagger\r\n```\r\n\r\nAfter installation, use in python like following:\r\n```yml\r\n# import the library\r\nfrom uztagger import Tagger\r\n# create an object \r\ntagger = Tagger()\r\n# call tagging method\r\ntagger.pos_tag('Bizlar bugun maktabga bormoqchimiz.')\r\n# output\r\n[('Bizlar','NOUN'),('bugun', 'NOUN'), ('maktabga', 'NOUN'), ('bormoqchimiz', 'VERB'), ('.', 'PUNC')]\r\n```\r\n\r\n### API\r\nAPI configurations: \r\n - Method: `GET`\r\n - Response type: `string`\r\n - URL: `https://nlp.urdu.uz:8080/uztagger/pos_tag`\r\n - Parameters: `text:string`\r\n - Sample Request: `https://nlp.urdu.uz:8080/uztagger/pos_tag?text=Ular%20maktabga%20borayaptilar.`\r\n - Sample output: `[(\"Ular\",\"NOUN\"),(\"maktabga\",\"\"),(\"borayaptilar\",\"\"),(\".\",\"PUNC\")]`\r\n\r\n### Web-UI\r\n\r\nThe web interface created to use easily the library:\r\nYou can use web interface [here](http://nlp.urdu.uz/?page=uztagger).\r\n\r\n![Demo image](src/uztagger/web-interface-ui.png)\r\n\r\n### POS tag list\r\nTagger using following options as POS tag:<br>\r\n `NOUN` Noun {Ot}<br>\r\n `VERB` Verb {Fe'l}<br>\r\n `ADJ ` Adjective {Sifat}<br>\r\n `NUM ` Numeric {Son}<br>\r\n `ADV ` Adverb {Ravish}<br>\r\n `PRN ` Pronoun {Olmosh}<br>\r\n `CNJ ` Conjunction {Bog'lovchi}<br>\r\n `ADP ` Adposition {Ko'makchi}<br>\r\n `PRT ` Particle {Yuklama}<br>\r\n `INTJ` Interjection {Undov}<br>\r\n `MOD ` Modal {Modal}<br>\r\n `IMIT` Imitation {Taqlid}<br>\r\n `AUX ` Auxiliary verb {Yordamchi fe'l}<br>\r\n `PPN ` Proper noun {Atoqli ot}<br>\r\n `PUNC` Punctuation {Tinish belgi}<br>\r\n `SYM ` Symbol {Belgi}<br>\r\n\r\n### Result Explaining\r\n\r\nThe method ```pos_tag``` returns list, that an item of the list contain tuples for each token of the text with following format: ```(token, pos)```, for POS tag list, see <i>POS Tag List</i> section on above. \r\n#### Result from `tagger` method\r\n`[('Bizlar','NOUN'),('bugun', 'NOUN'), ('maktabga', 'NOUN'), ('bormoqchimiz', 'VERB'), ('.', 'PUNC')]`\r\n\r\n## Documentation\r\n\r\nSee [here](https://github.com/UlugbekSalaev/uztagger).\r\n\r\n## Citation\r\n\r\n```tex\r\n@misc{uztagger,\r\n title={{uztagger}: Morphological Part of Speech Tagger Tool for Uzbek},\r\n url={https://github.com/UlugbekSalaev/uztagger},\r\n note={Software available from https://github.com/UlugbekSalaev/uztagger},\r\n author={\r\n Ulugbek Salaev},\r\n year={2022},\r\n}\r\n```\r\n\r\n## Contact\r\n\r\nFor help and feedback, please feel free to contact [the author](https://github.com/UlugbekSalaev).\r\n",
"bugtrack_url": null,
"license": null,
"summary": "uztagger | Uzbek Morphological Part of Speech (POS) Tagging on Python",
"version": "0.0.11",
"project_urls": {
"Bug Tracker": "https://github.com/UlugbekSalaev/uztagger/issues",
"Homepage": "https://github.com/UlugbekSalaev/uztagger"
},
"split_keywords": [
"mophology",
" uzbek-language",
" pos tagging",
" morphological tagging"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "0ded8d631109da25cdb5eb11b5c41e5b25e62860fe260e76eb2fdaa8b364c38d",
"md5": "12a90e7d361f6785bdb49efa1e5ced67",
"sha256": "9c3f10f8999f87a6d3de1bdf1dde3023f057991b9e3c985b1742b6f453887f74"
},
"downloads": -1,
"filename": "uztagger-0.0.11-py3-none-any.whl",
"has_sig": false,
"md5_digest": "12a90e7d361f6785bdb49efa1e5ced67",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">=3.6",
"size": 351283,
"upload_time": "2024-05-29T06:16:26",
"upload_time_iso_8601": "2024-05-29T06:16:26.035465Z",
"url": "https://files.pythonhosted.org/packages/0d/ed/8d631109da25cdb5eb11b5c41e5b25e62860fe260e76eb2fdaa8b364c38d/uztagger-0.0.11-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "0c75a7a2ea66afddd2965d94ac993a40628c48f04c047635d2d987d07bd83464",
"md5": "30753433b6db692197b100f792b98466",
"sha256": "c0d6f78b7df271afe4483511256c8f9fd83f15e413cff822ffa2a2dd9b177f93"
},
"downloads": -1,
"filename": "uztagger-0.0.11.tar.gz",
"has_sig": false,
"md5_digest": "30753433b6db692197b100f792b98466",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">=3.6",
"size": 353345,
"upload_time": "2024-05-29T06:16:28",
"upload_time_iso_8601": "2024-05-29T06:16:28.683464Z",
"url": "https://files.pythonhosted.org/packages/0c/75/a7a2ea66afddd2965d94ac993a40628c48f04c047635d2d987d07bd83464/uztagger-0.0.11.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-05-29 06:16:28",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "UlugbekSalaev",
"github_project": "uztagger",
"github_not_found": true,
"lcname": "uztagger"
}