Corpus
============
Video Lectures
============
[<img src="https://github.com/StarlangSoftware/Corpus/blob/master/video.jpg" width="50%">](https://youtu.be/xTrdKY5uI08)
For Developers
============
You can also see [Python](https://github.com/starlangsoftware/Corpus-Py), [Java](https://github.com/starlangsoftware/Corpus), [C++](https://github.com/starlangsoftware/Corpus-CPP), [Swift](https://github.com/starlangsoftware/Corpus-Swift), [Js](https://github.com/starlangsoftware/Corpus-Js), or [C#](https://github.com/starlangsoftware/Corpus-CS) repository.
## Requirements
* [Python 3.7 or higher](#python)
* [Git](#git)
### Python
To check if you have a compatible version of Python installed, use the following command:
python -V
You can find the latest version of Python [here](https://www.python.org/downloads/).
### Git
Install the [latest version of Git](https://git-scm.com/book/en/v2/Getting-Started-Installing-Git).
## Pip Install
pip3 install NlpToolkit-Corpus-Cy
## Download Code
In order to work on code, create a fork from GitHub page.
Use Git for cloning the code to your local or below line for Ubuntu:
git clone <your-fork-git-link>
A directory called Corpus will be created. Or you can use below link for exploring the code:
git clone https://github.com/olcaytaner/Corpus-Cy.git
## Open project with Pycharm IDE
Steps for opening the cloned project:
* Start IDE
* Select **File | Open** from main menu
* Choose `Corpus-Cy` file
* Select open as project option
* Couple of seconds, dependencies will be downloaded.
Detailed Description
============
+ [Corpus](#corpus)
+ [TurkishSplitter](#turkishsplitter)
## Corpus
To store a corpus in memory
a = Corpus("derlem.txt")
If this corpus is split with dots but not in sentences
Corpus(self, fileName=None, splitterOrChecker=None)
The number of sentences in the corpus
sentenceCount(self) -> int
To get ith sentence in the corpus
getSentence(self, index: int) -> Sentence
## TurkishSplitter
TurkishSplitter class is used to split the text into sentences in accordance with the . rules of Turkish.
split(self, line: str) -> list
Raw data
{
"_id": null,
"home_page": "https://github.com/StarlangSoftware/Corpus-Cy",
"name": "NlpToolkit-Corpus-Cy",
"maintainer": "",
"docs_url": null,
"requires_python": "",
"maintainer_email": "",
"keywords": "",
"author": "olcaytaner",
"author_email": "olcay.yildiz@ozyegin.edu.tr",
"download_url": "https://files.pythonhosted.org/packages/32/e6/1d50ec3bda4c2f0b35590ea86fd5c0609f12ac8daa9c64c2198f3fa32331/NlpToolkit-Corpus-Cy-1.0.21.tar.gz",
"platform": null,
"description": "Corpus\n============\n\nVideo Lectures\n============\n\n[<img src=\"https://github.com/StarlangSoftware/Corpus/blob/master/video.jpg\" width=\"50%\">](https://youtu.be/xTrdKY5uI08)\n\nFor Developers\n============\nYou can also see [Python](https://github.com/starlangsoftware/Corpus-Py), [Java](https://github.com/starlangsoftware/Corpus), [C++](https://github.com/starlangsoftware/Corpus-CPP), [Swift](https://github.com/starlangsoftware/Corpus-Swift), [Js](https://github.com/starlangsoftware/Corpus-Js), or [C#](https://github.com/starlangsoftware/Corpus-CS) repository.\n\n## Requirements\n\n* [Python 3.7 or higher](#python)\n* [Git](#git)\n\n### Python \n\nTo check if you have a compatible version of Python installed, use the following command:\n\n python -V\n \nYou can find the latest version of Python [here](https://www.python.org/downloads/).\n\n### Git\n\nInstall the [latest version of Git](https://git-scm.com/book/en/v2/Getting-Started-Installing-Git).\n\n## Pip Install\n\n\tpip3 install NlpToolkit-Corpus-Cy\n\n## Download Code\n\nIn order to work on code, create a fork from GitHub page. \nUse Git for cloning the code to your local or below line for Ubuntu:\n\n\tgit clone <your-fork-git-link>\n\nA directory called Corpus will be created. Or you can use below link for exploring the code:\n\n\tgit clone https://github.com/olcaytaner/Corpus-Cy.git\n\n## Open project with Pycharm IDE\n\nSteps for opening the cloned project:\n\n* Start IDE\n* Select **File | Open** from main menu\n* Choose `Corpus-Cy` file\n* Select open as project option\n* Couple of seconds, dependencies will be downloaded. \n\nDetailed Description\n============\n\n+ [Corpus](#corpus)\n+ [TurkishSplitter](#turkishsplitter)\n\n## Corpus\n\nTo store a corpus in memory\n\n\ta = Corpus(\"derlem.txt\")\n\nIf this corpus is split with dots but not in sentences\n\n\tCorpus(self, fileName=None, splitterOrChecker=None)\n\nThe number of sentences in the corpus\n\n\tsentenceCount(self) -> int\n\nTo get ith sentence in the corpus\n\n\tgetSentence(self, index: int) -> Sentence\n\n## TurkishSplitter\n\nTurkishSplitter class is used to split the text into sentences in accordance with the . rules of Turkish.\n\n\tsplit(self, line: str) -> list",
"bugtrack_url": null,
"license": "",
"summary": "Corpus library",
"version": "1.0.21",
"split_keywords": [],
"urls": [
{
"comment_text": "",
"digests": {
"md5": "5312d144d11c78359799c127d1d4534b",
"sha256": "b7df4339532e1b43e2084e7134cd341319eee7b9e22748b520a18d76e27eb6d9"
},
"downloads": -1,
"filename": "NlpToolkit-Corpus-Cy-1.0.21.tar.gz",
"has_sig": false,
"md5_digest": "5312d144d11c78359799c127d1d4534b",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 478957,
"upload_time": "2022-12-07T12:04:12",
"upload_time_iso_8601": "2022-12-07T12:04:12.563105Z",
"url": "https://files.pythonhosted.org/packages/32/e6/1d50ec3bda4c2f0b35590ea86fd5c0609f12ac8daa9c64c2198f3fa32331/NlpToolkit-Corpus-Cy-1.0.21.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2022-12-07 12:04:12",
"github": true,
"gitlab": false,
"bitbucket": false,
"github_user": "StarlangSoftware",
"github_project": "Corpus-Cy",
"travis_ci": false,
"coveralls": false,
"github_actions": false,
"lcname": "nlptoolkit-corpus-cy"
}