# `getcl`: Mapping Lexical Data from CLDF Dictionaries to Wordlists
`getcl` offers the code for mapping lexical data from a CLDF dictionary to a wordlist as described in the following paper:
> Blum, Frederic and Englisch, Johannes and Hermida Rodriguez, Alba and van Gijn, Rik and List, Johann-Mattis (2024). To be published in the Proceedings of the 2nd Meeting of the ELRA/ISCA Special Interest Group on Under-Resourced Languages.
To install the tool, just clone this repository and then type:
```
pip install -e .
```
This will provide a new command `conceptlist` that you can use to map concepts in the sense descriptions of a dictionary to Concepticon.
To test the package, you can just run the following code:
```
$ git clone https://github.com/dictionaria/kalamang.git
$ conceptlist --data kalamang/cldf/cldf-metadata.json --conceptlist Swadesh-1955-100 --output kalamang.tsv | wc -l
```
The output should be 197. This means that 196 matches for the 100 concepts of Swadesh's concept list could be identified. To get a useful concept list of these matches (that contain duplicates and potential errors), you would not look at the file `kalamang.tsv` and modify it manually to arrive at a Swadesh list derived from the dictionary.
Raw data
{
"_id": null,
"home_page": "https://github.com/lingpy/getcl/",
"name": "getcl",
"maintainer": null,
"docs_url": null,
"requires_python": ">=3.6",
"maintainer_email": null,
"keywords": "word prediction",
"author": "Johann-Mattis List",
"author_email": "mattis.list@uni-passau.de",
"download_url": "https://files.pythonhosted.org/packages/a4/f1/3b73169d2173c42c9c7131b9734419283aed2e7e550b1138bc9ab1c14b9e/getcl-0.1.tar.gz",
"platform": "any",
"description": "# `getcl`: Mapping Lexical Data from CLDF Dictionaries to Wordlists\n\n`getcl` offers the code for mapping lexical data from a CLDF dictionary to a wordlist as described in the following paper:\n\n> Blum, Frederic and Englisch, Johannes and Hermida Rodriguez, Alba and van Gijn, Rik and List, Johann-Mattis (2024). To be published in the Proceedings of the 2nd Meeting of the ELRA/ISCA Special Interest Group on Under-Resourced Languages.\n\nTo install the tool, just clone this repository and then type:\n\n```\npip install -e .\n```\n\nThis will provide a new command `conceptlist` that you can use to map concepts in the sense descriptions of a dictionary to Concepticon.\n\nTo test the package, you can just run the following code:\n\n```\n$ git clone https://github.com/dictionaria/kalamang.git\n$ conceptlist --data kalamang/cldf/cldf-metadata.json --conceptlist Swadesh-1955-100 --output kalamang.tsv | wc -l\n```\nThe output should be 197. This means that 196 matches for the 100 concepts of Swadesh's concept list could be identified. To get a useful concept list of these matches (that contain duplicates and potential errors), you would not look at the file `kalamang.tsv` and modify it manually to arrive at a Swadesh list derived from the dictionary.\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "Python Package for the Mapping of Conceptlists",
"version": "0.1",
"project_urls": {
"Homepage": "https://github.com/lingpy/getcl/"
},
"split_keywords": [
"word",
"prediction"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "97eeca021e69d42a712425053850c3c3e962f2a623949460c7cca007673ee289",
"md5": "e4c459f0c65acabe6db03dd764d55093",
"sha256": "ec65367eb3b187a9332e2f50b54bda008d76732e0184e95d9ce92c9034a3c5ef"
},
"downloads": -1,
"filename": "getcl-0.1-py3-none-any.whl",
"has_sig": false,
"md5_digest": "e4c459f0c65acabe6db03dd764d55093",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">=3.6",
"size": 4053,
"upload_time": "2024-04-07T20:32:27",
"upload_time_iso_8601": "2024-04-07T20:32:27.837561Z",
"url": "https://files.pythonhosted.org/packages/97/ee/ca021e69d42a712425053850c3c3e962f2a623949460c7cca007673ee289/getcl-0.1-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "a4f13b73169d2173c42c9c7131b9734419283aed2e7e550b1138bc9ab1c14b9e",
"md5": "65fa74eeba8cdc22f43b676544a73590",
"sha256": "2482768a66e0c209b31ef4346511f1e31e8935911845579ab863deeea8084391"
},
"downloads": -1,
"filename": "getcl-0.1.tar.gz",
"has_sig": false,
"md5_digest": "65fa74eeba8cdc22f43b676544a73590",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">=3.6",
"size": 4100,
"upload_time": "2024-04-07T20:32:10",
"upload_time_iso_8601": "2024-04-07T20:32:10.014663Z",
"url": "https://files.pythonhosted.org/packages/a4/f1/3b73169d2173c42c9c7131b9734419283aed2e7e550b1138bc9ab1c14b9e/getcl-0.1.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-04-07 20:32:10",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "lingpy",
"github_project": "getcl",
"travis_ci": false,
"coveralls": false,
"github_actions": false,
"lcname": "getcl"
}