Textable
========
Textable is an open source add-on bringing advanced text-analytical
functionalities to the `Orange Canvas <http://orange.biolab.si/>`_ data mining
software package (itself open source). Look at the following `example
<http://orange-textable.readthedocs.io/en/latest/illustration.html>`_ to see
it in typical action.
The project's website is http://textable.io. It hosts a repository of
`recipes <http://textable.io/find-recipes>`_ to help you get started with
Textable.
Documentation is hosted at http://orange3-textable.readthedocs.io/ and
you can get further support at https://textable.freshdesk.com/ or by e-mail
to `support@textable.io <mailto:support@textable.io>`_
Orange Textable was designed and implemented by `LangTech Sarl
<http://langtech.ch>`_ on behalf of the `department of language and
information sciences (SLI) <http://www.unil.ch/sli>`_ at the `University of
Lausanne <http://www.unil.ch>`_ (see `Credits
<http://orange-textable.readthedocs.io/en/latest/credits.html>`_ and
`How to cite Orange Textable
<http://orange-textable.readthedocs.io/en/latest/citing.html>`_).
Features
--------
Basic text analysis
~~~~~~~~~~~~~~~~~~~
* use regular expressions to segment letters, words, sentences, etc. or full-text query
* use regexes to extract annotations from many input formats
* import in-line XML markup (e.g. TEI)
* include/exclude segments based on user-defined lists (stoplists)
* filter segments based on frequency
* easily generate random text samples
Advanced text analysis
~~~~~~~~~~~~~~~~~~~~~~
* concordances and collocations, also based on annotations
* segment distribution, document-term matrix, transition matrix, etc.
* co-occurrence tables, also between different types of segments
* lemmatization and POS-tagging via Treetagger
* robust linguistic complexity measures, incl. mean length of word, lexical diversity, etc.
* many advanced data mining algorithms: clustering, classification, factor analyses, etc.
Text recoding
~~~~~~~~~~~~~
* Unicode-aware preprocessing functions, e.g. remove accents from Ancient Greek text
* recode and restructure texts using regexes, e.g. rewrite CSV as XML
Extensibility
~~~~~~~~~~~~~
* handles hundreds of text files
* use Python script for custom text processing or to access external tools: NLTK, Pattern, GenSim, etc.
Interoperability
~~~~~~~~~~~~~~~~
* import text from keyboard, files, or URLs
* process any kind of raw text format: TXT, HTML, XML, CSV, etc.
* supports many text encodings, incl. Unicode
* export results in text files or copy-paste
* easy interfacing with Orange's Text Mining add-on
Ease of access
~~~~~~~~~~~~~~
* user-friendly visual interface
* ready-made recipes for a range of frequent use cases
* extensive documentation
* support and community forums
Raw data
{
"_id": null,
"home_page": "http://textable.io",
"name": "Orange3-Textable",
"maintainer": null,
"docs_url": null,
"requires_python": null,
"maintainer_email": null,
"keywords": "text mining, text analysis, textable, orange3, orange3 add-on",
"author": "LangTech Sarl",
"author_email": "info@langtech.ch",
"download_url": "https://files.pythonhosted.org/packages/6e/4b/82a1cd2a4521d8dc06b441ea3ec9be8cad98570aee4496fb9cc758e4d6d2/orange3_textable-3.2.7.tar.gz",
"platform": null,
"description": "Textable\r\n========\r\n\r\nTextable is an open source add-on bringing advanced text-analytical\r\nfunctionalities to the `Orange Canvas <http://orange.biolab.si/>`_ data mining\r\nsoftware package (itself open source). Look at the following `example\r\n<http://orange-textable.readthedocs.io/en/latest/illustration.html>`_ to see\r\nit in typical action.\r\n\r\nThe project's website is http://textable.io. It hosts a repository of\r\n`recipes <http://textable.io/find-recipes>`_ to help you get started with\r\nTextable.\r\n\r\nDocumentation is hosted at http://orange3-textable.readthedocs.io/ and\r\nyou can get further support at https://textable.freshdesk.com/ or by e-mail\r\nto `support@textable.io <mailto:support@textable.io>`_\r\n\r\nOrange Textable was designed and implemented by `LangTech Sarl\r\n<http://langtech.ch>`_ on behalf of the `department of language and\r\ninformation sciences (SLI) <http://www.unil.ch/sli>`_ at the `University of\r\nLausanne <http://www.unil.ch>`_ (see `Credits\r\n<http://orange-textable.readthedocs.io/en/latest/credits.html>`_ and\r\n`How to cite Orange Textable\r\n<http://orange-textable.readthedocs.io/en/latest/citing.html>`_).\r\n\r\nFeatures\r\n--------\r\n\r\nBasic text analysis\r\n~~~~~~~~~~~~~~~~~~~\r\n\r\n* use regular expressions to segment letters, words, sentences, etc. or full-text query\r\n* use regexes to extract annotations from many input formats\r\n* import in-line XML markup (e.g. TEI)\r\n* include/exclude segments based on user-defined lists (stoplists)\r\n* filter segments based on frequency\r\n* easily generate random text samples\r\n\r\nAdvanced text analysis\r\n~~~~~~~~~~~~~~~~~~~~~~\r\n\r\n* concordances and collocations, also based on annotations\r\n* segment distribution, document-term matrix, transition matrix, etc.\r\n* co-occurrence tables, also between different types of segments\r\n* lemmatization and POS-tagging via Treetagger\r\n* robust linguistic complexity measures, incl. mean length of word, lexical diversity, etc.\r\n* many advanced data mining algorithms: clustering, classification, factor analyses, etc.\r\n\r\nText recoding\r\n~~~~~~~~~~~~~\r\n\r\n* Unicode-aware preprocessing functions, e.g. remove accents from Ancient Greek text\r\n* recode and restructure texts using regexes, e.g. rewrite CSV as XML\r\n\r\nExtensibility\r\n~~~~~~~~~~~~~\r\n\r\n* handles hundreds of text files\r\n* use Python script for custom text processing or to access external tools: NLTK, Pattern, GenSim, etc.\r\n\r\nInteroperability\r\n~~~~~~~~~~~~~~~~\r\n\r\n* import text from keyboard, files, or URLs\r\n* process any kind of raw text format: TXT, HTML, XML, CSV, etc.\r\n* supports many text encodings, incl. Unicode\r\n* export results in text files or copy-paste\r\n* easy interfacing with Orange's Text Mining add-on\r\n\r\nEase of access\r\n~~~~~~~~~~~~~~\r\n\r\n* user-friendly visual interface\r\n* ready-made recipes for a range of frequent use cases\r\n* extensive documentation\r\n* support and community forums\r\n",
"bugtrack_url": null,
"license": "GPLv3",
"summary": "Textable add-on for Orange 3 data mining software package.",
"version": "3.2.7",
"project_urls": {
"Download": "https://github.com/axanthos/orange3-textable/archive/master.zip",
"Homepage": "http://textable.io"
},
"split_keywords": [
"text mining",
" text analysis",
" textable",
" orange3",
" orange3 add-on"
],
"urls": [
{
"comment_text": null,
"digests": {
"blake2b_256": "b4fc64b2504d41603e1b53c411492eca3d4f5b09133494194823da4ade75f346",
"md5": "6fb2cced4101911db53ff0b5ab3b75ee",
"sha256": "2758987393e7a54a6c80d630a619b6a4527b87ebbc43140fdecacec4dd048c65"
},
"downloads": -1,
"filename": "orange3_textable-3.2.7-py3-none-any.whl",
"has_sig": false,
"md5_digest": "6fb2cced4101911db53ff0b5ab3b75ee",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": null,
"size": 205023,
"upload_time": "2025-10-20T14:34:49",
"upload_time_iso_8601": "2025-10-20T14:34:49.089537Z",
"url": "https://files.pythonhosted.org/packages/b4/fc/64b2504d41603e1b53c411492eca3d4f5b09133494194823da4ade75f346/orange3_textable-3.2.7-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": null,
"digests": {
"blake2b_256": "6e4b82a1cd2a4521d8dc06b441ea3ec9be8cad98570aee4496fb9cc758e4d6d2",
"md5": "c233bfad1c0ce88a4ef28f8c2b44f111",
"sha256": "92721d90acc1c76776b67f42587a9ef6b36a1dcffdc0c4eebd6e9b22afa750f4"
},
"downloads": -1,
"filename": "orange3_textable-3.2.7.tar.gz",
"has_sig": false,
"md5_digest": "c233bfad1c0ce88a4ef28f8c2b44f111",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 7023508,
"upload_time": "2025-10-20T14:33:50",
"upload_time_iso_8601": "2025-10-20T14:33:50.435650Z",
"url": "https://files.pythonhosted.org/packages/6e/4b/82a1cd2a4521d8dc06b441ea3ec9be8cad98570aee4496fb9cc758e4d6d2/orange3_textable-3.2.7.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2025-10-20 14:33:50",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "axanthos",
"github_project": "orange3-textable",
"travis_ci": false,
"coveralls": false,
"github_actions": false,
"lcname": "orange3-textable"
}