ChemDataExtractor
==================================
ChemDataExtractor v2 is a toolkit for extracting chemical information from the scientific literature. Python 3.5 to Python 3.8 supported.
Installation
------------
`pip install chemdataextractor2`
Features
--------
- HTML, XML and PDF document readers
- Chemistry-aware natural language processing pipeline
- Chemical named entity recognition
- Rule-based parsing grammars for property and spectra extraction
- Table parser for extracting tabulated data
- Document processing to resolve data interdependencies
Documentation & Development
-----------------------------
Please read the documentation for instructions on contributing to the project.
https://cambridgemolecularengineering-chemdataextractor-development.readthedocs-hosted.com/en/latest/
License
-------
ChemDataExtractor v2 is licensed under the `MIT license`_, a permissive, business-friendly license for open source
software.
MIT license: https://github.com/CambridgeMolecularEngineering/ChemDataExtractor/blob/master/LICENSE
Raw data
{
"_id": null,
"home_page": "https://github.com/CambridgeMolecularEngineering/ChemDataExtractor2",
"name": "chemdataextractor2",
"maintainer": "",
"docs_url": null,
"requires_python": "",
"maintainer_email": "",
"keywords": "text-mining mining chemistry cheminformatics nlp html xml science scientific",
"author": "Matt Swain, Callum Court, Juraj Mavracic, Taketomo Isazawa, and contributors",
"author_email": "m.swain@me.com, cc889@cam.ac.uk, jm2111@cam.ac.uk, ti250@cam.ac.uk",
"download_url": "https://files.pythonhosted.org/packages/7b/c2/bb79103208b511defb44b6428df76da93c4650a0a1bb74d74b77d3d82072/chemdataextractor2-2.2.2.tar.gz",
"platform": null,
"description": "ChemDataExtractor\n==================================\n\nChemDataExtractor v2 is a toolkit for extracting chemical information from the scientific literature. Python 3.5 to Python 3.8 supported.\n\n\nInstallation\n------------\n\n`pip install chemdataextractor2`\n\n\nFeatures\n--------\n\n- HTML, XML and PDF document readers\n- Chemistry-aware natural language processing pipeline\n- Chemical named entity recognition\n- Rule-based parsing grammars for property and spectra extraction\n- Table parser for extracting tabulated data\n- Document processing to resolve data interdependencies\n\nDocumentation & Development\n-----------------------------\n\nPlease read the documentation for instructions on contributing to the project.\n\nhttps://cambridgemolecularengineering-chemdataextractor-development.readthedocs-hosted.com/en/latest/\n\nLicense\n-------\n\nChemDataExtractor v2 is licensed under the `MIT license`_, a permissive, business-friendly license for open source\nsoftware.\n\nMIT license: https://github.com/CambridgeMolecularEngineering/ChemDataExtractor/blob/master/LICENSE\n\n\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "A toolkit for extracting chemical information from the scientific literature.",
"version": "2.2.2",
"project_urls": {
"Homepage": "https://github.com/CambridgeMolecularEngineering/ChemDataExtractor2"
},
"split_keywords": [
"text-mining",
"mining",
"chemistry",
"cheminformatics",
"nlp",
"html",
"xml",
"science",
"scientific"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "7bc2bb79103208b511defb44b6428df76da93c4650a0a1bb74d74b77d3d82072",
"md5": "15166d476c176f7427b106762bcbc29e",
"sha256": "e8d4af74019bb7c1c9f18aae0df9af053e63dcdeafaefc1a81040ddcea84def4"
},
"downloads": -1,
"filename": "chemdataextractor2-2.2.2.tar.gz",
"has_sig": false,
"md5_digest": "15166d476c176f7427b106762bcbc29e",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 1099152,
"upload_time": "2024-01-18T12:54:05",
"upload_time_iso_8601": "2024-01-18T12:54:05.445100Z",
"url": "https://files.pythonhosted.org/packages/7b/c2/bb79103208b511defb44b6428df76da93c4650a0a1bb74d74b77d3d82072/chemdataextractor2-2.2.2.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-01-18 12:54:05",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "CambridgeMolecularEngineering",
"github_project": "ChemDataExtractor2",
"travis_ci": true,
"coveralls": false,
"github_actions": true,
"appveyor": true,
"requirements": [],
"lcname": "chemdataextractor2"
}