ChemDataExtractor
==================================
ChemDataExtractor v2 is a toolkit for extracting chemical information from the scientific literature. Python 3.5 to Python 3.8 supported.
Installation
------------
`pip install chemdataextractor2`
Features
--------
- HTML, XML and PDF document readers
- Chemistry-aware natural language processing pipeline
- Chemical named entity recognition
- Rule-based parsing grammars for property and spectra extraction
- Table parser for extracting tabulated data
- Document processing to resolve data interdependencies
Documentation & Development
-----------------------------
Please read the documentation for instructions on contributing to the project.
https://cambridgemolecularengineering-chemdataextractor-development.readthedocs-hosted.com/en/latest/
License
-------
ChemDataExtractor v2 is licensed under the `MIT license`_, a permissive, business-friendly license for open source
software.
MIT license: https://github.com/CambridgeMolecularEngineering/ChemDataExtractor/blob/master/LICENSE
Raw data
{
"_id": null,
"home_page": "https://github.com/CambridgeMolecularEngineering/ChemDataExtractor2",
"name": "chemdataextractor2",
"maintainer": null,
"docs_url": null,
"requires_python": null,
"maintainer_email": null,
"keywords": "text-mining mining chemistry cheminformatics nlp html xml science scientific",
"author": "Matt Swain, Callum Court, Juraj Mavracic, Taketomo Isazawa, and contributors",
"author_email": "m.swain@me.com, cc889@cam.ac.uk, jm2111@cam.ac.uk, ti250@cam.ac.uk",
"download_url": "https://files.pythonhosted.org/packages/62/f5/bd57ce5a8917cf9161f0892181dede8e3731e8e211d3640f3c2b8cd44344/chemdataextractor2-2.3.2.tar.gz",
"platform": null,
"description": "ChemDataExtractor\n==================================\n\nChemDataExtractor v2 is a toolkit for extracting chemical information from the scientific literature. Python 3.5 to Python 3.8 supported.\n\n\nInstallation\n------------\n\n`pip install chemdataextractor2`\n\n\nFeatures\n--------\n\n- HTML, XML and PDF document readers\n- Chemistry-aware natural language processing pipeline\n- Chemical named entity recognition\n- Rule-based parsing grammars for property and spectra extraction\n- Table parser for extracting tabulated data\n- Document processing to resolve data interdependencies\n\nDocumentation & Development\n-----------------------------\n\nPlease read the documentation for instructions on contributing to the project.\n\nhttps://cambridgemolecularengineering-chemdataextractor-development.readthedocs-hosted.com/en/latest/\n\nLicense\n-------\n\nChemDataExtractor v2 is licensed under the `MIT license`_, a permissive, business-friendly license for open source\nsoftware.\n\nMIT license: https://github.com/CambridgeMolecularEngineering/ChemDataExtractor/blob/master/LICENSE\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "A toolkit for extracting chemical information from the scientific literature.",
"version": "2.3.2",
"project_urls": {
"Homepage": "https://github.com/CambridgeMolecularEngineering/ChemDataExtractor2"
},
"split_keywords": [
"text-mining",
"mining",
"chemistry",
"cheminformatics",
"nlp",
"html",
"xml",
"science",
"scientific"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "62f5bd57ce5a8917cf9161f0892181dede8e3731e8e211d3640f3c2b8cd44344",
"md5": "3cc4cbde8151a075962ffaa48b17d3f7",
"sha256": "9dd40dd25609d3252f14d9951e743b34682a32e21fa1dfd7b2a60f8e3ffc436d"
},
"downloads": -1,
"filename": "chemdataextractor2-2.3.2.tar.gz",
"has_sig": false,
"md5_digest": "3cc4cbde8151a075962ffaa48b17d3f7",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 961460,
"upload_time": "2024-05-06T15:40:16",
"upload_time_iso_8601": "2024-05-06T15:40:16.864439Z",
"url": "https://files.pythonhosted.org/packages/62/f5/bd57ce5a8917cf9161f0892181dede8e3731e8e211d3640f3c2b8cd44344/chemdataextractor2-2.3.2.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-05-06 15:40:16",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "CambridgeMolecularEngineering",
"github_project": "ChemDataExtractor2",
"travis_ci": true,
"coveralls": false,
"github_actions": true,
"appveyor": true,
"requirements": [],
"lcname": "chemdataextractor2"
}