Name | auto-find-date-pdf JSON |
Version |
0.2.22
JSON |
| download |
home_page | |
Summary | A simple lib to find dates from any txt/ pdf/ docx/ rtf source. For documentation see |
upload_time | 2023-12-25 23:55:06 |
maintainer | |
docs_url | None |
author | Your Name |
requires_python | |
license | |
keywords |
|
VCS |
|
bugtrack_url |
|
requirements |
No requirements were recorded.
|
Travis-CI |
No Travis.
|
coveralls test coverage |
No coveralls.
|
# Simple use Date and text parsing from pdf rtf and images (with use of call back function)
This is a simple package provided by Marvsai healthcare LTD. It can find any format regular dates in a str
as python Datetime objects.
Easy to use method
` def find_dates(file_contents: str): `
Find any dates in a large python string usually taken from a file or pdf
Args:
file_contents (str): The string in which to find any format of dates
Returns:
List[datetime.datetime]: A list of datetime objects the latest can be found using max()
Optimised replacement of multiple strings in a string
` replace_multiple_strings(input_string, replacements_dict) `
Replace multiple strings in the input string using a dictionary of replacement pairs.
Args:
input_string (str): The string in which to replace the substrings.
replacements_dict (dict): A dictionary of replacement pairs, where the keys are the
substrings to be replaced and the values are the replacement strings.
Returns:
str: The input string with all instances of the substrings replaced with their
corresponding replacement strings.
"""
Easy to use extraction of text from PDF or RTF files:
`def extract_rtf_pdf(name: str, get_ai_text:Callable=None)->str:`
Find text from pdf and rtf
Args:
name (str): The string in which to find any format of dates
get_ai_text: call back function that can call google vision api or AWS or Azure equivalents for text extraction
Called for images and image PDFs.
Returns:
List[datetime.datetime]: A list of datetime objects the latest can be found using max()
"""
Raw data
{
"_id": null,
"home_page": "",
"name": "auto-find-date-pdf",
"maintainer": "",
"docs_url": null,
"requires_python": "",
"maintainer_email": "",
"keywords": "",
"author": "Your Name",
"author_email": "open@marvsai.com",
"download_url": "https://files.pythonhosted.org/packages/06/7c/b8cd0473fa210b2dc15fd423292ec9f4b7fa1fe12f6a0dc82cf4cc9b0f40/auto_find_date_pdf-0.2.22.tar.gz",
"platform": null,
"description": "# Simple use Date and text parsing from pdf rtf and images (with use of call back function)\r\n\r\nThis is a simple package provided by Marvsai healthcare LTD. It can find any format regular dates in a str \r\nas python Datetime objects.\r\n\r\nEasy to use method\r\n\r\n\r\n` def find_dates(file_contents: str): `\r\n\r\n Find any dates in a large python string usually taken from a file or pdf\r\n\r\n Args:\r\n file_contents (str): The string in which to find any format of dates\r\n\r\n\r\n Returns:\r\n List[datetime.datetime]: A list of datetime objects the latest can be found using max()\r\n\r\n\r\nOptimised replacement of multiple strings in a string\r\n\r\n` replace_multiple_strings(input_string, replacements_dict) ` \r\n\r\n Replace multiple strings in the input string using a dictionary of replacement pairs.\r\n\r\n Args:\r\n input_string (str): The string in which to replace the substrings.\r\n replacements_dict (dict): A dictionary of replacement pairs, where the keys are the\r\n substrings to be replaced and the values are the replacement strings.\r\n\r\n Returns:\r\n str: The input string with all instances of the substrings replaced with their\r\n corresponding replacement strings.\r\n \"\"\"\r\nEasy to use extraction of text from PDF or RTF files:\r\n\r\n`def extract_rtf_pdf(name: str, get_ai_text:Callable=None)->str:`\r\n \r\n Find text from pdf and rtf\r\n\r\n Args:\r\n name (str): The string in which to find any format of dates\r\n get_ai_text: call back function that can call google vision api or AWS or Azure equivalents for text extraction\r\n Called for images and image PDFs.\r\n\r\n Returns:\r\n List[datetime.datetime]: A list of datetime objects the latest can be found using max()\r\n \"\"\"\r\n",
"bugtrack_url": null,
"license": "",
"summary": "A simple lib to find dates from any txt/ pdf/ docx/ rtf source. For documentation see",
"version": "0.2.22",
"project_urls": null,
"split_keywords": [],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "e4268b0da77e94bf38e511571e381d36f0a112169ec27ca84594280cd93d8b45",
"md5": "caa82a09e705ede3466c00f06eee6ab3",
"sha256": "c9e0de9e4cc56582e475f715467f73723f5e00539c995392bc5f1869d325967a"
},
"downloads": -1,
"filename": "auto_find_date_pdf-0.2.22-py3-none-any.whl",
"has_sig": false,
"md5_digest": "caa82a09e705ede3466c00f06eee6ab3",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": null,
"size": 5712,
"upload_time": "2023-12-25T23:55:04",
"upload_time_iso_8601": "2023-12-25T23:55:04.651641Z",
"url": "https://files.pythonhosted.org/packages/e4/26/8b0da77e94bf38e511571e381d36f0a112169ec27ca84594280cd93d8b45/auto_find_date_pdf-0.2.22-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "067cb8cd0473fa210b2dc15fd423292ec9f4b7fa1fe12f6a0dc82cf4cc9b0f40",
"md5": "c5e9da1f5e0869a54cc95339f23d6db4",
"sha256": "2e1b3915759eea4313d344ff76ac77b1ea817529d5ca09e112374721d682e536"
},
"downloads": -1,
"filename": "auto_find_date_pdf-0.2.22.tar.gz",
"has_sig": false,
"md5_digest": "c5e9da1f5e0869a54cc95339f23d6db4",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 5229,
"upload_time": "2023-12-25T23:55:06",
"upload_time_iso_8601": "2023-12-25T23:55:06.007417Z",
"url": "https://files.pythonhosted.org/packages/06/7c/b8cd0473fa210b2dc15fd423292ec9f4b7fa1fe12f6a0dc82cf4cc9b0f40/auto_find_date_pdf-0.2.22.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2023-12-25 23:55:06",
"github": false,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"lcname": "auto-find-date-pdf"
}