auto-find-date-pdf


Nameauto-find-date-pdf JSON
Version 0.2.22 PyPI version JSON
download
home_page
SummaryA simple lib to find dates from any txt/ pdf/ docx/ rtf source. For documentation see
upload_time2023-12-25 23:55:06
maintainer
docs_urlNone
authorYour Name
requires_python
license
keywords
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # Simple use Date and text parsing from pdf rtf and images (with use of call back function)

This is a simple package provided by Marvsai healthcare LTD. It can find any format regular dates in a str 
as python Datetime objects.

Easy to use method


` def find_dates(file_contents: str): `

      Find any dates in a large python string usually taken from a file or pdf

      Args:
          file_contents (str): The string in which to find any format of dates


      Returns:
          List[datetime.datetime]: A list of datetime objects the latest can be found using max()


Optimised replacement of multiple strings in a string

` replace_multiple_strings(input_string, replacements_dict) ` 

    Replace multiple strings in the input string using a dictionary of replacement pairs.

    Args:
        input_string (str): The string in which to replace the substrings.
        replacements_dict (dict): A dictionary of replacement pairs, where the keys are the
            substrings to be replaced and the values are the replacement strings.

    Returns:
        str: The input string with all instances of the substrings replaced with their
            corresponding replacement strings.
    """
Easy to use extraction of text from PDF or RTF files:

`def extract_rtf_pdf(name: str, get_ai_text:Callable=None)->str:`
 
      Find text from pdf and rtf

      Args:
          name (str): The string in which to find any format of dates
          get_ai_text: call back function that can call google vision api or AWS or Azure equivalents for text extraction
          Called for images and image PDFs.

      Returns:
          List[datetime.datetime]: A list of datetime objects the latest can be found using max()
      """

            

Raw data

            {
    "_id": null,
    "home_page": "",
    "name": "auto-find-date-pdf",
    "maintainer": "",
    "docs_url": null,
    "requires_python": "",
    "maintainer_email": "",
    "keywords": "",
    "author": "Your Name",
    "author_email": "open@marvsai.com",
    "download_url": "https://files.pythonhosted.org/packages/06/7c/b8cd0473fa210b2dc15fd423292ec9f4b7fa1fe12f6a0dc82cf4cc9b0f40/auto_find_date_pdf-0.2.22.tar.gz",
    "platform": null,
    "description": "# Simple use Date and text parsing from pdf rtf and images (with use of call back function)\r\n\r\nThis is a simple package provided by Marvsai healthcare LTD. It can find any format regular dates in a str \r\nas python Datetime objects.\r\n\r\nEasy to use method\r\n\r\n\r\n` def find_dates(file_contents: str): `\r\n\r\n      Find any dates in a large python string usually taken from a file or pdf\r\n\r\n      Args:\r\n          file_contents (str): The string in which to find any format of dates\r\n\r\n\r\n      Returns:\r\n          List[datetime.datetime]: A list of datetime objects the latest can be found using max()\r\n\r\n\r\nOptimised replacement of multiple strings in a string\r\n\r\n` replace_multiple_strings(input_string, replacements_dict) ` \r\n\r\n    Replace multiple strings in the input string using a dictionary of replacement pairs.\r\n\r\n    Args:\r\n        input_string (str): The string in which to replace the substrings.\r\n        replacements_dict (dict): A dictionary of replacement pairs, where the keys are the\r\n            substrings to be replaced and the values are the replacement strings.\r\n\r\n    Returns:\r\n        str: The input string with all instances of the substrings replaced with their\r\n            corresponding replacement strings.\r\n    \"\"\"\r\nEasy to use extraction of text from PDF or RTF files:\r\n\r\n`def extract_rtf_pdf(name: str, get_ai_text:Callable=None)->str:`\r\n \r\n      Find text from pdf and rtf\r\n\r\n      Args:\r\n          name (str): The string in which to find any format of dates\r\n          get_ai_text: call back function that can call google vision api or AWS or Azure equivalents for text extraction\r\n          Called for images and image PDFs.\r\n\r\n      Returns:\r\n          List[datetime.datetime]: A list of datetime objects the latest can be found using max()\r\n      \"\"\"\r\n",
    "bugtrack_url": null,
    "license": "",
    "summary": "A simple lib to find dates from any txt/ pdf/ docx/ rtf source. For documentation see",
    "version": "0.2.22",
    "project_urls": null,
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "e4268b0da77e94bf38e511571e381d36f0a112169ec27ca84594280cd93d8b45",
                "md5": "caa82a09e705ede3466c00f06eee6ab3",
                "sha256": "c9e0de9e4cc56582e475f715467f73723f5e00539c995392bc5f1869d325967a"
            },
            "downloads": -1,
            "filename": "auto_find_date_pdf-0.2.22-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "caa82a09e705ede3466c00f06eee6ab3",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": null,
            "size": 5712,
            "upload_time": "2023-12-25T23:55:04",
            "upload_time_iso_8601": "2023-12-25T23:55:04.651641Z",
            "url": "https://files.pythonhosted.org/packages/e4/26/8b0da77e94bf38e511571e381d36f0a112169ec27ca84594280cd93d8b45/auto_find_date_pdf-0.2.22-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "067cb8cd0473fa210b2dc15fd423292ec9f4b7fa1fe12f6a0dc82cf4cc9b0f40",
                "md5": "c5e9da1f5e0869a54cc95339f23d6db4",
                "sha256": "2e1b3915759eea4313d344ff76ac77b1ea817529d5ca09e112374721d682e536"
            },
            "downloads": -1,
            "filename": "auto_find_date_pdf-0.2.22.tar.gz",
            "has_sig": false,
            "md5_digest": "c5e9da1f5e0869a54cc95339f23d6db4",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": null,
            "size": 5229,
            "upload_time": "2023-12-25T23:55:06",
            "upload_time_iso_8601": "2023-12-25T23:55:06.007417Z",
            "url": "https://files.pythonhosted.org/packages/06/7c/b8cd0473fa210b2dc15fd423292ec9f4b7fa1fe12f6a0dc82cf4cc9b0f40/auto_find_date_pdf-0.2.22.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2023-12-25 23:55:06",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "auto-find-date-pdf"
}
        
Elapsed time: 0.19320s