UniParse


NameUniParse JSON
Version 1.0.1 PyPI version JSON
download
home_pagehttps://github.com/hridesh-net/praserlib.git
SummaryA library to parse PDF, DOCX, and TXT files
upload_time2024-10-18 16:20:49
maintainerNone
docs_urlNone
authorHridesh
requires_python>=3.8
licenseNone
keywords parse parser pdf docx txt uniparse uniparser
VCS
bugtrack_url
requirements PyMuPDF python-docx
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # UniParse

A Python library to parse PDF, DOCX, and TXT files, now with resume summarization capabilities.

## Installation

```bash
pip install UniParse
```

## How to Use
```python
from UniParse import FileParser

parser = FileParser('path/to/your/file.pdf')
content = parser.parse()
print(content)
```

## Features
- Parse text from PDF files
- Extract content from DOCX documents
- Read text from TXT files

### Parsing Resumes and Extracting Information

```python
from UniParse import ResumeParser

parser = ResumeParser('path/to/resume.pdf')
data = parser.get_extracted_data()

print("Resume Data:")
print(data)

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/hridesh-net/praserlib.git",
    "name": "UniParse",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.8",
    "maintainer_email": null,
    "keywords": "parse parser pdf docx txt uniparse uniparser",
    "author": "Hridesh",
    "author_email": "hridesh.khandal@gmail.com",
    "download_url": "https://files.pythonhosted.org/packages/c5/b1/364537ab991b42143776f4019ca817706e2711cdb1e7a23e97e93ccedb6c/uniparse-1.0.1.tar.gz",
    "platform": null,
    "description": "# UniParse\n\nA Python library to parse PDF, DOCX, and TXT files, now with resume summarization capabilities.\n\n## Installation\n\n```bash\npip install UniParse\n```\n\n## How to Use\n```python\nfrom UniParse import FileParser\n\nparser = FileParser('path/to/your/file.pdf')\ncontent = parser.parse()\nprint(content)\n```\n\n## Features\n- Parse text from PDF files\n- Extract content from DOCX documents\n- Read text from TXT files\n\n### Parsing Resumes and Extracting Information\n\n```python\nfrom UniParse import ResumeParser\n\nparser = ResumeParser('path/to/resume.pdf')\ndata = parser.get_extracted_data()\n\nprint(\"Resume Data:\")\nprint(data)\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "A library to parse PDF, DOCX, and TXT files",
    "version": "1.0.1",
    "project_urls": {
        "Homepage": "https://github.com/hridesh-net/praserlib.git"
    },
    "split_keywords": [
        "parse",
        "parser",
        "pdf",
        "docx",
        "txt",
        "uniparse",
        "uniparser"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "371369cc3abe7da0a326cc3c159a059ceb7fbaaf1aa13d50fc794cdfd6f7f197",
                "md5": "d212eba203ae84bd7ad78c30f5f6dde9",
                "sha256": "7856807b32ac189f368d66dd678176819c61ef9f50d2b1206fa7b13c58928dcf"
            },
            "downloads": -1,
            "filename": "UniParse-1.0.1-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "d212eba203ae84bd7ad78c30f5f6dde9",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.8",
            "size": 8451,
            "upload_time": "2024-10-18T16:20:47",
            "upload_time_iso_8601": "2024-10-18T16:20:47.432034Z",
            "url": "https://files.pythonhosted.org/packages/37/13/69cc3abe7da0a326cc3c159a059ceb7fbaaf1aa13d50fc794cdfd6f7f197/UniParse-1.0.1-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "c5b1364537ab991b42143776f4019ca817706e2711cdb1e7a23e97e93ccedb6c",
                "md5": "8ad30729497a553fd1bf01104f12e2f7",
                "sha256": "da62edb8caeb4aa9cf76eb61cb920bc45d7da40523f5d8b2188e985b35af2fe1"
            },
            "downloads": -1,
            "filename": "uniparse-1.0.1.tar.gz",
            "has_sig": false,
            "md5_digest": "8ad30729497a553fd1bf01104f12e2f7",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.8",
            "size": 8781,
            "upload_time": "2024-10-18T16:20:49",
            "upload_time_iso_8601": "2024-10-18T16:20:49.213192Z",
            "url": "https://files.pythonhosted.org/packages/c5/b1/364537ab991b42143776f4019ca817706e2711cdb1e7a23e97e93ccedb6c/uniparse-1.0.1.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-10-18 16:20:49",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "hridesh-net",
    "github_project": "praserlib",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "requirements": [
        {
            "name": "PyMuPDF",
            "specs": [
                [
                    ">=",
                    "1.18.0"
                ]
            ]
        },
        {
            "name": "python-docx",
            "specs": [
                [
                    ">=",
                    "0.8.10"
                ]
            ]
        }
    ],
    "lcname": "uniparse"
}
        
Elapsed time: 0.32984s