cameo-txt


Namecameo-txt JSON
Version 0.0.1 PyPI version JSON
download
home_page
Summary將各種格式的檔案提取成txt
upload_time2023-08-08 12:17:45
maintainer
docs_urlNone
authorJcXGTcW
requires_python
license
keywords
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # cameo-txt

`cameo-txt` 是一個用於將不同檔案格式(如 docx、pdf、csv、odt 等)轉換為純文本文件的 Python 庫。

## 安裝

您可以使用以下命令安裝此套件:

```bash
pip install cameo-txt
```
## 用法
以下是一個簡單的例子,說明如何使用這個函數庫:

```
from cameo_txt import convert_to_txt

# 單個檔案
result = convert_to_txt('path/to/your/file.docx')

# 多個檔案
results = convert_to_txt(['path/to/your/file1.pdf', 'path/to/your/file2.csv'])

# 保存到特定輸出資料夾
results = convert_to_txt(['path/to/your/file1.pdf', 'path/to/your/file2.csv'], output_folder='path/to/output/folder')
```
## 功能
cameo-txt主要提供以下功能:
### 下載文件
如果提供了URL,庫將自動下載文件並保存為臨時文件。
### 支援多種格式
支援docx、pdf、csv和odt格式的文件。您可以輕鬆添加對更多格式的支援。
### 並行處理
使用concurrent.futures並行處理多個文件,以提高效率。
### 自動編碼檢測
使用chardet自動檢測和處理不同編碼的文件。

            

Raw data

            {
    "_id": null,
    "home_page": "",
    "name": "cameo-txt",
    "maintainer": "",
    "docs_url": null,
    "requires_python": "",
    "maintainer_email": "",
    "keywords": "",
    "author": "JcXGTcW",
    "author_email": "",
    "download_url": "https://files.pythonhosted.org/packages/66/47/d50c62cfb058b18763a28d47a8dca76f262be0fe429252add82a87131684/cameo-txt-0.0.1.tar.gz",
    "platform": null,
    "description": "# cameo-txt\r\n\r\n`cameo-txt` \u662f\u4e00\u500b\u7528\u65bc\u5c07\u4e0d\u540c\u6a94\u6848\u683c\u5f0f\uff08\u5982 docx\u3001pdf\u3001csv\u3001odt \u7b49\uff09\u8f49\u63db\u70ba\u7d14\u6587\u672c\u6587\u4ef6\u7684 Python \u5eab\u3002\r\n\r\n## \u5b89\u88dd\r\n\r\n\u60a8\u53ef\u4ee5\u4f7f\u7528\u4ee5\u4e0b\u547d\u4ee4\u5b89\u88dd\u6b64\u5957\u4ef6\uff1a\r\n\r\n```bash\r\npip install cameo-txt\r\n```\r\n## \u7528\u6cd5\r\n\u4ee5\u4e0b\u662f\u4e00\u500b\u7c21\u55ae\u7684\u4f8b\u5b50\uff0c\u8aaa\u660e\u5982\u4f55\u4f7f\u7528\u9019\u500b\u51fd\u6578\u5eab\uff1a\r\n\r\n```\r\nfrom cameo_txt import convert_to_txt\r\n\r\n# \u55ae\u500b\u6a94\u6848\r\nresult = convert_to_txt('path/to/your/file.docx')\r\n\r\n# \u591a\u500b\u6a94\u6848\r\nresults = convert_to_txt(['path/to/your/file1.pdf', 'path/to/your/file2.csv'])\r\n\r\n# \u4fdd\u5b58\u5230\u7279\u5b9a\u8f38\u51fa\u8cc7\u6599\u593e\r\nresults = convert_to_txt(['path/to/your/file1.pdf', 'path/to/your/file2.csv'], output_folder='path/to/output/folder')\r\n```\r\n## \u529f\u80fd\r\ncameo-txt\u4e3b\u8981\u63d0\u4f9b\u4ee5\u4e0b\u529f\u80fd\uff1a\r\n### \u4e0b\u8f09\u6587\u4ef6\r\n\u5982\u679c\u63d0\u4f9b\u4e86URL\uff0c\u5eab\u5c07\u81ea\u52d5\u4e0b\u8f09\u6587\u4ef6\u4e26\u4fdd\u5b58\u70ba\u81e8\u6642\u6587\u4ef6\u3002\r\n### \u652f\u63f4\u591a\u7a2e\u683c\u5f0f\r\n\u652f\u63f4docx\u3001pdf\u3001csv\u548codt\u683c\u5f0f\u7684\u6587\u4ef6\u3002\u60a8\u53ef\u4ee5\u8f15\u9b06\u6dfb\u52a0\u5c0d\u66f4\u591a\u683c\u5f0f\u7684\u652f\u63f4\u3002\r\n### \u4e26\u884c\u8655\u7406\r\n\u4f7f\u7528concurrent.futures\u4e26\u884c\u8655\u7406\u591a\u500b\u6587\u4ef6\uff0c\u4ee5\u63d0\u9ad8\u6548\u7387\u3002\r\n### \u81ea\u52d5\u7de8\u78bc\u6aa2\u6e2c\r\n\u4f7f\u7528chardet\u81ea\u52d5\u6aa2\u6e2c\u548c\u8655\u7406\u4e0d\u540c\u7de8\u78bc\u7684\u6587\u4ef6\u3002\r\n",
    "bugtrack_url": null,
    "license": "",
    "summary": "\u5c07\u5404\u7a2e\u683c\u5f0f\u7684\u6a94\u6848\u63d0\u53d6\u6210txt",
    "version": "0.0.1",
    "project_urls": null,
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "1960de69c7e1dc49afae97eef7a83f28c85b25a934d483bdf05c45b0e049c0bf",
                "md5": "bd24fa92587570294cad19c24ca849dc",
                "sha256": "6b5724688b63280a62f08eb80d86c0bce084d99b4e6994ebc270de2fc9eaafc0"
            },
            "downloads": -1,
            "filename": "cameo_txt-0.0.1-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "bd24fa92587570294cad19c24ca849dc",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": null,
            "size": 3526,
            "upload_time": "2023-08-08T12:17:43",
            "upload_time_iso_8601": "2023-08-08T12:17:43.702148Z",
            "url": "https://files.pythonhosted.org/packages/19/60/de69c7e1dc49afae97eef7a83f28c85b25a934d483bdf05c45b0e049c0bf/cameo_txt-0.0.1-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "6647d50c62cfb058b18763a28d47a8dca76f262be0fe429252add82a87131684",
                "md5": "067e692de6da26f527d971ac8932d7b9",
                "sha256": "c590a80ca360b9acdffded7be755b933cb58dfa5db07d54f48846a5fc1dab23d"
            },
            "downloads": -1,
            "filename": "cameo-txt-0.0.1.tar.gz",
            "has_sig": false,
            "md5_digest": "067e692de6da26f527d971ac8932d7b9",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": null,
            "size": 3277,
            "upload_time": "2023-08-08T12:17:45",
            "upload_time_iso_8601": "2023-08-08T12:17:45.557661Z",
            "url": "https://files.pythonhosted.org/packages/66/47/d50c62cfb058b18763a28d47a8dca76f262be0fe429252add82a87131684/cameo-txt-0.0.1.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2023-08-08 12:17:45",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "cameo-txt"
}
        
Elapsed time: 0.31689s