textract-cli


Nametextract-cli JSON
Version 0.1 PyPI version JSON
download
home_pageNone
SummaryCLI for running files through AWS Textract
upload_time2024-03-29 18:49:12
maintainerNone
docs_urlNone
authorSimon Willison
requires_python>=3.8
licenseApache-2.0
keywords
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # textract-cli

[![PyPI](https://img.shields.io/pypi/v/textract-cli.svg)](https://pypi.org/project/textract-cli/)
[![Changelog](https://img.shields.io/github/v/release/simonw/textract-cli?include_prereleases&label=changelog)](https://github.com/simonw/textract-cli/releases)
[![Tests](https://github.com/simonw/textract-cli/actions/workflows/test.yml/badge.svg)](https://github.com/simonw/textract-cli/actions/workflows/test.yml)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/textract-cli/blob/master/LICENSE)

CLI for running files through AWS Textract

## Installation

Install this tool using `pip`:
```bash
pip install textract-cli
```
## Configuration

Any of the [methods for configuring](https://boto3.amazonaws.com/v1/documentation/api/latest/guide/configuration.html) `boto3` will work with this tool. Environment variables or a `~/.aws/config` file are good options here.

## Usage

To run Textract OCR against a JPEG or PNG file (must be smaller than 5MB):
```bash
textract-cli image.jpeg
```
This will output to standard out. To save to a file use this:
```bash
textract-cli image.jpeg > output.txt
```
Or use the `-o/--output` option like this:
```bash
textract-cli image.jpeg -o output.txt
```

For help, run:
```bash
textract-cli --help
```
You can also use:
```bash
python -m textract_cli --help
```
## Development

To contribute to this tool, first checkout the code. Then create a new virtual environment:
```bash
cd textract-cli
python -m venv venv
source venv/bin/activate
```
Now install the dependencies and test dependencies:
```bash
pip install -e '.[test]'
```
To run the tests:
```bash
pytest
```

            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "textract-cli",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.8",
    "maintainer_email": null,
    "keywords": null,
    "author": "Simon Willison",
    "author_email": null,
    "download_url": "https://files.pythonhosted.org/packages/a0/5c/de8cc5e2c38e41955a9b0185e83d1c38128e176c53b87d23597fcbc4df16/textract-cli-0.1.tar.gz",
    "platform": null,
    "description": "# textract-cli\n\n[![PyPI](https://img.shields.io/pypi/v/textract-cli.svg)](https://pypi.org/project/textract-cli/)\n[![Changelog](https://img.shields.io/github/v/release/simonw/textract-cli?include_prereleases&label=changelog)](https://github.com/simonw/textract-cli/releases)\n[![Tests](https://github.com/simonw/textract-cli/actions/workflows/test.yml/badge.svg)](https://github.com/simonw/textract-cli/actions/workflows/test.yml)\n[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/textract-cli/blob/master/LICENSE)\n\nCLI for running files through AWS Textract\n\n## Installation\n\nInstall this tool using `pip`:\n```bash\npip install textract-cli\n```\n## Configuration\n\nAny of the [methods for configuring](https://boto3.amazonaws.com/v1/documentation/api/latest/guide/configuration.html) `boto3` will work with this tool. Environment variables or a `~/.aws/config` file are good options here.\n\n## Usage\n\nTo run Textract OCR against a JPEG or PNG file (must be smaller than 5MB):\n```bash\ntextract-cli image.jpeg\n```\nThis will output to standard out. To save to a file use this:\n```bash\ntextract-cli image.jpeg > output.txt\n```\nOr use the `-o/--output` option like this:\n```bash\ntextract-cli image.jpeg -o output.txt\n```\n\nFor help, run:\n```bash\ntextract-cli --help\n```\nYou can also use:\n```bash\npython -m textract_cli --help\n```\n## Development\n\nTo contribute to this tool, first checkout the code. Then create a new virtual environment:\n```bash\ncd textract-cli\npython -m venv venv\nsource venv/bin/activate\n```\nNow install the dependencies and test dependencies:\n```bash\npip install -e '.[test]'\n```\nTo run the tests:\n```bash\npytest\n```\n",
    "bugtrack_url": null,
    "license": "Apache-2.0",
    "summary": "CLI for running files through AWS Textract",
    "version": "0.1",
    "project_urls": {
        "CI": "https://github.com/simonw/textract-cli/actions",
        "Changelog": "https://github.com/simonw/textract-cli/releases",
        "Homepage": "https://github.com/simonw/textract-cli",
        "Issues": "https://github.com/simonw/textract-cli/issues"
    },
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "fd746102999fc493cbfcfb944fd1ee4ff54831cfa2c4d672ca7df72359d8bcea",
                "md5": "90b1eeae1e65bba72bb3cb9152bb0c74",
                "sha256": "ed17add973cf94a43fc95a25ace82fb828747bb2867683cedd2cb9d2469d39e4"
            },
            "downloads": -1,
            "filename": "textract_cli-0.1-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "90b1eeae1e65bba72bb3cb9152bb0c74",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.8",
            "size": 7318,
            "upload_time": "2024-03-29T18:49:11",
            "upload_time_iso_8601": "2024-03-29T18:49:11.034082Z",
            "url": "https://files.pythonhosted.org/packages/fd/74/6102999fc493cbfcfb944fd1ee4ff54831cfa2c4d672ca7df72359d8bcea/textract_cli-0.1-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "a05cde8cc5e2c38e41955a9b0185e83d1c38128e176c53b87d23597fcbc4df16",
                "md5": "36e9783b54f6a52bf89b995576d76caa",
                "sha256": "ef555f6e1e52d63e8efd20e16c2fa950874a3e361a86f248755eccebdf17dffa"
            },
            "downloads": -1,
            "filename": "textract-cli-0.1.tar.gz",
            "has_sig": false,
            "md5_digest": "36e9783b54f6a52bf89b995576d76caa",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.8",
            "size": 6980,
            "upload_time": "2024-03-29T18:49:12",
            "upload_time_iso_8601": "2024-03-29T18:49:12.584375Z",
            "url": "https://files.pythonhosted.org/packages/a0/5c/de8cc5e2c38e41955a9b0185e83d1c38128e176c53b87d23597fcbc4df16/textract-cli-0.1.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-03-29 18:49:12",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "simonw",
    "github_project": "textract-cli",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "textract-cli"
}
        
Elapsed time: 0.46758s