nendo-plugin-transcribe-whisper


Namenendo-plugin-transcribe-whisper JSON
Version 0.1.0 PyPI version JSON
download
home_pagehttps://okio.ai
SummaryA nendo plugin for speech transcription, based on Whisper by OpenAI.
upload_time2024-02-21 10:26:16
maintainer
docs_urlNone
authorAaron Abebe
requires_python>=3.8,<3.11
licenseMIT
keywords ai generative music okio nendo audio production audio transcription music analysis song analysis
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # Nendo Plugin Transcribe Whisper

<br>
<p align="left">
    <img src="https://okio.ai/docs/assets/nendo_core_logo.png" width="350" alt="nendo core">
</p>
<br>

<p align="left">
<a href="https://okio.ai" target="_blank">
    <img src="https://img.shields.io/website/https/okio.ai" alt="Website">
</a>
<a href="https://twitter.com/okio_ai" target="_blank">
    <img src="https://img.shields.io/twitter/url/https/twitter.com/okio_ai.svg?style=social&label=Follow%20%40okio_ai" alt="Twitter">
</a>
<a href="https://discord.gg/gaZMZKzScj" target="_blank">
    <img src="https://dcbadge.vercel.app/api/server/XpkUsjwXTp?compact=true&style=flat" alt="Discord">
</a>
</p>

---

A nendo plugin for speech transcription, based on Whisper by OpenAI.

## Features

- Fast speech transcription with optional word-level timestamps.

## Requirements

Since we depend on `transformers`, please make sure that you fulfill their requirements.
You also need Pytorch installed on your system, please refer to the [pytorch installation instructions](https://pytorch.org/get-started/locally/).

## Installation

1. [Install Nendo](https://github.com/okio-ai/nendo#installation)
2. `pip install nendo-plugin-transcribe-whisper`

If you have a cuda GPU on your machine you can also install `flash-attn` to get an additional speedup:

`pip install flash-attn --no-build-isolation`

Then set `ATTN_IMPLEMENTATION=flash_attention_2` in your environment variables.

## Usage
```pycon
>>> from nendo import Nendo
>>> nd = Nendo(plugins=["nendo_plugin_transcribe_whisper"])
>>> track = nd.library.add_track(file_path="path/to/file.mp3")

>>> nd.plugins.transcribe_whisper(track=track)
>>> track.get_plugin_value("transcription")
```

## Contributing
Visit our docs to learn all about how to contribute to Nendo: [Contributing](https://okio.ai/docs/contributing/)

## License
Nendo: MIT License

Pretrained models: The weights are released under the Apache 2.0 license.

            

Raw data

            {
    "_id": null,
    "home_page": "https://okio.ai",
    "name": "nendo-plugin-transcribe-whisper",
    "maintainer": "",
    "docs_url": null,
    "requires_python": ">=3.8,<3.11",
    "maintainer_email": "",
    "keywords": "AI,generative,music,okio,nendo,audio production,audio transcription,music analysis,song analysis",
    "author": "Aaron Abebe",
    "author_email": "aaron@okio.ai",
    "download_url": "https://files.pythonhosted.org/packages/a4/f1/d3401aedb19b1a2650d10872524f885129a6114ce6dc4a72bea373613256/nendo_plugin_transcribe_whisper-0.1.0.tar.gz",
    "platform": null,
    "description": "# Nendo Plugin Transcribe Whisper\n\n<br>\n<p align=\"left\">\n    <img src=\"https://okio.ai/docs/assets/nendo_core_logo.png\" width=\"350\" alt=\"nendo core\">\n</p>\n<br>\n\n<p align=\"left\">\n<a href=\"https://okio.ai\" target=\"_blank\">\n    <img src=\"https://img.shields.io/website/https/okio.ai\" alt=\"Website\">\n</a>\n<a href=\"https://twitter.com/okio_ai\" target=\"_blank\">\n    <img src=\"https://img.shields.io/twitter/url/https/twitter.com/okio_ai.svg?style=social&label=Follow%20%40okio_ai\" alt=\"Twitter\">\n</a>\n<a href=\"https://discord.gg/gaZMZKzScj\" target=\"_blank\">\n    <img src=\"https://dcbadge.vercel.app/api/server/XpkUsjwXTp?compact=true&style=flat\" alt=\"Discord\">\n</a>\n</p>\n\n---\n\nA nendo plugin for speech transcription, based on Whisper by OpenAI.\n\n## Features\n\n- Fast speech transcription with optional word-level timestamps.\n\n## Requirements\n\nSince we depend on `transformers`, please make sure that you fulfill their requirements.\nYou also need Pytorch installed on your system, please refer to the [pytorch installation instructions](https://pytorch.org/get-started/locally/).\n\n## Installation\n\n1. [Install Nendo](https://github.com/okio-ai/nendo#installation)\n2. `pip install nendo-plugin-transcribe-whisper`\n\nIf you have a cuda GPU on your machine you can also install `flash-attn` to get an additional speedup:\n\n`pip install flash-attn --no-build-isolation`\n\nThen set `ATTN_IMPLEMENTATION=flash_attention_2` in your environment variables.\n\n## Usage\n```pycon\n>>> from nendo import Nendo\n>>> nd = Nendo(plugins=[\"nendo_plugin_transcribe_whisper\"])\n>>> track = nd.library.add_track(file_path=\"path/to/file.mp3\")\n\n>>> nd.plugins.transcribe_whisper(track=track)\n>>> track.get_plugin_value(\"transcription\")\n```\n\n## Contributing\nVisit our docs to learn all about how to contribute to Nendo: [Contributing](https://okio.ai/docs/contributing/)\n\n## License\nNendo: MIT License\n\nPretrained models: The weights are released under the Apache 2.0 license.\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "A nendo plugin for speech transcription, based on Whisper by OpenAI.",
    "version": "0.1.0",
    "project_urls": {
        "Homepage": "https://okio.ai",
        "Repository": "https://github.com/okio-ai/nendo_plugin_transcribe_whisper"
    },
    "split_keywords": [
        "ai",
        "generative",
        "music",
        "okio",
        "nendo",
        "audio production",
        "audio transcription",
        "music analysis",
        "song analysis"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "608e1ed3c433c9733ae6ada8d9b7b8e39c1a5a8b074f98feb1666ed933b84b58",
                "md5": "13cd9bf2a13ae4c0e2ebd61aff5b848c",
                "sha256": "b90d094e63f345a7fc6cdefc6fd8e430921690e899f3a45a4d1fbc5016be0131"
            },
            "downloads": -1,
            "filename": "nendo_plugin_transcribe_whisper-0.1.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "13cd9bf2a13ae4c0e2ebd61aff5b848c",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.8,<3.11",
            "size": 5461,
            "upload_time": "2024-02-21T10:26:14",
            "upload_time_iso_8601": "2024-02-21T10:26:14.429944Z",
            "url": "https://files.pythonhosted.org/packages/60/8e/1ed3c433c9733ae6ada8d9b7b8e39c1a5a8b074f98feb1666ed933b84b58/nendo_plugin_transcribe_whisper-0.1.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "a4f1d3401aedb19b1a2650d10872524f885129a6114ce6dc4a72bea373613256",
                "md5": "405cc9921f5acb93872a41749734538a",
                "sha256": "e9c5326476588e41495c41838a1f975b88973733f16d9b68d3f3c81a74c92b78"
            },
            "downloads": -1,
            "filename": "nendo_plugin_transcribe_whisper-0.1.0.tar.gz",
            "has_sig": false,
            "md5_digest": "405cc9921f5acb93872a41749734538a",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.8,<3.11",
            "size": 5244,
            "upload_time": "2024-02-21T10:26:16",
            "upload_time_iso_8601": "2024-02-21T10:26:16.289426Z",
            "url": "https://files.pythonhosted.org/packages/a4/f1/d3401aedb19b1a2650d10872524f885129a6114ce6dc4a72bea373613256/nendo_plugin_transcribe_whisper-0.1.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-02-21 10:26:16",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "okio-ai",
    "github_project": "nendo_plugin_transcribe_whisper",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "lcname": "nendo-plugin-transcribe-whisper"
}
        
Elapsed time: 0.44286s