Name | bcut-asr JSON |
Version |
0.0.3
JSON |
| download |
home_page | None |
Summary | 使用必剪API的语音字幕识别 |
upload_time | 2024-04-28 03:06:44 |
maintainer | None |
docs_url | None |
author | SocialSisterYi |
requires_python | >=3.10 |
license | MIT |
keywords |
|
VCS |
|
bugtrack_url |
|
requirements |
No requirements were recorded.
|
Travis-CI |
No Travis.
|
coveralls test coverage |
No coveralls.
|
<h1 align="center">Bcut-ASR</h1>
使用必剪 API 进行云端语音字幕识别,支持 CLI 和 module 调用
## ✨Feature
- 可直接上传`flac`, `aac`, `m4a`, `mp3`, `wav`音频格式
- 自动调用 ffmpeg, 实现视频伴音和其他音频格式转码
- 支持`srt`, `json`, `lrc`, `txt`格式字幕输出
- 字幕支持断句和首位时间标记
- 可使用 stdout 输出字幕文本
## 🚀Install
首先确保 ffmpeg 已安装,且 PATH 中可以访问,若未安装可以使用如下命令(已安装请无视):
Linux:
```bash
sudo apt install ffmpeg
```
Windows:
```powershell
winget install ffmpeg
```
本项目暂时未发布 pypi,应使用本地安装,Python 版本应 >= 3.10,需要安装 poetry
```bash
git clone https://github.com/SocialSisterYi/bcut-asr
cd bcut-asr
poetry build -f wheel
pip install dist/bcut_asr-0.0.3-py3-none-any.whl # Example
```
## 📃Usage
### CLI Interface
```bash
bcut_asr video.mp4
```
或
```bash
bcut_asr video.mp4 subtitle.srt
```
或
```bash
bcut_asr video.mp4 -f srt - > subtitle.srt
```
长音频指定任务状态轮询间隔(秒),避免接口频繁调用
```bash
bcut_asr video.mp4 -f srt -i 30 - > subtitle.srt
```
```
bcut_asr -h
usage: bcut-asr [-h] [-f [{srt,json,lrc,txt}]] [-i [1.0]] input [output]
必剪语音识别
positional arguments:
input 输入媒体文件
output 输出字幕文件, 可stdout
options:
-h, --help show this help message and exit
-f [{srt,json,lrc,txt}], --format [{srt,json,lrc,txt}]
输出字幕格式
-i [1.0], --interval [1.0]
任务状态轮询间隔(秒)
支持输入音频格式: flac, aac, m4a, mp3, wav 支持自动调用ffmpeg提取视频伴音
```
### Module
```python
from bcut_asr import BcutASR
from bcut_asr.orm import ResultStateEnum
asr = BcutASR('voice.mp3')
asr.upload() # 上传文件
asr.create_task() # 创建任务
# 轮询检查结果
while True:
result = asr.result()
# 判断识别成功
if result.state == ResultStateEnum.COMPLETE:
break
# 解析字幕内容
subtitle = result.parse()
# 判断是否存在字幕
if subtitle.has_data():
# 输出srt格式
print(subtitle.to_srt())
```
输入视频
```python
from bcut_asr import run_everywhere
from argparse import Namespace
f = open("file.mp4", "rb")
argg = Namespace(format="srt", interval=30.0, input=f, output=None)
run_everywhere(argg)
```
Raw data
{
"_id": null,
"home_page": null,
"name": "bcut-asr",
"maintainer": null,
"docs_url": null,
"requires_python": ">=3.10",
"maintainer_email": null,
"keywords": null,
"author": "SocialSisterYi",
"author_email": "1440239038@qq.com",
"download_url": null,
"platform": null,
"description": "<h1 align=\"center\">Bcut-ASR</h1>\n\n\u4f7f\u7528\u5fc5\u526a API \u8fdb\u884c\u4e91\u7aef\u8bed\u97f3\u5b57\u5e55\u8bc6\u522b\uff0c\u652f\u6301 CLI \u548c module \u8c03\u7528\n\n## \u2728Feature\n\n- \u53ef\u76f4\u63a5\u4e0a\u4f20`flac`, `aac`, `m4a`, `mp3`, `wav`\u97f3\u9891\u683c\u5f0f\n- \u81ea\u52a8\u8c03\u7528 ffmpeg, \u5b9e\u73b0\u89c6\u9891\u4f34\u97f3\u548c\u5176\u4ed6\u97f3\u9891\u683c\u5f0f\u8f6c\u7801\n- \u652f\u6301`srt`, `json`, `lrc`, `txt`\u683c\u5f0f\u5b57\u5e55\u8f93\u51fa\n- \u5b57\u5e55\u652f\u6301\u65ad\u53e5\u548c\u9996\u4f4d\u65f6\u95f4\u6807\u8bb0\n- \u53ef\u4f7f\u7528 stdout \u8f93\u51fa\u5b57\u5e55\u6587\u672c\n\n## \ud83d\ude80Install\n\n\u9996\u5148\u786e\u4fdd ffmpeg \u5df2\u5b89\u88c5\uff0c\u4e14 PATH \u4e2d\u53ef\u4ee5\u8bbf\u95ee\uff0c\u82e5\u672a\u5b89\u88c5\u53ef\u4ee5\u4f7f\u7528\u5982\u4e0b\u547d\u4ee4\uff08\u5df2\u5b89\u88c5\u8bf7\u65e0\u89c6\uff09\uff1a\n\nLinux\uff1a\n\n```bash\nsudo apt install ffmpeg\n```\n\nWindows\uff1a\n\n```powershell\nwinget install ffmpeg\n```\n\n\u672c\u9879\u76ee\u6682\u65f6\u672a\u53d1\u5e03 pypi\uff0c\u5e94\u4f7f\u7528\u672c\u5730\u5b89\u88c5\uff0cPython \u7248\u672c\u5e94 >= 3.10\uff0c\u9700\u8981\u5b89\u88c5 poetry \n\n```bash\ngit clone https://github.com/SocialSisterYi/bcut-asr\ncd bcut-asr\npoetry build -f wheel\npip install dist/bcut_asr-0.0.3-py3-none-any.whl # Example\n```\n\n## \ud83d\udcc3Usage\n\n### CLI Interface\n\n```bash\nbcut_asr video.mp4\n```\n\n\u6216\n\n```bash\nbcut_asr video.mp4 subtitle.srt\n```\n\n\u6216\n\n```bash\nbcut_asr video.mp4 -f srt - > subtitle.srt\n```\n\n\u957f\u97f3\u9891\u6307\u5b9a\u4efb\u52a1\u72b6\u6001\u8f6e\u8be2\u95f4\u9694(\u79d2)\uff0c\u907f\u514d\u63a5\u53e3\u9891\u7e41\u8c03\u7528\n\n```bash\nbcut_asr video.mp4 -f srt -i 30 - > subtitle.srt\n```\n\n```\nbcut_asr -h\nusage: bcut-asr [-h] [-f [{srt,json,lrc,txt}]] [-i [1.0]] input [output]\n\n\u5fc5\u526a\u8bed\u97f3\u8bc6\u522b\n\npositional arguments:\n input \u8f93\u5165\u5a92\u4f53\u6587\u4ef6\n output \u8f93\u51fa\u5b57\u5e55\u6587\u4ef6, \u53efstdout\n\noptions:\n -h, --help show this help message and exit\n -f [{srt,json,lrc,txt}], --format [{srt,json,lrc,txt}]\n \u8f93\u51fa\u5b57\u5e55\u683c\u5f0f\n -i [1.0], --interval [1.0]\n \u4efb\u52a1\u72b6\u6001\u8f6e\u8be2\u95f4\u9694(\u79d2)\n\n\u652f\u6301\u8f93\u5165\u97f3\u9891\u683c\u5f0f: flac, aac, m4a, mp3, wav \u652f\u6301\u81ea\u52a8\u8c03\u7528ffmpeg\u63d0\u53d6\u89c6\u9891\u4f34\u97f3\n```\n\n### Module\n\n```python\nfrom bcut_asr import BcutASR\nfrom bcut_asr.orm import ResultStateEnum\n\nasr = BcutASR('voice.mp3')\nasr.upload() # \u4e0a\u4f20\u6587\u4ef6\nasr.create_task() # \u521b\u5efa\u4efb\u52a1\n\n# \u8f6e\u8be2\u68c0\u67e5\u7ed3\u679c\nwhile True:\n result = asr.result()\n # \u5224\u65ad\u8bc6\u522b\u6210\u529f\n if result.state == ResultStateEnum.COMPLETE:\n break\n\n# \u89e3\u6790\u5b57\u5e55\u5185\u5bb9\nsubtitle = result.parse()\n# \u5224\u65ad\u662f\u5426\u5b58\u5728\u5b57\u5e55\nif subtitle.has_data():\n # \u8f93\u51fasrt\u683c\u5f0f\n print(subtitle.to_srt())\n```\n\n\u8f93\u5165\u89c6\u9891\n\n```python\nfrom bcut_asr import run_everywhere\nfrom argparse import Namespace\n\n\nf = open(\"file.mp4\", \"rb\")\nargg = Namespace(format=\"srt\", interval=30.0, input=f, output=None)\nrun_everywhere(argg)\n\n```\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "\u4f7f\u7528\u5fc5\u526aAPI\u7684\u8bed\u97f3\u5b57\u5e55\u8bc6\u522b",
"version": "0.0.3",
"project_urls": null,
"split_keywords": [],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "1729e4bf1bef9dd5c56e759df97eb55c2f57526f70b577c37873da021b84dbb6",
"md5": "9e16f62d7d2807230166e585c7ebb5d0",
"sha256": "5f74e7d3c5f05c96d1e71d82d93c34388c8f2c797f3cf03f0a392f269e5f729f"
},
"downloads": -1,
"filename": "bcut_asr-0.0.3-py3-none-any.whl",
"has_sig": false,
"md5_digest": "9e16f62d7d2807230166e585c7ebb5d0",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">=3.10",
"size": 9770,
"upload_time": "2024-04-28T03:06:44",
"upload_time_iso_8601": "2024-04-28T03:06:44.837619Z",
"url": "https://files.pythonhosted.org/packages/17/29/e4bf1bef9dd5c56e759df97eb55c2f57526f70b577c37873da021b84dbb6/bcut_asr-0.0.3-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-04-28 03:06:44",
"github": false,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"lcname": "bcut-asr"
}