# DLU Voice Transcriber
A Python tool for recording audio and transcribing it to text in multiple languages (English and German).
## Features
- High-quality audio recording (48kHz sample rate, 16-bit depth)
- Automatic input device detection and selection
- Support for multiple audio formats (WAV, MP3)
- Configurable MP3 quality settings
- Transcription to English and German
- Progress monitoring during recording
- Save transcriptions to text files
- Detailed device information and listing
- Intelligent device selection with native sample rate support
## Installation
```bash
pip install dlu_voice_transcriber
```
### System Requirements
- Python 3.8 or higher
- Working microphone
- Linux: `sudo apt-get install python3-pyaudio portaudio19-dev`
- macOS: `brew install portaudio`
- Windows: No additional requirements
## Usage
### Basic Commands
List available audio devices:
```bash
dlu_transcribe --list-devices
```
Basic recording (5 seconds, default settings):
```bash
dlu_transcribe
```
### Advanced Usage
Record with specific settings:
```bash
dlu_transcribe --device 2 --duration 10 --output my_recording --format mp3 --mp3-quality 320 --language both --save-text
```
### Command Line Options
- `--device`: Specify input device index (shown by --list-devices)
- `--duration`: Recording duration in seconds (default: 5)
- `--output`: Output filename without extension (default: recording)
- `--format`: Output format: 'wav' or 'mp3' (default: mp3)
- `--mp3-quality`: MP3 quality in kbps (default: 320)
- `--language`: Transcription language: 'de', 'en', or 'both' (default: both)
- `--save-text`: Save transcriptions to text files
- `--list-devices`: Show available audio input devices
### Examples
List all available devices:
```bash
dlu_transcribe --list-devices
```
Record high-quality MP3 for 15 seconds:
```bash
dlu_transcribe --duration 15 --format mp3 --mp3-quality 320 --output high_quality_recording
```
Record using specific device with German-only transcription:
```bash
dlu_transcribe --device 2 --language de --save-text --output german_speech
```
## Output Files
The tool generates the following files in the `recordings` directory:
- Audio file: `<output>.<format>` (e.g., recording.mp3)
- Transcription files (if --save-text is used):
- English: `<output>_en.txt`
- German: `<output>_de.txt`
## License
This project is licensed under the MIT License - see the LICENSE file for details.
Raw data
{
"_id": null,
"home_page": null,
"name": "dlu-voice-transcriber",
"maintainer": null,
"docs_url": null,
"requires_python": "<3.12,>=3.8",
"maintainer_email": null,
"keywords": "audio recording, speech recognition, transcription, german transcription, voice to text, multilingual, audio processing",
"author": null,
"author_email": "Dirk Ulbricht <dirk.ulbricht@gmail.com>",
"download_url": "https://files.pythonhosted.org/packages/28/43/cd30cdb3e7c49cd79c2d932b7444dcd2f12c760f41246d8ce1b84d21470c/dlu_voice_transcriber-0.2.3.tar.gz",
"platform": null,
"description": "# DLU Voice Transcriber\n\nA Python tool for recording audio and transcribing it to text in multiple languages (English and German).\n\n## Features\n\n- High-quality audio recording (48kHz sample rate, 16-bit depth)\n- Automatic input device detection and selection\n- Support for multiple audio formats (WAV, MP3)\n- Configurable MP3 quality settings\n- Transcription to English and German\n- Progress monitoring during recording\n- Save transcriptions to text files\n- Detailed device information and listing\n- Intelligent device selection with native sample rate support\n\n## Installation\n\n```bash\npip install dlu_voice_transcriber\n```\n\n### System Requirements\n\n- Python 3.8 or higher\n- Working microphone\n- Linux: `sudo apt-get install python3-pyaudio portaudio19-dev`\n- macOS: `brew install portaudio`\n- Windows: No additional requirements\n\n## Usage\n\n### Basic Commands\n\nList available audio devices:\n```bash\ndlu_transcribe --list-devices\n```\n\nBasic recording (5 seconds, default settings):\n```bash\ndlu_transcribe\n```\n\n### Advanced Usage\n\nRecord with specific settings:\n```bash\ndlu_transcribe --device 2 --duration 10 --output my_recording --format mp3 --mp3-quality 320 --language both --save-text\n```\n\n### Command Line Options\n\n- `--device`: Specify input device index (shown by --list-devices)\n- `--duration`: Recording duration in seconds (default: 5)\n- `--output`: Output filename without extension (default: recording)\n- `--format`: Output format: 'wav' or 'mp3' (default: mp3)\n- `--mp3-quality`: MP3 quality in kbps (default: 320)\n- `--language`: Transcription language: 'de', 'en', or 'both' (default: both)\n- `--save-text`: Save transcriptions to text files\n- `--list-devices`: Show available audio input devices\n\n### Examples\n\nList all available devices:\n```bash\ndlu_transcribe --list-devices\n```\n\nRecord high-quality MP3 for 15 seconds:\n```bash\ndlu_transcribe --duration 15 --format mp3 --mp3-quality 320 --output high_quality_recording\n```\n\nRecord using specific device with German-only transcription:\n```bash\ndlu_transcribe --device 2 --language de --save-text --output german_speech\n```\n\n## Output Files\n\nThe tool generates the following files in the `recordings` directory:\n- Audio file: `<output>.<format>` (e.g., recording.mp3)\n- Transcription files (if --save-text is used):\n - English: `<output>_en.txt`\n - German: `<output>_de.txt`\n\n## License\n\nThis project is licensed under the MIT License - see the LICENSE file for details.\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "A multilingual voice recording and transcription tool with German and English support",
"version": "0.2.3",
"project_urls": {
"Bug Tracker": "https://github.com/dullibri/dlu_voice_transcriber/issues",
"Changelog": "https://github.com/dullibri/dlu_voice_transcriber/blob/main/CHANGELOG.md",
"Documentation": "https://github.com/dullibri/dlu_voice_transcriber#readme",
"Homepage": "https://github.com/dullibri/dlu_voice_transcriber",
"Source Code": "https://github.com/dullibri/dlu_voice_transcriber"
},
"split_keywords": [
"audio recording",
" speech recognition",
" transcription",
" german transcription",
" voice to text",
" multilingual",
" audio processing"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "57c402e097d052bdf8c87099ada39ba25787351d82736857a313a9f35f86cf25",
"md5": "e5a2721c1ecb20d8d08c49fb589a49af",
"sha256": "17e31a13b50ba8583218a92e5446eff96acae73987747f3e9bf54967469f83a4"
},
"downloads": -1,
"filename": "dlu_voice_transcriber-0.2.3-py3-none-any.whl",
"has_sig": false,
"md5_digest": "e5a2721c1ecb20d8d08c49fb589a49af",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": "<3.12,>=3.8",
"size": 9016,
"upload_time": "2024-10-25T09:34:23",
"upload_time_iso_8601": "2024-10-25T09:34:23.201539Z",
"url": "https://files.pythonhosted.org/packages/57/c4/02e097d052bdf8c87099ada39ba25787351d82736857a313a9f35f86cf25/dlu_voice_transcriber-0.2.3-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "2843cd30cdb3e7c49cd79c2d932b7444dcd2f12c760f41246d8ce1b84d21470c",
"md5": "f1dcb3b5ed9e2bd7adea75a3123dd4ab",
"sha256": "9dc995a2749fe7f7d46783632602cd3e1764b8e373366e6f2cfc2b463b7f8d13"
},
"downloads": -1,
"filename": "dlu_voice_transcriber-0.2.3.tar.gz",
"has_sig": false,
"md5_digest": "f1dcb3b5ed9e2bd7adea75a3123dd4ab",
"packagetype": "sdist",
"python_version": "source",
"requires_python": "<3.12,>=3.8",
"size": 8626,
"upload_time": "2024-10-25T09:34:24",
"upload_time_iso_8601": "2024-10-25T09:34:24.550152Z",
"url": "https://files.pythonhosted.org/packages/28/43/cd30cdb3e7c49cd79c2d932b7444dcd2f12c760f41246d8ce1b84d21470c/dlu_voice_transcriber-0.2.3.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-10-25 09:34:24",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "dullibri",
"github_project": "dlu_voice_transcriber",
"travis_ci": false,
"coveralls": false,
"github_actions": false,
"lcname": "dlu-voice-transcriber"
}