dlu-voice-transcriber


Namedlu-voice-transcriber JSON
Version 0.2.3 PyPI version JSON
download
home_pageNone
SummaryA multilingual voice recording and transcription tool with German and English support
upload_time2024-10-25 09:34:24
maintainerNone
docs_urlNone
authorNone
requires_python<3.12,>=3.8
licenseMIT
keywords audio recording speech recognition transcription german transcription voice to text multilingual audio processing
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # DLU Voice Transcriber

A Python tool for recording audio and transcribing it to text in multiple languages (English and German).

## Features

- High-quality audio recording (48kHz sample rate, 16-bit depth)
- Automatic input device detection and selection
- Support for multiple audio formats (WAV, MP3)
- Configurable MP3 quality settings
- Transcription to English and German
- Progress monitoring during recording
- Save transcriptions to text files
- Detailed device information and listing
- Intelligent device selection with native sample rate support

## Installation

```bash
pip install dlu_voice_transcriber
```

### System Requirements

- Python 3.8 or higher
- Working microphone
- Linux: `sudo apt-get install python3-pyaudio portaudio19-dev`
- macOS: `brew install portaudio`
- Windows: No additional requirements

## Usage

### Basic Commands

List available audio devices:
```bash
dlu_transcribe --list-devices
```

Basic recording (5 seconds, default settings):
```bash
dlu_transcribe
```

### Advanced Usage

Record with specific settings:
```bash
dlu_transcribe --device 2 --duration 10 --output my_recording --format mp3 --mp3-quality 320 --language both --save-text
```

### Command Line Options

- `--device`: Specify input device index (shown by --list-devices)
- `--duration`: Recording duration in seconds (default: 5)
- `--output`: Output filename without extension (default: recording)
- `--format`: Output format: 'wav' or 'mp3' (default: mp3)
- `--mp3-quality`: MP3 quality in kbps (default: 320)
- `--language`: Transcription language: 'de', 'en', or 'both' (default: both)
- `--save-text`: Save transcriptions to text files
- `--list-devices`: Show available audio input devices

### Examples

List all available devices:
```bash
dlu_transcribe --list-devices
```

Record high-quality MP3 for 15 seconds:
```bash
dlu_transcribe --duration 15 --format mp3 --mp3-quality 320 --output high_quality_recording
```

Record using specific device with German-only transcription:
```bash
dlu_transcribe --device 2 --language de --save-text --output german_speech
```

## Output Files

The tool generates the following files in the `recordings` directory:
- Audio file: `<output>.<format>` (e.g., recording.mp3)
- Transcription files (if --save-text is used):
  - English: `<output>_en.txt`
  - German: `<output>_de.txt`

## License

This project is licensed under the MIT License - see the LICENSE file for details.

            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "dlu-voice-transcriber",
    "maintainer": null,
    "docs_url": null,
    "requires_python": "<3.12,>=3.8",
    "maintainer_email": null,
    "keywords": "audio recording, speech recognition, transcription, german transcription, voice to text, multilingual, audio processing",
    "author": null,
    "author_email": "Dirk Ulbricht <dirk.ulbricht@gmail.com>",
    "download_url": "https://files.pythonhosted.org/packages/28/43/cd30cdb3e7c49cd79c2d932b7444dcd2f12c760f41246d8ce1b84d21470c/dlu_voice_transcriber-0.2.3.tar.gz",
    "platform": null,
    "description": "# DLU Voice Transcriber\n\nA Python tool for recording audio and transcribing it to text in multiple languages (English and German).\n\n## Features\n\n- High-quality audio recording (48kHz sample rate, 16-bit depth)\n- Automatic input device detection and selection\n- Support for multiple audio formats (WAV, MP3)\n- Configurable MP3 quality settings\n- Transcription to English and German\n- Progress monitoring during recording\n- Save transcriptions to text files\n- Detailed device information and listing\n- Intelligent device selection with native sample rate support\n\n## Installation\n\n```bash\npip install dlu_voice_transcriber\n```\n\n### System Requirements\n\n- Python 3.8 or higher\n- Working microphone\n- Linux: `sudo apt-get install python3-pyaudio portaudio19-dev`\n- macOS: `brew install portaudio`\n- Windows: No additional requirements\n\n## Usage\n\n### Basic Commands\n\nList available audio devices:\n```bash\ndlu_transcribe --list-devices\n```\n\nBasic recording (5 seconds, default settings):\n```bash\ndlu_transcribe\n```\n\n### Advanced Usage\n\nRecord with specific settings:\n```bash\ndlu_transcribe --device 2 --duration 10 --output my_recording --format mp3 --mp3-quality 320 --language both --save-text\n```\n\n### Command Line Options\n\n- `--device`: Specify input device index (shown by --list-devices)\n- `--duration`: Recording duration in seconds (default: 5)\n- `--output`: Output filename without extension (default: recording)\n- `--format`: Output format: 'wav' or 'mp3' (default: mp3)\n- `--mp3-quality`: MP3 quality in kbps (default: 320)\n- `--language`: Transcription language: 'de', 'en', or 'both' (default: both)\n- `--save-text`: Save transcriptions to text files\n- `--list-devices`: Show available audio input devices\n\n### Examples\n\nList all available devices:\n```bash\ndlu_transcribe --list-devices\n```\n\nRecord high-quality MP3 for 15 seconds:\n```bash\ndlu_transcribe --duration 15 --format mp3 --mp3-quality 320 --output high_quality_recording\n```\n\nRecord using specific device with German-only transcription:\n```bash\ndlu_transcribe --device 2 --language de --save-text --output german_speech\n```\n\n## Output Files\n\nThe tool generates the following files in the `recordings` directory:\n- Audio file: `<output>.<format>` (e.g., recording.mp3)\n- Transcription files (if --save-text is used):\n  - English: `<output>_en.txt`\n  - German: `<output>_de.txt`\n\n## License\n\nThis project is licensed under the MIT License - see the LICENSE file for details.\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "A multilingual voice recording and transcription tool with German and English support",
    "version": "0.2.3",
    "project_urls": {
        "Bug Tracker": "https://github.com/dullibri/dlu_voice_transcriber/issues",
        "Changelog": "https://github.com/dullibri/dlu_voice_transcriber/blob/main/CHANGELOG.md",
        "Documentation": "https://github.com/dullibri/dlu_voice_transcriber#readme",
        "Homepage": "https://github.com/dullibri/dlu_voice_transcriber",
        "Source Code": "https://github.com/dullibri/dlu_voice_transcriber"
    },
    "split_keywords": [
        "audio recording",
        " speech recognition",
        " transcription",
        " german transcription",
        " voice to text",
        " multilingual",
        " audio processing"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "57c402e097d052bdf8c87099ada39ba25787351d82736857a313a9f35f86cf25",
                "md5": "e5a2721c1ecb20d8d08c49fb589a49af",
                "sha256": "17e31a13b50ba8583218a92e5446eff96acae73987747f3e9bf54967469f83a4"
            },
            "downloads": -1,
            "filename": "dlu_voice_transcriber-0.2.3-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "e5a2721c1ecb20d8d08c49fb589a49af",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": "<3.12,>=3.8",
            "size": 9016,
            "upload_time": "2024-10-25T09:34:23",
            "upload_time_iso_8601": "2024-10-25T09:34:23.201539Z",
            "url": "https://files.pythonhosted.org/packages/57/c4/02e097d052bdf8c87099ada39ba25787351d82736857a313a9f35f86cf25/dlu_voice_transcriber-0.2.3-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "2843cd30cdb3e7c49cd79c2d932b7444dcd2f12c760f41246d8ce1b84d21470c",
                "md5": "f1dcb3b5ed9e2bd7adea75a3123dd4ab",
                "sha256": "9dc995a2749fe7f7d46783632602cd3e1764b8e373366e6f2cfc2b463b7f8d13"
            },
            "downloads": -1,
            "filename": "dlu_voice_transcriber-0.2.3.tar.gz",
            "has_sig": false,
            "md5_digest": "f1dcb3b5ed9e2bd7adea75a3123dd4ab",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": "<3.12,>=3.8",
            "size": 8626,
            "upload_time": "2024-10-25T09:34:24",
            "upload_time_iso_8601": "2024-10-25T09:34:24.550152Z",
            "url": "https://files.pythonhosted.org/packages/28/43/cd30cdb3e7c49cd79c2d932b7444dcd2f12c760f41246d8ce1b84d21470c/dlu_voice_transcriber-0.2.3.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-10-25 09:34:24",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "dullibri",
    "github_project": "dlu_voice_transcriber",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "lcname": "dlu-voice-transcriber"
}
        
Elapsed time: 0.60526s