wikibee


Namewikibee JSON
Version 0.1.0 PyPI version JSON
download
home_pageNone
SummaryExtract Wikipedia articles and convert them to TTS-friendly text and audio
upload_time2025-08-25 02:40:50
maintainerNone
docs_urlNone
authorNone
requires_python>=3.8
licenseMIT
keywords audio extraction text-to-speech tts wikipedia
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # wikibee

[![CI](https://github.com/patrickdeanbrown/wikibee/actions/workflows/python-tests.yml/badge.svg)](https://github.com/patrickdeanbrown/wikibee/actions/workflows/python-tests.yml)
[![PyPI version](https://badge.fury.io/py/wikibee.svg)](https://badge.fury.io/py/wikibee)
[![Python 3.8+](https://img.shields.io/badge/python-3.8+-blue.svg)](https://www.python.org/downloads/)
[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)

Extract Wikipedia articles and convert them to TTS-friendly text and audio. Search by URL or keywords, get clean markdown and audio-optimized output.

## What is wikibee?

wikibee transforms Wikipedia articles into clean, accessible content perfect for:
- **Accessibility**: Convert text to speech for visually impaired users
- **Podcast creation**: Research and content preparation
- **Educational materials**: Clean text for study guides
- **Offline reading**: Save articles in markdown format
- **Audio content**: Generate TTS-ready text and audio files

## Quick Start

Install wikibee and create your first audio file in under 30 seconds:

```bash
# Install (recommended method)
pipx install wikibee

# Search and convert an article
wikibee "Albert Einstein" --tts --output my-audio/

# Or use a direct URL  
wikibee "https://en.wikipedia.org/wiki/Artificial_intelligence" --tts
```

That's it! You now have clean markdown and TTS-optimized text files ready for audio conversion.

## Installation

### Option 1: pipx (Recommended)
```bash
pipx install wikibee
```

### Option 2: pip
```bash
pip install wikibee
```

### Option 3: Standalone Binaries

**Windows/Linux**: Download from [releases](https://github.com/patrickdeanbrown/wikibee/releases)

**macOS**: Use pipx installation (binaries not provided for macOS)

### System Requirements
- Python 3.8 or higher
- Internet connection for Wikipedia access
- Optional: TTS server for audio generation

## Basic Usage

### Search by Keywords
```bash
# Simple search with fuzzy matching
wikibee "war of the roses"

# Auto-select first result (no interactive menu)
wikibee "napoleon bonaparte" --yolo

# Save to specific directory
wikibee "quantum computing" --output research/
```

### Use Direct URLs
```bash
# Process specific Wikipedia page
wikibee "https://en.wikipedia.org/wiki/Machine_learning"

# Get only the introduction section
wikibee "https://en.wikipedia.org/wiki/Python" --lead-only
```

### Generate Audio-Ready Content
```bash
# Create TTS-optimized text
wikibee "Ancient Rome" --tts

# Customize heading format for TTS
wikibee "World War II" --tts --heading-prefix "Section:"

# Specify output filename
wikibee "Mozart" --tts --output music/ --filename mozart_biography
```

## Common Use Cases

### For Accessibility
```bash
# Create clean, readable text files
wikibee "Marie Curie" --tts --output biographies/
# Output: marie_curie.md and marie_curie.txt (TTS-ready)
```

### For Podcast Research
```bash
# Quick research with minimal processing
wikibee "blockchain technology" --lead-only --output research/
```

### For Educational Content
```bash
# Batch process multiple topics
wikibee "photosynthesis" --tts --output biology/
wikibee "mitosis" --tts --output biology/
wikibee "evolution" --tts --output biology/
```

## Key Features

- **Smart Search**: Fuzzy matching handles typos and partial queries
- **Interactive Selection**: Choose from multiple search results with a numbered menu
- **Clean Output**: Removes Wikipedia markup, leaving clean markdown
- **TTS Optimization**: Strips formatting markers that confuse text-to-speech engines
- **Flexible Output**: Save to custom directories with custom filenames
- **Error Handling**: Graceful handling of network issues and missing pages
- **Cross-Platform**: Works on Windows, macOS, and Linux

## Documentation

- **[Quick Start Guide](docs/quickstart.md)** - Get up and running in 5 minutes
- **[Complete Tutorial](docs/tutorial/)** - Comprehensive usage guide
- **[CLI Reference](docs/reference/cli-reference.md)** - All commands and options
- **[API Reference](docs/reference/api-reference.md)** - Python API documentation
- **[Troubleshooting](docs/reference/troubleshooting.md)** - Common issues and solutions

## Development

wikibee is built with modern Python practices:

```bash
# Set up development environment
uv venv --python 3.12
source .venv/bin/activate
uv pip install -e ".[dev]"

# Run tests
pytest -q

# Check code style
ruff check .
```

See [CONTRIBUTING.md](CONTRIBUTING.md) for detailed development guidelines.

## Examples and Scripts

The `scripts/` directory contains working examples:
- `smoke_extract.py` - Basic extraction example
- See `docs/guides/examples.md` for more real-world scenarios

## Getting Help

- **Documentation**: Check the `docs/` directory for comprehensive guides
- **Issues**: [Report bugs or request features](https://github.com/patrickdeanbrown/wikibee/issues)
- **Discussions**: Ask questions in GitHub discussions

## License

MIT License - see [LICENSE](LICENSE) for details.

## Contributing

We welcome contributions! See [CONTRIBUTING.md](CONTRIBUTING.md) for guidelines on:
- Setting up your development environment
- Code style and testing requirements
- Submitting pull requests
- Reporting bugs and requesting features

---

**Made with care for the Wikipedia community and accessibility advocates everywhere.**

            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "wikibee",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.8",
    "maintainer_email": null,
    "keywords": "audio, extraction, text-to-speech, tts, wikipedia",
    "author": null,
    "author_email": "Patrick Brown <patrickdeanbrown@icloud.com>",
    "download_url": "https://files.pythonhosted.org/packages/fb/70/25c380c90e2aacb4973904923ed284f9caf334985b42049e86678ceefc34/wikibee-0.1.0.tar.gz",
    "platform": null,
    "description": "# wikibee\n\n[![CI](https://github.com/patrickdeanbrown/wikibee/actions/workflows/python-tests.yml/badge.svg)](https://github.com/patrickdeanbrown/wikibee/actions/workflows/python-tests.yml)\n[![PyPI version](https://badge.fury.io/py/wikibee.svg)](https://badge.fury.io/py/wikibee)\n[![Python 3.8+](https://img.shields.io/badge/python-3.8+-blue.svg)](https://www.python.org/downloads/)\n[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)\n\nExtract Wikipedia articles and convert them to TTS-friendly text and audio. Search by URL or keywords, get clean markdown and audio-optimized output.\n\n## What is wikibee?\n\nwikibee transforms Wikipedia articles into clean, accessible content perfect for:\n- **Accessibility**: Convert text to speech for visually impaired users\n- **Podcast creation**: Research and content preparation\n- **Educational materials**: Clean text for study guides\n- **Offline reading**: Save articles in markdown format\n- **Audio content**: Generate TTS-ready text and audio files\n\n## Quick Start\n\nInstall wikibee and create your first audio file in under 30 seconds:\n\n```bash\n# Install (recommended method)\npipx install wikibee\n\n# Search and convert an article\nwikibee \"Albert Einstein\" --tts --output my-audio/\n\n# Or use a direct URL  \nwikibee \"https://en.wikipedia.org/wiki/Artificial_intelligence\" --tts\n```\n\nThat's it! You now have clean markdown and TTS-optimized text files ready for audio conversion.\n\n## Installation\n\n### Option 1: pipx (Recommended)\n```bash\npipx install wikibee\n```\n\n### Option 2: pip\n```bash\npip install wikibee\n```\n\n### Option 3: Standalone Binaries\n\n**Windows/Linux**: Download from [releases](https://github.com/patrickdeanbrown/wikibee/releases)\n\n**macOS**: Use pipx installation (binaries not provided for macOS)\n\n### System Requirements\n- Python 3.8 or higher\n- Internet connection for Wikipedia access\n- Optional: TTS server for audio generation\n\n## Basic Usage\n\n### Search by Keywords\n```bash\n# Simple search with fuzzy matching\nwikibee \"war of the roses\"\n\n# Auto-select first result (no interactive menu)\nwikibee \"napoleon bonaparte\" --yolo\n\n# Save to specific directory\nwikibee \"quantum computing\" --output research/\n```\n\n### Use Direct URLs\n```bash\n# Process specific Wikipedia page\nwikibee \"https://en.wikipedia.org/wiki/Machine_learning\"\n\n# Get only the introduction section\nwikibee \"https://en.wikipedia.org/wiki/Python\" --lead-only\n```\n\n### Generate Audio-Ready Content\n```bash\n# Create TTS-optimized text\nwikibee \"Ancient Rome\" --tts\n\n# Customize heading format for TTS\nwikibee \"World War II\" --tts --heading-prefix \"Section:\"\n\n# Specify output filename\nwikibee \"Mozart\" --tts --output music/ --filename mozart_biography\n```\n\n## Common Use Cases\n\n### For Accessibility\n```bash\n# Create clean, readable text files\nwikibee \"Marie Curie\" --tts --output biographies/\n# Output: marie_curie.md and marie_curie.txt (TTS-ready)\n```\n\n### For Podcast Research\n```bash\n# Quick research with minimal processing\nwikibee \"blockchain technology\" --lead-only --output research/\n```\n\n### For Educational Content\n```bash\n# Batch process multiple topics\nwikibee \"photosynthesis\" --tts --output biology/\nwikibee \"mitosis\" --tts --output biology/\nwikibee \"evolution\" --tts --output biology/\n```\n\n## Key Features\n\n- **Smart Search**: Fuzzy matching handles typos and partial queries\n- **Interactive Selection**: Choose from multiple search results with a numbered menu\n- **Clean Output**: Removes Wikipedia markup, leaving clean markdown\n- **TTS Optimization**: Strips formatting markers that confuse text-to-speech engines\n- **Flexible Output**: Save to custom directories with custom filenames\n- **Error Handling**: Graceful handling of network issues and missing pages\n- **Cross-Platform**: Works on Windows, macOS, and Linux\n\n## Documentation\n\n- **[Quick Start Guide](docs/quickstart.md)** - Get up and running in 5 minutes\n- **[Complete Tutorial](docs/tutorial/)** - Comprehensive usage guide\n- **[CLI Reference](docs/reference/cli-reference.md)** - All commands and options\n- **[API Reference](docs/reference/api-reference.md)** - Python API documentation\n- **[Troubleshooting](docs/reference/troubleshooting.md)** - Common issues and solutions\n\n## Development\n\nwikibee is built with modern Python practices:\n\n```bash\n# Set up development environment\nuv venv --python 3.12\nsource .venv/bin/activate\nuv pip install -e \".[dev]\"\n\n# Run tests\npytest -q\n\n# Check code style\nruff check .\n```\n\nSee [CONTRIBUTING.md](CONTRIBUTING.md) for detailed development guidelines.\n\n## Examples and Scripts\n\nThe `scripts/` directory contains working examples:\n- `smoke_extract.py` - Basic extraction example\n- See `docs/guides/examples.md` for more real-world scenarios\n\n## Getting Help\n\n- **Documentation**: Check the `docs/` directory for comprehensive guides\n- **Issues**: [Report bugs or request features](https://github.com/patrickdeanbrown/wikibee/issues)\n- **Discussions**: Ask questions in GitHub discussions\n\n## License\n\nMIT License - see [LICENSE](LICENSE) for details.\n\n## Contributing\n\nWe welcome contributions! See [CONTRIBUTING.md](CONTRIBUTING.md) for guidelines on:\n- Setting up your development environment\n- Code style and testing requirements\n- Submitting pull requests\n- Reporting bugs and requesting features\n\n---\n\n**Made with care for the Wikipedia community and accessibility advocates everywhere.**\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "Extract Wikipedia articles and convert them to TTS-friendly text and audio",
    "version": "0.1.0",
    "project_urls": {
        "Homepage": "https://github.com/patrickdeanbrown/wikibee",
        "Issues": "https://github.com/patrickdeanbrown/wikibee/issues",
        "Repository": "https://github.com/patrickdeanbrown/wikibee"
    },
    "split_keywords": [
        "audio",
        " extraction",
        " text-to-speech",
        " tts",
        " wikipedia"
    ],
    "urls": [
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "682c0fcd6417bb6630499f93a824e53211860466fb20e5d6e4178ca597e7072a",
                "md5": "0c856db76f94870786c0a34ec4fc8ee6",
                "sha256": "03ee219e7f8084b3431262107b28f5d5d70513d52a1ca53852867a53065a3e80"
            },
            "downloads": -1,
            "filename": "wikibee-0.1.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "0c856db76f94870786c0a34ec4fc8ee6",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.8",
            "size": 16485,
            "upload_time": "2025-08-25T02:40:49",
            "upload_time_iso_8601": "2025-08-25T02:40:49.634314Z",
            "url": "https://files.pythonhosted.org/packages/68/2c/0fcd6417bb6630499f93a824e53211860466fb20e5d6e4178ca597e7072a/wikibee-0.1.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "fb7025c380c90e2aacb4973904923ed284f9caf334985b42049e86678ceefc34",
                "md5": "b704bab5067342653b279331e809339b",
                "sha256": "ddc52e7d91631441d7c992cb11c34063cb25e6f441576f4e25f2ddacae015249"
            },
            "downloads": -1,
            "filename": "wikibee-0.1.0.tar.gz",
            "has_sig": false,
            "md5_digest": "b704bab5067342653b279331e809339b",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.8",
            "size": 20488,
            "upload_time": "2025-08-25T02:40:50",
            "upload_time_iso_8601": "2025-08-25T02:40:50.995153Z",
            "url": "https://files.pythonhosted.org/packages/fb/70/25c380c90e2aacb4973904923ed284f9caf334985b42049e86678ceefc34/wikibee-0.1.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-08-25 02:40:50",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "patrickdeanbrown",
    "github_project": "wikibee",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "wikibee"
}
        
Elapsed time: 0.66359s