pnm


Namepnm JSON
Version 0.0.1 PyPI version JSON
download
home_pagehttps://github.com/pstwh/pnm
SummaryConvert audio to phonetic text and practice improving your speech accent.
upload_time2025-01-14 04:51:36
maintainerNone
docs_urlNone
authorNone
requires_python<4,>=3.5
licenseNone
keywords phonetics machine learning neural network english speech practice
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # pnm

pnm is an audio-to-phoneme conversion tool designed to transform spoken English into phonetic transcriptions. This project is a mini-project derived from a larger, unfinished personal project aimed at creating a tool for English phonetic practice. Although the main project wasn't completed, PNM is being transformed into a Python library for open-source use.

Currently, the tool is a work-in-progress but is functional and offers a simple way to convert audio into phonemes.

It is possible to classify the speech quality of the person training using the pnm tool. By analyzing the phonetic transcriptions generated from the spoken audio (by token confidence). This analysis can help in evaluating the quality of the speaker’s pronunciation and progress over time, allowing for personalized feedback during training. 

## Installation

To install the required dependencies, use the following command:

For cpu

```bash
pip install "pnm[cpu]"
```

For cuda 11.X
```bash
pip install "pnm[gpu]"
```

For cuda 12.X
```bash
pip install "pnm[gpu]" --extra-index-url https://aiinfra.pkgs.visualstudio.com/PublicPackages/_packaging/onnxruntime-cuda-12/pypi/simple/
```

## Usage

### Command Line Interface

For get the phonemes of an audio file:

```bash
pnm file --file_path path/to/audio.wav
```

For get the phonemes of an audio recorder (default input device):

```bash
pnm recorder
```

For practice (default input device):

```bash
pnm practice
```

# Images

<p align="center">
  <img src="https://raw.githubusercontent.com/pstwh/pnm/main/examples/image.png" width="768" />
</p>

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/pstwh/pnm",
    "name": "pnm",
    "maintainer": null,
    "docs_url": null,
    "requires_python": "<4,>=3.5",
    "maintainer_email": null,
    "keywords": "phonetics, machine learning, neural network, english, speech, practice",
    "author": null,
    "author_email": null,
    "download_url": null,
    "platform": null,
    "description": "# pnm\n\npnm is an audio-to-phoneme conversion tool designed to transform spoken English into phonetic transcriptions. This project is a mini-project derived from a larger, unfinished personal project aimed at creating a tool for English phonetic practice. Although the main project wasn't completed, PNM is being transformed into a Python library for open-source use.\n\nCurrently, the tool is a work-in-progress but is functional and offers a simple way to convert audio into phonemes.\n\nIt is possible to classify the speech quality of the person training using the pnm tool. By analyzing the phonetic transcriptions generated from the spoken audio (by token confidence). This analysis can help in evaluating the quality of the speaker\u2019s pronunciation and progress over time, allowing for personalized feedback during training. \n\n## Installation\n\nTo install the required dependencies, use the following command:\n\nFor cpu\n\n```bash\npip install \"pnm[cpu]\"\n```\n\nFor cuda 11.X\n```bash\npip install \"pnm[gpu]\"\n```\n\nFor cuda 12.X\n```bash\npip install \"pnm[gpu]\" --extra-index-url https://aiinfra.pkgs.visualstudio.com/PublicPackages/_packaging/onnxruntime-cuda-12/pypi/simple/\n```\n\n## Usage\n\n### Command Line Interface\n\nFor get the phonemes of an audio file:\n\n```bash\npnm file --file_path path/to/audio.wav\n```\n\nFor get the phonemes of an audio recorder (default input device):\n\n```bash\npnm recorder\n```\n\nFor practice (default input device):\n\n```bash\npnm practice\n```\n\n# Images\n\n<p align=\"center\">\n  <img src=\"https://raw.githubusercontent.com/pstwh/pnm/main/examples/image.png\" width=\"768\" />\n</p>\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "Convert audio to phonetic text and practice improving your speech accent.",
    "version": "0.0.1",
    "project_urls": {
        "Homepage": "https://github.com/pstwh/pnm"
    },
    "split_keywords": [
        "phonetics",
        " machine learning",
        " neural network",
        " english",
        " speech",
        " practice"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "f19c8e461c6fcfe898451440e6ad6202a69be546dd78c9742172335174bcd294",
                "md5": "6d749f14e0fb511c2bb8a3215afdc78a",
                "sha256": "3131139de268606183d8cadf48bb7474d4f158c382a1197f3425738f29486284"
            },
            "downloads": -1,
            "filename": "pnm-0.0.1-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "6d749f14e0fb511c2bb8a3215afdc78a",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": "<4,>=3.5",
            "size": 39352415,
            "upload_time": "2025-01-14T04:51:36",
            "upload_time_iso_8601": "2025-01-14T04:51:36.373277Z",
            "url": "https://files.pythonhosted.org/packages/f1/9c/8e461c6fcfe898451440e6ad6202a69be546dd78c9742172335174bcd294/pnm-0.0.1-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-01-14 04:51:36",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "pstwh",
    "github_project": "pnm",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "lcname": "pnm"
}
        
Elapsed time: 0.42231s