nemo-text-processing


Namenemo-text-processing JSON
Version 0.3.0rc0 PyPI version JSON
download
home_pagehttps://github.com/nvidia/nemo-text-processing
SummaryNeMo text processing for ASR and TTS
upload_time2024-03-13 21:50:34
maintainerNVIDIA
docs_urlNone
authorNVIDIA
requires_python
licenseApache2
keywords nemo nvidia tts asr text processing text normalization inverse text normalization language
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            **NeMo Text Processing**
==========================

Introduction
------------

`nemo-text-processing` is a Python package for text normalization and inverse text normalization.

Documentation
-------------

[NeMo-text-processing (text normalization and inverse text normalization)](https://docs.nvidia.com/deeplearning/nemo/user-guide/docs/en/main/nlp/text_normalization/intro.html).

Tutorials
-----------------

| Google Collab Notebook      | Description |
| ----------- | ----------- |
| [Text_(Inverse)_Normalization.ipynb](https://github.com/NVIDIA/NeMo-text-processing/blob/main/tutorials/Text_(Inverse)_Normalization.ipynb)     | Quick-start guide       |
| [WFST_Tutorial](https://github.com/NVIDIA/NeMo-text-processing/blob/main/tutorials/WFST_Tutorial.ipynb)   | In-depth tutorial on grammar customization        |


Getting help
--------------
If you have a question which is not answered in the [Github discussions](https://github.com/NVIDIA/NeMo-text-processing/discussions), encounter a bug or have a feature request, please create a [Github issue](https://github.com/NVIDIA/NeMo-text-processing/issues). We also welcome you to directly open a [pull request](https://github.com/NVIDIA/NeMo-text-processing/pulls) to fix a bug or add a feature.


Installation
------------

### Conda virtual environment

We recommend setting up a fresh Conda environment to install NeMo-text-processing.

```bash
conda create --name nemo_tn python==3.8
conda activate nemo_tn
```

(Optional) To use [hybrid text normalization](nemo_text_processing/hybrid/README.md) install PyTorch using their [configurator](https://pytorch.org/get-started/locally/). 

```
conda install pytorch torchvision torchaudio cudatoolkit=11.3 -c pytorch
```
**_NOTE:_** The command used to install PyTorch may depend on your system.


###  Pip

Use this installation mode if you want the latest released version.
```
pip install nemo_text_processing
```

**_NOTE:_** This should work on any Linux OS with x86_64. Pip installation on MacOS and Windows are not supported due to the dependency [Pynini](https://www.openfst.org/twiki/bin/view/GRM/Pynini). On a platform other than Linux x86_64, installing from Pip tries to compile Pynini from scratch, and requires OpenFst headers and libraries to be in the expected place. So if it's working for you, it's because you happen to have installed OpenFst in the right way in the right place. So if you want to Pip install Pynini on MacOS, you have to have pre-compiled and pre-installed OpenFst. The Pynini README for that version should tell you which version it needs and what `--enable-foo` flags to use.
Instead, we recommend you to use conda-forge to install Pynini on MacOS or Windows:
`conda install -c conda-forge pynini=2.1.5`.


###  Pip from source

Use this installation mode if you want the a version from particular GitHub branch (e.g main).

```
pip install Cython
python -m pip install git+https://github.com/NVIDIA/NeMo-text-processing.git@{BRANCH}#egg=nemo_text_processing
```


### From source

Use this installation mode if you are contributing to NeMo-text-processing.

```
git clone https://github.com/NVIDIA/NeMo-text-processing
cd NeMo-text-processing
./reinstall.sh
```

**_NOTE:_** If you only want the toolkit without additional conda-based dependencies, you may replace ``reinstall.sh`` with ``pip install -e .`` with the NeMo-text-processing root directory as your current working director.


Contributing
------------
We welcome community contributions! Please refer to the [CONTRIBUTING.md](CONTRIBUTING.md) for guidelines.



Citation
--------

```
@inproceedings{zhang21ja_interspeech,
  author={Yang Zhang and Evelina Bakhturina and Boris Ginsburg},
  title={{NeMo (Inverse) Text Normalization: From Development to Production}},
  year=2021,
  booktitle={Proc. Interspeech 2021},
  pages={4857--4859}
}

@inproceedings{bakhturina22_interspeech,
  author={Evelina Bakhturina and Yang Zhang and Boris Ginsburg},
  title={{Shallow Fusion of Weighted Finite-State Transducer and Language Model for
Text Normalization}},
  year=2022,
  booktitle={Proc. Interspeech 2022}
}
```

License
-------
NeMo-text-processing is under [Apache 2.0 license](LICENSE).

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/nvidia/nemo-text-processing",
    "name": "nemo-text-processing",
    "maintainer": "NVIDIA",
    "docs_url": null,
    "requires_python": "",
    "maintainer_email": "nemo-toolkit@nvidia.com",
    "keywords": "NeMo,nvidia,tts,asr,text processing,text normalization,inverse text normalization,language",
    "author": "NVIDIA",
    "author_email": "nemo-toolkit@nvidia.com",
    "download_url": "https://files.pythonhosted.org/packages/9e/6f/865bc2b4bf2f957441869723df96c435921655ddfca35da9f0545bba30a7/nemo_text_processing-0.3.0rc0.tar.gz",
    "platform": null,
    "description": "**NeMo Text Processing**\n==========================\n\nIntroduction\n------------\n\n`nemo-text-processing` is a Python package for text normalization and inverse text normalization.\n\nDocumentation\n-------------\n\n[NeMo-text-processing (text normalization and inverse text normalization)](https://docs.nvidia.com/deeplearning/nemo/user-guide/docs/en/main/nlp/text_normalization/intro.html).\n\nTutorials\n-----------------\n\n| Google Collab Notebook      | Description |\n| ----------- | ----------- |\n| [Text_(Inverse)_Normalization.ipynb](https://github.com/NVIDIA/NeMo-text-processing/blob/main/tutorials/Text_(Inverse)_Normalization.ipynb)     | Quick-start guide       |\n| [WFST_Tutorial](https://github.com/NVIDIA/NeMo-text-processing/blob/main/tutorials/WFST_Tutorial.ipynb)   | In-depth tutorial on grammar customization        |\n\n\nGetting help\n--------------\nIf you have a question which is not answered in the [Github discussions](https://github.com/NVIDIA/NeMo-text-processing/discussions), encounter a bug or have a feature request, please create a [Github issue](https://github.com/NVIDIA/NeMo-text-processing/issues). We also welcome you to directly open a [pull request](https://github.com/NVIDIA/NeMo-text-processing/pulls) to fix a bug or add a feature.\n\n\nInstallation\n------------\n\n### Conda virtual environment\n\nWe recommend setting up a fresh Conda environment to install NeMo-text-processing.\n\n```bash\nconda create --name nemo_tn python==3.8\nconda activate nemo_tn\n```\n\n(Optional) To use [hybrid text normalization](nemo_text_processing/hybrid/README.md) install PyTorch using their [configurator](https://pytorch.org/get-started/locally/). \n\n```\nconda install pytorch torchvision torchaudio cudatoolkit=11.3 -c pytorch\n```\n**_NOTE:_** The command used to install PyTorch may depend on your system.\n\n\n###  Pip\n\nUse this installation mode if you want the latest released version.\n```\npip install nemo_text_processing\n```\n\n**_NOTE:_** This should work on any Linux OS with x86_64. Pip installation on MacOS and Windows are not supported due to the dependency [Pynini](https://www.openfst.org/twiki/bin/view/GRM/Pynini). On a platform other than Linux x86_64, installing from Pip tries to compile Pynini from scratch, and requires OpenFst headers and libraries to be in the expected place. So if it's working for you, it's because you happen to have installed OpenFst in the right way in the right place. So if you want to Pip install Pynini on MacOS, you have to have pre-compiled and pre-installed OpenFst. The Pynini README for that version should tell you which version it needs and what `--enable-foo` flags to use.\nInstead, we recommend you to use conda-forge to install Pynini on MacOS or Windows:\n`conda install -c conda-forge pynini=2.1.5`.\n\n\n###  Pip from source\n\nUse this installation mode if you want the a version from particular GitHub branch (e.g main).\n\n```\npip install Cython\npython -m pip install git+https://github.com/NVIDIA/NeMo-text-processing.git@{BRANCH}#egg=nemo_text_processing\n```\n\n\n### From source\n\nUse this installation mode if you are contributing to NeMo-text-processing.\n\n```\ngit clone https://github.com/NVIDIA/NeMo-text-processing\ncd NeMo-text-processing\n./reinstall.sh\n```\n\n**_NOTE:_** If you only want the toolkit without additional conda-based dependencies, you may replace ``reinstall.sh`` with ``pip install -e .`` with the NeMo-text-processing root directory as your current working director.\n\n\nContributing\n------------\nWe welcome community contributions! Please refer to the [CONTRIBUTING.md](CONTRIBUTING.md) for guidelines.\n\n\n\nCitation\n--------\n\n```\n@inproceedings{zhang21ja_interspeech,\n  author={Yang Zhang and Evelina Bakhturina and Boris Ginsburg},\n  title={{NeMo (Inverse) Text Normalization: From Development to Production}},\n  year=2021,\n  booktitle={Proc. Interspeech 2021},\n  pages={4857--4859}\n}\n\n@inproceedings{bakhturina22_interspeech,\n  author={Evelina Bakhturina and Yang Zhang and Boris Ginsburg},\n  title={{Shallow Fusion of Weighted Finite-State Transducer and Language Model for\nText Normalization}},\n  year=2022,\n  booktitle={Proc. Interspeech 2022}\n}\n```\n\nLicense\n-------\nNeMo-text-processing is under [Apache 2.0 license](LICENSE).\n",
    "bugtrack_url": null,
    "license": "Apache2",
    "summary": "NeMo text processing for ASR and TTS",
    "version": "0.3.0rc0",
    "project_urls": {
        "Download": "https://github.com/NVIDIA/NeMo-text-processing/releases",
        "Homepage": "https://github.com/nvidia/nemo-text-processing"
    },
    "split_keywords": [
        "nemo",
        "nvidia",
        "tts",
        "asr",
        "text processing",
        "text normalization",
        "inverse text normalization",
        "language"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "7a89eda8673d56e0b42000a9f5a0bb01a0d2a261c3bba5b64b96ea534318839d",
                "md5": "251c3ab8083e84c91cb666249cfb2397",
                "sha256": "bea2f5d27a2c2eb5c822212f897edd8c62a2f8798f177ef5a300a58de6ed5b7d"
            },
            "downloads": -1,
            "filename": "nemo_text_processing-0.3.0rc0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "251c3ab8083e84c91cb666249cfb2397",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": null,
            "size": 2595814,
            "upload_time": "2024-03-13T21:50:32",
            "upload_time_iso_8601": "2024-03-13T21:50:32.047729Z",
            "url": "https://files.pythonhosted.org/packages/7a/89/eda8673d56e0b42000a9f5a0bb01a0d2a261c3bba5b64b96ea534318839d/nemo_text_processing-0.3.0rc0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "9e6f865bc2b4bf2f957441869723df96c435921655ddfca35da9f0545bba30a7",
                "md5": "70b4d2c9cfa6ebdbda3dac109d5ce772",
                "sha256": "b6924eb5b71639ef05e9f9088dd916e6f3bc91d56a9c99579dc98fa86feb57de"
            },
            "downloads": -1,
            "filename": "nemo_text_processing-0.3.0rc0.tar.gz",
            "has_sig": false,
            "md5_digest": "70b4d2c9cfa6ebdbda3dac109d5ce772",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": null,
            "size": 1629424,
            "upload_time": "2024-03-13T21:50:34",
            "upload_time_iso_8601": "2024-03-13T21:50:34.538627Z",
            "url": "https://files.pythonhosted.org/packages/9e/6f/865bc2b4bf2f957441869723df96c435921655ddfca35da9f0545bba30a7/nemo_text_processing-0.3.0rc0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-03-13 21:50:34",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "nvidia",
    "github_project": "nemo-text-processing",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "nemo-text-processing"
}
        
Elapsed time: 0.20057s