# ailia AI Voice Python API
!! CAUTION !!
“ailia” IS NOT OPEN SOURCE SOFTWARE (OSS).
As long as user complies with the conditions stated in [License Document](https://ailia.ai/license/), user may use the Software for free of charge, but the Software is basically paid software.
## About ailia AI Voice
ailia AI Voice is a library for speech synthesis using AI. It provides APIs for C# for Unity and for C for native applications. By using ailia AI Voice, it is possible to easily implement AI-based speech synthesis in applications.
ailia AI Voice can perform speech synthesis offline, only on edge devices, without the need for cloud connectivity. It also supports the latest GPT-SoVITS, enabling speech synthesis in any voice timbre.
## Install from pip
You can install the ailia AI Voice free evaluation package with the following command.
```
pip3 install ailia_voice
```
## Install from package
You can install the ailia AI Voice from Package with the following command.
```
python3 bootstrap.py
pip3 install ./
```
## Usage
```python
import ailia_voice
import librosa
import time
import soundfile
import os
import urllib.request
# Load reference audio
ref_text = "水をマレーシアから買わなくてはならない。"
ref_file_path = "reference_audio_girl.wav"
if not os.path.exists(ref_file_path):
urllib.request.urlretrieve(
"https://github.com/axinc-ai/ailia-models/raw/refs/heads/master/audio_processing/gpt-sovits/reference_audio_captured_by_ax.wav",
"reference_audio_girl.wav"
)
audio_waveform, sampling_rate = librosa.load(ref_file_path, mono=True)
# Infer
voice = ailia_voice.GPTSoVITS()
voice.initialize_model(model_path = "./models/")
voice.set_reference_audio(ref_text, ailia_voice.AILIA_VOICE_G2P_TYPE_GPT_SOVITS_JA, audio_waveform, sampling_rate)
buf, sampling_rate = voice.synthesize_voice("こんにちは。今日はいい天気ですね。", ailia_voice.AILIA_VOICE_G2P_TYPE_GPT_SOVITS_JA)
#buf, sampling_rate = voice.synthesize_voice("Hello world.", ailia_voice.AILIA_VOICE_G2P_TYPE_GPT_SOVITS_EN)
# Save result
soundfile.write("output.wav", buf, sampling_rate)
```
## API specification
https://github.com/axinc-ai/ailia-sdk
Raw data
{
"_id": null,
"home_page": "https://ailia.jp/",
"name": "ailia-voice",
"maintainer": null,
"docs_url": null,
"requires_python": ">3.6",
"maintainer_email": null,
"keywords": null,
"author": "ax Inc.",
"author_email": "contact@axinc.jp",
"download_url": "https://files.pythonhosted.org/packages/61/84/864a597e1e326297cfac15bb4bae1243db087caf6093e2cbe2251b3260ac/ailia_voice-1.2.0.0.tar.gz",
"platform": null,
"description": "# ailia AI Voice Python API\n\n!! CAUTION !!\n\u201cailia\u201d IS NOT OPEN SOURCE SOFTWARE (OSS).\nAs long as user complies with the conditions stated in [License Document](https://ailia.ai/license/), user may use the Software for free of charge, but the Software is basically paid software.\n\n## About ailia AI Voice\n\nailia AI Voice is a library for speech synthesis using AI. It provides APIs for C# for Unity and for C for native applications. By using ailia AI Voice, it is possible to easily implement AI-based speech synthesis in applications.\n\nailia AI Voice can perform speech synthesis offline, only on edge devices, without the need for cloud connectivity. It also supports the latest GPT-SoVITS, enabling speech synthesis in any voice timbre.\n\n## Install from pip\n\nYou can install the ailia AI Voice free evaluation package with the following command.\n\n```\npip3 install ailia_voice\n```\n\n## Install from package\n\nYou can install the ailia AI Voice from Package with the following command.\n\n```\npython3 bootstrap.py\npip3 install ./\n```\n\n## Usage\n\n```python\nimport ailia_voice\n\nimport librosa\nimport time\nimport soundfile\n\nimport os\nimport urllib.request\n\n# Load reference audio\nref_text = \"\u6c34\u3092\u30de\u30ec\u30fc\u30b7\u30a2\u304b\u3089\u8cb7\u308f\u306a\u304f\u3066\u306f\u306a\u3089\u306a\u3044\u3002\"\nref_file_path = \"reference_audio_girl.wav\"\nif not os.path.exists(ref_file_path):\n\turllib.request.urlretrieve(\n\t\t\"https://github.com/axinc-ai/ailia-models/raw/refs/heads/master/audio_processing/gpt-sovits/reference_audio_captured_by_ax.wav\",\n\t\t\"reference_audio_girl.wav\"\n\t)\naudio_waveform, sampling_rate = librosa.load(ref_file_path, mono=True)\n\n# Infer\nvoice = ailia_voice.GPTSoVITS()\nvoice.initialize_model(model_path = \"./models/\")\nvoice.set_reference_audio(ref_text, ailia_voice.AILIA_VOICE_G2P_TYPE_GPT_SOVITS_JA, audio_waveform, sampling_rate)\nbuf, sampling_rate = voice.synthesize_voice(\"\u3053\u3093\u306b\u3061\u306f\u3002\u4eca\u65e5\u306f\u3044\u3044\u5929\u6c17\u3067\u3059\u306d\u3002\", ailia_voice.AILIA_VOICE_G2P_TYPE_GPT_SOVITS_JA)\n#buf, sampling_rate = voice.synthesize_voice(\"Hello world.\", ailia_voice.AILIA_VOICE_G2P_TYPE_GPT_SOVITS_EN)\n\n# Save result\nsoundfile.write(\"output.wav\", buf, sampling_rate)\n```\n\n## API specification\n\nhttps://github.com/axinc-ai/ailia-sdk\n\n",
"bugtrack_url": null,
"license": "https://ailia.ai/en/license/",
"summary": "ailia AI Voice",
"version": "1.2.0.0",
"project_urls": {
"Homepage": "https://ailia.jp/"
},
"split_keywords": [],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "e43077a6335fc358d12ba622b6b915d4dc54f4a84f73e3c69559f2cb63d0f2dd",
"md5": "13f26dfe040d2cba0f8587055ef69302",
"sha256": "9b7cc2ff52e89eb212e84edfb931e2d3c5071635437e1e653d9aa228b65edd01"
},
"downloads": -1,
"filename": "ailia_voice-1.2.0.0-py3-none-any.whl",
"has_sig": false,
"md5_digest": "13f26dfe040d2cba0f8587055ef69302",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">3.6",
"size": 1998137,
"upload_time": "2024-10-25T04:28:35",
"upload_time_iso_8601": "2024-10-25T04:28:35.054514Z",
"url": "https://files.pythonhosted.org/packages/e4/30/77a6335fc358d12ba622b6b915d4dc54f4a84f73e3c69559f2cb63d0f2dd/ailia_voice-1.2.0.0-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "6184864a597e1e326297cfac15bb4bae1243db087caf6093e2cbe2251b3260ac",
"md5": "8cb8a5d99d3d0a2e74bd0e4a8ed67b7a",
"sha256": "f7012119e3e93c552fc1e8bb04cbd54859fa19956cd0610d3b0814f0a5f68caf"
},
"downloads": -1,
"filename": "ailia_voice-1.2.0.0.tar.gz",
"has_sig": false,
"md5_digest": "8cb8a5d99d3d0a2e74bd0e4a8ed67b7a",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">3.6",
"size": 1992000,
"upload_time": "2024-10-25T04:28:37",
"upload_time_iso_8601": "2024-10-25T04:28:37.069565Z",
"url": "https://files.pythonhosted.org/packages/61/84/864a597e1e326297cfac15bb4bae1243db087caf6093e2cbe2251b3260ac/ailia_voice-1.2.0.0.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-10-25 04:28:37",
"github": false,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"lcname": "ailia-voice"
}