[![Multi-Modality](agorabanner.png)](https://discord.gg/qUtxnK2NMf)
# AudioFlamingo
Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities". [PAPER LINK](https://arxiv.org/pdf/2402.01831.pdf)
## Install
`pip3 install audio-flamingo`
## Usage
```python
import torch
from audio_flamingo.model import AudioFlamingo
# Generate a random input sequence
text = torch.randint(0, 256, (1, 1024))
audio = torch.randn(1, 16000)
# Initialize AudioFlamingo model
model = AudioFlamingo(
dim=512,
num_tokens=256,
max_seq_len=1024,
heads=8,
depth=6,
dim_head=64,
dropout=0.1,
context_dim=512,
)
# Pass the input sequence through the model
output = model(text, audio) # (1, 1024, 256)
# Print the output shape
print(output.shape)
# Path: audio_flamingo/model.py
```
# License
MIT
Raw data
{
"_id": null,
"home_page": "https://github.com/kyegomez/AudioFlamingo",
"name": "audio-flamingo",
"maintainer": "",
"docs_url": null,
"requires_python": ">=3.6,<4.0",
"maintainer_email": "",
"keywords": "artificial intelligence,deep learning,optimizers,Prompt Engineering",
"author": "Kye Gomez",
"author_email": "kye@apac.ai",
"download_url": "https://files.pythonhosted.org/packages/81/0c/ed19e09199dc516035028509e35dd75ae9271aa8d13895ab973c1f342e54/audio_flamingo-0.0.3.tar.gz",
"platform": null,
"description": "[![Multi-Modality](agorabanner.png)](https://discord.gg/qUtxnK2NMf)\n\n# AudioFlamingo\nImplementation of the model \"AudioFlamingo\" from the paper: \"Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities\". [PAPER LINK](https://arxiv.org/pdf/2402.01831.pdf)\n\n\n## Install\n`pip3 install audio-flamingo`\n\n## Usage\n```python\nimport torch\nfrom audio_flamingo.model import AudioFlamingo\n\n# Generate a random input sequence\ntext = torch.randint(0, 256, (1, 1024))\naudio = torch.randn(1, 16000)\n\n# Initialize AudioFlamingo model\nmodel = AudioFlamingo(\n dim=512,\n num_tokens=256,\n max_seq_len=1024,\n heads=8,\n depth=6,\n dim_head=64,\n dropout=0.1,\n context_dim=512,\n)\n\n# Pass the input sequence through the model\noutput = model(text, audio) # (1, 1024, 256)\n\n# Print the output shape\nprint(output.shape)\n# Path: audio_flamingo/model.py\n\n```\n\n# License\nMIT\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "Paper - Pytorch",
"version": "0.0.3",
"project_urls": {
"Documentation": "https://github.com/kyegomez/AudioFlamingo",
"Homepage": "https://github.com/kyegomez/AudioFlamingo",
"Repository": "https://github.com/kyegomez/AudioFlamingo"
},
"split_keywords": [
"artificial intelligence",
"deep learning",
"optimizers",
"prompt engineering"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "0ab8f5a549c7de7b265d667998537094e78434f98a8499336648207271f2ff61",
"md5": "41932f5d7cc5f9bd234d9e2fc9cee5cd",
"sha256": "c3bfda9840a88e7a59da33c2d66d5b51815e5b67dc6aba2bfcfcc181fb7d8409"
},
"downloads": -1,
"filename": "audio_flamingo-0.0.3-py3-none-any.whl",
"has_sig": false,
"md5_digest": "41932f5d7cc5f9bd234d9e2fc9cee5cd",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">=3.6,<4.0",
"size": 6156,
"upload_time": "2024-02-07T17:54:48",
"upload_time_iso_8601": "2024-02-07T17:54:48.919149Z",
"url": "https://files.pythonhosted.org/packages/0a/b8/f5a549c7de7b265d667998537094e78434f98a8499336648207271f2ff61/audio_flamingo-0.0.3-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "810ced19e09199dc516035028509e35dd75ae9271aa8d13895ab973c1f342e54",
"md5": "600c77034cb28e6e76f9e6315f9b6d52",
"sha256": "ce8f94ed6a4edaa05631a5e42ce47645a98792aedc54c22d9ac309f055faecd6"
},
"downloads": -1,
"filename": "audio_flamingo-0.0.3.tar.gz",
"has_sig": false,
"md5_digest": "600c77034cb28e6e76f9e6315f9b6d52",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">=3.6,<4.0",
"size": 6226,
"upload_time": "2024-02-07T17:54:50",
"upload_time_iso_8601": "2024-02-07T17:54:50.563870Z",
"url": "https://files.pythonhosted.org/packages/81/0c/ed19e09199dc516035028509e35dd75ae9271aa8d13895ab973c1f342e54/audio_flamingo-0.0.3.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-02-07 17:54:50",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "kyegomez",
"github_project": "AudioFlamingo",
"travis_ci": false,
"coveralls": false,
"github_actions": true,
"requirements": [],
"lcname": "audio-flamingo"
}