audio-flamingo

Name	audio-flamingo JSON
Version	0.0.3 JSON
	download
home_page	https://github.com/kyegomez/AudioFlamingo
Summary	Paper - Pytorch
upload_time	2024-02-07 17:54:50
maintainer
docs_url	None
author	Kye Gomez
requires_python	>=3.6,<4.0
license	MIT
keywords	artificial intelligence deep learning optimizers prompt engineering
VCS
bugtrack_url
requirements	No requirements were recorded.
Travis-CI	No Travis.
coveralls test coverage	No coveralls.

            [![Multi-Modality](agorabanner.png)](https://discord.gg/qUtxnK2NMf)

# AudioFlamingo
Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities". [PAPER LINK](https://arxiv.org/pdf/2402.01831.pdf)


## Install
`pip3 install audio-flamingo`

## Usage
```python
import torch
from audio_flamingo.model import AudioFlamingo

# Generate a random input sequence
text = torch.randint(0, 256, (1, 1024))
audio = torch.randn(1, 16000)

# Initialize AudioFlamingo model
model = AudioFlamingo(
    dim=512,
    num_tokens=256,
    max_seq_len=1024,
    heads=8,
    depth=6,
    dim_head=64,
    dropout=0.1,
    context_dim=512,
)

# Pass the input sequence through the model
output = model(text, audio)  # (1, 1024, 256)

# Print the output shape
print(output.shape)
# Path: audio_flamingo/model.py

```

# License
MIT

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/kyegomez/AudioFlamingo",
    "name": "audio-flamingo",
    "maintainer": "",
    "docs_url": null,
    "requires_python": ">=3.6,<4.0",
    "maintainer_email": "",
    "keywords": "artificial intelligence,deep learning,optimizers,Prompt Engineering",
    "author": "Kye Gomez",
    "author_email": "kye@apac.ai",
    "download_url": "https://files.pythonhosted.org/packages/81/0c/ed19e09199dc516035028509e35dd75ae9271aa8d13895ab973c1f342e54/audio_flamingo-0.0.3.tar.gz",
    "platform": null,
    "description": "[![Multi-Modality](agorabanner.png)](https://discord.gg/qUtxnK2NMf)\n\n# AudioFlamingo\nImplementation of the model \"AudioFlamingo\" from the paper: \"Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities\". [PAPER LINK](https://arxiv.org/pdf/2402.01831.pdf)\n\n\n## Install\n`pip3 install audio-flamingo`\n\n## Usage\n```python\nimport torch\nfrom audio_flamingo.model import AudioFlamingo\n\n# Generate a random input sequence\ntext = torch.randint(0, 256, (1, 1024))\naudio = torch.randn(1, 16000)\n\n# Initialize AudioFlamingo model\nmodel = AudioFlamingo(\n    dim=512,\n    num_tokens=256,\n    max_seq_len=1024,\n    heads=8,\n    depth=6,\n    dim_head=64,\n    dropout=0.1,\n    context_dim=512,\n)\n\n# Pass the input sequence through the model\noutput = model(text, audio)  # (1, 1024, 256)\n\n# Print the output shape\nprint(output.shape)\n# Path: audio_flamingo/model.py\n\n```\n\n# License\nMIT\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "Paper - Pytorch",
    "version": "0.0.3",
    "project_urls": {
        "Documentation": "https://github.com/kyegomez/AudioFlamingo",
        "Homepage": "https://github.com/kyegomez/AudioFlamingo",
        "Repository": "https://github.com/kyegomez/AudioFlamingo"
    },
    "split_keywords": [
        "artificial intelligence",
        "deep learning",
        "optimizers",
        "prompt engineering"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "0ab8f5a549c7de7b265d667998537094e78434f98a8499336648207271f2ff61",
                "md5": "41932f5d7cc5f9bd234d9e2fc9cee5cd",
                "sha256": "c3bfda9840a88e7a59da33c2d66d5b51815e5b67dc6aba2bfcfcc181fb7d8409"
            },
            "downloads": -1,
            "filename": "audio_flamingo-0.0.3-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "41932f5d7cc5f9bd234d9e2fc9cee5cd",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.6,<4.0",
            "size": 6156,
            "upload_time": "2024-02-07T17:54:48",
            "upload_time_iso_8601": "2024-02-07T17:54:48.919149Z",
            "url": "https://files.pythonhosted.org/packages/0a/b8/f5a549c7de7b265d667998537094e78434f98a8499336648207271f2ff61/audio_flamingo-0.0.3-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "810ced19e09199dc516035028509e35dd75ae9271aa8d13895ab973c1f342e54",
                "md5": "600c77034cb28e6e76f9e6315f9b6d52",
                "sha256": "ce8f94ed6a4edaa05631a5e42ce47645a98792aedc54c22d9ac309f055faecd6"
            },
            "downloads": -1,
            "filename": "audio_flamingo-0.0.3.tar.gz",
            "has_sig": false,
            "md5_digest": "600c77034cb28e6e76f9e6315f9b6d52",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.6,<4.0",
            "size": 6226,
            "upload_time": "2024-02-07T17:54:50",
            "upload_time_iso_8601": "2024-02-07T17:54:50.563870Z",
            "url": "https://files.pythonhosted.org/packages/81/0c/ed19e09199dc516035028509e35dd75ae9271aa8d13895ab973c1f342e54/audio_flamingo-0.0.3.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-02-07 17:54:50",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "kyegomez",
    "github_project": "AudioFlamingo",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "requirements": [],
    "lcname": "audio-flamingo"
}

Kye Gomez