[![Multi-Modality](agorabanner.png)](https://discord.gg/qUtxnK2NMf)
# AST
Implementation of AST from the paper: "AST: Audio Spectrogram Transformer' in PyTorch and Zeta. In this implementation we basically take an 2d input tensor representing audio -> then patchify it -> linear proj -> then position embeddings -> then attention and feedforward in a loop for layers. Please Join Agora and tag me if this could be improved in any capacity.
## Install
`pip3 install ast-torch`
## Usage
```python
import torch
from ast_torch.model import ASTransformer
# Create dummy data
x = torch.randn(2, 16)
# Initialize model
model = ASTransformer(
dim=4, seqlen=16, dim_head=4, heads=4, depth=2, patch_size=4
)
# Run model and print output shape
print(model(x).shape)
```
# Citation
```bibtex
@misc{gong2021ast,
title={AST: Audio Spectrogram Transformer},
author={Yuan Gong and Yu-An Chung and James Glass},
year={2021},
eprint={2104.01778},
archivePrefix={arXiv},
primaryClass={cs.SD}
}
```
# License
MIT
Raw data
{
"_id": null,
"home_page": "https://github.com/kyegomez/AST",
"name": "ast-torch",
"maintainer": "",
"docs_url": null,
"requires_python": ">=3.6,<4.0",
"maintainer_email": "",
"keywords": "artificial intelligence,deep learning,optimizers,Prompt Engineering",
"author": "Kye Gomez",
"author_email": "kye@apac.ai",
"download_url": "https://files.pythonhosted.org/packages/89/49/5c19a9ad444f07eea9d7f4e051a24ff0364e4151431f1b151e596c98b189/ast_torch-0.0.5.tar.gz",
"platform": null,
"description": "[![Multi-Modality](agorabanner.png)](https://discord.gg/qUtxnK2NMf)\n\n# AST\nImplementation of AST from the paper: \"AST: Audio Spectrogram Transformer' in PyTorch and Zeta. In this implementation we basically take an 2d input tensor representing audio -> then patchify it -> linear proj -> then position embeddings -> then attention and feedforward in a loop for layers. Please Join Agora and tag me if this could be improved in any capacity.\n\n## Install\n`pip3 install ast-torch`\n\n## Usage\n\n```python\nimport torch\nfrom ast_torch.model import ASTransformer\n\n# Create dummy data\nx = torch.randn(2, 16)\n\n# Initialize model\nmodel = ASTransformer(\n dim=4, seqlen=16, dim_head=4, heads=4, depth=2, patch_size=4\n)\n\n# Run model and print output shape\nprint(model(x).shape)\n\n\n```\n\n\n# Citation\n```bibtex\n@misc{gong2021ast,\n title={AST: Audio Spectrogram Transformer}, \n author={Yuan Gong and Yu-An Chung and James Glass},\n year={2021},\n eprint={2104.01778},\n archivePrefix={arXiv},\n primaryClass={cs.SD}\n}\n\n```\n\n# License\nMIT",
"bugtrack_url": null,
"license": "MIT",
"summary": "ast - Pytorch",
"version": "0.0.5",
"project_urls": {
"Documentation": "https://github.com/kyegomez/AST",
"Homepage": "https://github.com/kyegomez/AST",
"Repository": "https://github.com/kyegomez/AST"
},
"split_keywords": [
"artificial intelligence",
"deep learning",
"optimizers",
"prompt engineering"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "5ea1c942462042444954e43c5476d5868289268ef9b8993b31e40c5233b5a7ed",
"md5": "9c9ac66dd4ba09e6696016e8c9b7dc15",
"sha256": "1920586c09473c403878e20c3e483194d0d8f4137ea644d8d9a24ae49bc36e55"
},
"downloads": -1,
"filename": "ast_torch-0.0.5-py3-none-any.whl",
"has_sig": false,
"md5_digest": "9c9ac66dd4ba09e6696016e8c9b7dc15",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">=3.6,<4.0",
"size": 7199,
"upload_time": "2024-01-01T08:28:51",
"upload_time_iso_8601": "2024-01-01T08:28:51.523418Z",
"url": "https://files.pythonhosted.org/packages/5e/a1/c942462042444954e43c5476d5868289268ef9b8993b31e40c5233b5a7ed/ast_torch-0.0.5-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "89495c19a9ad444f07eea9d7f4e051a24ff0364e4151431f1b151e596c98b189",
"md5": "0dac2ae3aaef4391931c40ba2c45ef40",
"sha256": "2db6f9d1c9b9f732881a1328905e084ef6073b7fda858d4aa12bb79a6d86582c"
},
"downloads": -1,
"filename": "ast_torch-0.0.5.tar.gz",
"has_sig": false,
"md5_digest": "0dac2ae3aaef4391931c40ba2c45ef40",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">=3.6,<4.0",
"size": 6853,
"upload_time": "2024-01-01T08:28:52",
"upload_time_iso_8601": "2024-01-01T08:28:52.569808Z",
"url": "https://files.pythonhosted.org/packages/89/49/5c19a9ad444f07eea9d7f4e051a24ff0364e4151431f1b151e596c98b189/ast_torch-0.0.5.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-01-01 08:28:52",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "kyegomez",
"github_project": "AST",
"travis_ci": false,
"coveralls": false,
"github_actions": true,
"requirements": [],
"lcname": "ast-torch"
}