A simple augmentation of PyTorch's VisionTransform class from torchvision.models to include registers, as per: https://arxiv.org/abs/2309.16588
Introduces registers to the encoder that are appended as tokens to the 'patchified' sequence, and excluded in the output.
The tokens are learnable parameters and do not receive positional embeddings.
The API of the class is identical with VisionTransformer, except the additional init argument for 'num_registers', which specifies the number of register tokens.
## Installation
```
pip install rvit
```
Raw data
{
"_id": null,
"home_page": null,
"name": "rvit",
"maintainer": null,
"docs_url": null,
"requires_python": null,
"maintainer_email": null,
"keywords": "python, pytorch, vision transformer, register, registers",
"author": "Joe Griffith",
"author_email": "<joeagriffith@gmail.com",
"download_url": "https://files.pythonhosted.org/packages/c8/78/a3b65b56cedb793e3dc8faac565f9ac9dee757470e4851e0984572340db8/rvit-1.0.3.tar.gz",
"platform": null,
"description": "\r\nA simple augmentation of PyTorch's VisionTransform class from torchvision.models to include registers, as per: https://arxiv.org/abs/2309.16588\r\n\r\n\r\n\r\nIntroduces registers to the encoder that are appended as tokens to the 'patchified' sequence, and excluded in the output.\r\n\r\nThe tokens are learnable parameters and do not receive positional embeddings.\r\n\r\n\r\n\r\nThe API of the class is identical with VisionTransformer, except the additional init argument for 'num_registers', which specifies the number of register tokens.\r\n\r\n\r\n\r\n## Installation\r\n\r\n```\r\n\r\npip install rvit\r\n\r\n```\r\n",
"bugtrack_url": null,
"license": null,
"summary": "Vision Transformer with Registers",
"version": "1.0.3",
"project_urls": null,
"split_keywords": [
"python",
" pytorch",
" vision transformer",
" register",
" registers"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "2293f6e6647715e9ea194d172a541a3042a3a15aa74489816d9a6d36ff9ccc05",
"md5": "08617510d8085c73d173e5544b155f35",
"sha256": "4e4d024ade3e8cea6ba59db9eee35cecf0106d45f927fb13a1faaaf19744fdde"
},
"downloads": -1,
"filename": "rvit-1.0.3-py3-none-any.whl",
"has_sig": false,
"md5_digest": "08617510d8085c73d173e5544b155f35",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": null,
"size": 3083,
"upload_time": "2024-05-15T16:28:22",
"upload_time_iso_8601": "2024-05-15T16:28:22.900201Z",
"url": "https://files.pythonhosted.org/packages/22/93/f6e6647715e9ea194d172a541a3042a3a15aa74489816d9a6d36ff9ccc05/rvit-1.0.3-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "c878a3b65b56cedb793e3dc8faac565f9ac9dee757470e4851e0984572340db8",
"md5": "97e8673b06f69ac6370363408b119e7c",
"sha256": "7cea8527dfaff35df676089db4befd7090ad3d5f8f5b9cb5f5c414eb59c28fb4"
},
"downloads": -1,
"filename": "rvit-1.0.3.tar.gz",
"has_sig": false,
"md5_digest": "97e8673b06f69ac6370363408b119e7c",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 2979,
"upload_time": "2024-05-15T16:28:25",
"upload_time_iso_8601": "2024-05-15T16:28:25.092768Z",
"url": "https://files.pythonhosted.org/packages/c8/78/a3b65b56cedb793e3dc8faac565f9ac9dee757470e4851e0984572340db8/rvit-1.0.3.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-05-15 16:28:25",
"github": false,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"lcname": "rvit"
}