# AugShuffleNet: Communicate More, Compute Less
AugShuffle1_0x has slightly different channels than the paper due to constraints in of the default `r`.
See: <https://arxiv.org/abs/2203.06589>
## Usage
```
import torch
from augshufflenet_pytorch import AugShuffleNet0_5x, AugShuffleNet1_0x, AugShuffleNet1_5x, AugShuffleNet
model = AugShuffleNet0_5x(input_channels=3)
x = model(torch.randn(1, 3, 64, 64)) # [1, 192]
# Equivalent to 0_5x
model = AugShuffleNet(stages_repeats=[3, 7, 3], stages_out_channels=[24, 48, 96, 192], input_channels=3, r=0.375)
x = model(torch.randn(1, 3, 64, 64)) # [1, 192]
```
> NOTE: each of the int(out_channels * r) & out_channels putneeds to be divisible by 2
## Citation
```
@misc{ye_augshufflenet_2022,
title = {{AugShuffleNet}: {Communicate} {More}, {Compute} {Less}},
shorttitle = {{AugShuffleNet}},
url = {http://arxiv.org/abs/2203.06589},
doi = {10.48550/arXiv.2203.06589},
abstract = {As a remarkable compact model, ShuffleNetV2 offers a good example to design efficient ConvNets but its limit is rarely noticed. In this paper, we rethink the design pattern of ShuffleNetV2 and find that the channel-wise redundancy problem still constrains the efficiency improvement of Shuffle block in the wider ShuffleNetV2. To resolve this issue, we propose another augmented variant of shuffle block in the form of bottleneck-like structure and more implicit short connections. To verify the effectiveness of this building block, we further build a more powerful and efficient model family, termed as AugShuffleNets. Evaluated on the CIFAR-10 and CIFAR-100 datasets, AugShuffleNet consistently outperforms ShuffleNetV2 in terms of accuracy with less computational cost and fewer parameter count.},
urldate = {2023-06-09},
publisher = {arXiv},
author = {Ye, Longqing},
month = aug,
year = {2022},
note = {arXiv:2203.06589 [cs]},
keywords = {Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning},
}
```
Raw data
{
"_id": null,
"home_page": "https://github.com/i404788/augshufflenet-pytorch",
"name": "augshufflenet-pytorch",
"maintainer": "",
"docs_url": null,
"requires_python": "",
"maintainer_email": "",
"keywords": "artificial intelligence,deep learning,shufflenet,convnet,computer vision",
"author": "Ferris Kwaijtaal",
"author_email": "ferris+gh@devdroplets.com",
"download_url": "https://files.pythonhosted.org/packages/58/b2/08103c308fd90c43e0389c8a36affbff3b003ddc36e3927810241c19165b/augshufflenet-pytorch-0.0.2.tar.gz",
"platform": null,
"description": "# AugShuffleNet: Communicate More, Compute Less\n\nAugShuffle1_0x has slightly different channels than the paper due to constraints in of the default `r`.\n\nSee: <https://arxiv.org/abs/2203.06589>\n\n## Usage\n```\nimport torch\nfrom augshufflenet_pytorch import AugShuffleNet0_5x, AugShuffleNet1_0x, AugShuffleNet1_5x, AugShuffleNet\n\n\nmodel = AugShuffleNet0_5x(input_channels=3)\nx = model(torch.randn(1, 3, 64, 64)) # [1, 192]\n\n# Equivalent to 0_5x\nmodel = AugShuffleNet(stages_repeats=[3, 7, 3], stages_out_channels=[24, 48, 96, 192], input_channels=3, r=0.375)\nx = model(torch.randn(1, 3, 64, 64)) # [1, 192]\n```\n\n> NOTE: each of the int(out_channels * r) & out_channels putneeds to be divisible by 2\n\n## Citation\n```\n@misc{ye_augshufflenet_2022,\n\ttitle = {{AugShuffleNet}: {Communicate} {More}, {Compute} {Less}},\n\tshorttitle = {{AugShuffleNet}},\n\turl = {http://arxiv.org/abs/2203.06589},\n\tdoi = {10.48550/arXiv.2203.06589},\n\tabstract = {As a remarkable compact model, ShuffleNetV2 offers a good example to design efficient ConvNets but its limit is rarely noticed. In this paper, we rethink the design pattern of ShuffleNetV2 and find that the channel-wise redundancy problem still constrains the efficiency improvement of Shuffle block in the wider ShuffleNetV2. To resolve this issue, we propose another augmented variant of shuffle block in the form of bottleneck-like structure and more implicit short connections. To verify the effectiveness of this building block, we further build a more powerful and efficient model family, termed as AugShuffleNets. Evaluated on the CIFAR-10 and CIFAR-100 datasets, AugShuffleNet consistently outperforms ShuffleNetV2 in terms of accuracy with less computational cost and fewer parameter count.},\n\turldate = {2023-06-09},\n\tpublisher = {arXiv},\n\tauthor = {Ye, Longqing},\n\tmonth = aug,\n\tyear = {2022},\n\tnote = {arXiv:2203.06589 [cs]},\n\tkeywords = {Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning},\n}\n```\n\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "AugShuffleNet: Communicate More, Compute Less - Pytorch",
"version": "0.0.2",
"project_urls": {
"Homepage": "https://github.com/i404788/augshufflenet-pytorch"
},
"split_keywords": [
"artificial intelligence",
"deep learning",
"shufflenet",
"convnet",
"computer vision"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "58b208103c308fd90c43e0389c8a36affbff3b003ddc36e3927810241c19165b",
"md5": "a76602bcb6fe33537bd7d5ffc06b4b47",
"sha256": "ebc60812de5935f68d2f3aa208e7d35bacf7cd025cdce805b331b43e0021ce1b"
},
"downloads": -1,
"filename": "augshufflenet-pytorch-0.0.2.tar.gz",
"has_sig": false,
"md5_digest": "a76602bcb6fe33537bd7d5ffc06b4b47",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 9230,
"upload_time": "2023-06-09T15:03:36",
"upload_time_iso_8601": "2023-06-09T15:03:36.789850Z",
"url": "https://files.pythonhosted.org/packages/58/b2/08103c308fd90c43e0389c8a36affbff3b003ddc36e3927810241c19165b/augshufflenet-pytorch-0.0.2.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2023-06-09 15:03:36",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "i404788",
"github_project": "augshufflenet-pytorch",
"travis_ci": false,
"coveralls": false,
"github_actions": false,
"lcname": "augshufflenet-pytorch"
}