maskgit

Name	maskgit JSON
Version	0.0.1.dev0 JSON
	download
home_page	None
Summary	maskgit
upload_time	2024-06-26 04:43:14
maintainer	None
docs_url	None
author	Shadow Walker
requires_python	None
license	None
keywords	maskgit
VCS
bugtrack_url
requirements	No requirements were recorded.
Travis-CI	No Travis.
coveralls test coverage	No coveralls.

            # MaskGIT: Masked Generative Image Transformer
Official Jax Implementation of the CVPR 2022 Paper

[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/maskgit-masked-generative-image-transformer/image-generation-on-imagenet-512x512)](https://paperswithcode.com/sota/image-generation-on-imagenet-512x512?p=maskgit-masked-generative-image-transformer)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/maskgit-masked-generative-image-transformer/image-generation-on-imagenet-256x256)](https://paperswithcode.com/sota/image-generation-on-imagenet-256x256?p=maskgit-masked-generative-image-transformer)

[[Paper](https://arxiv.org/abs/2202.04200)] [[Project Page](https://masked-generative-image-transformer.github.io/)] [[Demo Colab](https://colab.research.google.com/github/google-research/maskgit/blob/main/MaskGIT_demo.ipynb)]

![teaser](imgs/teaser.png)

## Summary
MaskGIT is a novel image synthesis paradigm using a bidirectional transformer decoder. During training, MaskGIT learns to predict randomly masked tokens by attending to tokens in all directions. At inference time, the model begins with generating all tokens of an image simultaneously, and then refines the image iteratively conditioned on the previous generation. 

## Running pretrained models

Class conditional Image Genration models:

| Dataset  | Resolution | Model | Link | FID |
| ------------- | ------------- | ------------- | ------------- | ------------- |
| ImageNet  | 256 x 256 | Tokenizer | [checkpoint](https://storage.googleapis.com/maskgit-public/checkpoints/tokenizer_imagenet256_checkpoint)| 2.28 (reconstruction) |
| ImageNet  | 512 x 512 | Tokenizer | [checkpoint](https://storage.googleapis.com/maskgit-public/checkpoints/tokenizer_imagenet512_checkpoint)| 1.97 (reconstruction) |
| ImageNet  | 256 x 256 | MaskGIT Transformer |[checkpoint](https://storage.googleapis.com/maskgit-public/checkpoints/maskgit_imagenet256_checkpoint)| 6.06 (generation) |
| ImageNet  | 512 x 512 | MaskGIT Transformer | [checkpoint](https://storage.googleapis.com/maskgit-public/checkpoints/maskgit_imagenet512_checkpoint) | 7.32 (generation) |

You can run these models for class-conditional image **generation** and **editing** in the [demo Colab](https://colab.research.google.com/github/google-research/maskgit/blob/main/MaskGIT_demo.ipynb).

![teaser](imgs/class-conditional-teaser-small.png)

## Training
[Coming Soon]


## BibTeX

```
@InProceedings{chang2022maskgit,
  title = {MaskGIT: Masked Generative Image Transformer},
  author={Huiwen Chang and Han Zhang and Lu Jiang and Ce Liu and William T. Freeman},
  booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  month = {June},
  year = {2022}
}
```

## Disclaimer

This is not an officially supported Google product.

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "maskgit",
    "maintainer": null,
    "docs_url": null,
    "requires_python": null,
    "maintainer_email": null,
    "keywords": "maskgit",
    "author": "Shadow Walker",
    "author_email": null,
    "download_url": null,
    "platform": null,
    "description": "# MaskGIT: Masked Generative Image Transformer\nOfficial Jax Implementation of the CVPR 2022 Paper\n\n[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/maskgit-masked-generative-image-transformer/image-generation-on-imagenet-512x512)](https://paperswithcode.com/sota/image-generation-on-imagenet-512x512?p=maskgit-masked-generative-image-transformer)\n[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/maskgit-masked-generative-image-transformer/image-generation-on-imagenet-256x256)](https://paperswithcode.com/sota/image-generation-on-imagenet-256x256?p=maskgit-masked-generative-image-transformer)\n\n[[Paper](https://arxiv.org/abs/2202.04200)] [[Project Page](https://masked-generative-image-transformer.github.io/)] [[Demo Colab](https://colab.research.google.com/github/google-research/maskgit/blob/main/MaskGIT_demo.ipynb)]\n\n![teaser](imgs/teaser.png)\n\n## Summary\nMaskGIT is a novel image synthesis paradigm using a bidirectional transformer decoder. During training, MaskGIT learns to predict randomly masked tokens by attending to tokens in all directions. At inference time, the model begins with generating all tokens of an image simultaneously, and then refines the image iteratively conditioned on the previous generation. \n\n## Running pretrained models\n\nClass conditional Image Genration models:\n\n| Dataset  | Resolution | Model | Link | FID |\n| ------------- | ------------- | ------------- | ------------- | ------------- |\n| ImageNet  | 256 x 256 | Tokenizer | [checkpoint](https://storage.googleapis.com/maskgit-public/checkpoints/tokenizer_imagenet256_checkpoint)| 2.28 (reconstruction) |\n| ImageNet  | 512 x 512 | Tokenizer | [checkpoint](https://storage.googleapis.com/maskgit-public/checkpoints/tokenizer_imagenet512_checkpoint)| 1.97 (reconstruction) |\n| ImageNet  | 256 x 256 | MaskGIT Transformer |[checkpoint](https://storage.googleapis.com/maskgit-public/checkpoints/maskgit_imagenet256_checkpoint)| 6.06 (generation) |\n| ImageNet  | 512 x 512 | MaskGIT Transformer | [checkpoint](https://storage.googleapis.com/maskgit-public/checkpoints/maskgit_imagenet512_checkpoint) | 7.32 (generation) |\n\nYou can run these models for class-conditional image **generation** and **editing** in the [demo Colab](https://colab.research.google.com/github/google-research/maskgit/blob/main/MaskGIT_demo.ipynb).\n\n![teaser](imgs/class-conditional-teaser-small.png)\n\n## Training\n[Coming Soon]\n\n\n## BibTeX\n\n```\n@InProceedings{chang2022maskgit,\n  title = {MaskGIT: Masked Generative Image Transformer},\n  author={Huiwen Chang and Han Zhang and Lu Jiang and Ce Liu and William T. Freeman},\n  booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},\n  month = {June},\n  year = {2022}\n}\n```\n\n## Disclaimer\n\nThis is not an officially supported Google product.\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "maskgit",
    "version": "0.0.1.dev0",
    "project_urls": null,
    "split_keywords": [
        "maskgit"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "feaf7835247ad28cf0dfae940d1b9f08e893c2ff2da7854c36c035037e468b85",
                "md5": "a8437984ad8a76190b7da2904ce4eab8",
                "sha256": "2339a47edbd06b12a1b8477e27e14c270c80ba2187838890d205ce8e3b380f9e"
            },
            "downloads": -1,
            "filename": "maskgit-0.0.1.dev0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "a8437984ad8a76190b7da2904ce4eab8",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": null,
            "size": 30500,
            "upload_time": "2024-06-26T04:43:14",
            "upload_time_iso_8601": "2024-06-26T04:43:14.683224Z",
            "url": "https://files.pythonhosted.org/packages/fe/af/7835247ad28cf0dfae940d1b9f08e893c2ff2da7854c36c035037e468b85/maskgit-0.0.1.dev0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-06-26 04:43:14",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "maskgit"
}

Shadow Walker