sdgne


Namesdgne JSON
Version 4.0.0 PyPI version JSON
download
home_pagehttps://github.com/SartajBhuvaji
SummarySynthetic Data Generation and Evaluation
upload_time2024-02-10 01:20:35
maintainer
docs_urlNone
authorSartaj Bhuvaji
requires_python
license
keywords python synthetic data autoencoders smote
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            


  [![Build](https://github.com/SartajBhuvaji/SDGnE/actions/workflows/main.yaml/badge.svg)](https://github.com/SartajBhuvaji/SDGnE/actions/workflows/main.yaml)





## About

- SDGnE (Synthetic Data Generation and Evaluation) is a Python package designed to generate synthetic data and evaluate its quality using neural network models. 

- This tool is intended for developers and researchers who require synthetic datasets for testing and development.

- The current dittto version `v1.0.0` uses <i>Autoencoders</i> and <i>SMOTE</i> to generate synthetic data.



## Getting Started

`pip install sdgne`

 

 ## Notebooks

 To get started, we have created notebook for the Autoencoder and SMOTE algorithm.



  ### Auto Encoder

  Autoencoders are a class of neural networks designed for unsupervised learning and representing features in a smaller space. They consist of an encoder and a decoder, intending to learn the input data's compressed representation (encoding).  We leverage this architecture to generate synthetic data.



  [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/SartajBhuvaji/SDGnE/blob/main/notebooks/SDGnE_Autoencoder_Notebook.ipynb)



  ### SMOTE

  SMOTE, abbreviated as Synthetic Minority Oversampling Technique, is used to generate synthetic data from the original dataset. Over the years, several variants of SMOTE have been developed, each tailored to specific scenarios and requirements. These variants employ distinct methodologies and innovations to enhance the generation of synthetic data, thereby improving model performance by ensuring a more balanced distribution of classes. We provide a few SMOTE variants for synthetic data generation.



  [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/SartajBhuvaji/SDGnE/blob/main/notebooks/SDGnE_SMOTE_Notebook.ipynb)



  ### Comparison

  In this notebook, we will compare the `Single Encoder Autoencoder` and the `SMOTE Algorithm` for synthetic data generation. We will generate synthetic data using both the algorithms and perform statistical evaluation.

  

  [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/SartajBhuvaji/SDGnE/blob/main/notebooks/SDGnE_Comparison_Notebook.ipynb)



 ## Features



- **Data Generation**: Create synthetic datasets that mimic the statistical properties of real-world data.

- **Neural Autoencoders**: Utilize various autoencoder architectures to learn data representations.

- **Evaluation Metrics**: Assess the quality of synthetic data using built-in evaluation metrics.

- **Extensibility**: Easily extend the package with custom data generators and evaluators.



 ## Links

 - **Documentation**: https://seattle-university.gitbook.io/sdgne/

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/SartajBhuvaji",
    "name": "sdgne",
    "maintainer": "",
    "docs_url": null,
    "requires_python": "",
    "maintainer_email": "",
    "keywords": "python,synthetic data,autoencoders,smote",
    "author": "Sartaj Bhuvaji",
    "author_email": "s.bhuvaj@gmail.com",
    "download_url": "https://files.pythonhosted.org/packages/5c/7c/aba637683b0930f8aa927168f811de462dbee33d184a981d6092595e4818/sdgne-4.0.0.tar.gz",
    "platform": null,
    "description": "\r\n\r\n\r\n  [![Build](https://github.com/SartajBhuvaji/SDGnE/actions/workflows/main.yaml/badge.svg)](https://github.com/SartajBhuvaji/SDGnE/actions/workflows/main.yaml)\r\n\r\n\r\n\r\n\r\n\r\n## About\r\n\r\n- SDGnE (Synthetic Data Generation and Evaluation) is a Python package designed to generate synthetic data and evaluate its quality using neural network models. \r\n\r\n- This tool is intended for developers and researchers who require synthetic datasets for testing and development.\r\n\r\n- The current dittto version `v1.0.0` uses <i>Autoencoders</i> and <i>SMOTE</i> to generate synthetic data.\r\n\r\n\r\n\r\n## Getting Started\r\n\r\n`pip install sdgne`\r\n\r\n \r\n\r\n ## Notebooks\r\n\r\n To get started, we have created notebook for the Autoencoder and SMOTE algorithm.\r\n\r\n\r\n\r\n  ### Auto Encoder\r\n\r\n  Autoencoders are a class of neural networks designed for unsupervised learning and representing features in a smaller space. They consist of an encoder and a decoder, intending to learn the input data's compressed representation (encoding).  We leverage this architecture to generate synthetic data.\r\n\r\n\r\n\r\n  [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/SartajBhuvaji/SDGnE/blob/main/notebooks/SDGnE_Autoencoder_Notebook.ipynb)\r\n\r\n\r\n\r\n  ### SMOTE\r\n\r\n  SMOTE, abbreviated as Synthetic Minority Oversampling Technique, is used to generate synthetic data from the original dataset. Over the years, several variants of SMOTE have been developed, each tailored to specific scenarios and requirements. These variants employ distinct methodologies and innovations to enhance the generation of synthetic data, thereby improving model performance by ensuring a more balanced distribution of classes. We provide a few SMOTE variants for synthetic data\u00a0generation.\r\n\r\n\r\n\r\n  [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/SartajBhuvaji/SDGnE/blob/main/notebooks/SDGnE_SMOTE_Notebook.ipynb)\r\n\r\n\r\n\r\n  ### Comparison\r\n\r\n  In this notebook, we will compare the `Single Encoder Autoencoder` and the `SMOTE Algorithm` for synthetic data generation. We will generate synthetic data using both the algorithms and perform statistical evaluation.\r\n\r\n  \r\n\r\n  [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/SartajBhuvaji/SDGnE/blob/main/notebooks/SDGnE_Comparison_Notebook.ipynb)\r\n\r\n\r\n\r\n ## Features\r\n\r\n\r\n\r\n- **Data Generation**: Create synthetic datasets that mimic the statistical properties of real-world data.\r\n\r\n- **Neural Autoencoders**: Utilize various autoencoder architectures to learn data representations.\r\n\r\n- **Evaluation Metrics**: Assess the quality of synthetic data using built-in evaluation metrics.\r\n\r\n- **Extensibility**: Easily extend the package with custom data generators and evaluators.\r\n\r\n\r\n\r\n ## Links\r\n\r\n - **Documentation**: https://seattle-university.gitbook.io/sdgne/\r\n",
    "bugtrack_url": null,
    "license": "",
    "summary": "Synthetic Data Generation and Evaluation",
    "version": "4.0.0",
    "project_urls": {
        "Homepage": "https://github.com/SartajBhuvaji"
    },
    "split_keywords": [
        "python",
        "synthetic data",
        "autoencoders",
        "smote"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "78edc1781b8cc46046186716f0d6e79364b258ea848d1ca9112c2a0a096ef7d2",
                "md5": "db7143ca305a311fe27c0e730c2fe36a",
                "sha256": "1b83ee659a5da787d08e26417a9218f3aacb994d2017e714228f733a25419b4a"
            },
            "downloads": -1,
            "filename": "sdgne-4.0.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "db7143ca305a311fe27c0e730c2fe36a",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": null,
            "size": 293289,
            "upload_time": "2024-02-10T01:20:33",
            "upload_time_iso_8601": "2024-02-10T01:20:33.057072Z",
            "url": "https://files.pythonhosted.org/packages/78/ed/c1781b8cc46046186716f0d6e79364b258ea848d1ca9112c2a0a096ef7d2/sdgne-4.0.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "5c7caba637683b0930f8aa927168f811de462dbee33d184a981d6092595e4818",
                "md5": "37827acde501aea8dd44b2693f5ac18e",
                "sha256": "c771e7bc81612fa95a783da759a63f30ea44c9a6e88c705f0fdcf2301cd7bcc1"
            },
            "downloads": -1,
            "filename": "sdgne-4.0.0.tar.gz",
            "has_sig": false,
            "md5_digest": "37827acde501aea8dd44b2693f5ac18e",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": null,
            "size": 286614,
            "upload_time": "2024-02-10T01:20:35",
            "upload_time_iso_8601": "2024-02-10T01:20:35.809735Z",
            "url": "https://files.pythonhosted.org/packages/5c/7c/aba637683b0930f8aa927168f811de462dbee33d184a981d6092595e4818/sdgne-4.0.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-02-10 01:20:35",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "sdgne"
}
        
Elapsed time: 0.21335s