sb3-contrib


Namesb3-contrib JSON
Version 2.4.0 PyPI version JSON
download
home_pagehttps://github.com/Stable-Baselines-Team/stable-baselines3-contrib
SummaryContrib package of Stable Baselines3, experimental code.
upload_time2024-11-18 10:20:53
maintainerNone
docs_urlNone
authorAntonin Raffin
requires_python>=3.8
licenseMIT
keywords reinforcement-learning-algorithms reinforcement-learning machine-learning gym openai stable baselines toolbox python data-science
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            

# Stable-Baselines3 - Contrib (SB3-Contrib)

Contrib package for [Stable-Baselines3](https://github.com/DLR-RM/stable-baselines3) - Experimental reinforcement learning (RL) code.
"sb3-contrib" for short.

### What is SB3-Contrib?

A place for RL algorithms and tools that are considered experimental, e.g. implementations of the latest publications. Goal is to keep the simplicity, documentation and style of stable-baselines3 but for less matured implementations.

### Why create this repository?

Over the span of stable-baselines and stable-baselines3, the community has been eager to contribute in form of better logging utilities, environment wrappers, extended support (e.g. different action spaces) and learning algorithms.

However sometimes these utilities were too niche to be considered for stable-baselines or proved to be too difficult to integrate well into the existing code without creating a mess. sb3-contrib aims to fix this by not requiring the neatest code integration with existing code and not setting limits on what is too niche: almost everything remotely useful goes!
We hope this allows us to provide reliable implementations following stable-baselines usual standards (consistent style, documentation, etc) beyond the relatively small scope of utilities in the main repository.


## Features

See documentation for the full list of included features.

**RL Algorithms**:
- [Augmented Random Search (ARS)](https://arxiv.org/abs/1803.07055)
- [Quantile Regression DQN (QR-DQN)](https://arxiv.org/abs/1710.10044)
- [PPO with invalid action masking (MaskablePPO)](https://arxiv.org/abs/2006.14171)
- [PPO with recurrent policy (RecurrentPPO aka PPO LSTM)](https://ppo-details.cleanrl.dev//2021/11/05/ppo-implementation-details/)
- [Truncated Quantile Critics (TQC)](https://arxiv.org/abs/2005.04269)
- [Trust Region Policy Optimization (TRPO)](https://arxiv.org/abs/1502.05477)
- [Batch Normalization in Deep Reinforcement Learning (CrossQ)](https://openreview.net/forum?id=PczQtTsTIX)

**Gym Wrappers**:
- [Time Feature Wrapper](https://arxiv.org/abs/1712.00378)

## Documentation

Documentation is available online: [https://sb3-contrib.readthedocs.io/](https://sb3-contrib.readthedocs.io/)


## Installation

**Note:** You need the `master` version of [Stable Baselines3](https://github.com/DLR-RM/stable-baselines3/).

To install Stable Baselines3 `master` version:
```
pip install git+https://github.com/DLR-RM/stable-baselines3
```

To install Stable Baselines3 contrib `master` version:
```
pip install git+https://github.com/Stable-Baselines-Team/stable-baselines3-contrib


            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/Stable-Baselines-Team/stable-baselines3-contrib",
    "name": "sb3-contrib",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.8",
    "maintainer_email": null,
    "keywords": "reinforcement-learning-algorithms reinforcement-learning machine-learning gym openai stable baselines toolbox python data-science",
    "author": "Antonin Raffin",
    "author_email": "antonin.raffin@dlr.de",
    "download_url": "https://files.pythonhosted.org/packages/c7/4e/dd31aba603634a7b9473b502fe85b694e4392c448eae0d345640b87211c2/sb3_contrib-2.4.0.tar.gz",
    "platform": null,
    "description": "\n\n# Stable-Baselines3 - Contrib (SB3-Contrib)\n\nContrib package for [Stable-Baselines3](https://github.com/DLR-RM/stable-baselines3) - Experimental reinforcement learning (RL) code.\n\"sb3-contrib\" for short.\n\n### What is SB3-Contrib?\n\nA place for RL algorithms and tools that are considered experimental, e.g. implementations of the latest publications. Goal is to keep the simplicity, documentation and style of stable-baselines3 but for less matured implementations.\n\n### Why create this repository?\n\nOver the span of stable-baselines and stable-baselines3, the community has been eager to contribute in form of better logging utilities, environment wrappers, extended support (e.g. different action spaces) and learning algorithms.\n\nHowever sometimes these utilities were too niche to be considered for stable-baselines or proved to be too difficult to integrate well into the existing code without creating a mess. sb3-contrib aims to fix this by not requiring the neatest code integration with existing code and not setting limits on what is too niche: almost everything remotely useful goes!\nWe hope this allows us to provide reliable implementations following stable-baselines usual standards (consistent style, documentation, etc) beyond the relatively small scope of utilities in the main repository.\n\n\n## Features\n\nSee documentation for the full list of included features.\n\n**RL Algorithms**:\n- [Augmented Random Search (ARS)](https://arxiv.org/abs/1803.07055)\n- [Quantile Regression DQN (QR-DQN)](https://arxiv.org/abs/1710.10044)\n- [PPO with invalid action masking (MaskablePPO)](https://arxiv.org/abs/2006.14171)\n- [PPO with recurrent policy (RecurrentPPO aka PPO LSTM)](https://ppo-details.cleanrl.dev//2021/11/05/ppo-implementation-details/)\n- [Truncated Quantile Critics (TQC)](https://arxiv.org/abs/2005.04269)\n- [Trust Region Policy Optimization (TRPO)](https://arxiv.org/abs/1502.05477)\n- [Batch Normalization in Deep Reinforcement Learning (CrossQ)](https://openreview.net/forum?id=PczQtTsTIX)\n\n**Gym Wrappers**:\n- [Time Feature Wrapper](https://arxiv.org/abs/1712.00378)\n\n## Documentation\n\nDocumentation is available online: [https://sb3-contrib.readthedocs.io/](https://sb3-contrib.readthedocs.io/)\n\n\n## Installation\n\n**Note:** You need the `master` version of [Stable Baselines3](https://github.com/DLR-RM/stable-baselines3/).\n\nTo install Stable Baselines3 `master` version:\n```\npip install git+https://github.com/DLR-RM/stable-baselines3\n```\n\nTo install Stable Baselines3 contrib `master` version:\n```\npip install git+https://github.com/Stable-Baselines-Team/stable-baselines3-contrib\n\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "Contrib package of Stable Baselines3, experimental code.",
    "version": "2.4.0",
    "project_urls": {
        "Changelog": "https://stable-baselines3.readthedocs.io/en/master/misc/changelog.html",
        "Code": "https://github.com/Stable-Baselines-Team/stable-baselines3-contrib",
        "Documentation": "https://sb3-contrib.readthedocs.io/",
        "Homepage": "https://github.com/Stable-Baselines-Team/stable-baselines3-contrib",
        "RL-Zoo": "https://github.com/DLR-RM/rl-baselines3-zoo",
        "SBX": "https://github.com/araffin/sbx",
        "Stable-Baselines3": "https://github.com/DLR-RM/stable-baselines3"
    },
    "split_keywords": [
        "reinforcement-learning-algorithms",
        "reinforcement-learning",
        "machine-learning",
        "gym",
        "openai",
        "stable",
        "baselines",
        "toolbox",
        "python",
        "data-science"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "50de33d3b00116488de9371a56009089ab408d949cda7057e2560c8efae810df",
                "md5": "e32beab3d64a5becde8bd64c6685c8d9",
                "sha256": "725d90157028a94c69804f2e7332128518fb7cbab39e7e141d2d0355547a72ab"
            },
            "downloads": -1,
            "filename": "sb3_contrib-2.4.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "e32beab3d64a5becde8bd64c6685c8d9",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.8",
            "size": 92752,
            "upload_time": "2024-11-18T10:20:51",
            "upload_time_iso_8601": "2024-11-18T10:20:51.074603Z",
            "url": "https://files.pythonhosted.org/packages/50/de/33d3b00116488de9371a56009089ab408d949cda7057e2560c8efae810df/sb3_contrib-2.4.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "c74edd31aba603634a7b9473b502fe85b694e4392c448eae0d345640b87211c2",
                "md5": "c7fc0802a27c060538a11aa05ed116c2",
                "sha256": "a3709d97ddd529c45e3d56a5ae7e61a380bb2e11bacb9a3e6951ac0c481350ca"
            },
            "downloads": -1,
            "filename": "sb3_contrib-2.4.0.tar.gz",
            "has_sig": false,
            "md5_digest": "c7fc0802a27c060538a11aa05ed116c2",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.8",
            "size": 89358,
            "upload_time": "2024-11-18T10:20:53",
            "upload_time_iso_8601": "2024-11-18T10:20:53.214292Z",
            "url": "https://files.pythonhosted.org/packages/c7/4e/dd31aba603634a7b9473b502fe85b694e4392c448eae0d345640b87211c2/sb3_contrib-2.4.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-11-18 10:20:53",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "Stable-Baselines-Team",
    "github_project": "stable-baselines3-contrib",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "sb3-contrib"
}
        
Elapsed time: 0.45015s