kaggle-environments


Namekaggle-environments JSON
Version 1.14.5 PyPI version JSON
download
home_pagehttps://github.com/Kaggle/kaggle-environments
SummaryKaggle Environments
upload_time2024-05-13 23:33:13
maintainerNone
docs_urlNone
authorKaggle
requires_python>=3.8
licenseApache 2.0
keywords kaggle
VCS
bugtrack_url
requirements flask gym ipython jsonschema numpy requests
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # [<img src="https://kaggle.com/static/images/site-logo.png" height="50" style="margin-bottom:-15px" />](https://kaggle.com) Environments

```bash
pip install kaggle-environments
```

# TLDR;

```python
from kaggle_environments import make

# Setup a tictactoe environment.
env = make("tictactoe")

# Basic agent which marks the first available cell.
def my_agent(obs):
  return [c for c in range(len(obs.board)) if obs.board[c] == 0][0]

# Run the basic agent against a default agent which chooses a "random" move.
env.run([my_agent, "random"])

# Render an html ipython replay of the tictactoe game.
env.render(mode="ipython")
```

# Overview

Kaggle Environments was created to evaluate episodes. While other libraries have set interface precedents (such as Open.ai Gym), the emphasis of this library focuses on:

1. Episode evaluation (compared to training agents).
2. Configurable environment/agent lifecycles.
3. Simplified agent and environment creation.
4. Cross language compatible/transpilable syntax/interfaces.

## Help Documentation

```python
# Additional documentation (especially interfaces) can be found on all public functions:
from kaggle_environments import make
help(make)
env = make("tictactoe")
dir(env)
help(env.reset)
```

# Agents

> A function which given an observation generates an action.

## Writing

Agent functions can have observation and configuration parameters and must return a valid action. Details about the observation, configuration, and actions can seen by viewing the specification.

```python
from kaggle_environments import make
env = make("connectx", {"rows": 10, "columns": 8, "inarow": 5})

def agent(observation, configuration):
  print(observation) # {board: [...], mark: 1}
  print(configuration) # {rows: 10, columns: 8, inarow: 5}
  return 3 # Action: always place a mark in the 3rd column.

# Run an episode using the agent above vs the default random agent.
env.run([agent, "random"])

# Print schemas from the specification.
print(env.specification.observation)
print(env.specification.configuration)
print(env.specification.action)
```

## Loading Agents

Agents are always functions, however there are some shorthand syntax options to make generating/using them easier.

```python
# Agent def accepting an observation and returning an action.
def agent1(obs):
  return [c for c in range(len(obs.board)) if obs.board[c] == 0][0]

# Load a default agent called "random".
agent2 = "random"

# Load an agent from source.
agent3 = """
def act(obs):
  return [c for c in range(len(obs.board)) if obs.board[c] == 0][0]
"""

# Load an agent from a file.
agent4 = "C:\path\file.py"

# Return a fixed action.
agent5 = 3

# Return an action from a url.
agent6 = "http://localhost:8000/run/agent"
```

## Default Agents

Most environments contain default agents to play against. To see the list of available agents for a specific environment run:

```python
from kaggle_environments import make
env = make("tictactoe")

# The list of available default agents.
print(*env.agents)

# Run random agent vs reaction agent.
env.run(["random", "reaction"])
```

## Training

Open AI Gym interface is used to assist with training agents. The `None` keyword is used below to denote which agent to train (i.e. train as first or second player of connectx).

```python
from kaggle_environments import make

env = make("connectx", debug=True)

# Training agent in first position (player 1) against the default random agent.
trainer = env.train([None, "random"])

obs = trainer.reset()
for _ in range(100):
    env.render()
    action = 0 # Action for the agent being trained.
    obs, reward, done, info = trainer.step(action)
    if done:
        obs = trainer.reset()
```

## Debugging

There are 3 types of errors which can occur from agent execution:

1. **Timeout** - the agent runtime exceeded the allowed limit. There are 2 timeouts:
   1. `agentTimeout` - Used for initialization of an agent on first "act".
   2. `actTimeout` - Used for obtaining an action.
2. **Error** - the agent raised and error during execution.
3. **Invalid** - the agent action response didn't match the action specification or the environment deemed it invalid (i.e. playing twice in the same cell in tictactoe).

To help debug your agent and why it threw the errors above, add the `debug` flag when setting up the environment.

```python
from kaggle_environments import make

def agent():
  return "Something Bad"

env = make("tictactoe", debug=True)

env.run([agent, "random"])
# Prints: "Invalid Action: Something Bad"
```

# Environments

> A function which given a state and agent actions generates a new state.

| Name      | Description                          | Make                      |
| --------- | ------------------------------------ | ------------------------- |
| connectx  | Connect 4 in a row but configurable. | `env = make("connectx")`  |
| tictactoe | Classic Tic Tac Toe                  | `env = make("tictactoe")` |
| identity  | For debugging, action is the reward. | `env = make("identity")`  |

## Making

An environment instance can be made from an existing specification (such as those listed above).

```python
from kaggle_environments import make

# Create an environment instance.
env = make(
  # Specification or name to registered specification.
  "connectx",

  # Override default and environment configuration.
  configuration={"rows": 9, "columns": 10},

  # Initialize the environment from a prior state (episode resume).
  steps=[],

  # Enable verbose logging.
  debug=True
)
```

## Configuration

There are two types of configuration: Defaults applying to every environment and those specific to the environment. The following is a list of the default configuration:

| Name         | Description                                                     |
| ------------ | --------------------------------------------------------------- |
| episodeSteps | Maximum number of steps in the episode.                         |
| agentTimeout | Maximum runtime (seconds) to initialize an agent.               |
| actTimeout   | Maximum runtime (seconds) to obtain an action from an agent.    |
| runTimeout   | Maximum runtime (seconds) of an episode (not necessarily DONE). |
| maxLogLength | Maximum log length (number of characters, `None` -> no limit)   |

```python
env = make("connectx", configuration={
  "columns": 19, # Specific to ConnectX.
  "actTimeout": 10,
})
```

## Resetting

Environments are reset by default after "make" (unless starting steps are passed in) as well as when calling "run". Reset can be called at anytime to clear the environment.

```python
num_agents = 2
reset_state = env.reset(num_agents)
```

## Running

Execute an episode against the environment using the passed in agents until they are no longer running (i.e. status != ACTIVE).

```python
steps = env.run([agent1, agent2])
print(steps)
```

## Evaluating

Evaluation is used to run an episode (environment + agents) multiple times and just return the rewards.

```python
from kaggle_environments import evaluate

# Same definitions as "make" above.
environment = "connectx"
configuration = {"rows": 10, "columns": 8, "inarow": 5}
steps = []

# Which agents to run repeatedly.  Same as env.run(agents)
agents = ["random", agent1]

# How many times to run them.
num_episodes = 10

rewards = evaluate(environment, agents, configuration, steps, num_episodes)
```

## Stepping

Running above essentially just steps until no agent is still active. To execute a singular game loop, pass in actions directly for each agent. Note that this is normally used for training agents (most useful in a single agent setup such as using the gym interface).

```python
agent1_action = agent1(env.state[0].observation)
agent2_action = agent2(env.state[1].observation)
state = env.step([agent1_action, agent2_action])
```

## Playing

A few environments offer an interactive play against agents within jupyter notebooks. An example of this is using connectx:

```python
from kaggle_environments import make

env = make("connectx")
# None indicates which agent will be manually played.
env.play([None, "random"])
```

## Rendering

The following rendering modes are supported:

- json - Same as doing a json dump of `env.toJSON()`
- ansi - Ascii character representation of the environment.
- human - ansi just printed to stdout
- html - HTML player representation of the environment.
- ipython - html just printed to the output of a ipython notebook.

```python
out = env.render(mode="ansi")
print(out)
```

# Command Line

```sh
> python main.py -h
```

## List Registered Environments

```sh
> python main.py list
```

## Evaluate Episode Rewards

```sh
python main.py evaluate --environment tictactoe --agents random random --episodes 10
```

## Run an Episode

```sh
> python main.py run --environment tictactoe --agents random /pathtomy/agent.py --debug True
```

## Load an Episode

This is useful when converting an episode json output into html.

```sh
python main.py load --environment tictactoe --steps [...] --render '{"mode": "html"}'
```

# HTTP Server

The HTTP server contains the same interface/actions as the CLI above merging both POST body and GET params.

## Setup

```bash
python main.py http-server --port=8012 --host=0.0.0.0
```

### Running Agents on Separate Servers

```python
# How to run agent on a separate server.
import requests
import json

path_to_agent1 = "/home/ajeffries/git/playground/agent1.py"
path_to_agent2 = "/home/ajeffries/git/playground/agent2.py"

agent1_url = f"http://localhost:5001?agents[]={path_to_agent1}"
agent2_url = f"http://localhost:5002?agents[]={path_to_agent2}"

body = {
    "action": "run",
    "environment": "tictactoe",
    "agents": [agent1_url, agent2_url]
}
resp = requests.post(url="http://localhost:5000", data=json.dumps(body)).json()

# Inflate the response replay to visualize.
from kaggle_environments import make
env = make("tictactoe", steps=resp["steps"], debug=True)
env.render(mode="ipython")
print(resp)
```

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/Kaggle/kaggle-environments",
    "name": "kaggle-environments",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.8",
    "maintainer_email": null,
    "keywords": "Kaggle",
    "author": "Kaggle",
    "author_email": "support@kaggle.com",
    "download_url": "https://files.pythonhosted.org/packages/4d/dc/adda775ca744f310ef3423981751f9f8f75ea45ae3bf7df3346dbdc4f476/kaggle-environments-1.14.5.tar.gz",
    "platform": null,
    "description": "# [<img src=\"https://kaggle.com/static/images/site-logo.png\" height=\"50\" style=\"margin-bottom:-15px\" />](https://kaggle.com) Environments\n\n```bash\npip install kaggle-environments\n```\n\n# TLDR;\n\n```python\nfrom kaggle_environments import make\n\n# Setup a tictactoe environment.\nenv = make(\"tictactoe\")\n\n# Basic agent which marks the first available cell.\ndef my_agent(obs):\n  return [c for c in range(len(obs.board)) if obs.board[c] == 0][0]\n\n# Run the basic agent against a default agent which chooses a \"random\" move.\nenv.run([my_agent, \"random\"])\n\n# Render an html ipython replay of the tictactoe game.\nenv.render(mode=\"ipython\")\n```\n\n# Overview\n\nKaggle Environments was created to evaluate episodes. While other libraries have set interface precedents (such as Open.ai Gym), the emphasis of this library focuses on:\n\n1. Episode evaluation (compared to training agents).\n2. Configurable environment/agent lifecycles.\n3. Simplified agent and environment creation.\n4. Cross language compatible/transpilable syntax/interfaces.\n\n## Help Documentation\n\n```python\n# Additional documentation (especially interfaces) can be found on all public functions:\nfrom kaggle_environments import make\nhelp(make)\nenv = make(\"tictactoe\")\ndir(env)\nhelp(env.reset)\n```\n\n# Agents\n\n> A function which given an observation generates an action.\n\n## Writing\n\nAgent functions can have observation and configuration parameters and must return a valid action. Details about the observation, configuration, and actions can seen by viewing the specification.\n\n```python\nfrom kaggle_environments import make\nenv = make(\"connectx\", {\"rows\": 10, \"columns\": 8, \"inarow\": 5})\n\ndef agent(observation, configuration):\n  print(observation) # {board: [...], mark: 1}\n  print(configuration) # {rows: 10, columns: 8, inarow: 5}\n  return 3 # Action: always place a mark in the 3rd column.\n\n# Run an episode using the agent above vs the default random agent.\nenv.run([agent, \"random\"])\n\n# Print schemas from the specification.\nprint(env.specification.observation)\nprint(env.specification.configuration)\nprint(env.specification.action)\n```\n\n## Loading Agents\n\nAgents are always functions, however there are some shorthand syntax options to make generating/using them easier.\n\n```python\n# Agent def accepting an observation and returning an action.\ndef agent1(obs):\n  return [c for c in range(len(obs.board)) if obs.board[c] == 0][0]\n\n# Load a default agent called \"random\".\nagent2 = \"random\"\n\n# Load an agent from source.\nagent3 = \"\"\"\ndef act(obs):\n  return [c for c in range(len(obs.board)) if obs.board[c] == 0][0]\n\"\"\"\n\n# Load an agent from a file.\nagent4 = \"C:\\path\\file.py\"\n\n# Return a fixed action.\nagent5 = 3\n\n# Return an action from a url.\nagent6 = \"http://localhost:8000/run/agent\"\n```\n\n## Default Agents\n\nMost environments contain default agents to play against. To see the list of available agents for a specific environment run:\n\n```python\nfrom kaggle_environments import make\nenv = make(\"tictactoe\")\n\n# The list of available default agents.\nprint(*env.agents)\n\n# Run random agent vs reaction agent.\nenv.run([\"random\", \"reaction\"])\n```\n\n## Training\n\nOpen AI Gym interface is used to assist with training agents. The `None` keyword is used below to denote which agent to train (i.e. train as first or second player of connectx).\n\n```python\nfrom kaggle_environments import make\n\nenv = make(\"connectx\", debug=True)\n\n# Training agent in first position (player 1) against the default random agent.\ntrainer = env.train([None, \"random\"])\n\nobs = trainer.reset()\nfor _ in range(100):\n    env.render()\n    action = 0 # Action for the agent being trained.\n    obs, reward, done, info = trainer.step(action)\n    if done:\n        obs = trainer.reset()\n```\n\n## Debugging\n\nThere are 3 types of errors which can occur from agent execution:\n\n1. **Timeout** - the agent runtime exceeded the allowed limit. There are 2 timeouts:\n   1. `agentTimeout` - Used for initialization of an agent on first \"act\".\n   2. `actTimeout` - Used for obtaining an action.\n2. **Error** - the agent raised and error during execution.\n3. **Invalid** - the agent action response didn't match the action specification or the environment deemed it invalid (i.e. playing twice in the same cell in tictactoe).\n\nTo help debug your agent and why it threw the errors above, add the `debug` flag when setting up the environment.\n\n```python\nfrom kaggle_environments import make\n\ndef agent():\n  return \"Something Bad\"\n\nenv = make(\"tictactoe\", debug=True)\n\nenv.run([agent, \"random\"])\n# Prints: \"Invalid Action: Something Bad\"\n```\n\n# Environments\n\n> A function which given a state and agent actions generates a new state.\n\n| Name      | Description                          | Make                      |\n| --------- | ------------------------------------ | ------------------------- |\n| connectx  | Connect 4 in a row but configurable. | `env = make(\"connectx\")`  |\n| tictactoe | Classic Tic Tac Toe                  | `env = make(\"tictactoe\")` |\n| identity  | For debugging, action is the reward. | `env = make(\"identity\")`  |\n\n## Making\n\nAn environment instance can be made from an existing specification (such as those listed above).\n\n```python\nfrom kaggle_environments import make\n\n# Create an environment instance.\nenv = make(\n  # Specification or name to registered specification.\n  \"connectx\",\n\n  # Override default and environment configuration.\n  configuration={\"rows\": 9, \"columns\": 10},\n\n  # Initialize the environment from a prior state (episode resume).\n  steps=[],\n\n  # Enable verbose logging.\n  debug=True\n)\n```\n\n## Configuration\n\nThere are two types of configuration: Defaults applying to every environment and those specific to the environment. The following is a list of the default configuration:\n\n| Name         | Description                                                     |\n| ------------ | --------------------------------------------------------------- |\n| episodeSteps | Maximum number of steps in the episode.                         |\n| agentTimeout | Maximum runtime (seconds) to initialize an agent.               |\n| actTimeout   | Maximum runtime (seconds) to obtain an action from an agent.    |\n| runTimeout   | Maximum runtime (seconds) of an episode (not necessarily DONE). |\n| maxLogLength | Maximum log length (number of characters, `None` -> no limit)   |\n\n```python\nenv = make(\"connectx\", configuration={\n  \"columns\": 19, # Specific to ConnectX.\n  \"actTimeout\": 10,\n})\n```\n\n## Resetting\n\nEnvironments are reset by default after \"make\" (unless starting steps are passed in) as well as when calling \"run\". Reset can be called at anytime to clear the environment.\n\n```python\nnum_agents = 2\nreset_state = env.reset(num_agents)\n```\n\n## Running\n\nExecute an episode against the environment using the passed in agents until they are no longer running (i.e. status != ACTIVE).\n\n```python\nsteps = env.run([agent1, agent2])\nprint(steps)\n```\n\n## Evaluating\n\nEvaluation is used to run an episode (environment + agents) multiple times and just return the rewards.\n\n```python\nfrom kaggle_environments import evaluate\n\n# Same definitions as \"make\" above.\nenvironment = \"connectx\"\nconfiguration = {\"rows\": 10, \"columns\": 8, \"inarow\": 5}\nsteps = []\n\n# Which agents to run repeatedly.  Same as env.run(agents)\nagents = [\"random\", agent1]\n\n# How many times to run them.\nnum_episodes = 10\n\nrewards = evaluate(environment, agents, configuration, steps, num_episodes)\n```\n\n## Stepping\n\nRunning above essentially just steps until no agent is still active. To execute a singular game loop, pass in actions directly for each agent. Note that this is normally used for training agents (most useful in a single agent setup such as using the gym interface).\n\n```python\nagent1_action = agent1(env.state[0].observation)\nagent2_action = agent2(env.state[1].observation)\nstate = env.step([agent1_action, agent2_action])\n```\n\n## Playing\n\nA few environments offer an interactive play against agents within jupyter notebooks. An example of this is using connectx:\n\n```python\nfrom kaggle_environments import make\n\nenv = make(\"connectx\")\n# None indicates which agent will be manually played.\nenv.play([None, \"random\"])\n```\n\n## Rendering\n\nThe following rendering modes are supported:\n\n- json - Same as doing a json dump of `env.toJSON()`\n- ansi - Ascii character representation of the environment.\n- human - ansi just printed to stdout\n- html - HTML player representation of the environment.\n- ipython - html just printed to the output of a ipython notebook.\n\n```python\nout = env.render(mode=\"ansi\")\nprint(out)\n```\n\n# Command Line\n\n```sh\n> python main.py -h\n```\n\n## List Registered Environments\n\n```sh\n> python main.py list\n```\n\n## Evaluate Episode Rewards\n\n```sh\npython main.py evaluate --environment tictactoe --agents random random --episodes 10\n```\n\n## Run an Episode\n\n```sh\n> python main.py run --environment tictactoe --agents random /pathtomy/agent.py --debug True\n```\n\n## Load an Episode\n\nThis is useful when converting an episode json output into html.\n\n```sh\npython main.py load --environment tictactoe --steps [...] --render '{\"mode\": \"html\"}'\n```\n\n# HTTP Server\n\nThe HTTP server contains the same interface/actions as the CLI above merging both POST body and GET params.\n\n## Setup\n\n```bash\npython main.py http-server --port=8012 --host=0.0.0.0\n```\n\n### Running Agents on Separate Servers\n\n```python\n# How to run agent on a separate server.\nimport requests\nimport json\n\npath_to_agent1 = \"/home/ajeffries/git/playground/agent1.py\"\npath_to_agent2 = \"/home/ajeffries/git/playground/agent2.py\"\n\nagent1_url = f\"http://localhost:5001?agents[]={path_to_agent1}\"\nagent2_url = f\"http://localhost:5002?agents[]={path_to_agent2}\"\n\nbody = {\n    \"action\": \"run\",\n    \"environment\": \"tictactoe\",\n    \"agents\": [agent1_url, agent2_url]\n}\nresp = requests.post(url=\"http://localhost:5000\", data=json.dumps(body)).json()\n\n# Inflate the response replay to visualize.\nfrom kaggle_environments import make\nenv = make(\"tictactoe\", steps=resp[\"steps\"], debug=True)\nenv.render(mode=\"ipython\")\nprint(resp)\n```\n",
    "bugtrack_url": null,
    "license": "Apache 2.0",
    "summary": "Kaggle Environments",
    "version": "1.14.5",
    "project_urls": {
        "Homepage": "https://github.com/Kaggle/kaggle-environments"
    },
    "split_keywords": [
        "kaggle"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "ffcd54d2dc4e25ab48f33d570e2af93cc643f94afcc101d9314d087f9edc6079",
                "md5": "605b15f01845825eb2c4f0cb6d3b560e",
                "sha256": "0990c17be89cf04e3e60b7127cde069cd97705ef39adfa8d13b3852f52ebd760"
            },
            "downloads": -1,
            "filename": "kaggle_environments-1.14.5-py2.py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "605b15f01845825eb2c4f0cb6d3b560e",
            "packagetype": "bdist_wheel",
            "python_version": "py2.py3",
            "requires_python": ">=3.8",
            "size": 1316383,
            "upload_time": "2024-05-13T23:33:09",
            "upload_time_iso_8601": "2024-05-13T23:33:09.746291Z",
            "url": "https://files.pythonhosted.org/packages/ff/cd/54d2dc4e25ab48f33d570e2af93cc643f94afcc101d9314d087f9edc6079/kaggle_environments-1.14.5-py2.py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "4ddcadda775ca744f310ef3423981751f9f8f75ea45ae3bf7df3346dbdc4f476",
                "md5": "509760532205dd91a24b32717ad9267e",
                "sha256": "77a6ba67fbd0a6fb0b6ebdd4ad619e66ea10398d9ec2f764205384b3816ab296"
            },
            "downloads": -1,
            "filename": "kaggle-environments-1.14.5.tar.gz",
            "has_sig": false,
            "md5_digest": "509760532205dd91a24b32717ad9267e",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.8",
            "size": 1256687,
            "upload_time": "2024-05-13T23:33:13",
            "upload_time_iso_8601": "2024-05-13T23:33:13.063238Z",
            "url": "https://files.pythonhosted.org/packages/4d/dc/adda775ca744f310ef3423981751f9f8f75ea45ae3bf7df3346dbdc4f476/kaggle-environments-1.14.5.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-05-13 23:33:13",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "Kaggle",
    "github_project": "kaggle-environments",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "requirements": [
        {
            "name": "flask",
            "specs": []
        },
        {
            "name": "gym",
            "specs": []
        },
        {
            "name": "ipython",
            "specs": []
        },
        {
            "name": "jsonschema",
            "specs": []
        },
        {
            "name": "numpy",
            "specs": []
        },
        {
            "name": "requests",
            "specs": []
        }
    ],
    "lcname": "kaggle-environments"
}
        
Elapsed time: 0.27564s