Name | mlos-bench JSON |
Version |
0.6.1
JSON |
| download |
home_page | None |
Summary | MLOS Bench Python interface for benchmark automation and optimization. |
upload_time | 2024-08-16 18:15:52 |
maintainer | None |
docs_url | None |
author | None |
requires_python | >=3.8 |
license | MIT |
keywords |
autotuning
benchmarking
optimization
systems
|
VCS |
|
bugtrack_url |
|
requirements |
No requirements were recorded.
|
Travis-CI |
No Travis.
|
coveralls test coverage |
No coveralls.
|
# mlos-bench
This [directory](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./) contains the code for the `mlos-bench` experiment runner package.
It makes use of the [`mlos-core`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/../mlos_core/) package for its optimizer.
It's available for `pip install` via the pypi repository at [mlos-bench](https://pypi.org/project/mlos-bench/).
## Table of Contents
<!-- markdownlint-disable MD007 -->
<!-- TOC -->
- [mlos-bench](#mlos-bench)
- [Table of Contents](#table-of-contents)
- [Description](#description)
- [Features](#features)
- [Quickstart](#quickstart)
- [Install and activate the conda environment](#install-and-activate-the-conda-environment)
- [Make sure that you have Azure CLI tool installed and working](#make-sure-that-you-have-azure-cli-tool-installed-and-working)
- [Generate access tokens to interact with Azure resources](#generate-access-tokens-to-interact-with-azure-resources)
- [Create a JSON config with DB credentials Optional](#create-a-json-config-with-db-credentials-optional)
- [Create a top-level configuration file for your MLOS setup](#create-a-top-level-configuration-file-for-your-mlos-setup)
- [Create another config file for the parameters specific to your experiment](#create-another-config-file-for-the-parameters-specific-to-your-experiment)
- [Importance of the Experiment ID config](#importance-of-the-experiment-id-config)
- [Run the benchmark](#run-the-benchmark)
- [Optimization](#optimization)
- [Resuming interrupted experiments](#resuming-interrupted-experiments)
- [Analyzing Results](#analyzing-results)
<!-- /TOC -->
<!-- markdownlint-enable MD007 -->
## Description
`mlos-bench` is an end-to-end benchmarking service that can be independently launched for experimentation but is also integrated with `mlos-core` as its optimizer for OS tuning.
Given a user-provided VM configuration, `mlos-bench` provisions a configured environment and remotely executes benchmarks on the cloud.
Experiment results (benchmark results & telemetry) are stored as input to the `mlos-core` optimization engine loop to evaluate proposed configuration parameters and produce new results.
## Features
With a [JSON5](https://spec.json5.org) [config file](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/) and command line parameters as input, `mlos-bench` streamlines workload performance measurement by automating the following benchmarking steps:
1. Set up & clean up benchmark and application configuration
- **Ease of use:** Mlos-bench abstracts away controls for managing VMs in Azure, e.g., setup, teardown, stop, deprovision, and reboot. Get visibility into VM status through Azure Portal, ensuring that a VM is provisioned & running before issuing commands to it.
- **Versatility:** Mlos-bench provides a common interface to control a collection of environments (application, OS, VM), regardless of where or which cloud they come from. This allows changes to easily propagate to all environment layers when a new set of kernel parameters are applied.
- **Efficiency:** In adapting an environment to new parameters, mlos-bench optimizes for low re-configuration costs during optimization. For example, considering that not all OS kernel parameter adjustments require a full reboot, as some can be changed during run-time.
2. Run benchmarks in the provisioned environment & standardize results for the optimizer
- Through Azure File Share, access docker scripts to run benchmarks & store results as input for optimization. For example, execute Redis benchmark uploaded to the file share, running a benchmark docker container with specified parameters. The file share is mounted to VMs via remote execution, instead of ARM templates.
- **Configurable:** Specify a python script in the initial config to post-process & standardize benchmark results. An example post-processing script for Redis benchmarks is included.
- **Local & remote benchmark execution:** Benchmarks can be run both locally in Hyper-V and remotely on Azure. Local execution allows better accuracy, while Azure runs are required to estimate the benchmark noise and understand the VM behavior when using cloud storage.
- **Cloud agnostic:** Mlos-bench can remotely execute benchmarks on other clouds, outside of Azure - e.g., controls for EC2 instances and ability to provision environments on AWS with Terraform.
- **Persistence:** Storage integration is available to persist experiment parameters and track results for re-use, either for analysis during & after trials, or warm-starting future experiments.
## Quickstart
To get started, we can adapt an example configuration to test out running `mlos-bench`.
For these instructions, we will be using Azure for our resources.
### 1. Install and activate the conda environment
From here onwards we assume we are in the project root directory.
Ensure you have a conda environment (`mlos`) set up for executing `mlos_bench`.
Create and activate the environment with:
```sh
conda env create -f conda-envs/mlos.yml
conda activate mlos
```
> Note: if you are running inside the devcontainer, this should be done automatically.
### 2. Make sure that you have Azure CLI tool installed and working
> Installation instructions for `az` (Azure CLI) [can be found here](https://docs.microsoft.com/en-us/cli/azure/install-azure-cli).
>
> Note: `az` comes preinstalled inside the devcontainer.
If necessary, login to Azure and set your default subscription:
```shell
# If using az cli for the first time, a login will be required:
az login
# Make sure to set your default subscription, RG, and Storage Account for these experiments.
# For instance:
az account set --subscription "My Subscription Name"
az config set defaults.group=MyRG --local
az config set storage.account=MyStorageAccount --local
az account set --subscription "..."
```
### 3. Generate access tokens to interact with Azure resources
A script at `./scripts/generate-azure-credentials-config` produces a JSON config snippet with necessary Azure credentials.
```shell
./scripts/generate-azure-credentials-config > ./global_config_azure.jsonc
```
This data produced in the `global_config_azure.jsonc` file is in the format that can be used by our framework.
```json
{
"subscription": "some-guid",
"tenant": "some-other-guid",
"storageAccountKey": "some-base-64-encoded-key",
}
```
> Note: On Linux, this script also requires `jq` to also be installed (comes preinstalled in the devcontainer).
### 4. Create a JSON config with DB credentials (Optional)
If you plan to store the information about experiments and benchmarks in a (remote) database like PostgreSQL or MySQL, create a JSON/JSONC file with the DB hostname and password.
See [`mysql.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/storage/mysql.jsonc) or [`postgresql.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/storage/postgresql.jsonc) configuration files for examples with a more complete list of DB parameters supported by underlying the [SqlAlchemy](https://www.sqlalchemy.org/library.html#reference) library.
Save your config in `./global_config_storage.jsonc` file.
It should look like this:
```json
{
"host": "mysql-db.mysql.database.azure.com",
"password": "database_password"
}
```
Any parameter that is not specified in `./global_config_storage.json` will be taken from the corresponding DB's config file, e.g., [`postgresql.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/storage/postgresql.jsonc).
For database like SQLite or DuckDB, there is no need for an additional config file.
The data will be stored locally in a file, e.g., `./mlos_bench.duckdb`.
See [`sqlite.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/storage/sqlite.jsonc) or [`duckdb.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/storage/duckdb.jsonc) for more details.
> Note: if no storage is specified, a basic sqlite config will be used by default.
### 5. Create a top-level configuration file for your MLOS setup
We provide a few examples of such files in [`./mlos_bench/config/cli/`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/cli).
For example, [`azure-redis-opt.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/cli/azure-redis-opt.jsonc) is a configuration for optimizing Redis VM on Azure and saving the results in a local SQLite database.
Likewise, [`azure-redis-bench.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/cli/azure-redis-bench.jsonc) is a setup to run a single Redis benchmark (and, again, save the results in SQLite).
CLI configs like those are meant to connect several MLOS components together, namely:
- Benchmarking environment (configured in [`environments/root/root-azure-redis.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/environments/root/root-azure-redis.jsonc));
- Optimization engine ([`optimizers/mlos_core_flaml.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/optimizers/mlos_core_flaml.jsonc));
- Storage for experimental data ([`storage/sqlite.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/storage/sqlite.jsonc));
They also refer to other configs, e.g.
- Reusable config snippets in `"config_path"` section, and
- Additional config files containing sensitive data like DB passwords and Azure credentials.
> Make sure that the files `./global_config_azure.jsonc` and `./global_config_storage.json` you created in the previous steps are included in the `"globals"` section of your CLI config.
For the purpose of this tutorial, we will assume that we reuse the existing [`azure-redis-bench.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/cli/azure-redis-bench.jsonc) and [`azure-redis-opt.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/cli/azure-redis-opt.jsonc) configurations without any changes.
In a more realistic scenario, however, you might need to change and/or create new config files for some parts of your benchmarking environment.
We'll give more details on that below.
### 5. Create another config file for the parameters specific to your experiment
Copy one of our examples, e.g., [`experiment_RedisBench.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/experiments/experiment_RedisBench.jsonc) and name it after your experiment, e.g. `experiment_MyBenchmark.jsonc`.
In that file, you can specify any parameters that occur in your other configs, namely in `"const_args"` section of the Environment configs, or in `"config"` sections of your Service, Storage, or Optimizer configurations.
> A general rule is that the parameters from the global configs like `./global_config_azure.jsonc` or `experiment_MyAppBench.jsonc` override the corresponding parameters in other configurations.
> That allows us to propagate the values of the parameters that are specific to the experiment into other components of the framework and keep the majority of the config files in our library immutable and reusable.
#### Importance of the Experiment ID config
An important part of this file is the value of `experiment_id` which controls the storage and retrieval of trial data.
Should the experiment be interrupted, the `experiment_id` will be used to resume the experiment from the last completed trial, reloading the optimizer with data from the previously completed trial data.
As such this value should be unique for each experiment.
Be sure to change it whenever *"incompatible"* changes are made to the experiment configuration or scripts.
Unfortunately, determining what constitutes and *"incompatible"* change for any given system is not always possible, so `mlos_bench` largely leaves this up to the user.
### 6. Run the benchmark
Now we can run our configuration with `mlos_bench`:
```shell
mlos_bench --config "./mlos_bench/mlos_bench/config/cli/azure-redis-bench.jsonc" --globals "experiment_MyBenchmark.jsonc"
```
This should run a single trial with the given tunable values (loaded from one or more files in the `"tunable_values"`), write the results to the log and keep the environment running (as directed by the `"teardown": false` configuration parameter in the CLI config).
Note that using the `--globals` command line option is the same as adding `experiment_MyBenchmark.jsonc` to the `"globals"` section of the CLI config.
Same applies to all other CLI parameters - e.g., you can change the log level by adding `--log_level INFO` to the command line.
Also, note that you don't have to provide full path to the `experiment_MyBenchmark.jsonc` file - the application will look for it in the paths specified in the `"config_path"` section of the CLI config.
## Optimization
Searching for an optimal set of tunable parameters is very similar to running a single benchmark.
All we have to do is specifying the [`Optimizer`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/optimizers/) in the top-level configuration, like in our [`azure-redis-opt.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/cli/azure-redis-opt.jsonc) example.
```sh
mlos_bench --config "./mlos_bench/mlos_bench/config/cli/azure-redis-opt.jsonc" --globals "experiment_MyBenchmark.jsonc" --max_suggestions 10 --trial-config-repeat-count 3
```
Note that again we use the command line option `--max_suggestions` to override the max. number of suggested configurations to trial from [`mlos_core_flaml.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/optimizers/mlos_core_flaml.jsonc).
We also use `--trial-config-repeat-count` to benchmark each suggested configuration 3 times.
That means, we will run 30 trials in total, 3 for each of the 10 suggested configurations.
We don't have to specify the `"tunable_values"` for the optimization: the optimizer will suggest new values on each iteration and the framework will feed this data into the benchmarking environment.
### Resuming interrupted experiments
Experiments sometimes get interrupted, e.g., due to errors in automation scripts or other failures in the system.
<!-- TODO: Document retry logic once implemented: #523 -->
To resume an interrupted experiment, simply run the same command as before.
As mentioned above in the [importance of the `experiment_id` config](#importance-of-the-experiment-id-config) section, the `experiment_id` is used to resume interrupted experiments, reloading prior trial data for that `experiment_id`.
## Analyzing Results
The results of the experiment are stored in the database as specified in experiment configs (see above).
After running the experiment, you can use the [`mlos-viz`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/../mlos_viz/) package to analyze the results in a Jupyter notebook, for instance.
See the [`sqlite-autotuning`](https://github.com/Microsoft-CISL/sqlite-autotuning) repository for a full example.
The `mlos-viz` package uses the `ExperimentData` and `TrialData` [`mlos_bench.storage` APIs](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/storage/) to load the data from the database and visualize it.
For example:
```python
from mlos_bench.storage import from_config
# Specify the experiment_id used for your experiment.
experiment_id = "YourExperimentId"
trial_id = 1
# Specify the path to your storage config file.
storage = from_config(config_file="storage/sqlite.jsonc")
# Access one of the experiments' data:
experiment_data = storage.experiments[experiment_id]
# Full experiment results are accessible in a data frame:
results_data_frame = experiment_data.results
# Individual trial results are accessible via the trials dictionary:
trial_data = experiment_data.trials[trial_id]
# Tunables used for the trial are accessible via the config property:
trial_config = trial_data.config
```
See Also: <https://microsoft.github.io/MLOS> for full API documentation.
Raw data
{
"_id": null,
"home_page": null,
"name": "mlos-bench",
"maintainer": null,
"docs_url": null,
"requires_python": ">=3.8",
"maintainer_email": "Microsoft <mlos-maintainers@service.microsoft.com>",
"keywords": "autotuning, benchmarking, optimization, systems",
"author": null,
"author_email": "Microsoft <mlos-maintainers@service.microsoft.com>",
"download_url": "https://files.pythonhosted.org/packages/a5/da/2b910244da0acfe906cf02d2e928f8d6699015f8009ea89594e183d1e123/mlos_bench-0.6.1.tar.gz",
"platform": null,
"description": "# mlos-bench\n\nThis [directory](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./) contains the code for the `mlos-bench` experiment runner package.\n\nIt makes use of the [`mlos-core`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/../mlos_core/) package for its optimizer.\n\nIt's available for `pip install` via the pypi repository at [mlos-bench](https://pypi.org/project/mlos-bench/).\n\n## Table of Contents\n\n<!-- markdownlint-disable MD007 -->\n\n<!-- TOC -->\n\n- [mlos-bench](#mlos-bench)\n - [Table of Contents](#table-of-contents)\n - [Description](#description)\n - [Features](#features)\n - [Quickstart](#quickstart)\n - [Install and activate the conda environment](#install-and-activate-the-conda-environment)\n - [Make sure that you have Azure CLI tool installed and working](#make-sure-that-you-have-azure-cli-tool-installed-and-working)\n - [Generate access tokens to interact with Azure resources](#generate-access-tokens-to-interact-with-azure-resources)\n - [Create a JSON config with DB credentials Optional](#create-a-json-config-with-db-credentials-optional)\n - [Create a top-level configuration file for your MLOS setup](#create-a-top-level-configuration-file-for-your-mlos-setup)\n - [Create another config file for the parameters specific to your experiment](#create-another-config-file-for-the-parameters-specific-to-your-experiment)\n - [Importance of the Experiment ID config](#importance-of-the-experiment-id-config)\n - [Run the benchmark](#run-the-benchmark)\n - [Optimization](#optimization)\n - [Resuming interrupted experiments](#resuming-interrupted-experiments)\n - [Analyzing Results](#analyzing-results)\n\n<!-- /TOC -->\n\n<!-- markdownlint-enable MD007 -->\n\n## Description\n\n`mlos-bench` is an end-to-end benchmarking service that can be independently launched for experimentation but is also integrated with `mlos-core` as its optimizer for OS tuning.\n Given a user-provided VM configuration, `mlos-bench` provisions a configured environment and remotely executes benchmarks on the cloud.\n Experiment results (benchmark results & telemetry) are stored as input to the `mlos-core` optimization engine loop to evaluate proposed configuration parameters and produce new results.\n\n## Features\n\nWith a [JSON5](https://spec.json5.org) [config file](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/) and command line parameters as input, `mlos-bench` streamlines workload performance measurement by automating the following benchmarking steps:\n\n1. Set up & clean up benchmark and application configuration\n - **Ease of use:** Mlos-bench abstracts away controls for managing VMs in Azure, e.g., setup, teardown, stop, deprovision, and reboot. Get visibility into VM status through Azure Portal, ensuring that a VM is provisioned & running before issuing commands to it.\n - **Versatility:** Mlos-bench provides a common interface to control a collection of environments (application, OS, VM), regardless of where or which cloud they come from. This allows changes to easily propagate to all environment layers when a new set of kernel parameters are applied.\n - **Efficiency:** In adapting an environment to new parameters, mlos-bench optimizes for low re-configuration costs during optimization. For example, considering that not all OS kernel parameter adjustments require a full reboot, as some can be changed during run-time.\n2. Run benchmarks in the provisioned environment & standardize results for the optimizer\n - Through Azure File Share, access docker scripts to run benchmarks & store results as input for optimization. For example, execute Redis benchmark uploaded to the file share, running a benchmark docker container with specified parameters. The file share is mounted to VMs via remote execution, instead of ARM templates.\n - **Configurable:** Specify a python script in the initial config to post-process & standardize benchmark results. An example post-processing script for Redis benchmarks is included.\n - **Local & remote benchmark execution:** Benchmarks can be run both locally in Hyper-V and remotely on Azure. Local execution allows better accuracy, while Azure runs are required to estimate the benchmark noise and understand the VM behavior when using cloud storage.\n - **Cloud agnostic:** Mlos-bench can remotely execute benchmarks on other clouds, outside of Azure - e.g., controls for EC2 instances and ability to provision environments on AWS with Terraform.\n - **Persistence:** Storage integration is available to persist experiment parameters and track results for re-use, either for analysis during & after trials, or warm-starting future experiments.\n\n## Quickstart\n\nTo get started, we can adapt an example configuration to test out running `mlos-bench`.\nFor these instructions, we will be using Azure for our resources.\n\n### 1. Install and activate the conda environment\n\nFrom here onwards we assume we are in the project root directory.\nEnsure you have a conda environment (`mlos`) set up for executing `mlos_bench`.\nCreate and activate the environment with:\n\n```sh\nconda env create -f conda-envs/mlos.yml\nconda activate mlos\n```\n\n> Note: if you are running inside the devcontainer, this should be done automatically.\n\n### 2. Make sure that you have Azure CLI tool installed and working\n\n> Installation instructions for `az` (Azure CLI) [can be found here](https://docs.microsoft.com/en-us/cli/azure/install-azure-cli).\n>\n> Note: `az` comes preinstalled inside the devcontainer.\n\nIf necessary, login to Azure and set your default subscription:\n\n```shell\n# If using az cli for the first time, a login will be required:\naz login\n# Make sure to set your default subscription, RG, and Storage Account for these experiments.\n# For instance:\naz account set --subscription \"My Subscription Name\"\naz config set defaults.group=MyRG --local\naz config set storage.account=MyStorageAccount --local\naz account set --subscription \"...\"\n```\n\n### 3. Generate access tokens to interact with Azure resources\n\nA script at `./scripts/generate-azure-credentials-config` produces a JSON config snippet with necessary Azure credentials.\n\n```shell\n./scripts/generate-azure-credentials-config > ./global_config_azure.jsonc\n```\n\nThis data produced in the `global_config_azure.jsonc` file is in the format that can be used by our framework.\n\n```json\n{\n \"subscription\": \"some-guid\",\n \"tenant\": \"some-other-guid\",\n \"storageAccountKey\": \"some-base-64-encoded-key\",\n}\n```\n\n> Note: On Linux, this script also requires `jq` to also be installed (comes preinstalled in the devcontainer).\n\n### 4. Create a JSON config with DB credentials (Optional)\n\nIf you plan to store the information about experiments and benchmarks in a (remote) database like PostgreSQL or MySQL, create a JSON/JSONC file with the DB hostname and password.\nSee [`mysql.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/storage/mysql.jsonc) or [`postgresql.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/storage/postgresql.jsonc) configuration files for examples with a more complete list of DB parameters supported by underlying the [SqlAlchemy](https://www.sqlalchemy.org/library.html#reference) library.\nSave your config in `./global_config_storage.jsonc` file.\nIt should look like this:\n\n```json\n{\n \"host\": \"mysql-db.mysql.database.azure.com\",\n \"password\": \"database_password\"\n}\n```\n\nAny parameter that is not specified in `./global_config_storage.json` will be taken from the corresponding DB's config file, e.g., [`postgresql.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/storage/postgresql.jsonc).\n\nFor database like SQLite or DuckDB, there is no need for an additional config file.\nThe data will be stored locally in a file, e.g., `./mlos_bench.duckdb`.\nSee [`sqlite.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/storage/sqlite.jsonc) or [`duckdb.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/storage/duckdb.jsonc) for more details.\n\n> Note: if no storage is specified, a basic sqlite config will be used by default.\n\n### 5. Create a top-level configuration file for your MLOS setup\n\nWe provide a few examples of such files in [`./mlos_bench/config/cli/`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/cli).\nFor example, [`azure-redis-opt.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/cli/azure-redis-opt.jsonc) is a configuration for optimizing Redis VM on Azure and saving the results in a local SQLite database.\nLikewise, [`azure-redis-bench.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/cli/azure-redis-bench.jsonc) is a setup to run a single Redis benchmark (and, again, save the results in SQLite).\n\nCLI configs like those are meant to connect several MLOS components together, namely:\n\n- Benchmarking environment (configured in [`environments/root/root-azure-redis.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/environments/root/root-azure-redis.jsonc));\n- Optimization engine ([`optimizers/mlos_core_flaml.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/optimizers/mlos_core_flaml.jsonc));\n- Storage for experimental data ([`storage/sqlite.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/storage/sqlite.jsonc));\n\nThey also refer to other configs, e.g.\n\n- Reusable config snippets in `\"config_path\"` section, and\n- Additional config files containing sensitive data like DB passwords and Azure credentials.\n\n> Make sure that the files `./global_config_azure.jsonc` and `./global_config_storage.json` you created in the previous steps are included in the `\"globals\"` section of your CLI config.\n\nFor the purpose of this tutorial, we will assume that we reuse the existing [`azure-redis-bench.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/cli/azure-redis-bench.jsonc) and [`azure-redis-opt.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/cli/azure-redis-opt.jsonc) configurations without any changes.\nIn a more realistic scenario, however, you might need to change and/or create new config files for some parts of your benchmarking environment.\nWe'll give more details on that below.\n\n### 5. Create another config file for the parameters specific to your experiment\n\nCopy one of our examples, e.g., [`experiment_RedisBench.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/experiments/experiment_RedisBench.jsonc) and name it after your experiment, e.g. `experiment_MyBenchmark.jsonc`.\n\nIn that file, you can specify any parameters that occur in your other configs, namely in `\"const_args\"` section of the Environment configs, or in `\"config\"` sections of your Service, Storage, or Optimizer configurations.\n\n> A general rule is that the parameters from the global configs like `./global_config_azure.jsonc` or `experiment_MyAppBench.jsonc` override the corresponding parameters in other configurations.\n> That allows us to propagate the values of the parameters that are specific to the experiment into other components of the framework and keep the majority of the config files in our library immutable and reusable.\n\n#### Importance of the Experiment ID config\n\nAn important part of this file is the value of `experiment_id` which controls the storage and retrieval of trial data.\nShould the experiment be interrupted, the `experiment_id` will be used to resume the experiment from the last completed trial, reloading the optimizer with data from the previously completed trial data.\n\nAs such this value should be unique for each experiment.\nBe sure to change it whenever *\"incompatible\"* changes are made to the experiment configuration or scripts.\nUnfortunately, determining what constitutes and *\"incompatible\"* change for any given system is not always possible, so `mlos_bench` largely leaves this up to the user.\n\n### 6. Run the benchmark\n\nNow we can run our configuration with `mlos_bench`:\n\n```shell\nmlos_bench --config \"./mlos_bench/mlos_bench/config/cli/azure-redis-bench.jsonc\" --globals \"experiment_MyBenchmark.jsonc\"\n```\n\nThis should run a single trial with the given tunable values (loaded from one or more files in the `\"tunable_values\"`), write the results to the log and keep the environment running (as directed by the `\"teardown\": false` configuration parameter in the CLI config).\n\nNote that using the `--globals` command line option is the same as adding `experiment_MyBenchmark.jsonc` to the `\"globals\"` section of the CLI config.\nSame applies to all other CLI parameters - e.g., you can change the log level by adding `--log_level INFO` to the command line.\n\nAlso, note that you don't have to provide full path to the `experiment_MyBenchmark.jsonc` file - the application will look for it in the paths specified in the `\"config_path\"` section of the CLI config.\n\n## Optimization\n\nSearching for an optimal set of tunable parameters is very similar to running a single benchmark.\nAll we have to do is specifying the [`Optimizer`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/optimizers/) in the top-level configuration, like in our [`azure-redis-opt.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/cli/azure-redis-opt.jsonc) example.\n\n```sh\nmlos_bench --config \"./mlos_bench/mlos_bench/config/cli/azure-redis-opt.jsonc\" --globals \"experiment_MyBenchmark.jsonc\" --max_suggestions 10 --trial-config-repeat-count 3\n```\n\nNote that again we use the command line option `--max_suggestions` to override the max. number of suggested configurations to trial from [`mlos_core_flaml.jsonc`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/config/optimizers/mlos_core_flaml.jsonc).\nWe also use `--trial-config-repeat-count` to benchmark each suggested configuration 3 times.\nThat means, we will run 30 trials in total, 3 for each of the 10 suggested configurations.\n\nWe don't have to specify the `\"tunable_values\"` for the optimization: the optimizer will suggest new values on each iteration and the framework will feed this data into the benchmarking environment.\n\n### Resuming interrupted experiments\n\nExperiments sometimes get interrupted, e.g., due to errors in automation scripts or other failures in the system.\n\n<!-- TODO: Document retry logic once implemented: #523 -->\n\nTo resume an interrupted experiment, simply run the same command as before.\n\nAs mentioned above in the [importance of the `experiment_id` config](#importance-of-the-experiment-id-config) section, the `experiment_id` is used to resume interrupted experiments, reloading prior trial data for that `experiment_id`.\n\n## Analyzing Results\n\nThe results of the experiment are stored in the database as specified in experiment configs (see above).\n\nAfter running the experiment, you can use the [`mlos-viz`](https://github.com/microsoft/MLOS/tree/main/mlos_bench/../mlos_viz/) package to analyze the results in a Jupyter notebook, for instance.\nSee the [`sqlite-autotuning`](https://github.com/Microsoft-CISL/sqlite-autotuning) repository for a full example.\n\nThe `mlos-viz` package uses the `ExperimentData` and `TrialData` [`mlos_bench.storage` APIs](https://github.com/microsoft/MLOS/tree/main/mlos_bench/./mlos_bench/storage/) to load the data from the database and visualize it.\n\nFor example:\n\n```python\nfrom mlos_bench.storage import from_config\n# Specify the experiment_id used for your experiment.\nexperiment_id = \"YourExperimentId\"\ntrial_id = 1\n# Specify the path to your storage config file.\nstorage = from_config(config_file=\"storage/sqlite.jsonc\")\n# Access one of the experiments' data:\nexperiment_data = storage.experiments[experiment_id]\n# Full experiment results are accessible in a data frame:\nresults_data_frame = experiment_data.results\n# Individual trial results are accessible via the trials dictionary:\ntrial_data = experiment_data.trials[trial_id]\n# Tunables used for the trial are accessible via the config property:\ntrial_config = trial_data.config\n```\n\nSee Also: <https://microsoft.github.io/MLOS> for full API documentation.\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "MLOS Bench Python interface for benchmark automation and optimization.",
"version": "0.6.1",
"project_urls": {
"Documentation": "https://microsoft.github.io/MLOS/source_tree_docs/mlos_bench/",
"Issues": "https://github.com/microsoft/MLOS/issues",
"Package Source": "https://github.com/microsoft/MLOS/tree/main/mlos_bench/",
"Repository": "https://github.com/microsoft/MLOS/"
},
"split_keywords": [
"autotuning",
" benchmarking",
" optimization",
" systems"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "88a8cce85b4ac24c259c14f543e4f0011f3da0798922c294698df0dcff6c0410",
"md5": "394f61fc526ae8831ca86d41e2d0d934",
"sha256": "4ee71e40144e099929a400d52bb18f85fe795bec92fbe310c07ae72c73f4f650"
},
"downloads": -1,
"filename": "mlos_bench-0.6.1-py3-none-any.whl",
"has_sig": false,
"md5_digest": "394f61fc526ae8831ca86d41e2d0d934",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">=3.8",
"size": 278736,
"upload_time": "2024-08-16T18:15:47",
"upload_time_iso_8601": "2024-08-16T18:15:47.516839Z",
"url": "https://files.pythonhosted.org/packages/88/a8/cce85b4ac24c259c14f543e4f0011f3da0798922c294698df0dcff6c0410/mlos_bench-0.6.1-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "a5da2b910244da0acfe906cf02d2e928f8d6699015f8009ea89594e183d1e123",
"md5": "2ed893f734a678d5c8505065e50a7681",
"sha256": "00c3e0b2d6f0402b0a6c1ce5c2f4a3e5a85184bb943c1e5e722f4918a298092f"
},
"downloads": -1,
"filename": "mlos_bench-0.6.1.tar.gz",
"has_sig": false,
"md5_digest": "2ed893f734a678d5c8505065e50a7681",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">=3.8",
"size": 172541,
"upload_time": "2024-08-16T18:15:52",
"upload_time_iso_8601": "2024-08-16T18:15:52.079685Z",
"url": "https://files.pythonhosted.org/packages/a5/da/2b910244da0acfe906cf02d2e928f8d6699015f8009ea89594e183d1e123/mlos_bench-0.6.1.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-08-16 18:15:52",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "microsoft",
"github_project": "MLOS",
"travis_ci": false,
"coveralls": false,
"github_actions": true,
"lcname": "mlos-bench"
}