geneticalgorithm2


Namegeneticalgorithm2 JSON
Version 6.9.2 PyPI version JSON
download
home_pagehttps://github.com/PasaOpasen/geneticalgorithm2
SummarySupported highly optimized and flexible genetic algorithm package for python
upload_time2024-04-14 13:10:04
maintainerDemetry Pascal
docs_urlNone
authorDemetry Pascal
requires_pythonNone
licenseMIT
keywords solve solver equation optimization problem genetic algorithm ga easy fast genetic-algorithm combinatorial mixed evolutionary
VCS
bugtrack_url
requirements matplotlib numpy typing_extensions OppOpPopInit
Travis-CI No Travis.
coveralls test coverage No coveralls.
            [![PyPI
version](https://badge.fury.io/py/geneticalgorithm2.svg)](https://pypi.org/project/geneticalgorithm2/)
[![Downloads](https://pepy.tech/badge/geneticalgorithm2)](https://pepy.tech/project/geneticalgorithm2)
[![Downloads](https://pepy.tech/badge/geneticalgorithm2/month)](https://pepy.tech/project/geneticalgorithm2)
[![Downloads](https://pepy.tech/badge/geneticalgorithm2/week)](https://pepy.tech/project/geneticalgorithm2)

[![Gitter](https://badges.gitter.im/geneticalgorithm2/community.svg)](https://gitter.im/geneticalgorithm2/community?utm_source=badge&utm_medium=badge&utm_campaign=pr-badge)
[![PRs Welcome](https://img.shields.io/badge/PRs-welcome-brightgreen.svg?style=flat-square)](https://github.com/PasaOpasen/geneticalgorithm2/pulls)

https://pasaopasen.github.io/geneticalgorithm2/

**geneticalgorithm2** (from [DPEA](https://github.com/PasaOpasen/PasaOpasen.github.io/blob/master/EA_packages.md)) **is the supported advanced optimized fork of non-supported package** [geneticalgorithm](https://github.com/rmsolgi/geneticalgorithm) of *Ryan (Mohammad) Solgi*

- [About](#about)
- [Installation](#installation)
- [Updates information](#updates-information)
  - [**Future**](#future)
  - [**TODO firstly**](#todo-firstly)
  - [6.9.2 bugfixes](#692-bugfixes)
  - [6.9.1 refactor](#691-refactor)
  - [6.9.0 reborn](#690-reborn)
  - [6.8.7 minor update](#687-minor-update)
  - [6.8.6 minor update](#686-minor-update)
  - [6.8.5 minor update](#685-minor-update)
  - [6.8.4 minor update](#684-minor-update)
  - [6.8.3 types update](#683-types-update)
  - [6.8.2 patch](#682-patch)
  - [6.8.1 patch](#681-patch)
  - [6.8.0 minor update](#680-minor-update)
  - [6.7.7 refactor](#677-refactor)
  - [6.7.6 bug fix](#676-bug-fix)
  - [6.7.5 refactor](#675-refactor)
  - [6.7.4 bug fix](#674-bug-fix)
  - [6.7.3 speed up](#673-speed-up)
  - [6.7.2 little update](#672-little-update)
  - [6.7.1 patch](#671-patch)
  - [6.7.0 minor update (new features)](#670-minor-update-new-features)
  - [6.6.2 patch (speed up)](#662-patch-speed-up)
  - [6.6.1 patch](#661-patch)
  - [6.6.0 minor update (refactoring)](#660-minor-update-refactoring)
  - [6.5.1 patch](#651-patch)
  - [6.5.0 minor update (refactoring)](#650-minor-update-refactoring)
  - [6.4.1 patch (bug fix)](#641-patch-bug-fix)
  - [6.4.0 minor update (refactoring)](#640-minor-update-refactoring)
  - [6.3.0 minor update (refactoring)](#630-minor-update-refactoring)
- [Working process](#working-process)
  - [Main algorithm structure](#main-algorithm-structure)
  - [Optimization process components](#optimization-process-components)
    - [Function to minimize](#function-to-minimize)
    - [Optimization space](#optimization-space)
    - [Algorithm parameters](#algorithm-parameters)
    - [Samples constructors](#samples-constructors)
    - [Callbacks](#callbacks)
  - [How to run](#how-to-run)
  - [Constructor parameters](#constructor-parameters)
  - [Genetic algorithm's parameters](#genetic-algorithms-parameters)
    - [AlgorithmParams object](#algorithmparams-object)
    - [Parameters of algorithm](#parameters-of-algorithm)
      - [**Crossover**](#crossover)
      - [**Mutation**](#mutation)
      - [**Selection**](#selection)
  - [Methods and Properties of model](#methods-and-properties-of-model)
- [Examples for beginner](#examples-for-beginner)
  - [A minimal example](#a-minimal-example)
  - [The simple example with integer variables](#the-simple-example-with-integer-variables)
  - [The simple example with Boolean variables](#the-simple-example-with-boolean-variables)
  - [The simple example with mixed variables](#the-simple-example-with-mixed-variables)
  - [Optimization problems with constraints](#optimization-problems-with-constraints)
  - [Middle example: select fixed count of objects from set](#middle-example-select-fixed-count-of-objects-from-set)
- [U should know these features](#u-should-know-these-features)
  - [Available crossovers](#available-crossovers)
  - [Available selections](#available-selections)
  - [Function timeout](#function-timeout)
  - [Standard GA vs. Elitist GA](#standard-ga-vs-elitist-ga)
  - [Standard crossover vs. stud EA crossover](#standard-crossover-vs-stud-ea-crossover)
  - [Creating better start population](#creating-better-start-population)
    - [Select best N of kN](#select-best-n-of-kn)
    - [Do local optimization](#do-local-optimization)
    - [Optimization with oppositions](#optimization-with-oppositions)
  - [Revolutions](#revolutions)
  - [Duplicates removing](#duplicates-removing)
  - [Cache](#cache)
  - [Report checker](#report-checker)
  - [Middle callbacks](#middle-callbacks)
  - [How to compare efficiency of several versions of GA optimization](#how-to-compare-efficiency-of-several-versions-of-ga-optimization)
  - [Hints on how to adjust genetic algorithm's parameters (from `geneticalgorithm` package)](#hints-on-how-to-adjust-genetic-algorithms-parameters-from-geneticalgorithm-package)
  - [How to get maximum speed](#how-to-get-maximum-speed)
    - [Don't use plotting](#dont-use-plotting)
    - [Don't use progress bar](#dont-use-progress-bar)
    - [Try to use faster optimizing function](#try-to-use-faster-optimizing-function)
    - [Specify custom optimized `mutation`, `crossover`, `selection`](#specify-custom-optimized-mutation-crossover-selection)
    - [Specify `fill_children` method](#specify-fill_children-method)
- [Examples pretty collection](#examples-pretty-collection)
  - [Optimization test functions](#optimization-test-functions)
    - [Sphere](#sphere)
    - [Ackley](#ackley)
    - [AckleyTest](#ackleytest)
    - [Rosenbrock](#rosenbrock)
    - [Fletcher](#fletcher)
    - [Griewank](#griewank)
    - [Penalty2](#penalty2)
    - [Quartic](#quartic)
    - [Rastrigin](#rastrigin)
    - [SchwefelDouble](#schwefeldouble)
    - [SchwefelMax](#schwefelmax)
    - [SchwefelAbs](#schwefelabs)
    - [SchwefelSin](#schwefelsin)
    - [Stairs](#stairs)
    - [Abs](#abs)
    - [Michalewicz](#michalewicz)
    - [Scheffer](#scheffer)
    - [Eggholder](#eggholder)
    - [Weierstrass](#weierstrass)
  - [Using GA in reinforcement learning](#using-ga-in-reinforcement-learning)
  - [Using GA with image reconstruction by polygons](#using-ga-with-image-reconstruction-by-polygons)
- [Popular questions](#popular-questions)
  - [How to disable autoplot?](#how-to-disable-autoplot)
  - [How to plot population scores?](#how-to-plot-population-scores)
  - [How to specify evaluated function for all population?](#how-to-specify-evaluated-function-for-all-population)
  - [What about parallelism?](#what-about-parallelism)
  - [How to initialize start population? How to continue optimization with new run?](#how-to-initialize-start-population-how-to-continue-optimization-with-new-run)

# About

[**geneticalgorithm2**](https://pasaopasen.github.io/geneticalgorithm2/) is very flexible and highly optimized Python library for implementing classic
[genetic-algorithm](https://towardsdatascience.com/introduction-to-optimization-with-genetic-algorithm-2f5001d9964b) (GA).

Features of this package:

* written on **pure python**
* **extremely fast**
* **no hard dependencies** (only numpy primary, can work without matplotlib)
* **easy to run**: no need to perform long task-setup process
* easy to logging, reach **support of flexible callbacks**
* **many built-in plotting functions**
* **many built-in cases of crossover, mutation and selection**
* support of integer, boolean and real (continuous/discrete) variables types
* support of mixed types of variables
* **support of classic, elitist and studEA genetic algorithm combinations**
* **support of revolutions and duplicates utilization**
* **reach support of customization**
    
# Installation

Install this package with standard light dependencies to use the base functional.
```
pip install geneticalgorithm2
```

Install this package with full dependencies to use all provided functional including plotting and built-in parallelism tools.

```
pip install geneticalgorithm2[full]
```

# Updates information

## **Future**

- duplicates removing and revolutions will be moved to `MiddleCallbacks` and removed as alone `run()` parameters
- new stop criteria callbacks (min std, max functions evaluations)
- `vartype` will support strings like `iiiiibbf`

## **TODO firstly**
- Remove old style mentions from README

## 6.9.2 bugfixes

## 6.9.1 refactor

- Finally move `function_timeout` and `function` to `run()` method and deprecate its usage in init()
- `function` is not mandatory to be non-empty 
- reduce documentation duplicates

## 6.9.0 reborn

- recreate the repository without excess heavy files materials 
- host the [code documentation](https://pasaopasen.github.io/geneticalgorithm2/)
- rename `geneticalgorithm2` class to `GeneticAlgorithm2`
- substantial package architecture refactor
- add more docstrings

## 6.8.7 minor update

- some code refactor
- fixes:
  - ensure the directory of generation file exists on save

## 6.8.6 minor update

- small package installation update: add `pip install geneticalgorithm2[full]` version
- small refactor

## 6.8.5 minor update

- update `OppOpPopInit` `2.0.0->2.0.1`
- set default `function_timeout` to `None` which means no use of function time checking
- remove `joblib` and `func_timeout` from necessary dependencies

## 6.8.4 minor update

- a bit of refactor
- little optimizations
- add empty field `fill_children(pop_matrix, parents_count)` to `geneticalgorithm2` class to specify children creating behavior (what is the most intensive part of algorithm after optimizing func calculations), see [this](#specify-fill_children-method)

## 6.8.3 types update

- much more type hints

## 6.8.2 patch

- for printing info
- fix logic: now population is always sorted before going to callbacks

## 6.8.1 patch

- printing progress bar to `'stderr'` or `'stdout'` or `None` (disable) by choice (`progress_bar_stream` argument of `run()`), deprecated `disable_progress_bar`
- little speed up
- new `geneticalgorithm2.vectorized_set_function` set function, which can be faster for big populations 

## 6.8.0 minor update

- remove `crossover_probability` model parameter because of it has no sense to exist (and 1.0 value is better than others, take a look at [results](/examples/output/sense_of_crossover_prob__no_sense.png)). This parameter came from `geneticalgorithm` old package and did`t change before.

## 6.7.7 refactor

- change some behavior about parents selection

## 6.7.6 bug fix

- fix some bug of `variable_type=='bool'`
- some refactor of progress bar
- add some dependencies to `setup.py`

## 6.7.5 refactor

- shorter progress bar (length can be controlled by setting `PROGRESS_BAR_LEN` field of `geneticalgorithm2` class)
- shorter logic of `run()`, more informative output 

## 6.7.4 bug fix

- bug fix

## 6.7.3 speed up

- refactor to make `run()` method faster

## 6.7.2 little update

- better flexible logic for report, [take a look](#report-checker)
- removed `show mean` parameter from `model.plot_result` and now model reports only best score by default, not average and so on (u can specify if u wanna report average too)
- `plot_several_lines` useful function

## 6.7.1 patch

- changes according to new [OppOpPopInit](https://github.com/PasaOpasen/opp-op-pop-init) version

## 6.7.0 minor update (new features)

- add `mutation_discrete_type` and `mutation_discrete_probability` parameters in model. It controls mutation behavior for discrete (integer) variables and works like `mutation_type` and `mutation_probability` work for continuous (real) variables. Take a look at [algorithm parameters](#parameters-of-algorithm) 

## 6.6.2 patch (speed up)

- fix and speed up mutation

## 6.6.1 patch

- removed unnecessary dependencies

## 6.6.0 minor update (refactoring)

- deprecated `variable_type_mixed`, now use `variable_type` for mixed optimization too
- deprecated `output_dict`, now it's better object with name `result`
- refactor of big part of **tests**
- refactor of README

## 6.5.1 patch

- replace `collections.Sequence` with `collections.abc.Sequence`, now it should work for `python3.10+`

## 6.5.0 minor update (refactoring)

- another form of data object using with middle callbacks (`MiddleCallbackData` dataclass instead of dictionary)
- type hints for callbacks module 

## 6.4.1 patch (bug fix)

- fix bug setting attribute to algorithm parameters (in middle callbacks) 


## 6.4.0 minor update (refactoring)

- new valid forms for `start_generation`; now it's valid to use
    * `None`
    * `str` path to saved generation
    * dictionary with structure `{'variables': variables/None, 'scores': scores/None}`
    * `Generation` object: `Generation(variables = variables, scores = scores)`
    * `np.ndarray` with shape `(samples, dim)` for only population or `(samples, dim+1)` for concatenated population and score (scores is the last matrix column)
    * `tuple(np.ndarray/None, np.ndarray/None)` for variables and scores
  
  here `variables` is 2D numpy array with shape `(samples, dim)`, `scores` is 1D numpy array with scores (function values) for each sample; [here](examples/output/start_gen.py) and [here](#how-to-initialize-start-population-how-to-continue-optimization-with-new-run) u can see examples of using these valid forms 


## 6.3.0 minor update (refactoring)

- type hints for entire part of functions
- new valid forms for function parameters (now u don't need to use numpy arrays everywhere)
- `AlgorithmParams` class for base GA algorithm parameters (instead of dictionary)
- `Generation` class for saving/loading/returning generation (instead of dictionary)

All that classes are collected [in file](geneticalgorithm2/data_types/classes.py). To maintain backward compatibility, `AlgorithmParams` and `Generation` classes have dictionary-like interface for getting fields: u can use `object.field` or `object['field']` notations.


# Working process

## Main algorithm structure

```
Pre-process: making inner functions depends on params, making/loading start population

while True:

    if reason to stop (time is elapsed / no progress / generation count is reached / min value is reached):
        break


    select parents to crossover from last population and put them to new population:
        select (elit count) best samples
        select (parents count - elit count) random samples (by selection function)

    create (total samples count - parents count) children (samples from selected parents) and put them to new population:
        while not all children are created:
            select 2 random parents
            make child1, child2 from them using crossover
            mutate child1 by mutation (model.mut)
            mutate child2 by middle mutation (model.mut_middle)
            put children to new population
    
    remove duplicates, make revolutions, sort population by scores
    use callbacks, use middle callbacks

Post-process: plotting results, saving

```

## Optimization process components

### Function to minimize

The goal of the optimization process is to find the *minimum* of the given `function (1D array) -> float` where the function argument is a vector of some values in different dimensions. 

If u want to find the *maximum*, use this idea:
```python
opt_func = lambda arr: -func(arr)

#
# ... find global min of opt_func
#

opt_minimum=opt_func(best value)
maximum = -opt_minimum
```

Also it is possible and highly recommended to create and use a *vectorized* version of this function called `set_function (2D array) -> (1D array)` which transforms several samples matrix to samples scores vector by one call. Using this way u can speed up calculations or set up more complex tasks optimization

### Optimization space

The function rates 1D arrays (vectors) where each component (dimension) means something u program it to mean. Each dimension has its `bound` (`[min; max]` cut) and `variable type` (real/discrete).

**Advice**. Genetic algorithms work much faster and efficient for discrete tasks. If high precision is not required u can split any real dimension to many discrete values (for instance, `[1.1, 1.2, 1.25, 1.44]`) and try to optimize indexes of the given array which are converted to real values inside `function` itself.

### Algorithm parameters

There are a number of hyperparameters u can probe to optimize including population size and selection/crossover/mutation types.

### Samples constructors

There are several ways to create new testing samples from zero when u start with empty population or when u need new samples after [duplicates removing](#duplicates-removing) and [revolutions](#revolutions). 


### Callbacks

Now the package supports 2 different types of highly customized callbacks:
* [simple callbacks](#callbacks)
* [middle callbacks](#middle-callbacks)

## How to run

Firstly, u should **import needed packages**. 

All available (but not always necessary) imports are:

```python
import numpy as np

# the only one required import
from geneticalgorithm2 import GeneticAlgorithm2 as ga  # for creating and running optimization model

from geneticalgorithm2 import Generation, AlgorithmParams  # classes for comfortable parameters setting and getting

from geneticalgorithm2 import Crossover, Mutations, Selection  # classes for specific mutation and crossover behavior

from geneticalgorithm2 import get_population_initializer  # for creating better start population

from geneticalgorithm2 import np_lru_cache  # for cache function (if u want)

from geneticalgorithm2 import plot_pop_scores  # for plotting population scores, if u want

from geneticalgorithm2 import Callbacks  # simple callbacks (will be deprecated)

from geneticalgorithm2 import Actions, ActionConditions, MiddleCallbacks  # middle callbacks
```

Next step: **define the function to minimize**:

```python
def function(X: np.ndarray) -> float: # X as 1d-numpy array
    return np.sum(X**2) + X.mean() + X.min() + X[0]*X[2] # some float result
```

Also u should **create the bounds for each variable** (if exist) such as:

```python
var_bound = np.array([[0,10]]*3) # 2D numpy array with shape (dim, 2)

# also u can use Sequence of Tuples (from version 6.3.0)
var_bound = [
    (0, 10),
    (0, 10),
    (0, 10)
]
```

**Important**. U don't need to use variable boundaries only if variable type of each variable is boolean. This case will be automatically converted to discrete variables with bounds `(0, 1)`.

After that u **create a `GeneticAlgorithm2` (was imported early as ga) object**:

```python
model = ga( 
    dimension = 3, 
    variable_type='real', 
    variable_boundaries = var_bound,
    algorithm_parameters={
        'max_num_iteration': None,
        'population_size':100,
        'mutation_probability': 0.1,
        'mutation_discrete_probability': None,
        'elit_ratio': 0.01,
        'parents_portion': 0.3,
        'crossover_type':'uniform',
        'mutation_type': 'uniform_by_center',
        'mutation_discrete_type': 'uniform_discrete',
        'selection_type': 'roulette',
        'max_iteration_without_improv':None
    }
)
```

**Note**: it is not mandatory to write all possible `algorithm_parameters`, here it is done only to show u defaults. Also u can use `AlgorithmParams` (with typehints and docstrings) class instead of dicts:

```python
algorithm_parameters=AlgorithmParams(
    max_num_iteration=None,
    population_size=100,
    mutation_probability=0.1,
    mutation_discrete_probability=None,
    elit_ratio=0.01,
    parents_portion=0.3,
    crossover_type='uniform',
    mutation_type='uniform_by_center',
    mutation_discrete_type='uniform_discrete',
    selection_type='roulette',
    max_iteration_without_improv=None
)
```

**Run the search method**:

```python
# all of this parameters are default
result = model.run(
    no_plot = False, 
    progress_bar_stream = 'stdout',
    disable_printing = False,

    function=function,
    function_timeout=None,

    set_function = None, 
    apply_function_to_parents = False, 
    start_generation = None,
    studEA = False,
    mutation_indexes = None,

    init_creator = None,
    init_oppositors = None,
    duplicates_oppositor = None,
    remove_duplicates_generation_step = None,
    revolution_oppositor = None,
    revolution_after_stagnation_step = None,
    revolution_part = 0.3,
    
    population_initializer = Population_initializer(select_best_of = 1, local_optimization_step = 'never', local_optimizer = None),
    
    stop_when_reached = None,
    callbacks = [],
    middle_callbacks = [],
    time_limit_secs = None, 
    save_last_generation_as = None,
    seed = None
)

# best candidate
print(result.variable)

# best score
print(result.score)

# last generation
print(result.last_generation)

```

## Constructor parameters

Have a look at https://pasaopasen.github.io/geneticalgorithm2/geneticalgorithm2/geneticalgorithm2.html#GeneticAlgorithm2.__init__

## Genetic algorithm's parameters

### AlgorithmParams object

The parameters of GA is defined as a dictionary or `AlgorithmParams` object: https://pasaopasen.github.io/geneticalgorithm2/geneticalgorithm2/data_types/algorithm_params.html

To get the global default params use code:
```python
params = ga.default_params
```

To get actual parameters of an existing model use code:
```python
params = model.param
```

### Parameters of algorithm

#### **Crossover**

https://pasaopasen.github.io/geneticalgorithm2/geneticalgorithm2/crossovers.html

#### **Mutation**

https://pasaopasen.github.io/geneticalgorithm2/geneticalgorithm2/mutations.html

#### **Selection**

https://pasaopasen.github.io/geneticalgorithm2/geneticalgorithm2/selections.html

## Methods and Properties of model

Have a look at https://pasaopasen.github.io/geneticalgorithm2/geneticalgorithm2/geneticalgorithm2.html#GeneticAlgorithm2.run

# Examples for beginner

## A minimal example 

Assume we want to find a set of `X = (x1, x2, x3)` that minimizes function `f(X) = x1 + x2 + x3` where `X` can be any real number in `[0, 10]`.

This is a trivial problem and we already know that the answer is `X = (0, 0, 0)` where `f(X) = 0`.  

We just use this simple example to show how to implement it with `geneticalgorithm2`. First we import `geneticalgorithm2` and [numpy](https://numpy.org). Next, we define 
function `f` which we want to minimize and the boundaries of the decision variables. Then simply `geneticalgorithm2` is called to solve the defined optimization problem as follows:

```python
import numpy as np
from geneticalgorithm2 import GeneticAlgorithm2 as ga


def f(X):
    return np.sum(X)


varbound = [[0, 10]] * 3

model = ga(dimension=3, variable_type='real', variable_boundaries=varbound)

model.run(function=f)
```
  
If you run the code, you should see a progress bar that shows the progress of the 
genetic algorithm (GA) and then the solution, objective function value and the convergence curve as follows:

![](https://github.com/PasaOpasen/geneticalgorithm2/blob/master/genetic_algorithm_convergence.gif)

Also we can access to the best answer of the defined optimization problem found by GA as a dictionary and a report of the progress of the genetic algorithm. 
To do so we complete the code as follows:

```python
convergence = model.report

solution = model.result
```

## The simple example with integer variables

Considering the problem given in the simple example above.
Now assume all variables are integers. So `x1, x2, x3` can be any integers in `[0, 10]`.
In this case the code is as the following:

```python
import numpy as np
from geneticalgorithm2 import GeneticAlgorithm2 as ga


def f(X):
    return np.sum(X)

varbound = [[0, 10]] * 3

model = ga(dimension=3, variable_type='int', variable_boundaries=varbound)

model.run(function=f)
```
So, as it is seen the only difference is that for `variable_type` we use string `'int'`. 

## The simple example with Boolean variables

Considering the problem given in the simple example above.
Now assume all variables are boolean instead of real or integer. So `X` can be either zero or one. Also instead of three let's have 30 variables.
In this case the code is as the following:

```python
import numpy as np
from geneticalgorithm2 import GeneticAlgorithm2 as ga


def f(X):
    return np.sum(X)

model = ga(dimension=30, variable_type='bool')

model.run(function=f)
```
Note for variable_type we use string `'bool'` when all variables are boolean.  
Note that when variable_type equal `'bool'` there is no need for `variable_boundaries` to be defined.

## The simple example with mixed variables

Considering the problem given in the the simple example above where we want to minimize `f(X) = x1 + x2 + x3`. 
Now assume `x1` is a real (continuous) variable in `[0.5; 1.5]`, `x2` is an integer variable in `[1;100]`, and `x3` is a boolean variable that can be either zero or one.
We already know that the answer is `X = (0.5, 1, 0)` where `f(X) = 1.5`.
We implement geneticalgorithm2 as the following:

```python
import numpy as np
from geneticalgorithm2 import GeneticAlgorithm2 as ga


def f(X):
    return np.sum(X)


varbound = [[0.5, 1.5], [1, 100], [0, 1]]
vartype = ('real', 'int', 'int')
model = ga(dimension=3, variable_type=vartype, variable_boundaries=varbound)

model.run(function=f)
```

## Optimization problems with constraints

In all above examples, the optimization problem was unconstrained. Now consider that we want to minimize `f(X) = x1+x2+x3` where `X` is a set of real variables in `[0; 10]`. Also we have an extra constraint so that sum of `x1` and `x2` is equal or greater than 2. The minimum of `f(X)` is 2.
In such a case, a trick is to define penalty function. Hence we use the code below:

```python
import numpy as np
from geneticalgorithm2 import GeneticAlgorithm2 as ga


def f(X):
    pen = 0
    if X[0] + X[1] < 2:
        pen = 500 + 1000 * (2 - X[0] - X[1])
    return np.sum(X) + pen


varbound = [[0, 10]] * 3

model = ga(dimension=3, variable_type='real', variable_boundaries=varbound)

model.run(function=f)

```
As seen above we add a penalty to the objective function whenever the constraint is not met.  

Some hints about how to define a penalty function:  

1. Usually you may use a constant greater than the maximum possible value of the objective function if the maximum is known or if we have a guess of that. Here the highest possible value of our function is 300 (i.e. if all variables were 10, `f(X)=300`). So I chose a constant of 500. So, if a trial solution is not in the feasible region even though its objective function may be small, the penalized objective function (fitness function) is worse than any feasible solution.
2. Use a coefficient big enough and multiply that by the amount of violation. This helps the algorithm learn how to approach feasible domain.
3. How to define penalty function usually influences the convergence rate of an evolutionary algorithm. In my [book on metaheuristics and evolutionary algorithms](https://www.wiley.com/en-us/Meta+heuristic+and+Evolutionary+Algorithms+for+Engineering+Optimization-p-9781119386995) you can learn more about that. 
4. Finally after you solved the problem test the solution to see if boundaries are met. If the solution does not meet constraints, it shows that a bigger penalty is required. However, in problems where optimum is exactly on the boundary of the feasible region (or very close to the constraints) which is common in some kinds of problems, a very strict and big penalty may prevent the genetic algorithm to approach the optimal region. In such a case designing an appropriate penalty function might be more challenging. Actually what we have to do is to design a penalty function that let the algorithm searches unfeasible domain while finally converge to a feasible solution. Hence you may need more sophisticated penalty functions. But in most cases the above formulation work fairly well.

## Middle example: select fixed count of objects from set

For some task u need to think a lot and create good specific crossover or mutation functions. For example, take a look at this problem:
```
    From set like X = {x1, x2, x3, ..., xn} u should select only k objects which get the best function value
```
U can do it using this code:

```python
import numpy as np
from geneticalgorithm2 import GeneticAlgorithm2 as ga

subset_size = 20  # how many objects we can choose

objects_count = 100  # how many objects are in set

my_set = np.random.random(objects_count) * 10 - 5  # set values


# minimized function
def f(X):
    return abs(np.mean(my_set[X == 1]) - np.median(my_set[X == 1]))


# initialize start generation and params

N = 1000  # size of population
start_generation = np.zeros((N, objects_count))
indexes = np.arange(0, objects_count, dtype=np.int8)  # indexes of variables

for i in range(N):
    inds = np.random.choice(indexes, subset_size, replace=False)
    start_generation[i, inds] = 1


def my_crossover(parent_a, parent_b):
    a_indexes = set(indexes[parent_a == 1])
    b_indexes = set(indexes[parent_b == 1])

    intersect = a_indexes.intersection(b_indexes)  # elements in both parents
    a_only = a_indexes - intersect  # elements only in 'a' parent
    b_only = b_indexes - intersect

    child_inds = np.array(list(a_only) + list(b_only), dtype=np.int8)
    np.random.shuffle(child_inds)  # mix

    children = np.zeros((2, parent_a.size))
    if intersect:
        children[:, np.array(list(intersect))] = 1
    children[0, child_inds[:int(child_inds.size / 2)]] = 1
    children[1, child_inds[int(child_inds.size / 2):]] = 1

    return children[0, :], children[1, :]


model = ga(
    dimension=objects_count,
    variable_type='bool',
    algorithm_parameters={
        'max_num_iteration': 500,
        'mutation_probability': 0,  # no mutation, just crossover
        'elit_ratio': 0.05,
        'parents_portion': 0.3,
        'crossover_type': my_crossover,
        'max_iteration_without_improv': 20
    }
)

model.run(
    function=f,
    no_plot=False, 
    start_generation=(start_generation, None)
)
```

# U should know these features

## Available crossovers

For two example parents (*one with ones* and *one with zeros*) next crossovers will give same children ([examples](examples/crossovers_examples.py)): 

* **one_point**:

|0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1|
|:---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---:|
|1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0|

* **two_point**:

|1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1|
|:---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---:|
|0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0|

* **uniform**:

|1 | 1 | 1 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0|
|:---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---:|
|0 | 0 | 0 | 1 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1|

* **uniform_window**:

|1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 1 | 1|
|:---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---:|
|0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 0 | 0 | 0|

* **shuffle**:

|0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 1 | 1 | 0 | 1 | 0|
|:---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---:|
|1 | 1 | 1 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 1 | 0 | 1|

* **segment**:

|0 | 1 | 1 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 1 | 1|
|:---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---:|
|1 | 0 | 0 | 1 | 1 | 0 | 1 | 0 | 1 | 1 | 0 | 1 | 1 | 0 | 0|

* **arithmetic**:

|0.13 | 0.13 | 0.13 | 0.13 | 0.13 | 0.13 | 0.13 | 0.13 | 0.13 | 0.13 | 0.13 | 0.13 | 0.13 | 0.13 | 0.13|
|:---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---:|
|0.87 | 0.87 | 0.87 | 0.87 | 0.87 | 0.87 | 0.87 | 0.87 | 0.87 | 0.87 | 0.87 | 0.87 | 0.87 | 0.87 | 0.87|

* **mixed**:

|0.63 | 0.84 | 1.1 | 0.73 | 0.67 | -0.19 | 0.3 | 0.72 | -0.18 | 0.61 | 0.84 | 1.14 | 1.36 | -0.37 | -0.19|
|:---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---:|
|0.51 | 0.58 | 0.43 | 0.42 | 0.55 | 0.49 | 0.57 | 0.48 | 0.46 | 0.56 | 0.56 | 0.54 | 0.44 | 0.51 | 0.4|

## Available selections

![](examples/output/selections.png)


## Function timeout

**geneticalgorithm2** inherited several features from `geneticalgorithm` package sush as that if the given function does not provide any output before timeout, the algorithm would be terminated and raise the appropriate error. 

In such a case make sure the given function works correctly (i.e. there is no infinite loop in the given function). Also if the given function takes more than 10 seconds to complete the work
make sure to increase `function_timeout` in arguments.

## Standard GA vs. Elitist GA

The convergence curve of an elitist genetic algorithm is always non-increasing. So, the best ever found solution is equal to the best solution of the last iteration. However, the convergence curve of a standard genetic algorithm is different. If `elit_ratio` is zero geneticalgorithm2 implements a standard GA. The output of geneticalgorithm2 for standard GA is the best ever found solution not the solution of the last iteration. The difference between the convergence curve of standard GA and elitist GA is shown below:

![](examples/output/standard_vs_elitist.png)

## Standard crossover vs. stud EA crossover

[Stud EA](https://link.springer.com/chapter/10.1007%2FBFb0056910) is the idea of using crossover always with best object. So one of two parents is always the best object of population. It can help us in a lot of tasks!

![](examples/output/studEA.png)

## Creating better start population

There is `get_population_initializer(select_best_of = 4, local_optimization_step = 'never', local_optimizer = None)` function for creating start population creators. Take a look at [its docs](https://pasaopasen.github.io/geneticalgorithm2/geneticalgorithm2/population_initializer.html)

### Select best N of kN

This little option can help u especially with multimodal tasks. 

![](examples/output/init_best_of.png)

### Do local optimization

We can apply some local optimization on start generation before starting GA search. It can be some gradient descent or hill climbing and so on. Also we can apply it before selection best objects (on entire population) or after (on best part of population) and so forth.

In next example I'm using my [DiscreteHillClimbing](https://github.com/PasaOpasen/DiscreteHillClimbing) algorithm for local optimization my discrete task:

```python
import numpy as np
import matplotlib.pyplot as plt

from DiscreteHillClimbing import Hill_Climbing_descent

from geneticalgorithm2 import GeneticAlgorithm2 as ga
from geneticalgorithm2 import get_population_initializer


def f(arr):
    arr2 = arr / 25
    return -np.sum(arr2 * np.sin(np.sqrt(np.abs(arr2)))) ** 5 + np.sum(np.abs(arr2)) ** 2


iterations = 100

varbound = [[-100, 100]] * 15

available_values = [np.arange(-100, 101)] * 15

my_local_optimizer = lambda arr, score: Hill_Climbing_descent(
    function=f, available_predictors_values=available_values,
    max_function_evals=50, start_solution=arr
)

model = ga(
    dimension=varbound.shape[0],
    variable_type='int',
    variable_boundaries=varbound,
    algorithm_parameters={
        'max_num_iteration': iterations,
        'population_size': 400
    }
)

for time in ('before_select', 'after_select', 'never'):
    model.run(
        function=f
        no_plot=True,
        population_initializer=get_population_initializer(
            select_best_of=3,
            local_optimization_step=time,
            local_optimizer=my_local_optimizer
        )
    )

    plt.plot(model.report, label=f"local optimization time = '{time}'")

plt.xlabel('Generation')
plt.ylabel('Minimized function (40 simulations average)')
plt.title('Selection best N object before running GA')
plt.legend()
```

![](examples/output/init_local_opt.png)

### Optimization with oppositions

Also u can create start population with [oppositions](https://github.com/PasaOpasen/opp-op-pop-init). See [example of code](examples/best_of_N_with_opp.py)

![](examples/output/init_best_of_opp.png)

## Revolutions

U can create [revolutions in your population](https://github.com/PasaOpasen/opp-op-pop-init) after some stagnation steps. It really can help u for some tasks. See [example](examples/revolution.py)

![](examples/output/revolution.png)


## Duplicates removing

If u remove duplicates each `k` generations, u can speed up the optimization process ([example](examples/remove_dups.py))

![](examples/output/remove_dups.png)

## Cache

It can be useful for run-speed to use cache with *some discrete tasks*. For this u can import `np_lru_cache` decorator and use it like here:

```python
import np_lru_cache

@np_lru_cache(maxsize=some_size)
def minimized_func(arr):
    # code
    return result

#
# run
#    algorithm
#


# don't forget to clear cache
minimized_func.cache_clear()
```
## Report checker

Basically the model checks best population score (minimal score of generation) each generation and saves it to `report` field. Actually this sequence of numbers u see in big part of plots. This behavior is needed for several parts and u cannot disable it. But if u want to report some other metric without using [callbacks](#middle-callbacks), there is highly simple and fast way.

After creating `model` but before running `run()` u need to append ur logic to `model.checked_reports` field. Take a look at example:

```python
import numpy as np

from geneticalgorithm2 import GeneticAlgorithm2 as ga
from geneticalgorithm2 import plot_several_lines


def f(X):
    return 50 * np.sum(X) - np.sum(np.sqrt(X) * np.sin(X))


dim = 25
varbound = [[0, 10]] * dim

model = ga(
    dimension=dim,
    variable_type='real', 
    variable_boundaries=varbound,
    algorithm_parameters={
        'max_num_iteration': 600
    }
)

# here model exists and has checked_reports field
# now u can append any functions to report

model.checked_reports.extend(
    [
        ('report_average', np.mean),
        ('report_25', lambda arr: np.quantile(arr, 0.25)),
        ('report_50', np.median)
    ]
)

# run optimization process
model.run(
    function=f,
    no_plot=False
)

# now u have not only model.report but model.report_25 and so on

# plot reports
names = [name for name, _ in model.checked_reports[::-1]]
plot_several_lines(
    lines=[getattr(model, name) for name in names],
    colors=('green', 'black', 'red', 'blue'),
    labels=['median value', '25% quantile', 'mean of population', 'best pop score'],
    linewidths=(1, 1.5, 1, 2),
    title="Several custom reports with base reports",
    save_as='./output/report.png'
)
```

![](examples/output/report.png)

As u see, u should append tuple `(name of report, func to evaluate report)` to `model.checked_report`. It's highly recommended to start this name with `report_` (e. g. `report_my_median`). And the function u use will get 1D-numpy *sorted* array of population scores.


## Middle callbacks

There is an amazing way to control optimization process using `MiddleCallbacks` class. Just learn next logic:

1. u can use several `MiddleCallbacks` callbacks as list at `middle_callbacks` parameter in `run()` method
2. each middle callback is the pair of `action` and `condition` functions
3. `condition(data)` (`Callable[[MiddleCallbackData], bool]`) function gets `data` object (dataclass `MiddleCallbackData` from version 6.5.0) about primary model parameters and makes logical decision about applying `action` function
4. `action(data)` (`Callable[[MiddleCallbackData],MiddleCallbackData]`) function modifies `data` objects as u need -- and model will be modified by new `data`
5. `data` object is the structure with several parameters u can modify:
   ```python
    data = MiddleCallbackData(
        last_generation=Generation.from_pop_matrix(pop),
        current_generation=t,
        report_list=self.report,

        mutation_prob=self.prob_mut,
        crossover_prob=self.prob_cross,
        mutation=self.real_mutation,
        crossover=self.crossover,
        selection=self.selection,

        current_stagnation=counter,
        max_stagnation=self.max_stagnations,

        parents_portion=self.param.parents_portion,
        elit_ratio=self.param.elit_ratio,

        set_function=self.set_function
    )
   ```  
   So, the `action` function gets `data` objects and returns `data` object.

It's very simple to create your own `action` and `condition` functions. But there are several popular functions contained in `Actions` and `ActionConditions` classes:
* `actions`:
  * `Stop()` -- just stop optimization process
  * `ReduceMutationProb(reduce_coef = 0.9)` -- reduce mutation probability
  * `ChangeRandomCrossover(available_crossovers: Sequence[Callable[[np.ndarray, np.ndarray], Tuple[np.ndarray, np.ndarray]]])` -- change another (random) crossover from list of crossovers
  * `ChangeRandomSelection(available_selections: Sequence[Callable[[np.ndarray, int], np.ndarray]])`
  * `ChangeRandomMutation(available_mutations: Sequence[Callable[[float, float, float], float]])`
  * `RemoveDuplicates(oppositor = None, creator = None, converter = None)`; see [doc](geneticalgorithm2/callbacks/middle.py)
  * `CopyBest(by_indexes)` -- copies best population object values (from dimensions in `by_indexes`) to all population
  * `PlotPopulationScores(title_pattern = lambda data: f"Generation {data['current_generation']}", save_as_name_pattern = None)` -- plot population scores; needs 2 functions like `data`->string for title and file name (to save)
* `conditions`:
  * `ActionConditions.EachGen(generation_step = 10)` -- do action each `generation_step` generations
  * `ActionConditions.Always()` do action each generations, equals to `ActionConditions.EachGen(1)`
  * `ActionConditions.AfterStagnation(stagnation_generations = 50)` -- do action after `stagnation_generations` stagnation generations
  * `ActionConditions.Several(list_of_conditions)` -- do action if all conditions in list are true

To combine `action` and `condition` to callback, just use `MiddleCallbacks.UniversalCallback(action, condition)` methods.


There are also next high-level useful callbacks:

* `MiddleCallbacks.ReduceMutationGen(reduce_coef = 0.9, min_mutation = 0.005, reduce_each_generation = 50, reload_each_generation = 500)`
* `MiddleCallbacks.GeneDiversityStats(step_generations_for_plotting:int = 10)` -- plots some duplicates statistics each gen ([example](/examples/plot_diversities.py))
![](diversity.gif)


See [code example](examples/small_middle_callbacks.py)

## How to compare efficiency of several versions of GA optimization

To compare efficiency of several versions of GA optimization (such as several values of several hyperparameters or including/excepting some actions like oppositions) u should make some count of simulations and compare results using some statistical test. I have realized this logic [here](https://github.com/PasaOpasen/ab-testing-results-difference) 

## Hints on how to adjust genetic algorithm's parameters (from `geneticalgorithm` package)

In general the performance of a genetic algorithm or any evolutionary algorithm
depends on its parameters. Parameter setting of an evolutionary algorithm is important. Usually these parameters are adjusted based on experience and by conducting a sensitivity analysis.
It is impossible to provide a general guideline to parameter setting but the suggestions provided below may help:  

* **Number of iterations**: Select a `max_num_iterations` sufficiently large; otherwise the reported solution may not be satisfactory. On the other hand 
selecting a very large number of iterations increases the run time significantly. So this is actually a compromise between
the accuracy you want and the time and computational cost you spend. 

* **Population size**: Given a constant number of functional evaluations (`max_num_iterations` times population_size) I would select smaller population size and greater iterations. However, a very small choice of population size is also deteriorative. For most problems I would select a population size of 100 unless the dimension of the problem is very large that needs a bigger population size.

* **elit_ratio**: Although having few elites is usually a good idea and may increase the rate of convergence in some problems, having too many elites in the population may cause the algorithm to easily trap in a local optima. I would usually select only one elite in most cases. Elitism is not always necessary and in some problems may even be deteriorative.

* **mutation_probability**: This is a parameter you may need to adjust more than the other ones. Its appropriate value heavily depends on the problem. Sometimes we may select
mutation_probability as small as 0.01 (i.e. 1 percent) and sometimes even as large as 0.5 (i.e. 50 percent) or even larger. In general if the genetic algorithm trapped 
in a local optimum increasing the mutation probability may help. On the other hand if the algorithm suffers from stagnation reducing the mutation probability may be effective. However, this rule of thumb is not always true.

* **parents_portion**: If parents_portion set zero, it means that the whole of the population is filled with the newly generated solutions. 
On the other hand having this parameter equals 1 (i.e. 100 percent) means no new solution
is generated and the algorithm would just repeat the previous values without any change which is not meaningful and effective obviously. Anything between these two may work. The exact value depends on the problem.

* **crossover_type**: Depends on the problem. I would usually use uniform crossover. But testing the other ones in your problem is recommended.

* **max_iteration_without_improv**: This is a parameter that I recommend being used cautiously. 
If this parameter is too small then the algorithm may stop while it trapped in a local optimum.
So make sure you select a sufficiently large criteria to provide enough time for the algorithm to progress and to avoid immature convergence. 

Finally to make sure that the parameter setting is fine, we usually should run the 
algorithm for several times and if convergence curves of all runs converged to the same objective function value we may accept that solution as the optimum. The number of runs
depends but usually five or ten runs is prevalent. Notice that in some problems
several possible set of variables produces the same objective function value. 
When we study the convergence of a genetic algorithm we compare the objective function values not the decision variables.

## How to get maximum speed

### Don't use plotting

```python
result = model.run(
    no_plot = True, 
)
```

### Don't use progress bar

```python
result = model.run(
    progress_bar_stream = None,
)
```

### Try to use faster optimizing function

Try to speed up your optimizing `function` using Numpy, [Numba](https://numba.pydata.org/) or [Cython](https://cython.org/). If u can, write your own `set_function` (function which applies to whole population samples matrix) with cython optimizations, parallelism and so.

### Specify custom optimized `mutation`, `crossover`, `selection`

Write faster implementations for model methods `mut`, `mut_middle`, `crossover`, `selection` and set them before running optimization process:

```python
model.mut = custom_mut
model.crossover = custom_crossover

model.run(...)
```

### Specify `fill_children` method

From version `6.8.4` there is `fill_children` model method:

```python
self.fill_children: Optional[Callable[[array2D, int], None]] = None
```

It is empty and does nothing; but if u specify it, u can get huge speed up at very intensive algorithm part. Take a look at [main algo structure](#main-algorithm-structure). There is a part with creating children from parents, this part is the most intensive because it uses python loops, calls sampling, crossover and mutations at each iteration. Using `fill_children`, u can rewrite this logic in your manner to speed up.

Suppose u have new population matrix `pop` (type `np.float64`, shape `(population_size, dim_count)`) where first `parents_count` rows are selected parents, next rows are filled by random, so inside `fill_children` method u should fill last `population_size - parents_count` rows (children) by using some your logic. Expected (but not mandatory) logic like this:

```python
for k in range(self.parents_count, self.population_size, 2):

    r1, r2 = get_parents_inds()  # get 2 random parents indexes from [0, parents_count)

    pvar1 = pop[r1]
    pvar2 = pop[r2]

    ch1, ch2 = self.crossover(pvar1, pvar2)  # crossover

    # mutations
    ch1 = self.mut(ch1)
    ch2 = self.mut_middle(ch2, pvar1, pvar2)

    # put to population
    pop[k] = ch1
    pop[k+1] = ch2
```

**Example**. In one task I use this algorithm many times (100 000 generations total), so the speed matters. Every sample item is the index of element in other array there, so `i`th sample element is always integer value from cut `[0, end[i]]`. I use uniform crossover and uniform mutation (work perfect for this task). So I specified creating children logic for this task using cython.

Content of file `fill_children.pyx`:

```cython
#!python
#cython: language_level=3

import numpy as np

cimport numpy as np

np.import_array()

cimport cython

import math
import random

@cython.boundscheck(False)
@cython.wraparound(False)
def fill_children(
    np.ndarray[np.float64_t, ndim=2] pop,  # samples are integers but always float64 type
    int parents_count, # count of already done parents

    float mut_prob,  # mutation probability
    np.ndarray[np.uint8_t, ndim=1] ends  # max elements for each dimension (min elements are 0)
):

    cdef:
        Py_ssize_t i, k, population_size = pop.shape[0], dim_count = pop.shape[1], r1, r2

        float v1, v2, tmp
        np.ndarray[np.float64_t, ndim=1] cross, mut, mut_middle

    # making 2 children at each iteration
    for k in range(parents_count, population_size, 2): # C loop, not Python
        
        #
        # 2 random parents (fast implementation)
        #

        r1 = random.randrange(parents_count)
        r2 = random.randrange(parents_count)
        if r1 == r2:
            while r1 == r2:  # C loop!
               r2 = random.randrange(parents_count) 

        #
        # I always need these 3 random probs sequences, so the fastest way to obtain them is np.random.random
        #
        cross = np.random.random(dim_count)  # crossover probabilities for each dimension
        mut = np.random.random(dim_count)
        mut_middle = np.random.random(dim_count)

        for i in range(dim_count):  # C loop for each dimension
            v1 = pop[r1, i]  # first parent value
            v2 = pop[r2, i]  # second parent value

            if cross[i] < 0.5:  # random swap (uniform crossover), copy otherwise
                tmp = v2
                v2 = v1
                v1 = tmp

            if mut[i] < mut_prob:  # random mutation for first child
                # fastest way to get random integer from [0, ends[i]]
                # random.random() calls not always but only on mut[i] < mut_prob
                v1 = math.floor(random.random() * (ends[i] + 1))

            if mut_middle[i] < mut_prob: # mut_middle for second
                tmp = random.random()
                if v1 < v2:
                    v2 = v1 + math.floor(tmp * (v2 - v1 + 1))  # integer from [v1, v2], v1 < v2
                elif v1 > v2:
                    v2 = v2 + math.floor(tmp * (v1 - v2 + 1)) # integer from [v2, v1], v2 < v1
                else:
                    v2 = math.floor(tmp * (ends[i] + 1))

            #
            # put values to children in array
            #
            pop[k, i] = v1
            pop[k + 1, i] = v2

```

After compilation this file I can call it from python file to use inside GA:

```python

mut_prob = param['mutation_probability']

def fill_children(pop: array2D, parents_count: int):
    """wrapper on fill_children.fill_children with putting local variables mut_prob, ends"""
    return fill_children.fill_children(
        pop, parents_count, mut_prob, ends
    )

model.fill_children = fill_children

model.run(...)
```


# Examples pretty collection

## Optimization test functions

Here there is the implementation of `geneticalgorithm2` for some benchmark problems. Test functions are got from my [`OptimizationTestFunctions`](https://github.com/PasaOpasen/OptimizationTestFunctions) package. 

The code for optimizations process is same for each function and is contained [in file](examples/optimization_test_functions.py).

### [Sphere](https://github.com/PasaOpasen/OptimizationTestFunctions#sphere)
![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20Sphere.png)
![](examples/output/opt_test_funcs/Optimization%20process%20for%20Sphere.png)

### [Ackley](https://github.com/PasaOpasen/OptimizationTestFunctions#ackley)
![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20Ackley.png)
![](examples/output/opt_test_funcs/Optimization%20process%20for%20Ackley.png)

### [AckleyTest](https://github.com/PasaOpasen/OptimizationTestFunctions#ackleytest)
![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20AckleyTest.png)
![](examples/output/opt_test_funcs/Optimization%20process%20for%20AckleyTest.png)

### [Rosenbrock](https://github.com/PasaOpasen/OptimizationTestFunctions#rosenbrock)
![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20Rosenbrock.png)
![](examples/output/opt_test_funcs/Optimization%20process%20for%20Rosenbrock.png)

### [Fletcher](https://github.com/PasaOpasen/OptimizationTestFunctions#fletcher)
![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20Fletcher.png)
![](examples/output/opt_test_funcs/Optimization%20process%20for%20Fletcher.png)

### [Griewank](https://github.com/PasaOpasen/OptimizationTestFunctions#griewank)
![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20Griewank.png)
![](examples/output/opt_test_funcs/Optimization%20process%20for%20Griewank.png)

### [Penalty2](https://github.com/PasaOpasen/OptimizationTestFunctions#penalty2)
![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20Penalty2.png)
![](examples/output/opt_test_funcs/Optimization%20process%20for%20Penalty2.png)

### [Quartic](https://github.com/PasaOpasen/OptimizationTestFunctions#quartic)
![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20Quartic.png)
![](examples/output/opt_test_funcs/Optimization%20process%20for%20Quartic.png)

### [Rastrigin](https://github.com/PasaOpasen/OptimizationTestFunctions#rastrigin)
![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20Rastrigin.png)
![](examples/output/opt_test_funcs/Optimization%20process%20for%20Rastrigin.png)

### [SchwefelDouble](https://github.com/PasaOpasen/OptimizationTestFunctions#schwefeldouble)
![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20SchwefelDouble.png)
![](examples/output/opt_test_funcs/Optimization%20process%20for%20SchwefelDouble.png)

### [SchwefelMax](https://github.com/PasaOpasen/OptimizationTestFunctions#schwefelmax)
![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20SchwefelMax.png)
![](examples/output/opt_test_funcs/Optimization%20process%20for%20SchwefelMax.png)

### [SchwefelAbs](https://github.com/PasaOpasen/OptimizationTestFunctions#schwefelabs)
![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20SchwefelAbs.png)
![](examples/output/opt_test_funcs/Optimization%20process%20for%20SchwefelAbs.png)

### [SchwefelSin](https://github.com/PasaOpasen/OptimizationTestFunctions#schwefelsin)
![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20SchwefelSin.png)
![](examples/output/opt_test_funcs/Optimization%20process%20for%20SchwefelSin.png)

### [Stairs](https://github.com/PasaOpasen/OptimizationTestFunctions#stairs)
![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20Stairs.png)
![](examples/output/opt_test_funcs/Optimization%20process%20for%20Stairs.png)

### [Abs](https://github.com/PasaOpasen/OptimizationTestFunctions#abs)
![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20Abs.png)
![](examples/output/opt_test_funcs/Optimization%20process%20for%20Abs.png)

### [Michalewicz](https://github.com/PasaOpasen/OptimizationTestFunctions#michalewicz)
![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20Michalewicz.png)
![](examples/output/opt_test_funcs/Optimization%20process%20for%20Michalewicz.png)

### [Scheffer](https://github.com/PasaOpasen/OptimizationTestFunctions#scheffer)
![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20Scheffer.png)
![](examples/output/opt_test_funcs/Optimization%20process%20for%20Scheffer.png)

### [Eggholder](https://github.com/PasaOpasen/OptimizationTestFunctions#eggholder)
![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20Eggholder.png)
![](examples/output/opt_test_funcs/Optimization%20process%20for%20Eggholder.png)

### [Weierstrass](https://github.com/PasaOpasen/OptimizationTestFunctions#weierstrass)
![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20Weierstrass.png)
![](examples/output/opt_test_funcs/Optimization%20process%20for%20Weierstrass.png)



## Using GA in reinforcement learning

See [example of using GA optimization with keras neural networks](https://www.kaggle.com/demetrypascal/opengym-tasks-using-keras-and-geneticalgorithm2) for solving OpenGym tasks.

Better example is [OpenGym using cost2fitness and geneticalgorithm2](https://www.kaggle.com/demetrypascal/opengym-using-cost2fitness-and-geneticalgorithm2) where I use also my [cost2fitness](https://github.com/PasaOpasen/cost2fitness) package for fast forward propagation


## Using GA with image reconstruction by polygons

Links:
1. https://www.kaggle.com/demetrypascal/fork-of-imagereconstruction-with-geneticalgorithm2
2. https://www.kaggle.com/demetrypascal/imagereconstructionpolygons-with-geneticalgorithm2


# Popular questions

## How to disable autoplot?

Just use `no_plot = True` param in `run` method:

```python
model.run(no_plot = True)
```

If u want, u can plot results later by using

```python
model.plot_results()
```

Also u can create your pretty plots using `model.report` object (it's a list of values):

```python
re = np.array(model.report)

plt.plot(re)
plt.xlabel('Iteration')
plt.ylabel('Objective function')
plt.title('Genetic Algorithm')
plt.show()
```

## How to plot population scores?

There are 2 ways to plot of scores of population:
* use `plot_pop_scores(scores, title = 'Population scores', save_as = None)` function from `geneticalgorithm2` environment
* use `plot_generation_scores(self, title = 'Last generation scores', save_as = None)` method of `ga` object for plotting scores of last generation (yes, it's wrapper of previous function)

Let's check example:

```python
import numpy as np

from geneticalgorithm2 import GeneticAlgorithm2 as ga

from geneticalgorithm2 import plot_pop_scores  # for plotting scores without ga object


def f(X):
    return 50 * np.sum(X) - np.sum(np.sqrt(X) * np.sin(X))


dim = 25
varbound = [[0, 10]] * dim

# create start population
start_pop = np.random.uniform(0, 10, (50, dim))
# eval scores of start population
start_scores = np.array([f(start_pop[i]) for i in range(start_pop.shape[0])])

# plot start scores using plot_pop_scores function
plot_pop_scores(start_scores, title='Population scores before beginning of searching', save_as='plot_scores_start.png')

model = ga(function=f, dimension=dim, variable_type='real', variable_boundaries=varbound)
# run optimization process
model.run(no_plot=True,
          start_generation={
              'variables': start_pop,
              'scores': start_scores
          })
# plot and save optimization process plot
model.plot_results(save_as='plot_scores_process.png')

# plot scores of last population
model.plot_generation_scores(title='Population scores after ending of searching', save_as='plot_scores_end.png')
```
![](examples/output/plot_scores_start.png)
![](examples/output/plot_scores_process.png)
![](examples/output/plot_scores_end.png)



## How to specify evaluated function for all population?

U can do it using `set_function` parameter into `run()` method.

This function should get `numpy 2D-array` (samples x dimension) and return `1D-array` with results.

By default it uses `set_function = geneticalgorithm2.default_set_function(function)`, where

```python
    def default_set_function(function_for_set):
        def func(matrix):
            return np.array([function_for_set(matrix[i,:]) for i in range(matrix.shape[0])])
        return func
```
U may want to use it for creating some specific or fast-vectorized evaluations like here:

```python

def sigmoid(z):
    return 1/(1+np.exp(-z))

matrix = np.random.random((1000,100))

def vectorised(X):
    return sigmoid(matrix.dot(X))

model.run(set_function = vectorised)
```

## What about parallelism?

By using `set_function` u can determine your own behavior for parallelism or u can use `geneticalgorithm2.set_function_multiprocess(f, n_jobs = -1)` for using just parallelism (recommended for heavy functions and big populations, not recommended for fast functions and small populations).

For example:

```python
import numpy as np
from geneticalgorithm2 import GeneticAlgorithm2 as ga


def f(X):
    import math
    a = X[0]
    b = X[1]
    c = X[2]
    s = 0
    for i in range(10000):
        s += math.sin(a * i) + math.sin(b * i) + math.cos(c * i)

    return s


algorithm_param = {'max_num_iteration': 50,
                   'population_size': 100,
                   'mutation_probability': 0.1,
                   'elit_ratio': 0.01,
                   'parents_portion': 0.3,
                   'crossover_type': 'uniform',
                   'mutation_type': 'uniform_by_center',
                   'selection_type': 'roulette',
                   'max_iteration_without_improv': None}

varbound = np.array([[-10, 10]] * 3)

model = ga(function=f, dimension=3,
           variable_type='real',
           variable_boundaries=varbound,
           algorithm_parameters=algorithm_param)

########

%time
model.run()
# Wall time: 1min 52s

%time
model.run(set_function=ga.set_function_multiprocess(f, n_jobs=6))
# Wall time: 31.7 s
```

## How to initialize start population? How to continue optimization with new run?

For this there is `start_generation` parameter in `run()` method. It's the dictionary with structure like returned `model.output_dict['last_generation']`. Let's see example how can u to use it:

```python
import numpy as np
from geneticalgorithm2 import GeneticAlgorithm2 as ga


def f(X):
    return np.sum(X)


dim = 6

varbound = [(0, 10)] * dim

algorithm_param = {'max_num_iteration': 500,
                   'population_size': 100,
                   'mutation_probability': 0.1,
                   'elit_ratio': 0.01,
                   'parents_portion': 0.3,
                   'crossover_type': 'uniform',
                   'max_iteration_without_improv': None}

model = ga(function=f,
           dimension=dim,
           variable_type='real',
           variable_boundaries=varbound,
           algorithm_parameters=algorithm_param)

# start generation
# as u see u can use any values been valid for ur function
samples = np.random.uniform(0, 50, (300, dim))  # 300 is the new size of your generation

model.run(no_plot=False, start_generation={'variables': samples, 'scores': None})
# it's not necessary to evaluate scores before
# but u can do it if u have evaluated scores and don't wanna repeat calculations


# from version 6.3.0 it's recommended to use this form
from geneticalgorithm2 import Generation

model.run(no_plot=False, start_generation=Generation(variables=samples, scores=None))

# from version 6.4.0 u also can use these forms
model.run(no_plot=False, start_generation=samples)
model.run(no_plot=False, start_generation=(samples, None))

# if u have scores array, u can put it too
scores = np.array([f(sample) for sample in samples])
model.run(no_plot=False, start_generation=(samples, scores))

##
## after first run
## best value = 0.10426190111045064
##

# okay, let's continue optimization using saved last generation
model.run(no_plot=True, start_generation=model.output_dict['last_generation'])

##
## after second run
## best value = 0.06128462776296528
##

```

Also u can save and load populations using likely code:

```python
import numpy as np

from geneticalgorithm2 import GeneticAlgorithm2 as ga

from OptimizationTestFunctions import Eggholder

dim = 2 * 15

f = Eggholder(dim)

xmin, xmax, ymin, ymax = f.bounds

varbound = np.array([[xmin, xmax], [ymin, ymax]] * 15)

model = ga(function=f,
           dimension=dim,
           variable_type='real',
           variable_boundaries=varbound,
           algorithm_parameters={
               'max_num_iteration': 300,
               'population_size': 100
           })

# first run and save last generation to file
filename = "eggholder_lastgen.npz"
model.run(save_last_generation_as=filename)

# load start generation from file and run again (continue optimization)
model.run(start_generation=filename)
```



            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/PasaOpasen/geneticalgorithm2",
    "name": "geneticalgorithm2",
    "maintainer": "Demetry Pascal",
    "docs_url": null,
    "requires_python": null,
    "maintainer_email": null,
    "keywords": "solve, solver, equation, optimization, problem, genetic, algorithm, GA, easy, fast, genetic-algorithm, combinatorial, mixed, evolutionary",
    "author": "Demetry Pascal",
    "author_email": "qtckpuhdsa@gmail.com",
    "download_url": "https://files.pythonhosted.org/packages/e5/90/f74c4e4afc528fd34e36fac8f3ed36f23b52fc34d733ed1984c7697d8e36/geneticalgorithm2-6.9.2.tar.gz",
    "platform": null,
    "description": "[![PyPI\nversion](https://badge.fury.io/py/geneticalgorithm2.svg)](https://pypi.org/project/geneticalgorithm2/)\n[![Downloads](https://pepy.tech/badge/geneticalgorithm2)](https://pepy.tech/project/geneticalgorithm2)\n[![Downloads](https://pepy.tech/badge/geneticalgorithm2/month)](https://pepy.tech/project/geneticalgorithm2)\n[![Downloads](https://pepy.tech/badge/geneticalgorithm2/week)](https://pepy.tech/project/geneticalgorithm2)\n\n[![Gitter](https://badges.gitter.im/geneticalgorithm2/community.svg)](https://gitter.im/geneticalgorithm2/community?utm_source=badge&utm_medium=badge&utm_campaign=pr-badge)\n[![PRs Welcome](https://img.shields.io/badge/PRs-welcome-brightgreen.svg?style=flat-square)](https://github.com/PasaOpasen/geneticalgorithm2/pulls)\n\nhttps://pasaopasen.github.io/geneticalgorithm2/\n\n**geneticalgorithm2** (from [DPEA](https://github.com/PasaOpasen/PasaOpasen.github.io/blob/master/EA_packages.md)) **is the supported advanced optimized fork of non-supported package** [geneticalgorithm](https://github.com/rmsolgi/geneticalgorithm) of *Ryan (Mohammad) Solgi*\n\n- [About](#about)\n- [Installation](#installation)\n- [Updates information](#updates-information)\n  - [**Future**](#future)\n  - [**TODO firstly**](#todo-firstly)\n  - [6.9.2 bugfixes](#692-bugfixes)\n  - [6.9.1 refactor](#691-refactor)\n  - [6.9.0 reborn](#690-reborn)\n  - [6.8.7 minor update](#687-minor-update)\n  - [6.8.6 minor update](#686-minor-update)\n  - [6.8.5 minor update](#685-minor-update)\n  - [6.8.4 minor update](#684-minor-update)\n  - [6.8.3 types update](#683-types-update)\n  - [6.8.2 patch](#682-patch)\n  - [6.8.1 patch](#681-patch)\n  - [6.8.0 minor update](#680-minor-update)\n  - [6.7.7 refactor](#677-refactor)\n  - [6.7.6 bug fix](#676-bug-fix)\n  - [6.7.5 refactor](#675-refactor)\n  - [6.7.4 bug fix](#674-bug-fix)\n  - [6.7.3 speed up](#673-speed-up)\n  - [6.7.2 little update](#672-little-update)\n  - [6.7.1 patch](#671-patch)\n  - [6.7.0 minor update (new features)](#670-minor-update-new-features)\n  - [6.6.2 patch (speed up)](#662-patch-speed-up)\n  - [6.6.1 patch](#661-patch)\n  - [6.6.0 minor update (refactoring)](#660-minor-update-refactoring)\n  - [6.5.1 patch](#651-patch)\n  - [6.5.0 minor update (refactoring)](#650-minor-update-refactoring)\n  - [6.4.1 patch (bug fix)](#641-patch-bug-fix)\n  - [6.4.0 minor update (refactoring)](#640-minor-update-refactoring)\n  - [6.3.0 minor update (refactoring)](#630-minor-update-refactoring)\n- [Working process](#working-process)\n  - [Main algorithm structure](#main-algorithm-structure)\n  - [Optimization process components](#optimization-process-components)\n    - [Function to minimize](#function-to-minimize)\n    - [Optimization space](#optimization-space)\n    - [Algorithm parameters](#algorithm-parameters)\n    - [Samples constructors](#samples-constructors)\n    - [Callbacks](#callbacks)\n  - [How to run](#how-to-run)\n  - [Constructor parameters](#constructor-parameters)\n  - [Genetic algorithm's parameters](#genetic-algorithms-parameters)\n    - [AlgorithmParams object](#algorithmparams-object)\n    - [Parameters of algorithm](#parameters-of-algorithm)\n      - [**Crossover**](#crossover)\n      - [**Mutation**](#mutation)\n      - [**Selection**](#selection)\n  - [Methods and Properties of model](#methods-and-properties-of-model)\n- [Examples for beginner](#examples-for-beginner)\n  - [A minimal example](#a-minimal-example)\n  - [The simple example with integer variables](#the-simple-example-with-integer-variables)\n  - [The simple example with Boolean variables](#the-simple-example-with-boolean-variables)\n  - [The simple example with mixed variables](#the-simple-example-with-mixed-variables)\n  - [Optimization problems with constraints](#optimization-problems-with-constraints)\n  - [Middle example: select fixed count of objects from set](#middle-example-select-fixed-count-of-objects-from-set)\n- [U should know these features](#u-should-know-these-features)\n  - [Available crossovers](#available-crossovers)\n  - [Available selections](#available-selections)\n  - [Function timeout](#function-timeout)\n  - [Standard GA vs. Elitist GA](#standard-ga-vs-elitist-ga)\n  - [Standard crossover vs. stud EA crossover](#standard-crossover-vs-stud-ea-crossover)\n  - [Creating better start population](#creating-better-start-population)\n    - [Select best N of kN](#select-best-n-of-kn)\n    - [Do local optimization](#do-local-optimization)\n    - [Optimization with oppositions](#optimization-with-oppositions)\n  - [Revolutions](#revolutions)\n  - [Duplicates removing](#duplicates-removing)\n  - [Cache](#cache)\n  - [Report checker](#report-checker)\n  - [Middle callbacks](#middle-callbacks)\n  - [How to compare efficiency of several versions of GA optimization](#how-to-compare-efficiency-of-several-versions-of-ga-optimization)\n  - [Hints on how to adjust genetic algorithm's parameters (from `geneticalgorithm` package)](#hints-on-how-to-adjust-genetic-algorithms-parameters-from-geneticalgorithm-package)\n  - [How to get maximum speed](#how-to-get-maximum-speed)\n    - [Don't use plotting](#dont-use-plotting)\n    - [Don't use progress bar](#dont-use-progress-bar)\n    - [Try to use faster optimizing function](#try-to-use-faster-optimizing-function)\n    - [Specify custom optimized `mutation`, `crossover`, `selection`](#specify-custom-optimized-mutation-crossover-selection)\n    - [Specify `fill_children` method](#specify-fill_children-method)\n- [Examples pretty collection](#examples-pretty-collection)\n  - [Optimization test functions](#optimization-test-functions)\n    - [Sphere](#sphere)\n    - [Ackley](#ackley)\n    - [AckleyTest](#ackleytest)\n    - [Rosenbrock](#rosenbrock)\n    - [Fletcher](#fletcher)\n    - [Griewank](#griewank)\n    - [Penalty2](#penalty2)\n    - [Quartic](#quartic)\n    - [Rastrigin](#rastrigin)\n    - [SchwefelDouble](#schwefeldouble)\n    - [SchwefelMax](#schwefelmax)\n    - [SchwefelAbs](#schwefelabs)\n    - [SchwefelSin](#schwefelsin)\n    - [Stairs](#stairs)\n    - [Abs](#abs)\n    - [Michalewicz](#michalewicz)\n    - [Scheffer](#scheffer)\n    - [Eggholder](#eggholder)\n    - [Weierstrass](#weierstrass)\n  - [Using GA in reinforcement learning](#using-ga-in-reinforcement-learning)\n  - [Using GA with image reconstruction by polygons](#using-ga-with-image-reconstruction-by-polygons)\n- [Popular questions](#popular-questions)\n  - [How to disable autoplot?](#how-to-disable-autoplot)\n  - [How to plot population scores?](#how-to-plot-population-scores)\n  - [How to specify evaluated function for all population?](#how-to-specify-evaluated-function-for-all-population)\n  - [What about parallelism?](#what-about-parallelism)\n  - [How to initialize start population? How to continue optimization with new run?](#how-to-initialize-start-population-how-to-continue-optimization-with-new-run)\n\n# About\n\n[**geneticalgorithm2**](https://pasaopasen.github.io/geneticalgorithm2/) is very flexible and highly optimized Python library for implementing classic\n[genetic-algorithm](https://towardsdatascience.com/introduction-to-optimization-with-genetic-algorithm-2f5001d9964b) (GA).\n\nFeatures of this package:\n\n* written on **pure python**\n* **extremely fast**\n* **no hard dependencies** (only numpy primary, can work without matplotlib)\n* **easy to run**: no need to perform long task-setup process\n* easy to logging, reach **support of flexible callbacks**\n* **many built-in plotting functions**\n* **many built-in cases of crossover, mutation and selection**\n* support of integer, boolean and real (continuous/discrete) variables types\n* support of mixed types of variables\n* **support of classic, elitist and studEA genetic algorithm combinations**\n* **support of revolutions and duplicates utilization**\n* **reach support of customization**\n    \n# Installation\n\nInstall this package with standard light dependencies to use the base functional.\n```\npip install geneticalgorithm2\n```\n\nInstall this package with full dependencies to use all provided functional including plotting and built-in parallelism tools.\n\n```\npip install geneticalgorithm2[full]\n```\n\n# Updates information\n\n## **Future**\n\n- duplicates removing and revolutions will be moved to `MiddleCallbacks` and removed as alone `run()` parameters\n- new stop criteria callbacks (min std, max functions evaluations)\n- `vartype` will support strings like `iiiiibbf`\n\n## **TODO firstly**\n- Remove old style mentions from README\n\n## 6.9.2 bugfixes\n\n## 6.9.1 refactor\n\n- Finally move `function_timeout` and `function` to `run()` method and deprecate its usage in init()\n- `function` is not mandatory to be non-empty \n- reduce documentation duplicates\n\n## 6.9.0 reborn\n\n- recreate the repository without excess heavy files materials \n- host the [code documentation](https://pasaopasen.github.io/geneticalgorithm2/)\n- rename `geneticalgorithm2` class to `GeneticAlgorithm2`\n- substantial package architecture refactor\n- add more docstrings\n\n## 6.8.7 minor update\n\n- some code refactor\n- fixes:\n  - ensure the directory of generation file exists on save\n\n## 6.8.6 minor update\n\n- small package installation update: add `pip install geneticalgorithm2[full]` version\n- small refactor\n\n## 6.8.5 minor update\n\n- update `OppOpPopInit` `2.0.0->2.0.1`\n- set default `function_timeout` to `None` which means no use of function time checking\n- remove `joblib` and `func_timeout` from necessary dependencies\n\n## 6.8.4 minor update\n\n- a bit of refactor\n- little optimizations\n- add empty field `fill_children(pop_matrix, parents_count)` to `geneticalgorithm2` class to specify children creating behavior (what is the most intensive part of algorithm after optimizing func calculations), see [this](#specify-fill_children-method)\n\n## 6.8.3 types update\n\n- much more type hints\n\n## 6.8.2 patch\n\n- for printing info\n- fix logic: now population is always sorted before going to callbacks\n\n## 6.8.1 patch\n\n- printing progress bar to `'stderr'` or `'stdout'` or `None` (disable) by choice (`progress_bar_stream` argument of `run()`), deprecated `disable_progress_bar`\n- little speed up\n- new `geneticalgorithm2.vectorized_set_function` set function, which can be faster for big populations \n\n## 6.8.0 minor update\n\n- remove `crossover_probability` model parameter because of it has no sense to exist (and 1.0 value is better than others, take a look at [results](/examples/output/sense_of_crossover_prob__no_sense.png)). This parameter came from `geneticalgorithm` old package and did`t change before.\n\n## 6.7.7 refactor\n\n- change some behavior about parents selection\n\n## 6.7.6 bug fix\n\n- fix some bug of `variable_type=='bool'`\n- some refactor of progress bar\n- add some dependencies to `setup.py`\n\n## 6.7.5 refactor\n\n- shorter progress bar (length can be controlled by setting `PROGRESS_BAR_LEN` field of `geneticalgorithm2` class)\n- shorter logic of `run()`, more informative output \n\n## 6.7.4 bug fix\n\n- bug fix\n\n## 6.7.3 speed up\n\n- refactor to make `run()` method faster\n\n## 6.7.2 little update\n\n- better flexible logic for report, [take a look](#report-checker)\n- removed `show mean` parameter from `model.plot_result` and now model reports only best score by default, not average and so on (u can specify if u wanna report average too)\n- `plot_several_lines` useful function\n\n## 6.7.1 patch\n\n- changes according to new [OppOpPopInit](https://github.com/PasaOpasen/opp-op-pop-init) version\n\n## 6.7.0 minor update (new features)\n\n- add `mutation_discrete_type` and `mutation_discrete_probability` parameters in model. It controls mutation behavior for discrete (integer) variables and works like `mutation_type` and `mutation_probability` work for continuous (real) variables. Take a look at [algorithm parameters](#parameters-of-algorithm) \n\n## 6.6.2 patch (speed up)\n\n- fix and speed up mutation\n\n## 6.6.1 patch\n\n- removed unnecessary dependencies\n\n## 6.6.0 minor update (refactoring)\n\n- deprecated `variable_type_mixed`, now use `variable_type` for mixed optimization too\n- deprecated `output_dict`, now it's better object with name `result`\n- refactor of big part of **tests**\n- refactor of README\n\n## 6.5.1 patch\n\n- replace `collections.Sequence` with `collections.abc.Sequence`, now it should work for `python3.10+`\n\n## 6.5.0 minor update (refactoring)\n\n- another form of data object using with middle callbacks (`MiddleCallbackData` dataclass instead of dictionary)\n- type hints for callbacks module \n\n## 6.4.1 patch (bug fix)\n\n- fix bug setting attribute to algorithm parameters (in middle callbacks) \n\n\n## 6.4.0 minor update (refactoring)\n\n- new valid forms for `start_generation`; now it's valid to use\n    * `None`\n    * `str` path to saved generation\n    * dictionary with structure `{'variables': variables/None, 'scores': scores/None}`\n    * `Generation` object: `Generation(variables = variables, scores = scores)`\n    * `np.ndarray` with shape `(samples, dim)` for only population or `(samples, dim+1)` for concatenated population and score (scores is the last matrix column)\n    * `tuple(np.ndarray/None, np.ndarray/None)` for variables and scores\n  \n  here `variables` is 2D numpy array with shape `(samples, dim)`, `scores` is 1D numpy array with scores (function values) for each sample; [here](examples/output/start_gen.py) and [here](#how-to-initialize-start-population-how-to-continue-optimization-with-new-run) u can see examples of using these valid forms \n\n\n## 6.3.0 minor update (refactoring)\n\n- type hints for entire part of functions\n- new valid forms for function parameters (now u don't need to use numpy arrays everywhere)\n- `AlgorithmParams` class for base GA algorithm parameters (instead of dictionary)\n- `Generation` class for saving/loading/returning generation (instead of dictionary)\n\nAll that classes are collected [in file](geneticalgorithm2/data_types/classes.py). To maintain backward compatibility, `AlgorithmParams` and `Generation` classes have dictionary-like interface for getting fields: u can use `object.field` or `object['field']` notations.\n\n\n# Working process\n\n## Main algorithm structure\n\n```\nPre-process: making inner functions depends on params, making/loading start population\n\nwhile True:\n\n    if reason to stop (time is elapsed / no progress / generation count is reached / min value is reached):\n        break\n\n\n    select parents to crossover from last population and put them to new population:\n        select (elit count) best samples\n        select (parents count - elit count) random samples (by selection function)\n\n    create (total samples count - parents count) children (samples from selected parents) and put them to new population:\n        while not all children are created:\n            select 2 random parents\n            make child1, child2 from them using crossover\n            mutate child1 by mutation (model.mut)\n            mutate child2 by middle mutation (model.mut_middle)\n            put children to new population\n    \n    remove duplicates, make revolutions, sort population by scores\n    use callbacks, use middle callbacks\n\nPost-process: plotting results, saving\n\n```\n\n## Optimization process components\n\n### Function to minimize\n\nThe goal of the optimization process is to find the *minimum* of the given `function (1D array) -> float` where the function argument is a vector of some values in different dimensions. \n\nIf u want to find the *maximum*, use this idea:\n```python\nopt_func = lambda arr: -func(arr)\n\n#\n# ... find global min of opt_func\n#\n\nopt_minimum=opt_func(best value)\nmaximum = -opt_minimum\n```\n\nAlso it is possible and highly recommended to create and use a *vectorized* version of this function called `set_function (2D array) -> (1D array)` which transforms several samples matrix to samples scores vector by one call. Using this way u can speed up calculations or set up more complex tasks optimization\n\n### Optimization space\n\nThe function rates 1D arrays (vectors) where each component (dimension) means something u program it to mean. Each dimension has its `bound` (`[min; max]` cut) and `variable type` (real/discrete).\n\n**Advice**. Genetic algorithms work much faster and efficient for discrete tasks. If high precision is not required u can split any real dimension to many discrete values (for instance, `[1.1, 1.2, 1.25, 1.44]`) and try to optimize indexes of the given array which are converted to real values inside `function` itself.\n\n### Algorithm parameters\n\nThere are a number of hyperparameters u can probe to optimize including population size and selection/crossover/mutation types.\n\n### Samples constructors\n\nThere are several ways to create new testing samples from zero when u start with empty population or when u need new samples after [duplicates removing](#duplicates-removing) and [revolutions](#revolutions). \n\n\n### Callbacks\n\nNow the package supports 2 different types of highly customized callbacks:\n* [simple callbacks](#callbacks)\n* [middle callbacks](#middle-callbacks)\n\n## How to run\n\nFirstly, u should **import needed packages**. \n\nAll available (but not always necessary) imports are:\n\n```python\nimport numpy as np\n\n# the only one required import\nfrom geneticalgorithm2 import GeneticAlgorithm2 as ga  # for creating and running optimization model\n\nfrom geneticalgorithm2 import Generation, AlgorithmParams  # classes for comfortable parameters setting and getting\n\nfrom geneticalgorithm2 import Crossover, Mutations, Selection  # classes for specific mutation and crossover behavior\n\nfrom geneticalgorithm2 import get_population_initializer  # for creating better start population\n\nfrom geneticalgorithm2 import np_lru_cache  # for cache function (if u want)\n\nfrom geneticalgorithm2 import plot_pop_scores  # for plotting population scores, if u want\n\nfrom geneticalgorithm2 import Callbacks  # simple callbacks (will be deprecated)\n\nfrom geneticalgorithm2 import Actions, ActionConditions, MiddleCallbacks  # middle callbacks\n```\n\nNext step: **define the function to minimize**:\n\n```python\ndef function(X: np.ndarray) -> float: # X as 1d-numpy array\n    return np.sum(X**2) + X.mean() + X.min() + X[0]*X[2] # some float result\n```\n\nAlso u should **create the bounds for each variable** (if exist) such as:\n\n```python\nvar_bound = np.array([[0,10]]*3) # 2D numpy array with shape (dim, 2)\n\n# also u can use Sequence of Tuples (from version 6.3.0)\nvar_bound = [\n    (0, 10),\n    (0, 10),\n    (0, 10)\n]\n```\n\n**Important**. U don't need to use variable boundaries only if variable type of each variable is boolean. This case will be automatically converted to discrete variables with bounds `(0, 1)`.\n\nAfter that u **create a `GeneticAlgorithm2` (was imported early as ga) object**:\n\n```python\nmodel = ga( \n    dimension = 3, \n    variable_type='real', \n    variable_boundaries = var_bound,\n    algorithm_parameters={\n        'max_num_iteration': None,\n        'population_size':100,\n        'mutation_probability': 0.1,\n        'mutation_discrete_probability': None,\n        'elit_ratio': 0.01,\n        'parents_portion': 0.3,\n        'crossover_type':'uniform',\n        'mutation_type': 'uniform_by_center',\n        'mutation_discrete_type': 'uniform_discrete',\n        'selection_type': 'roulette',\n        'max_iteration_without_improv':None\n    }\n)\n```\n\n**Note**: it is not mandatory to write all possible `algorithm_parameters`, here it is done only to show u defaults. Also u can use `AlgorithmParams` (with typehints and docstrings) class instead of dicts:\n\n```python\nalgorithm_parameters=AlgorithmParams(\n    max_num_iteration=None,\n    population_size=100,\n    mutation_probability=0.1,\n    mutation_discrete_probability=None,\n    elit_ratio=0.01,\n    parents_portion=0.3,\n    crossover_type='uniform',\n    mutation_type='uniform_by_center',\n    mutation_discrete_type='uniform_discrete',\n    selection_type='roulette',\n    max_iteration_without_improv=None\n)\n```\n\n**Run the search method**:\n\n```python\n# all of this parameters are default\nresult = model.run(\n    no_plot = False, \n    progress_bar_stream = 'stdout',\n    disable_printing = False,\n\n    function=function,\n    function_timeout=None,\n\n    set_function = None, \n    apply_function_to_parents = False, \n    start_generation = None,\n    studEA = False,\n    mutation_indexes = None,\n\n    init_creator = None,\n    init_oppositors = None,\n    duplicates_oppositor = None,\n    remove_duplicates_generation_step = None,\n    revolution_oppositor = None,\n    revolution_after_stagnation_step = None,\n    revolution_part = 0.3,\n    \n    population_initializer = Population_initializer(select_best_of = 1, local_optimization_step = 'never', local_optimizer = None),\n    \n    stop_when_reached = None,\n    callbacks = [],\n    middle_callbacks = [],\n    time_limit_secs = None, \n    save_last_generation_as = None,\n    seed = None\n)\n\n# best candidate\nprint(result.variable)\n\n# best score\nprint(result.score)\n\n# last generation\nprint(result.last_generation)\n\n```\n\n## Constructor parameters\n\nHave a look at https://pasaopasen.github.io/geneticalgorithm2/geneticalgorithm2/geneticalgorithm2.html#GeneticAlgorithm2.__init__\n\n## Genetic algorithm's parameters\n\n### AlgorithmParams object\n\nThe parameters of GA is defined as a dictionary or `AlgorithmParams` object: https://pasaopasen.github.io/geneticalgorithm2/geneticalgorithm2/data_types/algorithm_params.html\n\nTo get the global default params use code:\n```python\nparams = ga.default_params\n```\n\nTo get actual parameters of an existing model use code:\n```python\nparams = model.param\n```\n\n### Parameters of algorithm\n\n#### **Crossover**\n\nhttps://pasaopasen.github.io/geneticalgorithm2/geneticalgorithm2/crossovers.html\n\n#### **Mutation**\n\nhttps://pasaopasen.github.io/geneticalgorithm2/geneticalgorithm2/mutations.html\n\n#### **Selection**\n\nhttps://pasaopasen.github.io/geneticalgorithm2/geneticalgorithm2/selections.html\n\n## Methods and Properties of model\n\nHave a look at https://pasaopasen.github.io/geneticalgorithm2/geneticalgorithm2/geneticalgorithm2.html#GeneticAlgorithm2.run\n\n# Examples for beginner\n\n## A minimal example \n\nAssume we want to find a set of `X = (x1, x2, x3)` that minimizes function `f(X) = x1 + x2 + x3` where `X` can be any real number in `[0, 10]`.\n\nThis is a trivial problem and we already know that the answer is `X = (0, 0, 0)` where `f(X) = 0`.  \n\nWe just use this simple example to show how to implement it with `geneticalgorithm2`. First we import `geneticalgorithm2` and [numpy](https://numpy.org). Next, we define \nfunction `f` which we want to minimize and the boundaries of the decision variables. Then simply `geneticalgorithm2` is called to solve the defined optimization problem as follows:\n\n```python\nimport numpy as np\nfrom geneticalgorithm2 import GeneticAlgorithm2 as ga\n\n\ndef f(X):\n    return np.sum(X)\n\n\nvarbound = [[0, 10]] * 3\n\nmodel = ga(dimension=3, variable_type='real', variable_boundaries=varbound)\n\nmodel.run(function=f)\n```\n  \nIf you run the code, you should see a progress bar that shows the progress of the \ngenetic algorithm (GA) and then the solution, objective function value and the convergence curve as follows:\n\n![](https://github.com/PasaOpasen/geneticalgorithm2/blob/master/genetic_algorithm_convergence.gif)\n\nAlso we can access to the best answer of the defined optimization problem found by GA as a dictionary and a report of the progress of the genetic algorithm. \nTo do so we complete the code as follows:\n\n```python\nconvergence = model.report\n\nsolution = model.result\n```\n\n## The simple example with integer variables\n\nConsidering the problem given in the simple example above.\nNow assume all variables are integers. So `x1, x2, x3` can be any integers in `[0, 10]`.\nIn this case the code is as the following:\n\n```python\nimport numpy as np\nfrom geneticalgorithm2 import GeneticAlgorithm2 as ga\n\n\ndef f(X):\n    return np.sum(X)\n\nvarbound = [[0, 10]] * 3\n\nmodel = ga(dimension=3, variable_type='int', variable_boundaries=varbound)\n\nmodel.run(function=f)\n```\nSo, as it is seen the only difference is that for `variable_type` we use string `'int'`. \n\n## The simple example with Boolean variables\n\nConsidering the problem given in the simple example above.\nNow assume all variables are boolean instead of real or integer. So `X` can be either zero or one. Also instead of three let's have 30 variables.\nIn this case the code is as the following:\n\n```python\nimport numpy as np\nfrom geneticalgorithm2 import GeneticAlgorithm2 as ga\n\n\ndef f(X):\n    return np.sum(X)\n\nmodel = ga(dimension=30, variable_type='bool')\n\nmodel.run(function=f)\n```\nNote for variable_type we use string `'bool'` when all variables are boolean.  \nNote that when variable_type equal `'bool'` there is no need for `variable_boundaries` to be defined.\n\n## The simple example with mixed variables\n\nConsidering the problem given in the the simple example above where we want to minimize `f(X) = x1 + x2 + x3`. \nNow assume `x1` is a real (continuous) variable in `[0.5; 1.5]`, `x2` is an integer variable in `[1;100]`, and `x3` is a boolean variable that can be either zero or one.\nWe already know that the answer is `X = (0.5, 1, 0)` where `f(X) = 1.5`.\nWe implement geneticalgorithm2 as the following:\n\n```python\nimport numpy as np\nfrom geneticalgorithm2 import GeneticAlgorithm2 as ga\n\n\ndef f(X):\n    return np.sum(X)\n\n\nvarbound = [[0.5, 1.5], [1, 100], [0, 1]]\nvartype = ('real', 'int', 'int')\nmodel = ga(dimension=3, variable_type=vartype, variable_boundaries=varbound)\n\nmodel.run(function=f)\n```\n\n## Optimization problems with constraints\n\nIn all above examples, the optimization problem was unconstrained. Now consider that we want to minimize `f(X) = x1+x2+x3` where `X` is a set of real variables in `[0; 10]`. Also we have an extra constraint so that sum of `x1` and `x2` is equal or greater than 2. The minimum of `f(X)` is 2.\nIn such a case, a trick is to define penalty function. Hence we use the code below:\n\n```python\nimport numpy as np\nfrom geneticalgorithm2 import GeneticAlgorithm2 as ga\n\n\ndef f(X):\n    pen = 0\n    if X[0] + X[1] < 2:\n        pen = 500 + 1000 * (2 - X[0] - X[1])\n    return np.sum(X) + pen\n\n\nvarbound = [[0, 10]] * 3\n\nmodel = ga(dimension=3, variable_type='real', variable_boundaries=varbound)\n\nmodel.run(function=f)\n\n```\nAs seen above we add a penalty to the objective function whenever the constraint is not met.  \n\nSome hints about how to define a penalty function:  \n\n1. Usually you may use a constant greater than the maximum possible value of the objective function if the maximum is known or if we have a guess of that. Here the highest possible value of our function is 300 (i.e. if all variables were 10, `f(X)=300`). So I chose a constant of 500. So, if a trial solution is not in the feasible region even though its objective function may be small, the penalized objective function (fitness function) is worse than any feasible solution.\n2. Use a coefficient big enough and multiply that by the amount of violation. This helps the algorithm learn how to approach feasible domain.\n3. How to define penalty function usually influences the convergence rate of an evolutionary algorithm. In my [book on metaheuristics and evolutionary algorithms](https://www.wiley.com/en-us/Meta+heuristic+and+Evolutionary+Algorithms+for+Engineering+Optimization-p-9781119386995) you can learn more about that. \n4. Finally after you solved the problem test the solution to see if boundaries are met. If the solution does not meet constraints, it shows that a bigger penalty is required. However, in problems where optimum is exactly on the boundary of the feasible region (or very close to the constraints) which is common in some kinds of problems, a very strict and big penalty may prevent the genetic algorithm to approach the optimal region. In such a case designing an appropriate penalty function might be more challenging. Actually what we have to do is to design a penalty function that let the algorithm searches unfeasible domain while finally converge to a feasible solution. Hence you may need more sophisticated penalty functions. But in most cases the above formulation work fairly well.\n\n## Middle example: select fixed count of objects from set\n\nFor some task u need to think a lot and create good specific crossover or mutation functions. For example, take a look at this problem:\n```\n    From set like X = {x1, x2, x3, ..., xn} u should select only k objects which get the best function value\n```\nU can do it using this code:\n\n```python\nimport numpy as np\nfrom geneticalgorithm2 import GeneticAlgorithm2 as ga\n\nsubset_size = 20  # how many objects we can choose\n\nobjects_count = 100  # how many objects are in set\n\nmy_set = np.random.random(objects_count) * 10 - 5  # set values\n\n\n# minimized function\ndef f(X):\n    return abs(np.mean(my_set[X == 1]) - np.median(my_set[X == 1]))\n\n\n# initialize start generation and params\n\nN = 1000  # size of population\nstart_generation = np.zeros((N, objects_count))\nindexes = np.arange(0, objects_count, dtype=np.int8)  # indexes of variables\n\nfor i in range(N):\n    inds = np.random.choice(indexes, subset_size, replace=False)\n    start_generation[i, inds] = 1\n\n\ndef my_crossover(parent_a, parent_b):\n    a_indexes = set(indexes[parent_a == 1])\n    b_indexes = set(indexes[parent_b == 1])\n\n    intersect = a_indexes.intersection(b_indexes)  # elements in both parents\n    a_only = a_indexes - intersect  # elements only in 'a' parent\n    b_only = b_indexes - intersect\n\n    child_inds = np.array(list(a_only) + list(b_only), dtype=np.int8)\n    np.random.shuffle(child_inds)  # mix\n\n    children = np.zeros((2, parent_a.size))\n    if intersect:\n        children[:, np.array(list(intersect))] = 1\n    children[0, child_inds[:int(child_inds.size / 2)]] = 1\n    children[1, child_inds[int(child_inds.size / 2):]] = 1\n\n    return children[0, :], children[1, :]\n\n\nmodel = ga(\n    dimension=objects_count,\n    variable_type='bool',\n    algorithm_parameters={\n        'max_num_iteration': 500,\n        'mutation_probability': 0,  # no mutation, just crossover\n        'elit_ratio': 0.05,\n        'parents_portion': 0.3,\n        'crossover_type': my_crossover,\n        'max_iteration_without_improv': 20\n    }\n)\n\nmodel.run(\n    function=f,\n    no_plot=False, \n    start_generation=(start_generation, None)\n)\n```\n\n# U should know these features\n\n## Available crossovers\n\nFor two example parents (*one with ones* and *one with zeros*) next crossovers will give same children ([examples](examples/crossovers_examples.py)): \n\n* **one_point**:\n\n|0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1|\n|:---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---:|\n|1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0|\n\n* **two_point**:\n\n|1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1|\n|:---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---:|\n|0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0|\n\n* **uniform**:\n\n|1 | 1 | 1 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0|\n|:---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---:|\n|0 | 0 | 0 | 1 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1|\n\n* **uniform_window**:\n\n|1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 1 | 1|\n|:---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---:|\n|0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 0 | 0 | 0|\n\n* **shuffle**:\n\n|0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 1 | 1 | 0 | 1 | 0|\n|:---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---:|\n|1 | 1 | 1 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 1 | 0 | 1|\n\n* **segment**:\n\n|0 | 1 | 1 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 1 | 1|\n|:---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---:|\n|1 | 0 | 0 | 1 | 1 | 0 | 1 | 0 | 1 | 1 | 0 | 1 | 1 | 0 | 0|\n\n* **arithmetic**:\n\n|0.13 | 0.13 | 0.13 | 0.13 | 0.13 | 0.13 | 0.13 | 0.13 | 0.13 | 0.13 | 0.13 | 0.13 | 0.13 | 0.13 | 0.13|\n|:---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---:|\n|0.87 | 0.87 | 0.87 | 0.87 | 0.87 | 0.87 | 0.87 | 0.87 | 0.87 | 0.87 | 0.87 | 0.87 | 0.87 | 0.87 | 0.87|\n\n* **mixed**:\n\n|0.63 | 0.84 | 1.1 | 0.73 | 0.67 | -0.19 | 0.3 | 0.72 | -0.18 | 0.61 | 0.84 | 1.14 | 1.36 | -0.37 | -0.19|\n|:---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---:|\n|0.51 | 0.58 | 0.43 | 0.42 | 0.55 | 0.49 | 0.57 | 0.48 | 0.46 | 0.56 | 0.56 | 0.54 | 0.44 | 0.51 | 0.4|\n\n## Available selections\n\n![](examples/output/selections.png)\n\n\n## Function timeout\n\n**geneticalgorithm2** inherited several features from `geneticalgorithm` package sush as that if the given function does not provide any output before timeout, the algorithm would be terminated and raise the appropriate error. \n\nIn such a case make sure the given function works correctly (i.e. there is no infinite loop in the given function). Also if the given function takes more than 10 seconds to complete the work\nmake sure to increase `function_timeout` in arguments.\n\n## Standard GA vs. Elitist GA\n\nThe convergence curve of an elitist genetic algorithm is always non-increasing. So, the best ever found solution is equal to the best solution of the last iteration. However, the convergence curve of a standard genetic algorithm is different. If `elit_ratio` is zero geneticalgorithm2 implements a standard GA. The output of geneticalgorithm2 for standard GA is the best ever found solution not the solution of the last iteration. The difference between the convergence curve of standard GA and elitist GA is shown below:\n\n![](examples/output/standard_vs_elitist.png)\n\n## Standard crossover vs. stud EA crossover\n\n[Stud EA](https://link.springer.com/chapter/10.1007%2FBFb0056910) is the idea of using crossover always with best object. So one of two parents is always the best object of population. It can help us in a lot of tasks!\n\n![](examples/output/studEA.png)\n\n## Creating better start population\n\nThere is `get_population_initializer(select_best_of = 4, local_optimization_step = 'never', local_optimizer = None)` function for creating start population creators. Take a look at [its docs](https://pasaopasen.github.io/geneticalgorithm2/geneticalgorithm2/population_initializer.html)\n\n### Select best N of kN\n\nThis little option can help u especially with multimodal tasks. \n\n![](examples/output/init_best_of.png)\n\n### Do local optimization\n\nWe can apply some local optimization on start generation before starting GA search. It can be some gradient descent or hill climbing and so on. Also we can apply it before selection best objects (on entire population) or after (on best part of population) and so forth.\n\nIn next example I'm using my [DiscreteHillClimbing](https://github.com/PasaOpasen/DiscreteHillClimbing) algorithm for local optimization my discrete task:\n\n```python\nimport numpy as np\nimport matplotlib.pyplot as plt\n\nfrom DiscreteHillClimbing import Hill_Climbing_descent\n\nfrom geneticalgorithm2 import GeneticAlgorithm2 as ga\nfrom geneticalgorithm2 import get_population_initializer\n\n\ndef f(arr):\n    arr2 = arr / 25\n    return -np.sum(arr2 * np.sin(np.sqrt(np.abs(arr2)))) ** 5 + np.sum(np.abs(arr2)) ** 2\n\n\niterations = 100\n\nvarbound = [[-100, 100]] * 15\n\navailable_values = [np.arange(-100, 101)] * 15\n\nmy_local_optimizer = lambda arr, score: Hill_Climbing_descent(\n    function=f, available_predictors_values=available_values,\n    max_function_evals=50, start_solution=arr\n)\n\nmodel = ga(\n    dimension=varbound.shape[0],\n    variable_type='int',\n    variable_boundaries=varbound,\n    algorithm_parameters={\n        'max_num_iteration': iterations,\n        'population_size': 400\n    }\n)\n\nfor time in ('before_select', 'after_select', 'never'):\n    model.run(\n        function=f\n        no_plot=True,\n        population_initializer=get_population_initializer(\n            select_best_of=3,\n            local_optimization_step=time,\n            local_optimizer=my_local_optimizer\n        )\n    )\n\n    plt.plot(model.report, label=f\"local optimization time = '{time}'\")\n\nplt.xlabel('Generation')\nplt.ylabel('Minimized function (40 simulations average)')\nplt.title('Selection best N object before running GA')\nplt.legend()\n```\n\n![](examples/output/init_local_opt.png)\n\n### Optimization with oppositions\n\nAlso u can create start population with [oppositions](https://github.com/PasaOpasen/opp-op-pop-init). See [example of code](examples/best_of_N_with_opp.py)\n\n![](examples/output/init_best_of_opp.png)\n\n## Revolutions\n\nU can create [revolutions in your population](https://github.com/PasaOpasen/opp-op-pop-init) after some stagnation steps. It really can help u for some tasks. See [example](examples/revolution.py)\n\n![](examples/output/revolution.png)\n\n\n## Duplicates removing\n\nIf u remove duplicates each `k` generations, u can speed up the optimization process ([example](examples/remove_dups.py))\n\n![](examples/output/remove_dups.png)\n\n## Cache\n\nIt can be useful for run-speed to use cache with *some discrete tasks*. For this u can import `np_lru_cache` decorator and use it like here:\n\n```python\nimport np_lru_cache\n\n@np_lru_cache(maxsize=some_size)\ndef minimized_func(arr):\n    # code\n    return result\n\n#\n# run\n#    algorithm\n#\n\n\n# don't forget to clear cache\nminimized_func.cache_clear()\n```\n## Report checker\n\nBasically the model checks best population score (minimal score of generation) each generation and saves it to `report` field. Actually this sequence of numbers u see in big part of plots. This behavior is needed for several parts and u cannot disable it. But if u want to report some other metric without using [callbacks](#middle-callbacks), there is highly simple and fast way.\n\nAfter creating `model` but before running `run()` u need to append ur logic to `model.checked_reports` field. Take a look at example:\n\n```python\nimport numpy as np\n\nfrom geneticalgorithm2 import GeneticAlgorithm2 as ga\nfrom geneticalgorithm2 import plot_several_lines\n\n\ndef f(X):\n    return 50 * np.sum(X) - np.sum(np.sqrt(X) * np.sin(X))\n\n\ndim = 25\nvarbound = [[0, 10]] * dim\n\nmodel = ga(\n    dimension=dim,\n    variable_type='real', \n    variable_boundaries=varbound,\n    algorithm_parameters={\n        'max_num_iteration': 600\n    }\n)\n\n# here model exists and has checked_reports field\n# now u can append any functions to report\n\nmodel.checked_reports.extend(\n    [\n        ('report_average', np.mean),\n        ('report_25', lambda arr: np.quantile(arr, 0.25)),\n        ('report_50', np.median)\n    ]\n)\n\n# run optimization process\nmodel.run(\n    function=f,\n    no_plot=False\n)\n\n# now u have not only model.report but model.report_25 and so on\n\n# plot reports\nnames = [name for name, _ in model.checked_reports[::-1]]\nplot_several_lines(\n    lines=[getattr(model, name) for name in names],\n    colors=('green', 'black', 'red', 'blue'),\n    labels=['median value', '25% quantile', 'mean of population', 'best pop score'],\n    linewidths=(1, 1.5, 1, 2),\n    title=\"Several custom reports with base reports\",\n    save_as='./output/report.png'\n)\n```\n\n![](examples/output/report.png)\n\nAs u see, u should append tuple `(name of report, func to evaluate report)` to `model.checked_report`. It's highly recommended to start this name with `report_` (e. g. `report_my_median`). And the function u use will get 1D-numpy *sorted* array of population scores.\n\n\n## Middle callbacks\n\nThere is an amazing way to control optimization process using `MiddleCallbacks` class. Just learn next logic:\n\n1. u can use several `MiddleCallbacks` callbacks as list at `middle_callbacks` parameter in `run()` method\n2. each middle callback is the pair of `action` and `condition` functions\n3. `condition(data)` (`Callable[[MiddleCallbackData], bool]`) function gets `data` object (dataclass `MiddleCallbackData` from version 6.5.0) about primary model parameters and makes logical decision about applying `action` function\n4. `action(data)` (`Callable[[MiddleCallbackData],MiddleCallbackData]`) function modifies `data` objects as u need -- and model will be modified by new `data`\n5. `data` object is the structure with several parameters u can modify:\n   ```python\n    data = MiddleCallbackData(\n        last_generation=Generation.from_pop_matrix(pop),\n        current_generation=t,\n        report_list=self.report,\n\n        mutation_prob=self.prob_mut,\n        crossover_prob=self.prob_cross,\n        mutation=self.real_mutation,\n        crossover=self.crossover,\n        selection=self.selection,\n\n        current_stagnation=counter,\n        max_stagnation=self.max_stagnations,\n\n        parents_portion=self.param.parents_portion,\n        elit_ratio=self.param.elit_ratio,\n\n        set_function=self.set_function\n    )\n   ```  \n   So, the `action` function gets `data` objects and returns `data` object.\n\nIt's very simple to create your own `action` and `condition` functions. But there are several popular functions contained in `Actions` and `ActionConditions` classes:\n* `actions`:\n  * `Stop()` -- just stop optimization process\n  * `ReduceMutationProb(reduce_coef = 0.9)` -- reduce mutation probability\n  * `ChangeRandomCrossover(available_crossovers: Sequence[Callable[[np.ndarray, np.ndarray], Tuple[np.ndarray, np.ndarray]]])` -- change another (random) crossover from list of crossovers\n  * `ChangeRandomSelection(available_selections: Sequence[Callable[[np.ndarray, int], np.ndarray]])`\n  * `ChangeRandomMutation(available_mutations: Sequence[Callable[[float, float, float], float]])`\n  * `RemoveDuplicates(oppositor = None, creator = None, converter = None)`; see [doc](geneticalgorithm2/callbacks/middle.py)\n  * `CopyBest(by_indexes)` -- copies best population object values (from dimensions in `by_indexes`) to all population\n  * `PlotPopulationScores(title_pattern = lambda data: f\"Generation {data['current_generation']}\", save_as_name_pattern = None)` -- plot population scores; needs 2 functions like `data`->string for title and file name (to save)\n* `conditions`:\n  * `ActionConditions.EachGen(generation_step = 10)` -- do action each `generation_step` generations\n  * `ActionConditions.Always()` do action each generations, equals to `ActionConditions.EachGen(1)`\n  * `ActionConditions.AfterStagnation(stagnation_generations = 50)` -- do action after `stagnation_generations` stagnation generations\n  * `ActionConditions.Several(list_of_conditions)` -- do action if all conditions in list are true\n\nTo combine `action` and `condition` to callback, just use `MiddleCallbacks.UniversalCallback(action, condition)` methods.\n\n\nThere are also next high-level useful callbacks:\n\n* `MiddleCallbacks.ReduceMutationGen(reduce_coef = 0.9, min_mutation = 0.005, reduce_each_generation = 50, reload_each_generation = 500)`\n* `MiddleCallbacks.GeneDiversityStats(step_generations_for_plotting:int = 10)` -- plots some duplicates statistics each gen ([example](/examples/plot_diversities.py))\n![](diversity.gif)\n\n\nSee [code example](examples/small_middle_callbacks.py)\n\n## How to compare efficiency of several versions of GA optimization\n\nTo compare efficiency of several versions of GA optimization (such as several values of several hyperparameters or including/excepting some actions like oppositions) u should make some count of simulations and compare results using some statistical test. I have realized this logic [here](https://github.com/PasaOpasen/ab-testing-results-difference) \n\n## Hints on how to adjust genetic algorithm's parameters (from `geneticalgorithm` package)\n\nIn general the performance of a genetic algorithm or any evolutionary algorithm\ndepends on its parameters. Parameter setting of an evolutionary algorithm is important. Usually these parameters are adjusted based on experience and by conducting a sensitivity analysis.\nIt is impossible to provide a general guideline to parameter setting but the suggestions provided below may help:  \n\n* **Number of iterations**: Select a `max_num_iterations` sufficiently large; otherwise the reported solution may not be satisfactory. On the other hand \nselecting a very large number of iterations increases the run time significantly. So this is actually a compromise between\nthe accuracy you want and the time and computational cost you spend. \n\n* **Population size**: Given a constant number of functional evaluations (`max_num_iterations` times population_size) I would select smaller population size and greater iterations. However, a very small choice of population size is also deteriorative. For most problems I would select a population size of 100 unless the dimension of the problem is very large that needs a bigger population size.\n\n* **elit_ratio**: Although having few elites is usually a good idea and may increase the rate of convergence in some problems, having too many elites in the population may cause the algorithm to easily trap in a local optima. I would usually select only one elite in most cases. Elitism is not always necessary and in some problems may even be deteriorative.\n\n* **mutation_probability**: This is a parameter you may need to adjust more than the other ones. Its appropriate value heavily depends on the problem. Sometimes we may select\nmutation_probability as small as 0.01 (i.e. 1 percent) and sometimes even as large as 0.5 (i.e. 50 percent) or even larger. In general if the genetic algorithm trapped \nin a local optimum increasing the mutation probability may help. On the other hand if the algorithm suffers from stagnation reducing the mutation probability may be effective. However, this rule of thumb is not always true.\n\n* **parents_portion**: If parents_portion set zero, it means that the whole of the population is filled with the newly generated solutions. \nOn the other hand having this parameter equals 1 (i.e. 100 percent) means no new solution\nis generated and the algorithm would just repeat the previous values without any change which is not meaningful and effective obviously. Anything between these two may work. The exact value depends on the problem.\n\n* **crossover_type**: Depends on the problem. I would usually use uniform crossover. But testing the other ones in your problem is recommended.\n\n* **max_iteration_without_improv**: This is a parameter that I recommend being used cautiously. \nIf this parameter is too small then the algorithm may stop while it trapped in a local optimum.\nSo make sure you select a sufficiently large criteria to provide enough time for the algorithm to progress and to avoid immature convergence. \n\nFinally to make sure that the parameter setting is fine, we usually should run the \nalgorithm for several times and if convergence curves of all runs converged to the same objective function value we may accept that solution as the optimum. The number of runs\ndepends but usually five or ten runs is prevalent. Notice that in some problems\nseveral possible set of variables produces the same objective function value. \nWhen we study the convergence of a genetic algorithm we compare the objective function values not the decision variables.\n\n## How to get maximum speed\n\n### Don't use plotting\n\n```python\nresult = model.run(\n    no_plot = True, \n)\n```\n\n### Don't use progress bar\n\n```python\nresult = model.run(\n    progress_bar_stream = None,\n)\n```\n\n### Try to use faster optimizing function\n\nTry to speed up your optimizing `function` using Numpy, [Numba](https://numba.pydata.org/) or [Cython](https://cython.org/). If u can, write your own `set_function` (function which applies to whole population samples matrix) with cython optimizations, parallelism and so.\n\n### Specify custom optimized `mutation`, `crossover`, `selection`\n\nWrite faster implementations for model methods `mut`, `mut_middle`, `crossover`, `selection` and set them before running optimization process:\n\n```python\nmodel.mut = custom_mut\nmodel.crossover = custom_crossover\n\nmodel.run(...)\n```\n\n### Specify `fill_children` method\n\nFrom version `6.8.4` there is `fill_children` model method:\n\n```python\nself.fill_children: Optional[Callable[[array2D, int], None]] = None\n```\n\nIt is empty and does nothing; but if u specify it, u can get huge speed up at very intensive algorithm part. Take a look at [main algo structure](#main-algorithm-structure). There is a part with creating children from parents, this part is the most intensive because it uses python loops, calls sampling, crossover and mutations at each iteration. Using `fill_children`, u can rewrite this logic in your manner to speed up.\n\nSuppose u have new population matrix `pop` (type `np.float64`, shape `(population_size, dim_count)`) where first `parents_count` rows are selected parents, next rows are filled by random, so inside `fill_children` method u should fill last `population_size - parents_count` rows (children) by using some your logic. Expected (but not mandatory) logic like this:\n\n```python\nfor k in range(self.parents_count, self.population_size, 2):\n\n    r1, r2 = get_parents_inds()  # get 2 random parents indexes from [0, parents_count)\n\n    pvar1 = pop[r1]\n    pvar2 = pop[r2]\n\n    ch1, ch2 = self.crossover(pvar1, pvar2)  # crossover\n\n    # mutations\n    ch1 = self.mut(ch1)\n    ch2 = self.mut_middle(ch2, pvar1, pvar2)\n\n    # put to population\n    pop[k] = ch1\n    pop[k+1] = ch2\n```\n\n**Example**. In one task I use this algorithm many times (100 000 generations total), so the speed matters. Every sample item is the index of element in other array there, so `i`th sample element is always integer value from cut `[0, end[i]]`. I use uniform crossover and uniform mutation (work perfect for this task). So I specified creating children logic for this task using cython.\n\nContent of file `fill_children.pyx`:\n\n```cython\n#!python\n#cython: language_level=3\n\nimport numpy as np\n\ncimport numpy as np\n\nnp.import_array()\n\ncimport cython\n\nimport math\nimport random\n\n@cython.boundscheck(False)\n@cython.wraparound(False)\ndef fill_children(\n    np.ndarray[np.float64_t, ndim=2] pop,  # samples are integers but always float64 type\n    int parents_count, # count of already done parents\n\n    float mut_prob,  # mutation probability\n    np.ndarray[np.uint8_t, ndim=1] ends  # max elements for each dimension (min elements are 0)\n):\n\n    cdef:\n        Py_ssize_t i, k, population_size = pop.shape[0], dim_count = pop.shape[1], r1, r2\n\n        float v1, v2, tmp\n        np.ndarray[np.float64_t, ndim=1] cross, mut, mut_middle\n\n    # making 2 children at each iteration\n    for k in range(parents_count, population_size, 2): # C loop, not Python\n        \n        #\n        # 2 random parents (fast implementation)\n        #\n\n        r1 = random.randrange(parents_count)\n        r2 = random.randrange(parents_count)\n        if r1 == r2:\n            while r1 == r2:  # C loop!\n               r2 = random.randrange(parents_count) \n\n        #\n        # I always need these 3 random probs sequences, so the fastest way to obtain them is np.random.random\n        #\n        cross = np.random.random(dim_count)  # crossover probabilities for each dimension\n        mut = np.random.random(dim_count)\n        mut_middle = np.random.random(dim_count)\n\n        for i in range(dim_count):  # C loop for each dimension\n            v1 = pop[r1, i]  # first parent value\n            v2 = pop[r2, i]  # second parent value\n\n            if cross[i] < 0.5:  # random swap (uniform crossover), copy otherwise\n                tmp = v2\n                v2 = v1\n                v1 = tmp\n\n            if mut[i] < mut_prob:  # random mutation for first child\n                # fastest way to get random integer from [0, ends[i]]\n                # random.random() calls not always but only on mut[i] < mut_prob\n                v1 = math.floor(random.random() * (ends[i] + 1))\n\n            if mut_middle[i] < mut_prob: # mut_middle for second\n                tmp = random.random()\n                if v1 < v2:\n                    v2 = v1 + math.floor(tmp * (v2 - v1 + 1))  # integer from [v1, v2], v1 < v2\n                elif v1 > v2:\n                    v2 = v2 + math.floor(tmp * (v1 - v2 + 1)) # integer from [v2, v1], v2 < v1\n                else:\n                    v2 = math.floor(tmp * (ends[i] + 1))\n\n            #\n            # put values to children in array\n            #\n            pop[k, i] = v1\n            pop[k + 1, i] = v2\n\n```\n\nAfter compilation this file I can call it from python file to use inside GA:\n\n```python\n\nmut_prob = param['mutation_probability']\n\ndef fill_children(pop: array2D, parents_count: int):\n    \"\"\"wrapper on fill_children.fill_children with putting local variables mut_prob, ends\"\"\"\n    return fill_children.fill_children(\n        pop, parents_count, mut_prob, ends\n    )\n\nmodel.fill_children = fill_children\n\nmodel.run(...)\n```\n\n\n# Examples pretty collection\n\n## Optimization test functions\n\nHere there is the implementation of `geneticalgorithm2` for some benchmark problems. Test functions are got from my [`OptimizationTestFunctions`](https://github.com/PasaOpasen/OptimizationTestFunctions) package. \n\nThe code for optimizations process is same for each function and is contained [in file](examples/optimization_test_functions.py).\n\n### [Sphere](https://github.com/PasaOpasen/OptimizationTestFunctions#sphere)\n![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20Sphere.png)\n![](examples/output/opt_test_funcs/Optimization%20process%20for%20Sphere.png)\n\n### [Ackley](https://github.com/PasaOpasen/OptimizationTestFunctions#ackley)\n![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20Ackley.png)\n![](examples/output/opt_test_funcs/Optimization%20process%20for%20Ackley.png)\n\n### [AckleyTest](https://github.com/PasaOpasen/OptimizationTestFunctions#ackleytest)\n![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20AckleyTest.png)\n![](examples/output/opt_test_funcs/Optimization%20process%20for%20AckleyTest.png)\n\n### [Rosenbrock](https://github.com/PasaOpasen/OptimizationTestFunctions#rosenbrock)\n![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20Rosenbrock.png)\n![](examples/output/opt_test_funcs/Optimization%20process%20for%20Rosenbrock.png)\n\n### [Fletcher](https://github.com/PasaOpasen/OptimizationTestFunctions#fletcher)\n![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20Fletcher.png)\n![](examples/output/opt_test_funcs/Optimization%20process%20for%20Fletcher.png)\n\n### [Griewank](https://github.com/PasaOpasen/OptimizationTestFunctions#griewank)\n![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20Griewank.png)\n![](examples/output/opt_test_funcs/Optimization%20process%20for%20Griewank.png)\n\n### [Penalty2](https://github.com/PasaOpasen/OptimizationTestFunctions#penalty2)\n![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20Penalty2.png)\n![](examples/output/opt_test_funcs/Optimization%20process%20for%20Penalty2.png)\n\n### [Quartic](https://github.com/PasaOpasen/OptimizationTestFunctions#quartic)\n![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20Quartic.png)\n![](examples/output/opt_test_funcs/Optimization%20process%20for%20Quartic.png)\n\n### [Rastrigin](https://github.com/PasaOpasen/OptimizationTestFunctions#rastrigin)\n![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20Rastrigin.png)\n![](examples/output/opt_test_funcs/Optimization%20process%20for%20Rastrigin.png)\n\n### [SchwefelDouble](https://github.com/PasaOpasen/OptimizationTestFunctions#schwefeldouble)\n![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20SchwefelDouble.png)\n![](examples/output/opt_test_funcs/Optimization%20process%20for%20SchwefelDouble.png)\n\n### [SchwefelMax](https://github.com/PasaOpasen/OptimizationTestFunctions#schwefelmax)\n![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20SchwefelMax.png)\n![](examples/output/opt_test_funcs/Optimization%20process%20for%20SchwefelMax.png)\n\n### [SchwefelAbs](https://github.com/PasaOpasen/OptimizationTestFunctions#schwefelabs)\n![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20SchwefelAbs.png)\n![](examples/output/opt_test_funcs/Optimization%20process%20for%20SchwefelAbs.png)\n\n### [SchwefelSin](https://github.com/PasaOpasen/OptimizationTestFunctions#schwefelsin)\n![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20SchwefelSin.png)\n![](examples/output/opt_test_funcs/Optimization%20process%20for%20SchwefelSin.png)\n\n### [Stairs](https://github.com/PasaOpasen/OptimizationTestFunctions#stairs)\n![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20Stairs.png)\n![](examples/output/opt_test_funcs/Optimization%20process%20for%20Stairs.png)\n\n### [Abs](https://github.com/PasaOpasen/OptimizationTestFunctions#abs)\n![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20Abs.png)\n![](examples/output/opt_test_funcs/Optimization%20process%20for%20Abs.png)\n\n### [Michalewicz](https://github.com/PasaOpasen/OptimizationTestFunctions#michalewicz)\n![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20Michalewicz.png)\n![](examples/output/opt_test_funcs/Optimization%20process%20for%20Michalewicz.png)\n\n### [Scheffer](https://github.com/PasaOpasen/OptimizationTestFunctions#scheffer)\n![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20Scheffer.png)\n![](examples/output/opt_test_funcs/Optimization%20process%20for%20Scheffer.png)\n\n### [Eggholder](https://github.com/PasaOpasen/OptimizationTestFunctions#eggholder)\n![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20Eggholder.png)\n![](examples/output/opt_test_funcs/Optimization%20process%20for%20Eggholder.png)\n\n### [Weierstrass](https://github.com/PasaOpasen/OptimizationTestFunctions#weierstrass)\n![](https://github.com/PasaOpasen/OptimizationTestFunctions/blob/main/tests/heatmap%20for%20Weierstrass.png)\n![](examples/output/opt_test_funcs/Optimization%20process%20for%20Weierstrass.png)\n\n\n\n## Using GA in reinforcement learning\n\nSee [example of using GA optimization with keras neural networks](https://www.kaggle.com/demetrypascal/opengym-tasks-using-keras-and-geneticalgorithm2) for solving OpenGym tasks.\n\nBetter example is [OpenGym using cost2fitness and geneticalgorithm2](https://www.kaggle.com/demetrypascal/opengym-using-cost2fitness-and-geneticalgorithm2) where I use also my [cost2fitness](https://github.com/PasaOpasen/cost2fitness) package for fast forward propagation\n\n\n## Using GA with image reconstruction by polygons\n\nLinks:\n1. https://www.kaggle.com/demetrypascal/fork-of-imagereconstruction-with-geneticalgorithm2\n2. https://www.kaggle.com/demetrypascal/imagereconstructionpolygons-with-geneticalgorithm2\n\n\n# Popular questions\n\n## How to disable autoplot?\n\nJust use `no_plot = True` param in `run` method:\n\n```python\nmodel.run(no_plot = True)\n```\n\nIf u want, u can plot results later by using\n\n```python\nmodel.plot_results()\n```\n\nAlso u can create your pretty plots using `model.report` object (it's a list of values):\n\n```python\nre = np.array(model.report)\n\nplt.plot(re)\nplt.xlabel('Iteration')\nplt.ylabel('Objective function')\nplt.title('Genetic Algorithm')\nplt.show()\n```\n\n## How to plot population scores?\n\nThere are 2 ways to plot of scores of population:\n* use `plot_pop_scores(scores, title = 'Population scores', save_as = None)` function from `geneticalgorithm2` environment\n* use `plot_generation_scores(self, title = 'Last generation scores', save_as = None)` method of `ga` object for plotting scores of last generation (yes, it's wrapper of previous function)\n\nLet's check example:\n\n```python\nimport numpy as np\n\nfrom geneticalgorithm2 import GeneticAlgorithm2 as ga\n\nfrom geneticalgorithm2 import plot_pop_scores  # for plotting scores without ga object\n\n\ndef f(X):\n    return 50 * np.sum(X) - np.sum(np.sqrt(X) * np.sin(X))\n\n\ndim = 25\nvarbound = [[0, 10]] * dim\n\n# create start population\nstart_pop = np.random.uniform(0, 10, (50, dim))\n# eval scores of start population\nstart_scores = np.array([f(start_pop[i]) for i in range(start_pop.shape[0])])\n\n# plot start scores using plot_pop_scores function\nplot_pop_scores(start_scores, title='Population scores before beginning of searching', save_as='plot_scores_start.png')\n\nmodel = ga(function=f, dimension=dim, variable_type='real', variable_boundaries=varbound)\n# run optimization process\nmodel.run(no_plot=True,\n          start_generation={\n              'variables': start_pop,\n              'scores': start_scores\n          })\n# plot and save optimization process plot\nmodel.plot_results(save_as='plot_scores_process.png')\n\n# plot scores of last population\nmodel.plot_generation_scores(title='Population scores after ending of searching', save_as='plot_scores_end.png')\n```\n![](examples/output/plot_scores_start.png)\n![](examples/output/plot_scores_process.png)\n![](examples/output/plot_scores_end.png)\n\n\n\n## How to specify evaluated function for all population?\n\nU can do it using `set_function` parameter into `run()` method.\n\nThis function should get `numpy 2D-array` (samples x dimension) and return `1D-array` with results.\n\nBy default it uses `set_function = geneticalgorithm2.default_set_function(function)`, where\n\n```python\n    def default_set_function(function_for_set):\n        def func(matrix):\n            return np.array([function_for_set(matrix[i,:]) for i in range(matrix.shape[0])])\n        return func\n```\nU may want to use it for creating some specific or fast-vectorized evaluations like here:\n\n```python\n\ndef sigmoid(z):\n    return 1/(1+np.exp(-z))\n\nmatrix = np.random.random((1000,100))\n\ndef vectorised(X):\n    return sigmoid(matrix.dot(X))\n\nmodel.run(set_function = vectorised)\n```\n\n## What about parallelism?\n\nBy using `set_function` u can determine your own behavior for parallelism or u can use `geneticalgorithm2.set_function_multiprocess(f, n_jobs = -1)` for using just parallelism (recommended for heavy functions and big populations, not recommended for fast functions and small populations).\n\nFor example:\n\n```python\nimport numpy as np\nfrom geneticalgorithm2 import GeneticAlgorithm2 as ga\n\n\ndef f(X):\n    import math\n    a = X[0]\n    b = X[1]\n    c = X[2]\n    s = 0\n    for i in range(10000):\n        s += math.sin(a * i) + math.sin(b * i) + math.cos(c * i)\n\n    return s\n\n\nalgorithm_param = {'max_num_iteration': 50,\n                   'population_size': 100,\n                   'mutation_probability': 0.1,\n                   'elit_ratio': 0.01,\n                   'parents_portion': 0.3,\n                   'crossover_type': 'uniform',\n                   'mutation_type': 'uniform_by_center',\n                   'selection_type': 'roulette',\n                   'max_iteration_without_improv': None}\n\nvarbound = np.array([[-10, 10]] * 3)\n\nmodel = ga(function=f, dimension=3,\n           variable_type='real',\n           variable_boundaries=varbound,\n           algorithm_parameters=algorithm_param)\n\n########\n\n%time\nmodel.run()\n# Wall time: 1min 52s\n\n%time\nmodel.run(set_function=ga.set_function_multiprocess(f, n_jobs=6))\n# Wall time: 31.7 s\n```\n\n## How to initialize start population? How to continue optimization with new run?\n\nFor this there is `start_generation` parameter in `run()` method. It's the dictionary with structure like returned `model.output_dict['last_generation']`. Let's see example how can u to use it:\n\n```python\nimport numpy as np\nfrom geneticalgorithm2 import GeneticAlgorithm2 as ga\n\n\ndef f(X):\n    return np.sum(X)\n\n\ndim = 6\n\nvarbound = [(0, 10)] * dim\n\nalgorithm_param = {'max_num_iteration': 500,\n                   'population_size': 100,\n                   'mutation_probability': 0.1,\n                   'elit_ratio': 0.01,\n                   'parents_portion': 0.3,\n                   'crossover_type': 'uniform',\n                   'max_iteration_without_improv': None}\n\nmodel = ga(function=f,\n           dimension=dim,\n           variable_type='real',\n           variable_boundaries=varbound,\n           algorithm_parameters=algorithm_param)\n\n# start generation\n# as u see u can use any values been valid for ur function\nsamples = np.random.uniform(0, 50, (300, dim))  # 300 is the new size of your generation\n\nmodel.run(no_plot=False, start_generation={'variables': samples, 'scores': None})\n# it's not necessary to evaluate scores before\n# but u can do it if u have evaluated scores and don't wanna repeat calculations\n\n\n# from version 6.3.0 it's recommended to use this form\nfrom geneticalgorithm2 import Generation\n\nmodel.run(no_plot=False, start_generation=Generation(variables=samples, scores=None))\n\n# from version 6.4.0 u also can use these forms\nmodel.run(no_plot=False, start_generation=samples)\nmodel.run(no_plot=False, start_generation=(samples, None))\n\n# if u have scores array, u can put it too\nscores = np.array([f(sample) for sample in samples])\nmodel.run(no_plot=False, start_generation=(samples, scores))\n\n##\n## after first run\n## best value = 0.10426190111045064\n##\n\n# okay, let's continue optimization using saved last generation\nmodel.run(no_plot=True, start_generation=model.output_dict['last_generation'])\n\n##\n## after second run\n## best value = 0.06128462776296528\n##\n\n```\n\nAlso u can save and load populations using likely code:\n\n```python\nimport numpy as np\n\nfrom geneticalgorithm2 import GeneticAlgorithm2 as ga\n\nfrom OptimizationTestFunctions import Eggholder\n\ndim = 2 * 15\n\nf = Eggholder(dim)\n\nxmin, xmax, ymin, ymax = f.bounds\n\nvarbound = np.array([[xmin, xmax], [ymin, ymax]] * 15)\n\nmodel = ga(function=f,\n           dimension=dim,\n           variable_type='real',\n           variable_boundaries=varbound,\n           algorithm_parameters={\n               'max_num_iteration': 300,\n               'population_size': 100\n           })\n\n# first run and save last generation to file\nfilename = \"eggholder_lastgen.npz\"\nmodel.run(save_last_generation_as=filename)\n\n# load start generation from file and run again (continue optimization)\nmodel.run(start_generation=filename)\n```\n\n\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "Supported highly optimized and flexible genetic algorithm package for python",
    "version": "6.9.2",
    "project_urls": {
        "Homepage": "https://github.com/PasaOpasen/geneticalgorithm2"
    },
    "split_keywords": [
        "solve",
        " solver",
        " equation",
        " optimization",
        " problem",
        " genetic",
        " algorithm",
        " ga",
        " easy",
        " fast",
        " genetic-algorithm",
        " combinatorial",
        " mixed",
        " evolutionary"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "920e3ef899c35c486e7eb735d384fc8bc76b8616bf4bf8b6b8952b2b74cd9013",
                "md5": "9d7043bd3a994bb0d2a4c3fdadfa065a",
                "sha256": "7c5971817a6c77cfb20fb7fa92c0aca7145a9de70234a42dcee27c3b7a7a2773"
            },
            "downloads": -1,
            "filename": "geneticalgorithm2-6.9.2-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "9d7043bd3a994bb0d2a4c3fdadfa065a",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": null,
            "size": 53230,
            "upload_time": "2024-04-14T13:10:00",
            "upload_time_iso_8601": "2024-04-14T13:10:00.508350Z",
            "url": "https://files.pythonhosted.org/packages/92/0e/3ef899c35c486e7eb735d384fc8bc76b8616bf4bf8b6b8952b2b74cd9013/geneticalgorithm2-6.9.2-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "e590f74c4e4afc528fd34e36fac8f3ed36f23b52fc34d733ed1984c7697d8e36",
                "md5": "a67b745ccd8884c62bd04d9190067864",
                "sha256": "43da664025ff38e56c16231b83fcb479d28df368879f89a539a671368814a15b"
            },
            "downloads": -1,
            "filename": "geneticalgorithm2-6.9.2.tar.gz",
            "has_sig": false,
            "md5_digest": "a67b745ccd8884c62bd04d9190067864",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": null,
            "size": 66899,
            "upload_time": "2024-04-14T13:10:04",
            "upload_time_iso_8601": "2024-04-14T13:10:04.656367Z",
            "url": "https://files.pythonhosted.org/packages/e5/90/f74c4e4afc528fd34e36fac8f3ed36f23b52fc34d733ed1984c7697d8e36/geneticalgorithm2-6.9.2.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-04-14 13:10:04",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "PasaOpasen",
    "github_project": "geneticalgorithm2",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "requirements": [
        {
            "name": "matplotlib",
            "specs": []
        },
        {
            "name": "numpy",
            "specs": []
        },
        {
            "name": "typing_extensions",
            "specs": []
        },
        {
            "name": "OppOpPopInit",
            "specs": [
                [
                    ">=",
                    "2.0.1"
                ]
            ]
        }
    ],
    "lcname": "geneticalgorithm2"
}
        
Elapsed time: 0.51851s