.. image:: /docs/source/img/golem_logo-02.png
:alt: Logo of GOLEM framework
:align: center
:width: 500
.. class:: center
Graph Optimization and Learning by Evolutionary Methods
-------------------------------------------------------
GOLEM is an open-source AI framework for optimization and learning of structured graph-based models with meta-heuristic
methods. It is centered around 2 ideas:
1. The potential of meta-heuristic methods in complex problem spaces.
The focus on meta-heuristics allows approaching the kinds of problems where gradient-based learning methods (notably, neural networks)
can't be easily applied, like optimization problems with multiple conflicting objectives or having a combinatorial nature.
2. The importance of structured models in multiple problem domains.
Graph-based learning enables solutions in the form of structured and hybrid probabilistic models, not to mention
that a wide range of domain-specific problems have a natural formulation in the form of graphs.
Together this constitutes an approach to AI that potentially leads to structured, intuitive, interpretable methods and
solutions for a wide range of tasks.
Core Features
=============
- **Structured** models with joint optimization of graph structure and properties (node attributes).
- **Metaheuristic** methods (mainly evolutionary) applicable to any task with a well-defined objective.
- **Multi-objective** optimization that can take into account both quality and complexity.
- **Constrained** optimization with support for arbitrary domain-specific constraints.
- **Extensible** to new domains.
- **Interpretable** thanks to meta-heuristics, structured models, and visualisation tools.
- **Reproducible** thanks to rich optimization history and model serialization.
Applications
============
GOLEM is potentially applicable to any optimization problem structures:
- that can be represented as directed graphs;
- that have some clearly defined fitness function on them.
Graph models can represent fixed structures (e.g. physical models such as truss structures) or functional models that
define a data-flow or inference process (e.g. bayesian networks that can be fitted and queried).
Examples of GOLEM applications:
- Automatic Machine Learning (AutoML) with optimal ML pipelines search in `FEDOT framework <https://github.com/aimclub/FEDOT>`_
- Bayesian network structure search in `BAMT framework <https://github.com/aimclub/BAMT>`_
- Differential equation discovery for physical models in `EPDE framework <https://github.com/ITMO-NSS-team/EPDE>`_
- Geometric design of physical objects in `GEFEST framework <https://github.com/aimclub/GEFEST>`_
- `Neural architecture search <https://github.com/ITMO-NSS-team/nas-fedot>`_
As GOLEM is a general-purpose framework, it's easy to imagine potential applications, for example, finite state automata search
for robotics control or molecular graph learning for drug discovery, and more.
Installation
============
GOLEM can be installed with ``pip``:
.. code-block::
$ pip install thegolem
Quick Start Example
===================
Following example demonstrates graph search using reference graph & edit distance metric. Optimizer is set up with a minimal set of parameters and simple single-point mutations. For more details see examples `simple_run.py <https://github.com/aimclub/GOLEM/blob/main/examples/synthetic_graph_evolution/simple_run.py>`_, `graph_search.py <https://github.com/aimclub/GOLEM/blob/main/examples/synthetic_graph_evolution/graph_search.py>`_ and `tree_search.py <https://github.com/aimclub/GOLEM/blob/main/examples/synthetic_graph_evolution/tree_search.py>`_ in directory `examples/synthetic_graph_evolution <https://github.com/aimclub/GOLEM/tree/main/examples/synthetic_graph_evolution>`_.
.. code-block:: python
def run_graph_search(size=16, timeout=8):
# Generate target graph sought by optimizer using edit distance objective
node_types = ('a', 'b') # Available node types that can appear in graphs
target_graph = generate_labeled_graph('tree', size, node_types)
objective = Objective(partial(tree_edit_dist, target_graph))
initial_population = [generate_labeled_graph('tree', 5, node_types) for _ in range(10)]
# Setup optimization parameters
requirements = GraphRequirements(timeout=timedelta(minutes=timeout))
gen_params = GraphGenerationParams(adapter=BaseNetworkxAdapter(), available_node_types=node_types)
algo_params = GPAlgorithmParameters(pop_size=30)
# Build and run the optimizer
optimiser = EvoGraphOptimizer(objective, initial_population, requirements, gen_params, algo_params)
found_graphs = optimiser.optimise(objective)
# Visualize results
found_graph = gen_params.adapter.restore(found_graphs[0]) # Transform back to NetworkX graph
draw_graphs_subplots(target_graph, found_graph, titles=['Target Graph', 'Found Graph'])
optimiser.history.show.fitness_line()
return found_graph
One can also notice that despite the fact that the edit distance generally decreases along the genealogical path, the optimizer sometimes sacrifices local fitness gain of some graphs in order to achieve diversity and thus obtain the best possible solution at the end.
Project Structure
=================
The repository includes the following packages and directories:
- Package ``core`` contains the main classes and scripts.
- Package ``core.adapter`` is responsible for transformation between domain graphs and internal graph representation used by optimisers.
- Package ``core.dag`` contains classes and algorithms for representation and processing of graphs.
- Package ``core.optimisers`` contains graph optimisers and all related classes (like those representing fitness, individuals, populations, etc.), including optimization history.
- Package ``core.optimisers.genetic`` contains genetic (also called evolutionary) graph optimiser and operators (mutation, selection, and so on).
- Package ``core.utilities`` contains utilities and data structures used by other modules.
- Package ``serializers`` contains class ``Serializer`` with required facilities, and is responsible for serialization of project classes (graphs, optimization history, and everything related).
- Package ``visualisation`` contains classes that allow to visualise optimization history, graphs, and certain plots useful for analysis.
- Package ``examples`` includes several use-cases where you can start to discover how the framework works.
- All unit and integration tests are contained in the ``test`` directory.
- The sources of the documentation are in the ``docs`` directory.
Current R&D and future plans
============================
Any contribution is welcome. Our R&D team is open for cooperation with other scientific teams as well as with industrial partners.
Contribution Guide
==================
- The contribution guide is available in the `repository </docs/source/contribution.rst>`__.
Acknowledgments
===============
We acknowledge the contributors for their important impact and the participants of the numerous scientific conferences and
workshops for their valuable advice and suggestions.
Supported by
============
The study is supported by the Research `Center Strong Artificial Intelligence in Industry <https://sai.itmo.ru/>`_
of `ITMO University <https://itmo.ru/>`_ as part of the plan of the center's program:
Development and testing of an experimental prototype of the library of strong AI algorithms
in terms of basic algorithms of automatic ML for structural training of composite AI models,
including automation of feature selection
Contacts
========
- `Telegram channel <https://t.me/FEDOT_helpdesk>`_ for solving problems and answering questions about FEDOT
- `Natural System Simulation Team <https://itmo-nss-team.github.io/>`_
- `Nikolay Nikitin <https://scholar.google.com/citations?user=eQBTGccAAAAJ&hl=ru>`_, AutoML Lead (nnikitin@itmo.ru)
- `Newsfeed <https://t.me/NSS_group>`_
- `Youtube channel <https://www.youtube.com/channel/UC4K9QWaEUpT_p3R4FeDp5jA>`_
Raw data
{
"_id": null,
"home_page": "https://github.com/aimclub/GOLEM",
"name": "thegolem",
"maintainer": null,
"docs_url": null,
"requires_python": ">=3.8",
"maintainer_email": null,
"keywords": null,
"author": "NSS Lab",
"author_email": "itmo.nss.team@gmail.com",
"download_url": "https://files.pythonhosted.org/packages/f4/24/a7b93e05627a97a53a5c2ef00aa5b9c82dfc8bccdb36c40e474d96b12097/thegolem-0.4.1.tar.gz",
"platform": null,
"description": ".. image:: /docs/source/img/golem_logo-02.png\n :alt: Logo of GOLEM framework\n :align: center\n :width: 500\n\n.. class:: center\n\n\nGraph Optimization and Learning by Evolutionary Methods\n-------------------------------------------------------\n\nGOLEM is an open-source AI framework for optimization and learning of structured graph-based models with meta-heuristic\nmethods. It is centered around 2 ideas:\n\n1. The potential of meta-heuristic methods in complex problem spaces.\n\nThe focus on meta-heuristics allows approaching the kinds of problems where gradient-based learning methods (notably, neural networks)\ncan't be easily applied, like optimization problems with multiple conflicting objectives or having a combinatorial nature.\n\n2. The importance of structured models in multiple problem domains.\n\nGraph-based learning enables solutions in the form of structured and hybrid probabilistic models, not to mention\nthat a wide range of domain-specific problems have a natural formulation in the form of graphs.\n\nTogether this constitutes an approach to AI that potentially leads to structured, intuitive, interpretable methods and\nsolutions for a wide range of tasks.\n\n\nCore Features\n=============\n\n- **Structured** models with joint optimization of graph structure and properties (node attributes).\n- **Metaheuristic** methods (mainly evolutionary) applicable to any task with a well-defined objective.\n- **Multi-objective** optimization that can take into account both quality and complexity.\n- **Constrained** optimization with support for arbitrary domain-specific constraints.\n- **Extensible** to new domains.\n- **Interpretable** thanks to meta-heuristics, structured models, and visualisation tools.\n- **Reproducible** thanks to rich optimization history and model serialization.\n\n\nApplications\n============\n\nGOLEM is potentially applicable to any optimization problem structures:\n\n- that can be represented as directed graphs;\n- that have some clearly defined fitness function on them.\n\nGraph models can represent fixed structures (e.g. physical models such as truss structures) or functional models that\ndefine a data-flow or inference process (e.g. bayesian networks that can be fitted and queried).\n\nExamples of GOLEM applications:\n\n- Automatic Machine Learning (AutoML) with optimal ML pipelines search in `FEDOT framework <https://github.com/aimclub/FEDOT>`_\n- Bayesian network structure search in `BAMT framework <https://github.com/aimclub/BAMT>`_\n- Differential equation discovery for physical models in `EPDE framework <https://github.com/ITMO-NSS-team/EPDE>`_\n- Geometric design of physical objects in `GEFEST framework <https://github.com/aimclub/GEFEST>`_\n- `Neural architecture search <https://github.com/ITMO-NSS-team/nas-fedot>`_\n\nAs GOLEM is a general-purpose framework, it's easy to imagine potential applications, for example, finite state automata search\nfor robotics control or molecular graph learning for drug discovery, and more.\n\n\nInstallation\n============\n\nGOLEM can be installed with ``pip``:\n\n.. code-block::\n\n $ pip install thegolem\n\n\nQuick Start Example\n===================\n\nFollowing example demonstrates graph search using reference graph & edit distance metric. Optimizer is set up with a minimal set of parameters and simple single-point mutations. For more details see examples `simple_run.py <https://github.com/aimclub/GOLEM/blob/main/examples/synthetic_graph_evolution/simple_run.py>`_, `graph_search.py <https://github.com/aimclub/GOLEM/blob/main/examples/synthetic_graph_evolution/graph_search.py>`_ and `tree_search.py <https://github.com/aimclub/GOLEM/blob/main/examples/synthetic_graph_evolution/tree_search.py>`_ in directory `examples/synthetic_graph_evolution <https://github.com/aimclub/GOLEM/tree/main/examples/synthetic_graph_evolution>`_.\n\n.. code-block:: python\n\n def run_graph_search(size=16, timeout=8):\n # Generate target graph sought by optimizer using edit distance objective\n node_types = ('a', 'b') # Available node types that can appear in graphs\n target_graph = generate_labeled_graph('tree', size, node_types)\n objective = Objective(partial(tree_edit_dist, target_graph))\n initial_population = [generate_labeled_graph('tree', 5, node_types) for _ in range(10)]\n\n # Setup optimization parameters\n requirements = GraphRequirements(timeout=timedelta(minutes=timeout))\n gen_params = GraphGenerationParams(adapter=BaseNetworkxAdapter(), available_node_types=node_types)\n algo_params = GPAlgorithmParameters(pop_size=30)\n\n # Build and run the optimizer\n optimiser = EvoGraphOptimizer(objective, initial_population, requirements, gen_params, algo_params)\n found_graphs = optimiser.optimise(objective)\n\n # Visualize results\n found_graph = gen_params.adapter.restore(found_graphs[0]) # Transform back to NetworkX graph\n draw_graphs_subplots(target_graph, found_graph, titles=['Target Graph', 'Found Graph'])\n optimiser.history.show.fitness_line()\n return found_graph\n\n\n\n\nOne can also notice that despite the fact that the edit distance generally decreases along the genealogical path, the optimizer sometimes sacrifices local fitness gain of some graphs in order to achieve diversity and thus obtain the best possible solution at the end.\n\nProject Structure\n=================\n\nThe repository includes the following packages and directories:\n\n- Package ``core`` contains the main classes and scripts.\n- Package ``core.adapter`` is responsible for transformation between domain graphs and internal graph representation used by optimisers.\n- Package ``core.dag`` contains classes and algorithms for representation and processing of graphs.\n- Package ``core.optimisers`` contains graph optimisers and all related classes (like those representing fitness, individuals, populations, etc.), including optimization history.\n- Package ``core.optimisers.genetic`` contains genetic (also called evolutionary) graph optimiser and operators (mutation, selection, and so on).\n- Package ``core.utilities`` contains utilities and data structures used by other modules.\n- Package ``serializers`` contains class ``Serializer`` with required facilities, and is responsible for serialization of project classes (graphs, optimization history, and everything related).\n- Package ``visualisation`` contains classes that allow to visualise optimization history, graphs, and certain plots useful for analysis.\n- Package ``examples`` includes several use-cases where you can start to discover how the framework works.\n- All unit and integration tests are contained in the ``test`` directory.\n- The sources of the documentation are in the ``docs`` directory.\n\n\nCurrent R&D and future plans\n============================\n\nAny contribution is welcome. Our R&D team is open for cooperation with other scientific teams as well as with industrial partners.\n\nContribution Guide\n==================\n\n- The contribution guide is available in the `repository </docs/source/contribution.rst>`__.\n\nAcknowledgments\n===============\n\nWe acknowledge the contributors for their important impact and the participants of the numerous scientific conferences and\nworkshops for their valuable advice and suggestions.\n\nSupported by\n============\n\nThe study is supported by the Research `Center Strong Artificial Intelligence in Industry <https://sai.itmo.ru/>`_\nof `ITMO University <https://itmo.ru/>`_ as part of the plan of the center's program: \nDevelopment and testing of an experimental prototype of the library of strong AI algorithms \nin terms of basic algorithms of automatic ML for structural training of composite AI models, \nincluding automation of feature selection\n\nContacts\n========\n- `Telegram channel <https://t.me/FEDOT_helpdesk>`_ for solving problems and answering questions about FEDOT\n- `Natural System Simulation Team <https://itmo-nss-team.github.io/>`_\n- `Nikolay Nikitin <https://scholar.google.com/citations?user=eQBTGccAAAAJ&hl=ru>`_, AutoML Lead (nnikitin@itmo.ru)\n- `Newsfeed <https://t.me/NSS_group>`_\n- `Youtube channel <https://www.youtube.com/channel/UC4K9QWaEUpT_p3R4FeDp5jA>`_\n\n",
"bugtrack_url": null,
"license": "BSD 3-Clause",
"summary": "Framework for Graph Optimization and Learning by Evolutionary Methods",
"version": "0.4.1",
"project_urls": {
"Homepage": "https://github.com/aimclub/GOLEM"
},
"split_keywords": [],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "78be9d2bf45ff44a860af23cf413d6feee4e72b28016e6d7b828a3c66b706a51",
"md5": "3057d00f732b64bd02010ce3dedff8aa",
"sha256": "176c4b7792d7b994b5ad34d98101aa2a0a0818531d19977e725bedf45bc99769"
},
"downloads": -1,
"filename": "thegolem-0.4.1-py3-none-any.whl",
"has_sig": false,
"md5_digest": "3057d00f732b64bd02010ce3dedff8aa",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">=3.8",
"size": 296234,
"upload_time": "2025-01-21T17:08:33",
"upload_time_iso_8601": "2025-01-21T17:08:33.966012Z",
"url": "https://files.pythonhosted.org/packages/78/be/9d2bf45ff44a860af23cf413d6feee4e72b28016e6d7b828a3c66b706a51/thegolem-0.4.1-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "f424a7b93e05627a97a53a5c2ef00aa5b9c82dfc8bccdb36c40e474d96b12097",
"md5": "3bb28056ebba1696f31183c0f2f3f18b",
"sha256": "de0a11ea82a858767431683ef63a3fa238e83eec2d906cca450fab36112aeee4"
},
"downloads": -1,
"filename": "thegolem-0.4.1.tar.gz",
"has_sig": false,
"md5_digest": "3bb28056ebba1696f31183c0f2f3f18b",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">=3.8",
"size": 216505,
"upload_time": "2025-01-21T17:08:36",
"upload_time_iso_8601": "2025-01-21T17:08:36.566353Z",
"url": "https://files.pythonhosted.org/packages/f4/24/a7b93e05627a97a53a5c2ef00aa5b9c82dfc8bccdb36c40e474d96b12097/thegolem-0.4.1.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2025-01-21 17:08:36",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "aimclub",
"github_project": "GOLEM",
"travis_ci": false,
"coveralls": false,
"github_actions": true,
"requirements": [
{
"name": "numpy",
"specs": [
[
">=",
"1.16.0"
],
[
"<",
"2.0.0"
],
[
"!=",
"1.24.0"
]
]
},
{
"name": "pandas",
"specs": [
[
">=",
"1.3.0"
]
]
},
{
"name": "networkx",
"specs": [
[
"!=",
"2.8.2"
],
[
"!=",
"2.8.3"
],
[
">=",
"2.4"
],
[
"!=",
"2.8.1"
],
[
"<",
"3.3"
],
[
"!=",
"2.7.*"
]
]
},
{
"name": "scipy",
"specs": [
[
"<",
"1.13.0"
]
]
},
{
"name": "zss",
"specs": [
[
">=",
"1.2.0"
]
]
},
{
"name": "matplotlib",
"specs": [
[
">=",
"3.3.1"
]
]
},
{
"name": "pyvis",
"specs": [
[
"==",
"0.2.1"
]
]
},
{
"name": "MarkupSafe",
"specs": [
[
"==",
"2.1.1"
]
]
},
{
"name": "seaborn",
"specs": [
[
">=",
"0.9.0"
]
]
},
{
"name": "imageio",
"specs": [
[
">=",
"2.28.1"
]
]
},
{
"name": "Pillow",
"specs": [
[
">=",
"9.5.0"
]
]
},
{
"name": "func_timeout",
"specs": [
[
"==",
"4.3.5"
]
]
},
{
"name": "joblib",
"specs": [
[
">=",
"0.17.0"
]
]
},
{
"name": "requests",
"specs": [
[
">=",
"2.0"
]
]
},
{
"name": "tqdm",
"specs": [
[
"~=",
"4.66.3"
]
]
},
{
"name": "typing",
"specs": [
[
">=",
"3.7.0"
]
]
},
{
"name": "psutil",
"specs": [
[
">=",
"5.9.2"
]
]
},
{
"name": "hyperopt",
"specs": [
[
">=",
"0.2.7"
]
]
},
{
"name": "iOpt",
"specs": [
[
"==",
"0.2.22"
]
]
},
{
"name": "optuna",
"specs": [
[
">=",
"3.2.0"
]
]
},
{
"name": "pytest",
"specs": [
[
">=",
"6.2.0"
]
]
},
{
"name": "testfixtures",
"specs": [
[
">=",
"6.18.0"
]
]
},
{
"name": "mabwiser",
"specs": [
[
">=",
"2.7.0"
]
]
}
],
"lcname": "thegolem"
}