psyki

Name	psyki JSON
Version	0.4.3 JSON
	download
home_page	None
Summary	PSyKI: a (Python) platform for symbolic knowledge injection
upload_time	2024-11-28 12:08:26
maintainer	None
docs_url	None
author	Matteo Magnini
requires_python	==3.9.13
license	Apache2.0
keywords
VCS
bugtrack_url
requirements	No requirements were recorded.
Travis-CI	No Travis.
coveralls test coverage	No coveralls.

            <h1 align="center"> PSyKI </h1>
<h2 align="center">Platform for Symbolic Knowledge Injection</h2>

[![PyPI version](https://badge.fury.io/py/psyki.svg)](https://badge.fury.io/py/psyki)
[![Test](https://github.com/psykei/psyki-python/actions/workflows/check.yml/badge.svg?event=push)]()
![Codecov](https://img.shields.io/codecov/c/github/psykei/psyki-python)
[![Release](https://github.com/psykei/psyki-python/actions/workflows/deploy.yml/badge.svg?event=push)]()
[![Black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)
[![License](https://img.shields.io/badge/License-Apache_2.0-blue.svg)](https://opensource.org/licenses/Apache-2.0)
[![Stand With Ukraine](https://raw.githubusercontent.com/vshymanskyy/StandWithUkraine/main/badges/StandWithUkraine.svg)](https://stand-with-ukraine.pp.ua)

PSyKI (<u><b>P</b></u>latform for <u><b>Sy</b></u>mbolic <u><b>K</b></u>nowledge <u><b>I</b></u>njection) is a python library for symbolic knowledge injection (<b>SKI</b>).
SKI is a particular subclass of neuro-symbolic (<b>NeSy</b>) integration techniques.
PSyKI offers SKI algorithms (a.k.a. <b>injectors</b>) along with quality of service metrics (<b>QoS</b>).

Some quick links:
<!-- * [Home Page](https://apice.unibo.it/xwiki/bin/view/PSyKI/) -->
* [GitHub Repository](https://github.com/psykei/psyki-python)
* [PyPi Repository](https://pypi.org/project/psyki/)
* [Issues](https://github.com/psykei/psyki-python/issues)


**✨ Update [Nov. 2024]**: from version 0.4.1, PSyKI supports **FaUCI**, a novel fairness method to reduce bias in machine learning models.


## Reference papers

Cite the **PSyKI** paper ff you use this library in your research.
Additionally, cite the respective paper if you use one of the original injectors, fairness methods or QoS metrics.

### PSyKI (main paper)
> Matteo Magnini, Giovanni Ciatto, Andrea Omicini. "On the Design of PSyKI: A Platform for Symbolic Knowledge Injection into Sub-Symbolic Predictors", in: Proceedings of the 4th International Workshop on EXplainable and TRAnsparent AI and Multi-Agent Systems, 2022.

Bibtex: 
```bibtex
@incollection{psyki-extraamas2022,
    author = {Magnini, Matteo and Ciatto, Giovanni and Omicini, Andrea},
    booktitle = {Explainable and Transparent AI and Multi-Agent Systems},
    chapter = 6,
    dblp = {conf/atal/MagniniCO22},
    doi = {10.1007/978-3-031-15565-9_6},
    editor = {Calvaresi, Davide and Najjar, Amro and Winikoff, Michael and Främling, Kary},
    eisbn = {978-3-031-15565-9},
    eissn = {1611-3349},
    iris = {11585/899511},
    isbn = {978-3-031-15564-2},
    issn = {0302-9743},
    keywords = {Symbolic Knowledge Injection, Explainable AI, XAI, Neural Networks, PSyKI},
    note = {4th International Workshop, EXTRAAMAS 2022, Virtual Event, May 9--10, 2022, Revised Selected Papers},
    pages = {90--108},
    publisher = {Springer},
    scholar = {7587528289517313138},
    scopus = {2-s2.0-85138317005},
    series = {Lecture Notes in Computer Science},
    title = {On the Design of {PSyKI}: a Platform for Symbolic Knowledge Injection into Sub-Symbolic Predictors},
    url = {https://link.springer.com/chapter/10.1007/978-3-031-15565-9_6},
    urlpdf = {https://link.springer.com/content/pdf/10.1007/978-3-031-15565-9_6.pdf},
    volume = 13283,
    wos = {000870042100006},
    year = 2022
}
```

### FaUCI fairness method

> Matteo Magnini, Giovanni Ciatto, Roberta Calegari, and Andrea Omicini, “Enforcing fairness via constraint injection with fauci,” in Proceedings of the 2nd Workshop on Fairness and Bias in AI co-located with 27th European Conference on Artificial Intelligence (ECAI 2024), Santiago de Compostela, Spain, October 20th, 2024, R. Calegari, V. Dignum, and B. O’Sullivan, Eds., ser. CEUR Workshop Proceedings, vol. 3808, CEUR-WS.org, 2024.

Bibtex: 
```bibtex
@inproceedings{DBLP:conf/aequitas/MagniniCCO24,
  author       = {Matteo Magnini and
                  Giovanni Ciatto and
                  Roberta Calegari and
                  Andrea Omicini},
  editor       = {Roberta Calegari and
                  Virginia Dignum and
                  Barry O'Sullivan},
  title        = {Enforcing Fairness via Constraint Injection with FaUCI},
  booktitle    = {Proceedings of the 2nd Workshop on Fairness and Bias in {AI} co-located
                  with 27th European Conference on Artificial Intelligence {(ECAI} 2024),
                  Santiago de Compostela, Spain, October 20th, 2024},
  series       = {{CEUR} Workshop Proceedings},
  volume       = {3808},
  publisher    = {CEUR-WS.org},
  year         = {2024},
  url          = {https://ceur-ws.org/Vol-3808/paper8.pdf},
  timestamp    = {Fri, 08 Nov 2024 15:21:04 +0100},
  biburl       = {https://dblp.org/rec/conf/aequitas/MagniniCCO24.bib},
  bibsource    = {dblp computer science bibliography, https://dblp.org}
}
```

### KINS injector

> Matteo Magnini, Giovanni Ciatto, and Andrea Omicini “Knowledge injection of datalog rules via neural network structuring with KINS,” J. Log. Comput., vol. 33, no. 8, pp. 1832–1850, 2023. doi: 10.1093/LOGCOM/
EXAD037.

Bibtex: 
```bibtex
@article{DBLP:journals/logcom/MagniniCO23,
  author       = {Matteo Magnini and
                  Giovanni Ciatto and
                  Andrea Omicini},
  title        = {Knowledge injection of Datalog rules via Neural Network Structuring
                  with {KINS}},
  journal      = {J. Log. Comput.},
  volume       = {33},
  number       = {8},
  pages        = {1832--1850},
  year         = {2023},
  url          = {https://doi.org/10.1093/logcom/exad037},
  doi          = {10.1093/LOGCOM/EXAD037},
  timestamp    = {Tue, 02 Jan 2024 12:25:17 +0100},
  biburl       = {https://dblp.org/rec/journals/logcom/MagniniCO23.bib},
  bibsource    = {dblp computer science bibliography, https://dblp.org}
}
```

### KILL injector

> Matteo Magnini, Giovanni Ciatto, and Andrea Omicini, “A view to a KILL: knowledge injection via lambda layer,” in Proceedings of the 23rd Workshop "From Objects to Agents", Genova, Italy, September 1-3, 2022, A. Ferrando and V. Mascardi, Eds., ser. CEUR Workshop Proceedings, vol. 3261, CEUR-WS.org, 2022, pp. 61–76.

Bibtex: 
```bibtex
@inproceedings{DBLP:conf/woa/MagniniCO22,
  author       = {Matteo Magnini and
                  Giovanni Ciatto and
                  Andrea Omicini},
  editor       = {Angelo Ferrando and
                  Viviana Mascardi},
  title        = {A view to a {KILL:} knowledge injection via lambda layer},
  booktitle    = {Proceedings of the 23rd Workshop "From Objects to Agents", Genova,
                  Italy, September 1-3, 2022},
  series       = {{CEUR} Workshop Proceedings},
  volume       = {3261},
  pages        = {61--76},
  publisher    = {CEUR-WS.org},
  year         = {2022},
  url          = {https://ceur-ws.org/Vol-3261/paper5.pdf},
  timestamp    = {Fri, 10 Mar 2023 16:22:40 +0100},
  biburl       = {https://dblp.org/rec/conf/woa/MagniniCO22.bib},
  bibsource    = {dblp computer science bibliography, https://dblp.org}
}
```

### QoS metrics

> Andrea Agiollo, Andrea Rafanelli, Matteo Magnini, Giovanni Ciatto, Andrea Omicini. "Symbolic knowledge injection meets intelligent agents: QoS metrics and experiments", in:  Autonomous Agents and Multi-Agent Systems, 2023.

Bibtex: 
```bibtex
@article{ski-qos23,
  author       = {Andrea Agiollo and
                  Andrea Rafanelli and
                  Matteo Magnini and
                  Giovanni Ciatto and
                  Andrea Omicini},
  title        = {Symbolic knowledge injection meets intelligent agents: QoS metrics
                  and experiments},
  journal      = {Auton. Agents Multi Agent Syst.},
  volume       = {37},
  number       = {2},
  pages        = {27},
  year         = {2023},
  url          = {https://doi.org/10.1007/s10458-023-09609-6},
  doi          = {10.1007/S10458-023-09609-6},
  timestamp    = {Tue, 12 Sep 2023 07:57:44 +0200},
  bibsource    = {dblp computer science bibliography, https://dblp.org}
}
```

### Robustness metrics for symbolic knowledge injection

> Andrea Rafanelli, Matteo Magnini, Andrea Agiollo, Giovanni Ciatto, and Andrea Omicini, “An empirical study on the robustness of knowledge injection techniques against data degradation,” in Proceedings of the 25th Workshop "From Objects to Agents", Bard (Aosta), Italy, July 8-10, 2024, M. Alderighi, M. Baldoni, C. Baroglio, R. Micalizio, and S. Tedeschi, Eds., ser. CEURWorkshop Proceedings, vol. 3735, CEUR-WS.org, 2024, pp. 20–32.

Bibtex: 
```bibtex
@inproceedings{DBLP:conf/woa/RafanelliMACO24,
  author       = {Andrea Rafanelli and
                  Matteo Magnini and
                  Andrea Agiollo and
                  Giovanni Ciatto and
                  Andrea Omicini},
  editor       = {Marco Alderighi and
                  Matteo Baldoni and
                  Cristina Baroglio and
                  Roberto Micalizio and
                  Stefano Tedeschi},
  title        = {An Empirical Study on the Robustness of Knowledge Injection Techniques
                  Against Data Degradation},
  booktitle    = {Proceedings of the 25th Workshop "From Objects to Agents", Bard (Aosta),
                  Italy, July 8-10, 2024},
  series       = {{CEUR} Workshop Proceedings},
  volume       = {3735},
  pages        = {20--32},
  publisher    = {CEUR-WS.org},
  year         = {2024},
  url          = {https://ceur-ws.org/Vol-3735/paper\_02.pdf},
  timestamp    = {Fri, 26 Jul 2024 22:35:03 +0200},
  biburl       = {https://dblp.org/rec/conf/woa/RafanelliMACO24.bib},
  bibsource    = {dblp computer science bibliography, https://dblp.org}
}

```

## More in detail

An `Injector` is a SKI algorithm that may -- or may not -- take a sub-symbolic predictor in conjunction with prior symbolic knowledge to create a new predictor through the `inject` method.
We refer to the new predictor as **educated**, while predictors that are not affected by symbolic knowledge are called **uneducated**.

Knowledge can be represented in many ways.
The most common is the representation via logic formulae.
PSyKI integrates [`2ppy`](https://github.com/tuProlog/2ppy), a python porting of [`2p-kt`](https://github.com/tuProlog/2p-kt) (a multi-paradigm logic programming framework).
Thanks to this integration, PSyKI supports logic formulae written with the formalism of Prolog.
Therefore, all subsets of the Prolog language (including Prolog itself) are potentially supported (e.g., propositional logic, Datalog, etc.).
It is worth noting that each injector may have its own requirements on the knowledge representation.

List of available injectors:

 - [`KBANN`](https://www.sciencedirect.com/science/article/pii/0004370294901058), one of the first injector introduced in literature;
 - [`KILL`](http://ceur-ws.org/Vol-3261/paper5.pdf), constrains a NN by altering its predictions using the knowledge;
 - [`KINS`](http://ceur-ws.org/Vol-3204/paper_25.pdf), structure the knowledge adding ad-hoc layers into a NN.

### High level architecture

![PSyKI class diagram](https://www.plantuml.com/plantuml/svg/TLF1Rjim3BthAnvyweQvGA_1C7Gx32YMvTBUYc9WPDdH8ec1H6z8i_pxP3bnd64z9NmKtoCVwVia5ANtJgcqjM57aJoS3KRsEmEEic6bTgItr1auxgm-A0NGEaaaBVZAqVUE3XbJm541WSLWpIBimUtvWGA0XeIG2tijVJG5QZc2HcB4tWsW2KqXKOEGTfGIdZQ6u_vGAfnDydnYVS4sy6zdciwC0bRBSnRu01la1QsXGUY7fzt_qeNxb3mgXPCCghiAx-iQLQYczjNnOaCswjg4X_3JQE5O6lpEZN7OHJEeSHoWHube-zTNsrfJ05iARavwKdxUBSRIkOtHTXi1jvF2Od55Z3wOfjSaffaRD_dsxM7rEBfcWy3HliWVvm-MoyCy_l9vjHehMiSaO6ywciKTUK_p5gjDFfHObyCnOc82jyD48DTnjBBngG8bhEuKHdhStfQeT3S6fG4RjSjyAC-rmZGqFlwfwu9erALg_3lIJV1oURMboV3qpyMUyN5C_BB9oiqRLvMNGc7_ncNFDugdI26rcI0XxVsQtUcWqzb-1Y7rwthANdyDc2smp74vnkpHfyaCTN4bMvUpipwKkiyKlNT_0G00)
*Class diagram representing the relations between `Injector`, `Theory` and `Fuzzifier` classes*

<!--
To generate/edit the class diagram browse the URL above, after replacing `svg` with `uml`
-->

The core abstractions of PSyKI are the following:

 - `Injector`: a SKI algorithm;
 - `Theory`: symbolic knowledge plus additional information about the domain;
 - `Fuzzifier`: entity that transforms (fuzzify) symbolic knowledge into a sub-symbolic data structure.

The class `Theory` is built upon the symbolic knowledge and the metadata of the dataset (extracted by a Pandas DataFrame).
The knowledge can be generated by an adapter that parses the Prolog theory (e.g., a `.pl` file, a string) and generates a list of `Formula` objects.
Each `Injector` has one `Fuzzifier`.
The `Fuzzifier` is used to transform the `Theory` into a sub-symbolic data structure (e.g., ad-hoc layers of a NN).
Different fuzzifiers encode the knowledge in different ways.

To avoid confusion, we use the following terminology:

 - **rule** is a single logic clause;
 - **knowledge** is the set of rules;
 - **theory** is the knowledge plus metadata.

### Hello world

The following example shows how to use PSyKI to inject knowledge into a NN.

```python
import pandas as pd
from tensorflow.keras.models import Model
from psyki.logic import Theory
from psyki.ski import Injector


def create_uneducated_predictor() -> Model:
...

dataset = pd.read_csv("path_to_dataset.csv")  # load dataset
knowledge_file = "path_to_knowledge_file.pl"  # load knowledge
theory = Theory(knowledge_file, dataset)      # create a theory
uneducated = create_uneducated_predictor()    # create a NN
injector = Injector.kins(uneducated)          # create an injector
educated = injector.inject(theory)            # inject knowledge into the NN

# From now on you can use the educated predictor as you would use the uneducated one
```

For more detailed examples, please refer to the demos in the [demo-psyki-python](https://github.com/psykei/demo-psyki-python) repository.

---------------------------------------------------------------------------------

##  Injection Assessment 

Knowledge injection methods aim to enhance the sustainability of machine learning by minimizing the learning time required. This accelerated learning can possibly lead to improvements in model accuracy as well. Additionally, by incorporating prior knowledge, the data requirements for effective training may be loosened, allowing for smaller datasets to be utilized. Injecting knowledge has also the potential to increase model interpretability by preventing the predictor from becoming a black-box.

Most existing works on knowledge injection techniques showcase standard evaluation metrics that aim to quantify these potential benefits. However, accurately quantifying sustainability, accuracy improvements, dataset needs, and interpretability in a consistent manner remains an open challenge.

<!--
**Efficiency metrics**: 

- **Memory footprint**, i.e., the size of the predictor under examination;
- **Data efficiency**, i.e., the amount of data required to train the predictor;
- **Latency**, i.e., the time required to run a predictor for inference;
- **Energy consumption**, i.e., the amount of energy required to train/run the predictor;
-->

### <b>💾 Memory Footprint </b>
<u><i>Intuition:</i></u> injected knowledge can reduce the amount of notions to be learnt data-drivenly.

<u><i>Idea:</i></u> measure the amount of total operations (FLOPs or MACs) required by the model.

### <b>🗂️ Data Efficiency </b>
<u><i>Intuition:</i></u> several concepts are injected → some portions of training data are not required.

<u><i>Idea:</i></u> reducing the size of the training set $D$ for the educated predictor by letting the input knowledge $K$ compensate for such lack of data.

### <b>⏳ Latency </b>
<u><i>Intuition:</i></u> knowledge injection remove unnecessary computations required to draw a prediction.

<u><i>Idea:</i></u> measures the average time required to draw a single prediction from a dataset $T$.

### <b>⚡ Energy Consumption </b> 
<u><i>Intuition:</i></u> knowledge injection reduces learning complexity → reduces amount of computations for training and running a model.

<u><i>Idea:</i></u> i) measures the average energy consumption (mW) for a single update, on a training dataset $T$; ii) measures the average energy consumption (mW) of a single forward on a test dataset $T$ composed by several samples.

## Example
The following example shows how to use QoS metrics.

```python

from psyki.qos.energy import Energy

# Model training
model1 = ... # create model 1
model2 = ... # create model 2

X_train, y_train = ... # get training data

model1.fit(X_train, y_train) 
model2.fit(X_train, y_train)

# Compute energy
params = {'x': X_train, 'y': y_train, 'epochs': 100}
training_energy = Energy.compute_during_training(model1, model2, params)


# Model inference
X_test, y_test = ... # get test data 

model1_preds = model1.predict(X_test)
model2_preds = model2.predict(X_test)

# Compute energy
params = {'x': X_test}  
inference_energy = Energy.compute_during_inference(model1, model2, params)

# The process for the other metrics (data efficiency, latency, and memory) is the same.
```


------------------------------------------------------------------


## Users

PSyKI is deployed as a library on Pypi, and it can therefore be installed as Python package by running:
```text
pip install psyki
```

### Requirements

- python 3.9+
- java 11
- 2ppy 0.4.0
- tensorflow 2.13.1
- numpy 1.22.3
- scikit-learn 1.0.2
- pandas 1.4.2
- codecarbon 2.1.4


## Developers

Working with PSyKI codebase requires a number of tools to be installed:
* Python 3.9+
* JDK 11+ (please ensure the `JAVA_HOME` environment variable is properly configured)
* Git 2.20+

### Develop PSyKI with PyCharm

To participate in the development of PSyKI, we suggest the [PyCharm](https://www.jetbrains.com/pycharm/) IDE.

#### Importing the project

1. Clone this repository in a folder of your preference using `git_clone` appropriately
2. Open PyCharm
3. Select `Open`
4. Navigate your file system and find the folder where you cloned the repository
5. Click `Open`

### Developing the project

Contributions to this project are welcome. Just some rules:
* We use [git flow](https://github.com/nvie/gitflow), so if you write new features, please do so in a separate `feature/` branch
* We recommend forking the project, developing your stuff, then contributing back vie pull request
* Commit often
* Stay in sync with the `develop` (or `main | master`) branch (pull frequently if the build passes)
* Do not introduce low quality or untested code

#### Issue tracking
If you meet some problem in using or developing PSyKI, you are encouraged to signal it through the project
["Issues" section](https://github.com/psykei/psyki-python/issues) on GitHub.

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "psyki",
    "maintainer": null,
    "docs_url": null,
    "requires_python": "==3.9.13",
    "maintainer_email": null,
    "keywords": null,
    "author": "Matteo Magnini",
    "author_email": "matteo.magnini@unibo.it",
    "download_url": "https://files.pythonhosted.org/packages/48/7f/1a3ee379e05c11107ca5599368f2d8a0217850a6cc51829a64d0a84840ee/psyki-0.4.3.tar.gz",
    "platform": null,
    "description": "<h1 align=\"center\"> PSyKI </h1>\n<h2 align=\"center\">Platform for Symbolic Knowledge Injection</h2>\n\n[![PyPI version](https://badge.fury.io/py/psyki.svg)](https://badge.fury.io/py/psyki)\n[![Test](https://github.com/psykei/psyki-python/actions/workflows/check.yml/badge.svg?event=push)]()\n![Codecov](https://img.shields.io/codecov/c/github/psykei/psyki-python)\n[![Release](https://github.com/psykei/psyki-python/actions/workflows/deploy.yml/badge.svg?event=push)]()\n[![Black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)\n[![License](https://img.shields.io/badge/License-Apache_2.0-blue.svg)](https://opensource.org/licenses/Apache-2.0)\n[![Stand With Ukraine](https://raw.githubusercontent.com/vshymanskyy/StandWithUkraine/main/badges/StandWithUkraine.svg)](https://stand-with-ukraine.pp.ua)\n\nPSyKI (<u><b>P</b></u>latform for <u><b>Sy</b></u>mbolic <u><b>K</b></u>nowledge <u><b>I</b></u>njection) is a python library for symbolic knowledge injection (<b>SKI</b>).\nSKI is a particular subclass of neuro-symbolic (<b>NeSy</b>) integration techniques.\nPSyKI offers SKI algorithms (a.k.a. <b>injectors</b>) along with quality of service metrics (<b>QoS</b>).\n\nSome quick links:\n<!-- * [Home Page](https://apice.unibo.it/xwiki/bin/view/PSyKI/) -->\n* [GitHub Repository](https://github.com/psykei/psyki-python)\n* [PyPi Repository](https://pypi.org/project/psyki/)\n* [Issues](https://github.com/psykei/psyki-python/issues)\n\n\n**\u2728 Update [Nov. 2024]**: from version 0.4.1, PSyKI supports **FaUCI**, a novel fairness method to reduce bias in machine learning models.\n\n\n## Reference papers\n\nCite the **PSyKI** paper ff you use this library in your research.\nAdditionally, cite the respective paper if you use one of the original injectors, fairness methods or QoS metrics.\n\n### PSyKI (main paper)\n> Matteo Magnini, Giovanni Ciatto, Andrea Omicini. \"On the Design of PSyKI: A Platform for Symbolic Knowledge Injection into Sub-Symbolic Predictors\", in: Proceedings of the 4th International Workshop on EXplainable and TRAnsparent AI and Multi-Agent Systems, 2022.\n\nBibtex: \n```bibtex\n@incollection{psyki-extraamas2022,\n    author = {Magnini, Matteo and Ciatto, Giovanni and Omicini, Andrea},\n    booktitle = {Explainable and Transparent AI and Multi-Agent Systems},\n    chapter = 6,\n    dblp = {conf/atal/MagniniCO22},\n    doi = {10.1007/978-3-031-15565-9_6},\n    editor = {Calvaresi, Davide and Najjar, Amro and Winikoff, Michael and Fr\u00e4mling, Kary},\n    eisbn = {978-3-031-15565-9},\n    eissn = {1611-3349},\n    iris = {11585/899511},\n    isbn = {978-3-031-15564-2},\n    issn = {0302-9743},\n    keywords = {Symbolic Knowledge Injection, Explainable AI, XAI, Neural Networks, PSyKI},\n    note = {4th International Workshop, EXTRAAMAS 2022, Virtual Event, May 9--10, 2022, Revised Selected Papers},\n    pages = {90--108},\n    publisher = {Springer},\n    scholar = {7587528289517313138},\n    scopus = {2-s2.0-85138317005},\n    series = {Lecture Notes in Computer Science},\n    title = {On the Design of {PSyKI}: a Platform for Symbolic Knowledge Injection into Sub-Symbolic Predictors},\n    url = {https://link.springer.com/chapter/10.1007/978-3-031-15565-9_6},\n    urlpdf = {https://link.springer.com/content/pdf/10.1007/978-3-031-15565-9_6.pdf},\n    volume = 13283,\n    wos = {000870042100006},\n    year = 2022\n}\n```\n\n### FaUCI fairness method\n\n> Matteo Magnini, Giovanni Ciatto, Roberta Calegari, and Andrea Omicini, \u201cEnforcing fairness via constraint injection with fauci,\u201d in Proceedings of the 2nd Workshop on Fairness and Bias in AI co-located with 27th European Conference on Artificial Intelligence (ECAI 2024), Santiago de Compostela, Spain, October 20th, 2024, R. Calegari, V. Dignum, and B. O\u2019Sullivan, Eds., ser. CEUR Workshop Proceedings, vol. 3808, CEUR-WS.org, 2024.\n\nBibtex: \n```bibtex\n@inproceedings{DBLP:conf/aequitas/MagniniCCO24,\n  author       = {Matteo Magnini and\n                  Giovanni Ciatto and\n                  Roberta Calegari and\n                  Andrea Omicini},\n  editor       = {Roberta Calegari and\n                  Virginia Dignum and\n                  Barry O'Sullivan},\n  title        = {Enforcing Fairness via Constraint Injection with FaUCI},\n  booktitle    = {Proceedings of the 2nd Workshop on Fairness and Bias in {AI} co-located\n                  with 27th European Conference on Artificial Intelligence {(ECAI} 2024),\n                  Santiago de Compostela, Spain, October 20th, 2024},\n  series       = {{CEUR} Workshop Proceedings},\n  volume       = {3808},\n  publisher    = {CEUR-WS.org},\n  year         = {2024},\n  url          = {https://ceur-ws.org/Vol-3808/paper8.pdf},\n  timestamp    = {Fri, 08 Nov 2024 15:21:04 +0100},\n  biburl       = {https://dblp.org/rec/conf/aequitas/MagniniCCO24.bib},\n  bibsource    = {dblp computer science bibliography, https://dblp.org}\n}\n```\n\n### KINS injector\n\n> Matteo Magnini, Giovanni Ciatto, and Andrea Omicini \u201cKnowledge injection of datalog rules via neural network structuring with KINS,\u201d J. Log. Comput., vol. 33, no. 8, pp. 1832\u20131850, 2023. doi: 10.1093/LOGCOM/\nEXAD037.\n\nBibtex: \n```bibtex\n@article{DBLP:journals/logcom/MagniniCO23,\n  author       = {Matteo Magnini and\n                  Giovanni Ciatto and\n                  Andrea Omicini},\n  title        = {Knowledge injection of Datalog rules via Neural Network Structuring\n                  with {KINS}},\n  journal      = {J. Log. Comput.},\n  volume       = {33},\n  number       = {8},\n  pages        = {1832--1850},\n  year         = {2023},\n  url          = {https://doi.org/10.1093/logcom/exad037},\n  doi          = {10.1093/LOGCOM/EXAD037},\n  timestamp    = {Tue, 02 Jan 2024 12:25:17 +0100},\n  biburl       = {https://dblp.org/rec/journals/logcom/MagniniCO23.bib},\n  bibsource    = {dblp computer science bibliography, https://dblp.org}\n}\n```\n\n### KILL injector\n\n> Matteo Magnini, Giovanni Ciatto, and Andrea Omicini, \u201cA view to a KILL: knowledge injection via lambda layer,\u201d in Proceedings of the 23rd Workshop \"From Objects to Agents\", Genova, Italy, September 1-3, 2022, A. Ferrando and V. Mascardi, Eds., ser. CEUR Workshop Proceedings, vol. 3261, CEUR-WS.org, 2022, pp. 61\u201376.\n\nBibtex: \n```bibtex\n@inproceedings{DBLP:conf/woa/MagniniCO22,\n  author       = {Matteo Magnini and\n                  Giovanni Ciatto and\n                  Andrea Omicini},\n  editor       = {Angelo Ferrando and\n                  Viviana Mascardi},\n  title        = {A view to a {KILL:} knowledge injection via lambda layer},\n  booktitle    = {Proceedings of the 23rd Workshop \"From Objects to Agents\", Genova,\n                  Italy, September 1-3, 2022},\n  series       = {{CEUR} Workshop Proceedings},\n  volume       = {3261},\n  pages        = {61--76},\n  publisher    = {CEUR-WS.org},\n  year         = {2022},\n  url          = {https://ceur-ws.org/Vol-3261/paper5.pdf},\n  timestamp    = {Fri, 10 Mar 2023 16:22:40 +0100},\n  biburl       = {https://dblp.org/rec/conf/woa/MagniniCO22.bib},\n  bibsource    = {dblp computer science bibliography, https://dblp.org}\n}\n```\n\n### QoS metrics\n\n> Andrea Agiollo, Andrea Rafanelli, Matteo Magnini, Giovanni Ciatto, Andrea Omicini. \"Symbolic knowledge injection meets intelligent agents: QoS metrics and experiments\", in:  Autonomous Agents and Multi-Agent Systems, 2023.\n\nBibtex: \n```bibtex\n@article{ski-qos23,\n  author       = {Andrea Agiollo and\n                  Andrea Rafanelli and\n                  Matteo Magnini and\n                  Giovanni Ciatto and\n                  Andrea Omicini},\n  title        = {Symbolic knowledge injection meets intelligent agents: QoS metrics\n                  and experiments},\n  journal      = {Auton. Agents Multi Agent Syst.},\n  volume       = {37},\n  number       = {2},\n  pages        = {27},\n  year         = {2023},\n  url          = {https://doi.org/10.1007/s10458-023-09609-6},\n  doi          = {10.1007/S10458-023-09609-6},\n  timestamp    = {Tue, 12 Sep 2023 07:57:44 +0200},\n  bibsource    = {dblp computer science bibliography, https://dblp.org}\n}\n```\n\n### Robustness metrics for symbolic knowledge injection\n\n> Andrea Rafanelli, Matteo Magnini, Andrea Agiollo, Giovanni Ciatto, and Andrea Omicini, \u201cAn empirical study on the robustness of knowledge injection techniques against data degradation,\u201d in Proceedings of the 25th Workshop \"From Objects to Agents\", Bard (Aosta), Italy, July 8-10, 2024, M. Alderighi, M. Baldoni, C. Baroglio, R. Micalizio, and S. Tedeschi, Eds., ser. CEURWorkshop Proceedings, vol. 3735, CEUR-WS.org, 2024, pp. 20\u201332.\n\nBibtex: \n```bibtex\n@inproceedings{DBLP:conf/woa/RafanelliMACO24,\n  author       = {Andrea Rafanelli and\n                  Matteo Magnini and\n                  Andrea Agiollo and\n                  Giovanni Ciatto and\n                  Andrea Omicini},\n  editor       = {Marco Alderighi and\n                  Matteo Baldoni and\n                  Cristina Baroglio and\n                  Roberto Micalizio and\n                  Stefano Tedeschi},\n  title        = {An Empirical Study on the Robustness of Knowledge Injection Techniques\n                  Against Data Degradation},\n  booktitle    = {Proceedings of the 25th Workshop \"From Objects to Agents\", Bard (Aosta),\n                  Italy, July 8-10, 2024},\n  series       = {{CEUR} Workshop Proceedings},\n  volume       = {3735},\n  pages        = {20--32},\n  publisher    = {CEUR-WS.org},\n  year         = {2024},\n  url          = {https://ceur-ws.org/Vol-3735/paper\\_02.pdf},\n  timestamp    = {Fri, 26 Jul 2024 22:35:03 +0200},\n  biburl       = {https://dblp.org/rec/conf/woa/RafanelliMACO24.bib},\n  bibsource    = {dblp computer science bibliography, https://dblp.org}\n}\n\n```\n\n## More in detail\n\nAn `Injector` is a SKI algorithm that may -- or may not -- take a sub-symbolic predictor in conjunction with prior symbolic knowledge to create a new predictor through the `inject` method.\nWe refer to the new predictor as **educated**, while predictors that are not affected by symbolic knowledge are called **uneducated**.\n\nKnowledge can be represented in many ways.\nThe most common is the representation via logic formulae.\nPSyKI integrates [`2ppy`](https://github.com/tuProlog/2ppy), a python porting of [`2p-kt`](https://github.com/tuProlog/2p-kt) (a multi-paradigm logic programming framework).\nThanks to this integration, PSyKI supports logic formulae written with the formalism of Prolog.\nTherefore, all subsets of the Prolog language (including Prolog itself) are potentially supported (e.g., propositional logic, Datalog, etc.).\nIt is worth noting that each injector may have its own requirements on the knowledge representation.\n\nList of available injectors:\n\n - [`KBANN`](https://www.sciencedirect.com/science/article/pii/0004370294901058), one of the first injector introduced in literature;\n - [`KILL`](http://ceur-ws.org/Vol-3261/paper5.pdf), constrains a NN by altering its predictions using the knowledge;\n - [`KINS`](http://ceur-ws.org/Vol-3204/paper_25.pdf), structure the knowledge adding ad-hoc layers into a NN.\n\n### High level architecture\n\n![PSyKI class diagram](https://www.plantuml.com/plantuml/svg/TLF1Rjim3BthAnvyweQvGA_1C7Gx32YMvTBUYc9WPDdH8ec1H6z8i_pxP3bnd64z9NmKtoCVwVia5ANtJgcqjM57aJoS3KRsEmEEic6bTgItr1auxgm-A0NGEaaaBVZAqVUE3XbJm541WSLWpIBimUtvWGA0XeIG2tijVJG5QZc2HcB4tWsW2KqXKOEGTfGIdZQ6u_vGAfnDydnYVS4sy6zdciwC0bRBSnRu01la1QsXGUY7fzt_qeNxb3mgXPCCghiAx-iQLQYczjNnOaCswjg4X_3JQE5O6lpEZN7OHJEeSHoWHube-zTNsrfJ05iARavwKdxUBSRIkOtHTXi1jvF2Od55Z3wOfjSaffaRD_dsxM7rEBfcWy3HliWVvm-MoyCy_l9vjHehMiSaO6ywciKTUK_p5gjDFfHObyCnOc82jyD48DTnjBBngG8bhEuKHdhStfQeT3S6fG4RjSjyAC-rmZGqFlwfwu9erALg_3lIJV1oURMboV3qpyMUyN5C_BB9oiqRLvMNGc7_ncNFDugdI26rcI0XxVsQtUcWqzb-1Y7rwthANdyDc2smp74vnkpHfyaCTN4bMvUpipwKkiyKlNT_0G00)\n*Class diagram representing the relations between `Injector`, `Theory` and `Fuzzifier` classes*\n\n<!--\nTo generate/edit the class diagram browse the URL above, after replacing `svg` with `uml`\n-->\n\nThe core abstractions of PSyKI are the following:\n\n - `Injector`: a SKI algorithm;\n - `Theory`: symbolic knowledge plus additional information about the domain;\n - `Fuzzifier`: entity that transforms (fuzzify) symbolic knowledge into a sub-symbolic data structure.\n\nThe class `Theory` is built upon the symbolic knowledge and the metadata of the dataset (extracted by a Pandas DataFrame).\nThe knowledge can be generated by an adapter that parses the Prolog theory (e.g., a `.pl` file, a string) and generates a list of `Formula` objects.\nEach `Injector` has one `Fuzzifier`.\nThe `Fuzzifier` is used to transform the `Theory` into a sub-symbolic data structure (e.g., ad-hoc layers of a NN).\nDifferent fuzzifiers encode the knowledge in different ways.\n\nTo avoid confusion, we use the following terminology:\n\n - **rule** is a single logic clause;\n - **knowledge** is the set of rules;\n - **theory** is the knowledge plus metadata.\n\n### Hello world\n\nThe following example shows how to use PSyKI to inject knowledge into a NN.\n\n```python\nimport pandas as pd\nfrom tensorflow.keras.models import Model\nfrom psyki.logic import Theory\nfrom psyki.ski import Injector\n\n\ndef create_uneducated_predictor() -> Model:\n...\n\ndataset = pd.read_csv(\"path_to_dataset.csv\")  # load dataset\nknowledge_file = \"path_to_knowledge_file.pl\"  # load knowledge\ntheory = Theory(knowledge_file, dataset)      # create a theory\nuneducated = create_uneducated_predictor()    # create a NN\ninjector = Injector.kins(uneducated)          # create an injector\neducated = injector.inject(theory)            # inject knowledge into the NN\n\n# From now on you can use the educated predictor as you would use the uneducated one\n```\n\nFor more detailed examples, please refer to the demos in the [demo-psyki-python](https://github.com/psykei/demo-psyki-python) repository.\n\n---------------------------------------------------------------------------------\n\n##  Injection Assessment \n\nKnowledge injection methods aim to enhance the sustainability of machine learning by minimizing the learning time required. This accelerated learning can possibly lead to improvements in model accuracy as well. Additionally, by incorporating prior knowledge, the data requirements for effective training may be loosened, allowing for smaller datasets to be utilized. Injecting knowledge has also the potential to increase model interpretability by preventing the predictor from becoming a black-box.\n\nMost existing works on knowledge injection techniques showcase standard evaluation metrics that aim to quantify these potential benefits. However, accurately quantifying sustainability, accuracy improvements, dataset needs, and interpretability in a consistent manner remains an open challenge.\n\n<!--\n**Efficiency metrics**: \n\n- **Memory footprint**, i.e., the size of the predictor under examination;\n- **Data efficiency**, i.e., the amount of data required to train the predictor;\n- **Latency**, i.e., the time required to run a predictor for inference;\n- **Energy consumption**, i.e., the amount of energy required to train/run the predictor;\n-->\n\n### <b>\ud83d\udcbe Memory Footprint </b>\n<u><i>Intuition:</i></u> injected knowledge can reduce the amount of notions to be learnt data-drivenly.\n\n<u><i>Idea:</i></u> measure the amount of total operations (FLOPs or MACs) required by the model.\n\n### <b>\ud83d\uddc2\ufe0f Data Efficiency </b>\n<u><i>Intuition:</i></u> several concepts are injected \u2192 some portions of training data are not required.\n\n<u><i>Idea:</i></u> reducing the size of the training set $D$ for the educated predictor by letting the input knowledge $K$ compensate for such lack of data.\n\n### <b>\u23f3 Latency </b>\n<u><i>Intuition:</i></u> knowledge injection remove unnecessary computations required to draw a prediction.\n\n<u><i>Idea:</i></u> measures the average time required to draw a single prediction from a dataset $T$.\n\n### <b>\u26a1 Energy Consumption </b> \n<u><i>Intuition:</i></u> knowledge injection reduces learning complexity \u2192 reduces amount of computations for training and running a model.\n\n<u><i>Idea:</i></u> i) measures the average energy consumption (mW) for a single update, on a training dataset $T$; ii) measures the average energy consumption (mW) of a single forward on a test dataset $T$ composed by several samples.\n\n## Example\nThe following example shows how to use QoS metrics.\n\n```python\n\nfrom psyki.qos.energy import Energy\n\n# Model training\nmodel1 = ... # create model 1\nmodel2 = ... # create model 2\n\nX_train, y_train = ... # get training data\n\nmodel1.fit(X_train, y_train) \nmodel2.fit(X_train, y_train)\n\n# Compute energy\nparams = {'x': X_train, 'y': y_train, 'epochs': 100}\ntraining_energy = Energy.compute_during_training(model1, model2, params)\n\n\n# Model inference\nX_test, y_test = ... # get test data \n\nmodel1_preds = model1.predict(X_test)\nmodel2_preds = model2.predict(X_test)\n\n# Compute energy\nparams = {'x': X_test}  \ninference_energy = Energy.compute_during_inference(model1, model2, params)\n\n# The process for the other metrics (data efficiency, latency, and memory) is the same.\n```\n\n\n------------------------------------------------------------------\n\n\n## Users\n\nPSyKI is deployed as a library on Pypi, and it can therefore be installed as Python package by running:\n```text\npip install psyki\n```\n\n### Requirements\n\n- python 3.9+\n- java 11\n- 2ppy 0.4.0\n- tensorflow 2.13.1\n- numpy 1.22.3\n- scikit-learn 1.0.2\n- pandas 1.4.2\n- codecarbon 2.1.4\n\n\n## Developers\n\nWorking with PSyKI codebase requires a number of tools to be installed:\n* Python 3.9+\n* JDK 11+ (please ensure the `JAVA_HOME` environment variable is properly configured)\n* Git 2.20+\n\n### Develop PSyKI with PyCharm\n\nTo participate in the development of PSyKI, we suggest the [PyCharm](https://www.jetbrains.com/pycharm/) IDE.\n\n#### Importing the project\n\n1. Clone this repository in a folder of your preference using `git_clone` appropriately\n2. Open PyCharm\n3. Select `Open`\n4. Navigate your file system and find the folder where you cloned the repository\n5. Click `Open`\n\n### Developing the project\n\nContributions to this project are welcome. Just some rules:\n* We use [git flow](https://github.com/nvie/gitflow), so if you write new features, please do so in a separate `feature/` branch\n* We recommend forking the project, developing your stuff, then contributing back vie pull request\n* Commit often\n* Stay in sync with the `develop` (or `main | master`) branch (pull frequently if the build passes)\n* Do not introduce low quality or untested code\n\n#### Issue tracking\nIf you meet some problem in using or developing PSyKI, you are encouraged to signal it through the project\n[\"Issues\" section](https://github.com/psykei/psyki-python/issues) on GitHub.\n\n\n",
    "bugtrack_url": null,
    "license": "Apache2.0",
    "summary": "PSyKI: a (Python) platform for symbolic knowledge injection",
    "version": "0.4.3",
    "project_urls": null,
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "10685dbcc5ea6f6812423c7f2181768909fc7fdf12a99105aee3a547bc6bf5ff",
                "md5": "97a9edff0a977997d5d2307e64f2abb4",
                "sha256": "6fc8b3ef7823b9db94043692dbbf089b41b70c82d6a14e5fe48c85f14c959354"
            },
            "downloads": -1,
            "filename": "psyki-0.4.3-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "97a9edff0a977997d5d2307e64f2abb4",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": "==3.9.13",
            "size": 48892,
            "upload_time": "2024-11-28T12:08:24",
            "upload_time_iso_8601": "2024-11-28T12:08:24.465278Z",
            "url": "https://files.pythonhosted.org/packages/10/68/5dbcc5ea6f6812423c7f2181768909fc7fdf12a99105aee3a547bc6bf5ff/psyki-0.4.3-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "487f1a3ee379e05c11107ca5599368f2d8a0217850a6cc51829a64d0a84840ee",
                "md5": "6aed0a81f43e7fb6bd27f7d0268d459d",
                "sha256": "c666d8bf79f91e3e2ef87a26f0e769f845ca931d3f71f661889b89cf9db09c2e"
            },
            "downloads": -1,
            "filename": "psyki-0.4.3.tar.gz",
            "has_sig": false,
            "md5_digest": "6aed0a81f43e7fb6bd27f7d0268d459d",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": "==3.9.13",
            "size": 43089,
            "upload_time": "2024-11-28T12:08:26",
            "upload_time_iso_8601": "2024-11-28T12:08:26.266712Z",
            "url": "https://files.pythonhosted.org/packages/48/7f/1a3ee379e05c11107ca5599368f2d8a0217850a6cc51829a64d0a84840ee/psyki-0.4.3.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-11-28 12:08:26",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "psyki"
}

Matteo Magnini