# ODT-AI-Filters
This repository is an AI plug-ins for the Oligo Designer Toolsuite package. Here we collect all the required functionalities and implementation to train and run machine learning models in the Oligo Designer Toolsuite pipelines.
For each task we provide a pretrained model, but also the code impelemtation to train you own model with the architecture and hypeparameters you prefer.
In general, the model training pipeline performs a grid hyperparameters seach and stores all the models trained in a folder [filter_type]/[model_architecture]/[dataset_name]. Then the best model is saved inthe same folder under the filter_type name.
## Available AI models.
### Hybridization Probability
This model is used to improove the specificity estiamtion of the oligo sequences.
With a Recurrent Neural Network we estimate the hybridization probability between off-target sites and oligos, and use it to determine if the sites represent a real threat. To predict hybridization probability, our models use the genomic sequences of the oligo and the off-target site. In addition, several manually extracted sequence features, such as the GC content and the melting temperature, were fed into the model.
For generating the ground-truths of the hybridization-probability filters we use the [NUPACK](https://docs.nupack.org/) pakage, which estimasates the equilibrium cooncentrations of DNA complexes.
In particualr, the score is obtained from the final concentration of DNA complexes in NUPACK tube experiment simulation
that contains the oligo sequence, the exact on-target region and the off-target. The oligo, on-target and off-target
strands are initially set at the same concentration $C_{in}$ and we define the duplexing score as:
$log( \dfrac{C_{oligo + off-t}}{C_{oligo + off-t} + C_{oligo + on-t} })$.
add how to install nupack
## Pretrained models
The best performing architecture for the different taks that are available as pretrained models are the following:
| AI filter | Architecture |
|-|-|
| hybridization probability | lstm |
Raw data
{
"_id": null,
"home_page": null,
"name": "oligo-designer-toolsuite-ai-filters",
"maintainer": null,
"docs_url": null,
"requires_python": "<3.11",
"maintainer_email": "Francesco Campi <francesco.campi@helmholtz-munich.de>",
"keywords": "bioinformatics, probes, oligo nucleotides, probe design, oligo design",
"author": null,
"author_email": "Francesco Campi <francesco.campi@helmholtz-munich.de>, Lisa Barros de Andrade e Sousa <lisa.barros.andrade.sousa@gmail.com>",
"download_url": "https://files.pythonhosted.org/packages/b9/9d/78f972c2b98adaef64485f464da41f1a0cf5083e1ddc6bfe488c52ba22a0/oligo_designer_toolsuite_ai_filters-0.0.3.tar.gz",
"platform": null,
"description": "# ODT-AI-Filters\nThis repository is an AI plug-ins for the Oligo Designer Toolsuite package. Here we collect all the required functionalities and implementation to train and run machine learning models in the Oligo Designer Toolsuite pipelines.\n\nFor each task we provide a pretrained model, but also the code impelemtation to train you own model with the architecture and hypeparameters you prefer.\nIn general, the model training pipeline performs a grid hyperparameters seach and stores all the models trained in a folder [filter_type]/[model_architecture]/[dataset_name]. Then the best model is saved inthe same folder under the filter_type name.\n\n\n## Available AI models.\n\n\n### Hybridization Probability\n\nThis model is used to improove the specificity estiamtion of the oligo sequences.\nWith a Recurrent Neural Network we estimate the hybridization probability between off-target sites and oligos, and use it to determine if the sites represent a real threat. To predict hybridization probability, our models use the genomic sequences of the oligo and the off-target site. In addition, several manually extracted sequence features, such as the GC content and the melting temperature, were fed into the model.\n\nFor generating the ground-truths of the hybridization-probability filters we use the [NUPACK](https://docs.nupack.org/) pakage, which estimasates the equilibrium cooncentrations of DNA complexes.\nIn particualr, the score is obtained from the final concentration of DNA complexes in NUPACK tube experiment simulation\nthat contains the oligo sequence, the exact on-target region and the off-target. The oligo, on-target and off-target\nstrands are initially set at the same concentration $C_{in}$ and we define the duplexing score as: \n\n$log( \\dfrac{C_{oligo + off-t}}{C_{oligo + off-t} + C_{oligo + on-t} })$. \n \nadd how to install nupack\n\n## Pretrained models\n\nThe best performing architecture for the different taks that are available as pretrained models are the following:\n\n| AI filter | Architecture |\n|-|-|\n| hybridization probability | lstm |\n",
"bugtrack_url": null,
"license": null,
"summary": "AI plug-ins for the Oligo Designer Toolsuite package",
"version": "0.0.3",
"project_urls": {
"repository": "https://github.com/HelmholtzAI-Consultants-Munich/oligo-designer-toolsuite-AI-filters"
},
"split_keywords": [
"bioinformatics",
" probes",
" oligo nucleotides",
" probe design",
" oligo design"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "d21eae0b19ec9ed732f0706caf30c9016ed117f58cf2397a270362864e522e86",
"md5": "fb088a0c3e3c2f6f0f13fa28618ffe04",
"sha256": "896a4bb3d0fb3f775d1409b68e6ab63b28c9ab50bb7989d9193cba898586d2f2"
},
"downloads": -1,
"filename": "oligo_designer_toolsuite_ai_filters-0.0.3-py3-none-any.whl",
"has_sig": false,
"md5_digest": "fb088a0c3e3c2f6f0f13fa28618ffe04",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": "<3.11",
"size": 1291573,
"upload_time": "2024-05-23T07:34:49",
"upload_time_iso_8601": "2024-05-23T07:34:49.530323Z",
"url": "https://files.pythonhosted.org/packages/d2/1e/ae0b19ec9ed732f0706caf30c9016ed117f58cf2397a270362864e522e86/oligo_designer_toolsuite_ai_filters-0.0.3-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "b99d78f972c2b98adaef64485f464da41f1a0cf5083e1ddc6bfe488c52ba22a0",
"md5": "05455fa9dc96cc925043d3be8b57ca81",
"sha256": "6c3c9d99cd5cb0d369b22bf6778f58f24204f0a0bb9f911272f8f23f4080c437"
},
"downloads": -1,
"filename": "oligo_designer_toolsuite_ai_filters-0.0.3.tar.gz",
"has_sig": false,
"md5_digest": "05455fa9dc96cc925043d3be8b57ca81",
"packagetype": "sdist",
"python_version": "source",
"requires_python": "<3.11",
"size": 15616687,
"upload_time": "2024-05-23T07:34:52",
"upload_time_iso_8601": "2024-05-23T07:34:52.338859Z",
"url": "https://files.pythonhosted.org/packages/b9/9d/78f972c2b98adaef64485f464da41f1a0cf5083e1ddc6bfe488c52ba22a0/oligo_designer_toolsuite_ai_filters-0.0.3.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-05-23 07:34:52",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "HelmholtzAI-Consultants-Munich",
"github_project": "oligo-designer-toolsuite-AI-filters",
"travis_ci": false,
"coveralls": false,
"github_actions": false,
"lcname": "oligo-designer-toolsuite-ai-filters"
}