# FindSimilar
User-friendly library to find similar objects
You can find **Full Project Documentation** [here][documentation_path]
<hr>
#### Workflows
[![Tests](https://github.com/findsimilar/find-similar/actions/workflows/run-tests.yml/badge.svg?branch=main)](https://github.com/findsimilar/find-similar/actions/workflows/run-tests.yml)
[![Pylint](https://github.com/findsimilar/find-similar/actions/workflows/lint.yml/badge.svg?branch=main)](https://github.com/findsimilar/find-similar/actions/workflows/lint.yml)
#### PyPi
[![Version](https://img.shields.io/pypi/v/find-similar.svg)](https://pypi.python.org/pypi/find-similar/)
[![Development Status](https://img.shields.io/pypi/status/find-similar.svg)](https://pypi.python.org/pypi/find-similar)
[![Python version](https://img.shields.io/pypi/pyversions/find-similar.svg)](https://pypi.python.org/pypi/find-similar/)
[![Wheel](https://img.shields.io/pypi/wheel/find-similar.svg)](https://pypi.python.org/pypi/find-similar/)
### Anaconda
[![Version](https://anaconda.org/quillcraftsman/find-similar/badges/version.svg)](https://anaconda.org/quillcraftsman/find-similar/)
[![Last Updated](https://anaconda.org/quillcraftsman/find-similar/badges/latest_release_date.svg)](https://anaconda.org/quillcraftsman/find-similar/)
[![Platforms](https://anaconda.org/quillcraftsman/find-similar/badges/platforms.svg)](https://anaconda.org/quillcraftsman/find-similar/)
### License
[![License](https://img.shields.io/pypi/l/find-similar)](https://github.com/findsimilar/find-similar/blob/main/LICENSE)
#### Support
[![Documentation](https://img.shields.io/badge/docs-0094FF.svg)][documentation_path]
[![Discussions](https://img.shields.io/badge/discussions-ff0068.svg)](https://github.com/findsimilar/find-similar/discussions/)
[![Issues](https://img.shields.io/badge/issues-11AE13.svg)](https://github.com/findsimilar/find-similar/issues/)
#### PyPi Downloads
[![Day Downloads](https://img.shields.io/pypi/dd/find-similar)](https://pepy.tech/project/find-similar)
[![Week Downloads](https://img.shields.io/pypi/dw/find-similar)](https://pepy.tech/project/find-similar)
[![Month Downloads](https://img.shields.io/pypi/dm/find-similar)](https://pepy.tech/project/find-similar)
#### Anaconda Downloads
[![Anaconda](https://anaconda.org/quillcraftsman/find-similar/badges/downloads.svg)](https://anaconda.org/quillcraftsman/find-similar/)
#### Languages
[![Languages](https://img.shields.io/github/languages/count/findsimilar/find-similar)](https://github.com/findsimilar/find-similar)
[![Top Language](https://img.shields.io/github/languages/top/findsimilar/find-similar)](https://github.com/findsimilar/find-similar)
#### Development
- [![Release date](https://img.shields.io/github/release-date/findsimilar/find-similar
)](https://github.com/findsimilar/find-similar/releases)
[![Last Commit](https://img.shields.io/github/last-commit/findsimilar/find-similar/main
)](https://github.com/findsimilar/find-similar)
- [![Issues](https://img.shields.io/github/issues/findsimilar/find-similar
)](https://github.com/findsimilar/find-similar/issues/)
[![Closed Issues](https://img.shields.io/github/issues-closed/findsimilar/find-similar
)](https://github.com/findsimilar/find-similar/issues/)
- [![Pull Requests](https://img.shields.io/github/issues-pr/findsimilar/find-similar
)](https://github.com/findsimilar/find-similar/pulls)
[![Closed Pull Requests](https://img.shields.io/github/issues-pr-closed-raw/findsimilar/find-similar
)](https://github.com/findsimilar/find-similar/pulls)
- [![Discussions](https://img.shields.io/github/discussions/findsimilar/find-similar
)](https://github.com/findsimilar/find-similar/discussions/)
#### Repository Stats
[![Stars](https://img.shields.io/github/stars/findsimilar/find-similar
)](https://github.com/findsimilar/find-similar)
[![Contributors](https://img.shields.io/github/contributors/findsimilar/find-similar
)](https://github.com/findsimilar/find-similargraphs/contributors)
[![Forks](https://img.shields.io/github/forks/findsimilar/find-similar
)](https://github.com/findsimilar/find-similar)
<hr>
## Menu
- [Mission](#mission)
- [Open Source Project](#open-source-project)
- [Features](#features)
- [Requirements](#requirements)
- [Development Status](#development-status)
- [Install](#install)
- [Quickstart](#quickstart)
- [Contributing](#contributing)
## Mission
The mission of the **FindSimilar** project is to provide a powerful and versatile open source library that empowers
developers to efficiently find similar objects and perform comparisons across a variety of data types.
Whether dealing with texts, images, audio, or more,
our project aims to simplify the process of identifying similarities and enhancing decision-making.
## Open Source Project
This is the open source project with [MIT license](LICENSE).
Be free to use, fork, clone and contribute.
## Features
Find similar texts
- on different languages
- with or without stopwords
- using dictionary (or not)
- using keywords (or not)
## Requirements
- nltk, pymorphy3
- See more in [Full Documentation](https://findsimilar.craftsman.lol/about.html#requirements)
## Development Status
- Package already available on [PyPi](https://pypi.org/project/find-similar/)
- See more in [Full Documentation](https://findsimilar.craftsman.lol/about.html#development-status)
## Install
### with pip
```commandline
pip install find-similar
```
See more in [Full Documentation](https://findsimilar.craftsman.lol/install.html)
## Quickstart
```python
from find_similar import find_similar
texts = ['one two', 'two three', 'three four']
text_to_compare = 'one four'
find_similar(text_to_compare, texts, count=10)
```
```commandline
[TokenText(text="one two", len(tokens)=2, cos=0.5), TokenText(text="three four", len(tokens)=2, cos=0.5), TokenText(text="two three", len(tokens)=2, cos=0)]
```
- The result is the list of `TokenText` instances ordering by `cos`
- `cos` is the mark of texts similarity
### See more examples in [Full Documentation][documentation_path]
## Contributing
You are welcome! To easy start please check:
- [Full Documentation][documentation_path]
- [Contributing](CONTRIBUTING.md)
- [Developer Documentation](https://findsimilar.craftsman.lol/dev_documentation.html)
- [Code of Conduct](CODE_OF_CONDUCT.md)
- [Security Policy](SECURITY.md)
- [Governance](GOVERNANCE.md)
- [Support](SUPPORT.md)
[documentation_path]: https://findsimilar.craftsman.lol
Raw data
{
"_id": null,
"home_page": "https://github.com/findsimilar/find-similar",
"name": "find-similar",
"maintainer": null,
"docs_url": null,
"requires_python": ">=3",
"maintainer_email": null,
"keywords": "python, search, machine-learning, natural-language-processing, find, words, texts, similar",
"author": "findsimilar",
"author_email": "quill@craftsman.lol",
"download_url": "https://files.pythonhosted.org/packages/1f/3b/b34e9121f75a17404960a06b8c7f22f6066b09e056284a1d6be3c7b8dacf/find_similar-2.2.1.tar.gz",
"platform": null,
"description": "# FindSimilar\n\nUser-friendly library to find similar objects\n\nYou can find **Full Project Documentation** [here][documentation_path]\n\n<hr>\n\n#### Workflows\n[![Tests](https://github.com/findsimilar/find-similar/actions/workflows/run-tests.yml/badge.svg?branch=main)](https://github.com/findsimilar/find-similar/actions/workflows/run-tests.yml)\n[![Pylint](https://github.com/findsimilar/find-similar/actions/workflows/lint.yml/badge.svg?branch=main)](https://github.com/findsimilar/find-similar/actions/workflows/lint.yml)\n\n#### PyPi\n[![Version](https://img.shields.io/pypi/v/find-similar.svg)](https://pypi.python.org/pypi/find-similar/)\n[![Development Status](https://img.shields.io/pypi/status/find-similar.svg)](https://pypi.python.org/pypi/find-similar)\n[![Python version](https://img.shields.io/pypi/pyversions/find-similar.svg)](https://pypi.python.org/pypi/find-similar/)\n[![Wheel](https://img.shields.io/pypi/wheel/find-similar.svg)](https://pypi.python.org/pypi/find-similar/)\n\n### Anaconda\n[![Version](https://anaconda.org/quillcraftsman/find-similar/badges/version.svg)](https://anaconda.org/quillcraftsman/find-similar/)\n[![Last Updated](https://anaconda.org/quillcraftsman/find-similar/badges/latest_release_date.svg)](https://anaconda.org/quillcraftsman/find-similar/)\n[![Platforms](https://anaconda.org/quillcraftsman/find-similar/badges/platforms.svg)](https://anaconda.org/quillcraftsman/find-similar/)\n\n### License\n[![License](https://img.shields.io/pypi/l/find-similar)](https://github.com/findsimilar/find-similar/blob/main/LICENSE)\n\n#### Support\n[![Documentation](https://img.shields.io/badge/docs-0094FF.svg)][documentation_path]\n[![Discussions](https://img.shields.io/badge/discussions-ff0068.svg)](https://github.com/findsimilar/find-similar/discussions/)\n[![Issues](https://img.shields.io/badge/issues-11AE13.svg)](https://github.com/findsimilar/find-similar/issues/)\n\n#### PyPi Downloads\n[![Day Downloads](https://img.shields.io/pypi/dd/find-similar)](https://pepy.tech/project/find-similar)\n[![Week Downloads](https://img.shields.io/pypi/dw/find-similar)](https://pepy.tech/project/find-similar)\n[![Month Downloads](https://img.shields.io/pypi/dm/find-similar)](https://pepy.tech/project/find-similar)\n#### Anaconda Downloads\n[![Anaconda](https://anaconda.org/quillcraftsman/find-similar/badges/downloads.svg)](https://anaconda.org/quillcraftsman/find-similar/)\n\n#### Languages\n[![Languages](https://img.shields.io/github/languages/count/findsimilar/find-similar)](https://github.com/findsimilar/find-similar)\n[![Top Language](https://img.shields.io/github/languages/top/findsimilar/find-similar)](https://github.com/findsimilar/find-similar)\n\n#### Development\n- [![Release date](https://img.shields.io/github/release-date/findsimilar/find-similar\n)](https://github.com/findsimilar/find-similar/releases)\n[![Last Commit](https://img.shields.io/github/last-commit/findsimilar/find-similar/main\n)](https://github.com/findsimilar/find-similar)\n- [![Issues](https://img.shields.io/github/issues/findsimilar/find-similar\n)](https://github.com/findsimilar/find-similar/issues/)\n[![Closed Issues](https://img.shields.io/github/issues-closed/findsimilar/find-similar\n)](https://github.com/findsimilar/find-similar/issues/)\n- [![Pull Requests](https://img.shields.io/github/issues-pr/findsimilar/find-similar\n)](https://github.com/findsimilar/find-similar/pulls)\n[![Closed Pull Requests](https://img.shields.io/github/issues-pr-closed-raw/findsimilar/find-similar\n)](https://github.com/findsimilar/find-similar/pulls)\n- [![Discussions](https://img.shields.io/github/discussions/findsimilar/find-similar\n)](https://github.com/findsimilar/find-similar/discussions/)\n\n#### Repository Stats\n\n[![Stars](https://img.shields.io/github/stars/findsimilar/find-similar\n)](https://github.com/findsimilar/find-similar)\n[![Contributors](https://img.shields.io/github/contributors/findsimilar/find-similar\n)](https://github.com/findsimilar/find-similargraphs/contributors)\n[![Forks](https://img.shields.io/github/forks/findsimilar/find-similar\n)](https://github.com/findsimilar/find-similar)\n\n<hr>\n\n## Menu\n\n- [Mission](#mission)\n- [Open Source Project](#open-source-project)\n- [Features](#features)\n- [Requirements](#requirements)\n- [Development Status](#development-status)\n- [Install](#install)\n- [Quickstart](#quickstart)\n- [Contributing](#contributing)\n\n## Mission\n\nThe mission of the **FindSimilar** project is to provide a powerful and versatile open source library that empowers \ndevelopers to efficiently find similar objects and perform comparisons across a variety of data types.\nWhether dealing with texts, images, audio, or more, \nour project aims to simplify the process of identifying similarities and enhancing decision-making.\n\n## Open Source Project\n\nThis is the open source project with [MIT license](LICENSE). \nBe free to use, fork, clone and contribute.\n\n## Features\n\nFind similar texts\n- on different languages\n- with or without stopwords\n- using dictionary (or not)\n- using keywords (or not)\n\n## Requirements\n\n- nltk, pymorphy3\n- See more in [Full Documentation](https://findsimilar.craftsman.lol/about.html#requirements)\n\n## Development Status\n\n- Package already available on [PyPi](https://pypi.org/project/find-similar/)\n- See more in [Full Documentation](https://findsimilar.craftsman.lol/about.html#development-status)\n\n## Install\n\n### with pip\n\n```commandline\npip install find-similar\n```\n\nSee more in [Full Documentation](https://findsimilar.craftsman.lol/install.html)\n\n## Quickstart\n\n```python\nfrom find_similar import find_similar\n\ntexts = ['one two', 'two three', 'three four']\n\ntext_to_compare = 'one four'\nfind_similar(text_to_compare, texts, count=10)\n```\n\n```commandline\n[TokenText(text=\"one two\", len(tokens)=2, cos=0.5), TokenText(text=\"three four\", len(tokens)=2, cos=0.5), TokenText(text=\"two three\", len(tokens)=2, cos=0)]\n```\n\n- The result is the list of `TokenText` instances ordering by `cos`\n- `cos` is the mark of texts similarity\n\n### See more examples in [Full Documentation][documentation_path]\n\n## Contributing\n\nYou are welcome! To easy start please check:\n- [Full Documentation][documentation_path]\n- [Contributing](CONTRIBUTING.md)\n- [Developer Documentation](https://findsimilar.craftsman.lol/dev_documentation.html)\n- [Code of Conduct](CODE_OF_CONDUCT.md)\n- [Security Policy](SECURITY.md)\n- [Governance](GOVERNANCE.md)\n- [Support](SUPPORT.md)\n\n[documentation_path]: https://findsimilar.craftsman.lol\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "User-friendly library to find similar objects",
"version": "2.2.1",
"project_urls": {
"Changelog": "https://github.com/findsimilar/find-similar/releases",
"Documentation": "https://findsimilar.craftsman.lol",
"Download": "https://pypi.org/project/find-similar/",
"Homepage": "https://github.com/findsimilar/find-similar",
"Release notes": "https://github.com/findsimilar/find-similar/releases",
"Source": "https://github.com/findsimilar/find-similar",
"Tracker": "https://github.com/findsimilar/find-similar/issues"
},
"split_keywords": [
"python",
" search",
" machine-learning",
" natural-language-processing",
" find",
" words",
" texts",
" similar"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "9e4b1cc9bd5e993fb81861e39808eb7546e4400e66f602c50eb59f6fbbca50d3",
"md5": "53e48de67794c4e4bc8364571a9ff808",
"sha256": "abbc99c2ada7bd14ab8d2baa6c92d45f9a53b15620817796f097f09b4c0ddbd5"
},
"downloads": -1,
"filename": "find_similar-2.2.1-py3-none-any.whl",
"has_sig": false,
"md5_digest": "53e48de67794c4e4bc8364571a9ff808",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">=3",
"size": 16871,
"upload_time": "2024-04-16T14:35:44",
"upload_time_iso_8601": "2024-04-16T14:35:44.810005Z",
"url": "https://files.pythonhosted.org/packages/9e/4b/1cc9bd5e993fb81861e39808eb7546e4400e66f602c50eb59f6fbbca50d3/find_similar-2.2.1-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "1f3bb34e9121f75a17404960a06b8c7f22f6066b09e056284a1d6be3c7b8dacf",
"md5": "abae8be8844b2ed4ba25df78d1b52647",
"sha256": "0741407aa3a38a49f5ae3a783220afcc03addcf32e45887f6287d3667f97e4b5"
},
"downloads": -1,
"filename": "find_similar-2.2.1.tar.gz",
"has_sig": false,
"md5_digest": "abae8be8844b2ed4ba25df78d1b52647",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">=3",
"size": 16364,
"upload_time": "2024-04-16T14:35:46",
"upload_time_iso_8601": "2024-04-16T14:35:46.579182Z",
"url": "https://files.pythonhosted.org/packages/1f/3b/b34e9121f75a17404960a06b8c7f22f6066b09e056284a1d6be3c7b8dacf/find_similar-2.2.1.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-04-16 14:35:46",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "findsimilar",
"github_project": "find-similar",
"travis_ci": false,
"coveralls": false,
"github_actions": true,
"requirements": [],
"lcname": "find-similar"
}