cleverminer


Namecleverminer JSON
Version 1.0.11 PyPI version JSON
download
home_pagehttps://cleverminer.org
SummaryBeyond apriori. Cleverminer is the package for enhanced association rule mining (eARM). Comparing to standard association rules, it is very enhanced, because the package implements the GUHA procedures that generalises apriori and association rules in many ways. Rules are based on categorical data that can be easily visualized and interpreted. Their if-then with probability allows easy deployment by human realized processes. Trully explainable knowledge mining.
upload_time2024-03-31 19:34:44
maintainerNone
docs_urlNone
author(C) Copyright 2020 - 2024 Little Big Company, s.r.o.
requires_python>=3.8
licenseNone
keywords
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # CleverMiner

<img alt="PyPI - Python Version" src="https://img.shields.io/pypi/pyversions/cleverminer">
<img alt="PyPI - Wheel" src="https://img.shields.io/pypi/wheel/cleverminer">
<img alt="PyPI - Status" src="https://img.shields.io/pypi/status/cleverminer">
<img alt="PyPI - Downloads" src="https://img.shields.io/pypi/dm/cleverminer">

## The CleverMiner is an enhanced association rule mining library 

Beyond apriori. Cleverminer is the package for enhanced association rule mining (eARM), that are INTERPRETABLE, so by definition   one of few intristic methods of explainable AI (XAI). Comparing to standard association rules, it is very enhanced, because the package implements the GUHA procedures that generalises apriori and association rules in many ways. Rules are based on categorical data that can be easily visualized and interpreted. Their if-then with probability allows easy deployment by human realized processes. Trully explainable knowledge mining.

## The CleverMiner in more detail

In general, apriori is looking for rules {ItemSet} -> {Item} (Base, prob). GUHA goes further and instead of items (boolean attributes), list of categorial attributes and combination of values (nominal and several strategies for ordinal -- joining categories) is searched on left and right hand side. Moreover, GUHA has much more possibilites and several other procedures, like mining interesting histograms, finding couples of rules etc.

To run cleverminer procedures, use dataframe with categorical variables only. Cleverminer prepares all variables and values for future reuse.

## Optimized in many ways

CleverMiner has optimized space search in several ways. 

- first, it encodes dataframe into internal format that is optimized for frequent querying for similar pattern, then it queries the dataframe many (typically several thousand or tens/hunderds thousand) times. 
- the algorithm has also optimizations by the derived properties of individual procedures (e.g. when procedure A finds that expanding rule will in every case lead into rules that does not meet requirements, it skips entire branch). This optimization typically reduces the mining time significantly.


## CleverMiner documentation

Documentation for CleverMiner can be found at [cleverminer.org](https://cleverminer.org)

## What's new

1.0.11
 - fixed wording in UIC miner output
 - fixed bug in dataset description

1.0.10
 - get_ruletext method
 - result contains rowcont of original dataframe to increase possibilities of post-processing

1.0.9
 - fixed bugs - error message when displaying of results that has not been calculated
 - Python 3.12 regression tests passed ok

1.0.8
 - data preparation enhanced in many ways
 - able to work with series like 1,2,3, 4-20, 21 and more, 0.0,1.0, ..., Temp 17-25oC,...


1.0.7
 - CF miner supports step size and range between relmax & relmin
 - categories printed in CF output
 - bugfix: checking number of categories fixed
 - can return also adjusted df (ordered categories, ...) - functionality not guaranteed to future

1.0.6
 - progressbar

1.0.5

 - supports missing pandas functionalities and implements several automated data preprocessing
 - listing of variables, ordering and labels
 - automatically process conversion to numeric, integers and order float & integer variables
 - fixed verbosity level prints

1.0.4
 - sorting output rules

1.0.3
 - UIC Miner introduced

1.0.2
 - merge changes from 0.91 (data structure checks; as 1.0.0 was build from 0.0.90 so remaining features are merged now)

1.0.1
 - new procedures get4fold, gethist, getquantifiers, getrulecount

1.0.0 - Major release, major rebuild from all views:
 - data import reworked and fastened significantly
 - much faster calculation (rule mining) in Py3.10 + next optimizations for rule mining are in place
 - output structure is enhanced, fully structured output is available for post-processing (trace_cedent, cedent_struct in output)
 - data can be read once and multiple tasks can be performed (.mine method)
 - optimizations for sd4ft miner
 - verbosity options available (run progress output has been changed)
 - additional options available (able to override maximum number of categories)
 - better formatting outputs (bugfix)
 - data structure in output has changed

0.0.91 - detect error in datatypes in input data and correctly report it

0.0.90 - fix in displaying rules for 4ft-Miner, in CF-Miner: allowing relmax to be bounded from both sides (leq introduced), in SD4ft-Miner: allowing ratioconf to be bounded from both sides (leq introduced)

0.0.89 - quantifiers and output dictionary names change in favor of rules terminology (output: hypotheses->rules; hypo_id -> rule_id, quantifiers kept 
for compatibility old and new names, including variability (like frstbase -> also base1 is possible)

0.0.88 - print of task summary, hypo listing and individual hypothesis

0.0.87 - support for 'one category' added

0.0.86 - bugfixes (space search for optimized branch, able to switch off optimization, minimal cedent length bug for optimized search)

0.0.85 - bugfixes (row_count), checking input structure

0.0.84 - optimizations for conjunctions

 



            

Raw data

            {
    "_id": null,
    "home_page": "https://cleverminer.org",
    "name": "cleverminer",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.8",
    "maintainer_email": null,
    "keywords": null,
    "author": "(C) Copyright 2020 - 2024 Little Big Company, s.r.o.",
    "author_email": "code@cleverminer.org",
    "download_url": "https://files.pythonhosted.org/packages/87/fb/5bdedd578750bb4536baa81ab87f4965c4c43deb57c19db596257d1e4a95/cleverminer-1.0.11.tar.gz",
    "platform": null,
    "description": "# CleverMiner\r\n\r\n<img alt=\"PyPI - Python Version\" src=\"https://img.shields.io/pypi/pyversions/cleverminer\">\r\n<img alt=\"PyPI - Wheel\" src=\"https://img.shields.io/pypi/wheel/cleverminer\">\r\n<img alt=\"PyPI - Status\" src=\"https://img.shields.io/pypi/status/cleverminer\">\r\n<img alt=\"PyPI - Downloads\" src=\"https://img.shields.io/pypi/dm/cleverminer\">\r\n\r\n## The CleverMiner is an enhanced association rule mining library \r\n\r\nBeyond apriori. Cleverminer is the package for enhanced association rule mining (eARM), that are INTERPRETABLE, so by definition   one of few intristic methods of explainable AI (XAI). Comparing to standard association rules, it is very enhanced, because the package implements the GUHA procedures that generalises apriori and association rules in many ways. Rules are based on categorical data that can be easily visualized and interpreted. Their if-then with probability allows easy deployment by human realized processes. Trully explainable knowledge mining.\r\n\r\n## The CleverMiner in more detail\r\n\r\nIn general, apriori is looking for rules {ItemSet} -> {Item} (Base, prob). GUHA goes further and instead of items (boolean attributes), list of categorial attributes and combination of values (nominal and several strategies for ordinal -- joining categories) is searched on left and right hand side. Moreover, GUHA has much more possibilites and several other procedures, like mining interesting histograms, finding couples of rules etc.\r\n\r\nTo run cleverminer procedures, use dataframe with categorical variables only. Cleverminer prepares all variables and values for future reuse.\r\n\r\n## Optimized in many ways\r\n\r\nCleverMiner has optimized space search in several ways. \r\n\r\n- first, it encodes dataframe into internal format that is optimized for frequent querying for similar pattern, then it queries the dataframe many (typically several thousand or tens/hunderds thousand) times. \r\n- the algorithm has also optimizations by the derived properties of individual procedures (e.g. when procedure A finds that expanding rule will in every case lead into rules that does not meet requirements, it skips entire branch). This optimization typically reduces the mining time significantly.\r\n\r\n\r\n## CleverMiner documentation\r\n\r\nDocumentation for CleverMiner can be found at [cleverminer.org](https://cleverminer.org)\r\n\r\n## What's new\r\n\r\n1.0.11\r\n - fixed wording in UIC miner output\r\n - fixed bug in dataset description\r\n\r\n1.0.10\r\n - get_ruletext method\r\n - result contains rowcont of original dataframe to increase possibilities of post-processing\r\n\r\n1.0.9\r\n - fixed bugs - error message when displaying of results that has not been calculated\r\n - Python 3.12 regression tests passed ok\r\n\r\n1.0.8\r\n - data preparation enhanced in many ways\r\n - able to work with series like 1,2,3, 4-20, 21 and more, 0.0,1.0, ..., Temp 17-25oC,...\r\n\r\n\r\n1.0.7\r\n - CF miner supports step size and range between relmax & relmin\r\n - categories printed in CF output\r\n - bugfix: checking number of categories fixed\r\n - can return also adjusted df (ordered categories, ...) - functionality not guaranteed to future\r\n\r\n1.0.6\r\n - progressbar\r\n\r\n1.0.5\r\n\r\n - supports missing pandas functionalities and implements several automated data preprocessing\r\n - listing of variables, ordering and labels\r\n - automatically process conversion to numeric, integers and order float & integer variables\r\n - fixed verbosity level prints\r\n\r\n1.0.4\r\n - sorting output rules\r\n\r\n1.0.3\r\n - UIC Miner introduced\r\n\r\n1.0.2\r\n - merge changes from 0.91 (data structure checks; as 1.0.0 was build from 0.0.90 so remaining features are merged now)\r\n\r\n1.0.1\r\n - new procedures get4fold, gethist, getquantifiers, getrulecount\r\n\r\n1.0.0 - Major release, major rebuild from all views:\r\n - data import reworked and fastened significantly\r\n - much faster calculation (rule mining) in Py3.10 + next optimizations for rule mining are in place\r\n - output structure is enhanced, fully structured output is available for post-processing (trace_cedent, cedent_struct in output)\r\n - data can be read once and multiple tasks can be performed (.mine method)\r\n - optimizations for sd4ft miner\r\n - verbosity options available (run progress output has been changed)\r\n - additional options available (able to override maximum number of categories)\r\n - better formatting outputs (bugfix)\r\n - data structure in output has changed\r\n\r\n0.0.91 - detect error in datatypes in input data and correctly report it\r\n\r\n0.0.90 - fix in displaying rules for 4ft-Miner, in CF-Miner: allowing relmax to be bounded from both sides (leq introduced), in SD4ft-Miner: allowing ratioconf to be bounded from both sides (leq introduced)\r\n\r\n0.0.89 - quantifiers and output dictionary names change in favor of rules terminology (output: hypotheses->rules; hypo_id -> rule_id, quantifiers kept \r\nfor compatibility old and new names, including variability (like frstbase -> also base1 is possible)\r\n\r\n0.0.88 - print of task summary, hypo listing and individual hypothesis\r\n\r\n0.0.87 - support for 'one category' added\r\n\r\n0.0.86 - bugfixes (space search for optimized branch, able to switch off optimization, minimal cedent length bug for optimized search)\r\n\r\n0.0.85 - bugfixes (row_count), checking input structure\r\n\r\n0.0.84 - optimizations for conjunctions\r\n\r\n \r\n\r\n\r\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "Beyond apriori. Cleverminer is the package for enhanced association rule mining (eARM). Comparing to standard association rules, it is very enhanced, because the package implements the GUHA procedures that generalises apriori and association rules in many ways. Rules are based on categorical data that can be easily visualized and interpreted. Their if-then with probability allows easy deployment by human realized processes. Trully explainable knowledge mining.",
    "version": "1.0.11",
    "project_urls": {
        "Homepage": "https://cleverminer.org"
    },
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "524873546980a429db758a8ed196b72daff04a098d51c01e47c23b456f193a90",
                "md5": "b7d824b712c1de127c19417a52433085",
                "sha256": "3aaaf5c23043f9a9a28534660af5ebc8e76ceda102864155715626180efe1565"
            },
            "downloads": -1,
            "filename": "cleverminer-1.0.11-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "b7d824b712c1de127c19417a52433085",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.8",
            "size": 26160,
            "upload_time": "2024-03-31T19:34:43",
            "upload_time_iso_8601": "2024-03-31T19:34:43.046806Z",
            "url": "https://files.pythonhosted.org/packages/52/48/73546980a429db758a8ed196b72daff04a098d51c01e47c23b456f193a90/cleverminer-1.0.11-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "87fb5bdedd578750bb4536baa81ab87f4965c4c43deb57c19db596257d1e4a95",
                "md5": "0a24ebdee865a6866b943684e4c48688",
                "sha256": "42411ba7a959bd706d110ef3f31bfaf850900ee500d38a2f61031400a69ab8c6"
            },
            "downloads": -1,
            "filename": "cleverminer-1.0.11.tar.gz",
            "has_sig": false,
            "md5_digest": "0a24ebdee865a6866b943684e4c48688",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.8",
            "size": 27683,
            "upload_time": "2024-03-31T19:34:44",
            "upload_time_iso_8601": "2024-03-31T19:34:44.954864Z",
            "url": "https://files.pythonhosted.org/packages/87/fb/5bdedd578750bb4536baa81ab87f4965c4c43deb57c19db596257d1e4a95/cleverminer-1.0.11.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-03-31 19:34:44",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "cleverminer"
}
        
Elapsed time: 0.23625s