fstoolbox


Namefstoolbox JSON
Version 0.0.2 PyPI version JSON
download
home_pagehttps://github.com/Craigacp/FEAST
SummaryAn information theoretic feature selection toolbox.
upload_time2022-12-10 16:17:34
maintainerBrent Barbachem
docs_urlNone
authorAdam Pocock
requires_python>=3.9, <4
licenseMIT
keywords feast feature selection fstoolbox
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            FEAST
=====

A FEAture Selection Toolbox for C/C++, Java, Python, &amp; MATLAB/Octave, v2.1.0.

FEAST provides implementations of common mutual information based filter
feature selection algorithms, and an implementation of RELIEF for Matlab. All
functions expect discrete inputs (except RELIEF, which does not depend on the
MIToolbox), and they return the selected feature indices. These implementations
were developed to help our research into the similarities between these
algorithms, and our results are presented in the following paper:

```
 Conditional Likelihood Maximisation: A Unifying Framework for Information Theoretic Feature Selection
 G. Brown, A. Pocock, M.-J. Zhao, M. Lujan
 Journal of Machine Learning Research, 13:27-66 (2012)
```

The weighted feature selection algorithms are described in Chapter 7 of:

```
 Feature Selection via Joint Likelihood
 A. Pocock
 PhD Thesis, University of Manchester, 2012
```

If you use these implementations for academic research please cite the relevant paper
above.  All FEAST code is licensed under the BSD 3-Clause License.

Contains implementations of:
   mim, mrmr, mifs, cmim, jmi, disr, cife, icap, condred, cmi, relief, fcbf, betagamma

And weighted implementations of:
   mim, cmim, jmi, disr, cmi

References for these algorithms are provided in the accompanying feast.bib file
(in BibTeX format).

FEAST works on discrete inputs, and all continuous values **must** be
discretised before use with FEAST.  In our experiments we've found that using
10 equal width bins is suitable for many problems, though this is data set size
dependent. FEAST produces unreliable results when used with continuous inputs,
runs slowly and uses much more memory than usual. The discrete inputs should
have small cardinality, FEAST will treat values {1,10,100} the same way it
treats {1,2,3} and the latter will be both faster and use less memory.

MATLAB Example (using "data" as our feature matrix, and "labels" as the class label vector):

```
>> size(data)
ans = 
     (569,30)                                     %% denoting 569 examples, and 30 features
```
```
>> selectedIndices = feast('jmi',5,data,labels) %% selecting the top 5 features using the jmi algorithm
selectedIndices =

    28
    21
     8
    27
    23
```
```
>> selectedIndices = feast('mrmr',10,data,labels) %% selecting the top 10 features using the mrmr algorithm
selectedIndices =

    28
    24
    22
     8
    27
    21
    29
     4
     7
    25
```
```
>> selectedIndices = feast('mifs',5,data,labels,0.7) %% selecting the top 5 features using the mifs algorithm with beta = 0.7
selectedIndices =

    28
    24
    22
    20
    29
```

The library is written in ANSI C for compatibility with the MATLAB mex
compiler, except for MIM, FCBF and RELIEF, which are written in MATLAB/OCTAVE
script. There is a different implementation of MIM available for use in the C
library. It depends on MIToolbox which is incorporated as a git submodule.

MIToolbox is developed on [GitHub](http://www.github.com/Craigacp/MIToolbox/).

The C library expects all matrices in column-major format (i.e. Fortran style).
This is for two reasons, a) MATLAB generates Fortran-style arrays, and b)
feature selection iterates over columns rather than rows, unlike most other ML
processes. 

Compilation instructions:
Run `git submodule init` then,
 - MATLAB/OCTAVE 
    - run `CompileFEAST.m` in the `matlab` folder.
 - Linux C shared library 
    - run `make x86` or `make x64` for a 32-bit or 64-bit library.
 - Windows C dll (expects pre built libMIToolbox.dll)
	- install MinGW from https://sourceforge.net/projects/mingw-w64/
	- add MinGW binaries folders to PATH, e.g. mingw/bin, mingw/msys/bin 
	- run `make x64_win`.
 - Java (requires Java 8)
    - run `make x64`, `sudo make install` to build and install the C library.
    - then `make java` to build the JNI wrapper.
    - then run `mvn package` in the `java` directory to build the jar file.
    - Note: the Java code should work on all platforms and future versions of Java, but the included Makefile only works on Ubuntu & Java 8.
 - Python
    - run `python setup.py` in the `python` folder.

Update History
 - xx/xx/xxxx - v2.1.0 - Added a python API and refactored the package structure.
 - 07/01/2017 - v2.0.0 - Added weighted feature selection, major refactoring of the code to improve speed and portability. FEAST functions now return the internal scores assigned by each criteria as well. Added a Java API via JNI. FEAST v2 is approximately 30% faster when called from Matlab.
 - 12/03/2016 - v1.1.4 - Fixed an issue where Matlab would segfault if all features had zero MI with the label.
 - 12/10/2014 - v1.1.2 - Updated documentation to note that FEAST expects column-major matrices.
 - 11/06/2014 - v1.1.1 - Fixed an issue where MIM wasn't compiled into libFSToolbox.
 - 22/02/2014 - v1.1.0 - Bug fixes in memory allocation, added a C implementation of MIM, moved the selected feature increment into the mex code.
 - 12/02/2013 - v1.0.1 - Bug fix for 32-bit Windows MATLAB's lcc.
 - 08/11/2011 - v1.0.0 - Public Release to complement the JMLR publication.


            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/Craigacp/FEAST",
    "name": "fstoolbox",
    "maintainer": "Brent Barbachem",
    "docs_url": null,
    "requires_python": ">=3.9, <4",
    "maintainer_email": "barbacbd@gmail.com",
    "keywords": "feast,feature,selection,fstoolbox",
    "author": "Adam Pocock",
    "author_email": "adam@adampocock.com",
    "download_url": "",
    "platform": null,
    "description": "FEAST\n=====\n\nA FEAture Selection Toolbox for C/C++, Java, Python, &amp; MATLAB/Octave, v2.1.0.\n\nFEAST provides implementations of common mutual information based filter\nfeature selection algorithms, and an implementation of RELIEF for Matlab. All\nfunctions expect discrete inputs (except RELIEF, which does not depend on the\nMIToolbox), and they return the selected feature indices. These implementations\nwere developed to help our research into the similarities between these\nalgorithms, and our results are presented in the following paper:\n\n```\n Conditional Likelihood Maximisation: A Unifying Framework for Information Theoretic Feature Selection\n G. Brown, A. Pocock, M.-J. Zhao, M. Lujan\n Journal of Machine Learning Research, 13:27-66 (2012)\n```\n\nThe weighted feature selection algorithms are described in Chapter 7 of:\n\n```\n Feature Selection via Joint Likelihood\n A. Pocock\n PhD Thesis, University of Manchester, 2012\n```\n\nIf you use these implementations for academic research please cite the relevant paper\nabove.  All FEAST code is licensed under the BSD 3-Clause License.\n\nContains implementations of:\n   mim, mrmr, mifs, cmim, jmi, disr, cife, icap, condred, cmi, relief, fcbf, betagamma\n\nAnd weighted implementations of:\n   mim, cmim, jmi, disr, cmi\n\nReferences for these algorithms are provided in the accompanying feast.bib file\n(in BibTeX format).\n\nFEAST works on discrete inputs, and all continuous values **must** be\ndiscretised before use with FEAST.  In our experiments we've found that using\n10 equal width bins is suitable for many problems, though this is data set size\ndependent. FEAST produces unreliable results when used with continuous inputs,\nruns slowly and uses much more memory than usual. The discrete inputs should\nhave small cardinality, FEAST will treat values {1,10,100} the same way it\ntreats {1,2,3} and the latter will be both faster and use less memory.\n\nMATLAB Example (using \"data\" as our feature matrix, and \"labels\" as the class label vector):\n\n```\n>> size(data)\nans = \n     (569,30)                                     %% denoting 569 examples, and 30 features\n```\n```\n>> selectedIndices = feast('jmi',5,data,labels) %% selecting the top 5 features using the jmi algorithm\nselectedIndices =\n\n    28\n    21\n     8\n    27\n    23\n```\n```\n>> selectedIndices = feast('mrmr',10,data,labels) %% selecting the top 10 features using the mrmr algorithm\nselectedIndices =\n\n    28\n    24\n    22\n     8\n    27\n    21\n    29\n     4\n     7\n    25\n```\n```\n>> selectedIndices = feast('mifs',5,data,labels,0.7) %% selecting the top 5 features using the mifs algorithm with beta = 0.7\nselectedIndices =\n\n    28\n    24\n    22\n    20\n    29\n```\n\nThe library is written in ANSI C for compatibility with the MATLAB mex\ncompiler, except for MIM, FCBF and RELIEF, which are written in MATLAB/OCTAVE\nscript. There is a different implementation of MIM available for use in the C\nlibrary. It depends on MIToolbox which is incorporated as a git submodule.\n\nMIToolbox is developed on [GitHub](http://www.github.com/Craigacp/MIToolbox/).\n\nThe C library expects all matrices in column-major format (i.e. Fortran style).\nThis is for two reasons, a) MATLAB generates Fortran-style arrays, and b)\nfeature selection iterates over columns rather than rows, unlike most other ML\nprocesses. \n\nCompilation instructions:\nRun `git submodule init` then,\n - MATLAB/OCTAVE \n    - run `CompileFEAST.m` in the `matlab` folder.\n - Linux C shared library \n    - run `make x86` or `make x64` for a 32-bit or 64-bit library.\n - Windows C dll (expects pre built libMIToolbox.dll)\n\t- install MinGW from https://sourceforge.net/projects/mingw-w64/\n\t- add MinGW binaries folders to PATH, e.g. mingw/bin, mingw/msys/bin \n\t- run `make x64_win`.\n - Java (requires Java 8)\n    - run `make x64`, `sudo make install` to build and install the C library.\n    - then `make java` to build the JNI wrapper.\n    - then run `mvn package` in the `java` directory to build the jar file.\n    - Note: the Java code should work on all platforms and future versions of Java, but the included Makefile only works on Ubuntu & Java 8.\n - Python\n    - run `python setup.py` in the `python` folder.\n\nUpdate History\n - xx/xx/xxxx - v2.1.0 - Added a python API and refactored the package structure.\n - 07/01/2017 - v2.0.0 - Added weighted feature selection, major refactoring of the code to improve speed and portability. FEAST functions now return the internal scores assigned by each criteria as well. Added a Java API via JNI. FEAST v2 is approximately 30% faster when called from Matlab.\n - 12/03/2016 - v1.1.4 - Fixed an issue where Matlab would segfault if all features had zero MI with the label.\n - 12/10/2014 - v1.1.2 - Updated documentation to note that FEAST expects column-major matrices.\n - 11/06/2014 - v1.1.1 - Fixed an issue where MIM wasn't compiled into libFSToolbox.\n - 22/02/2014 - v1.1.0 - Bug fixes in memory allocation, added a C implementation of MIM, moved the selected feature increment into the mex code.\n - 12/02/2013 - v1.0.1 - Bug fix for 32-bit Windows MATLAB's lcc.\n - 08/11/2011 - v1.0.0 - Public Release to complement the JMLR publication.\n\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "An information theoretic feature selection toolbox.",
    "version": "0.0.2",
    "split_keywords": [
        "feast",
        "feature",
        "selection",
        "fstoolbox"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "md5": "9fd28f66a0d372b73a3b40b519d8dcd7",
                "sha256": "c360eb5b9b7a61d00355bba4c7f2efb69eff2f0f050e6686f173f7cbab332a38"
            },
            "downloads": -1,
            "filename": "fstoolbox-0.0.2-cp310-cp310-manylinux_2_35_x86_64.whl",
            "has_sig": false,
            "md5_digest": "9fd28f66a0d372b73a3b40b519d8dcd7",
            "packagetype": "bdist_wheel",
            "python_version": "cp310",
            "requires_python": ">=3.9, <4",
            "size": 52772,
            "upload_time": "2022-12-10T16:17:34",
            "upload_time_iso_8601": "2022-12-10T16:17:34.210348Z",
            "url": "https://files.pythonhosted.org/packages/04/4d/7ee46852ec099ec86975a64e0a23e68b9ffcd7323cc67fb335fa987bda1c/fstoolbox-0.0.2-cp310-cp310-manylinux_2_35_x86_64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "md5": "56bd49d733cd0d2aebdd0e6f605ef4b6",
                "sha256": "1cf17fce605bf9822535fc369eb3824190c22225914b35092615bd65e82bb362"
            },
            "downloads": -1,
            "filename": "fstoolbox-0.0.2-cp311-cp311-manylinux_2_35_x86_64.whl",
            "has_sig": false,
            "md5_digest": "56bd49d733cd0d2aebdd0e6f605ef4b6",
            "packagetype": "bdist_wheel",
            "python_version": "cp311",
            "requires_python": ">=3.9, <4",
            "size": 52768,
            "upload_time": "2022-12-10T16:17:36",
            "upload_time_iso_8601": "2022-12-10T16:17:36.884751Z",
            "url": "https://files.pythonhosted.org/packages/52/a3/a09d4cc60215c69dd19527a274b4cbdb26bf6236633b1d691f6ea9c23ed6/fstoolbox-0.0.2-cp311-cp311-manylinux_2_35_x86_64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "md5": "54f2899347aef4f488f9e5b41093d0c9",
                "sha256": "e67b6edcc6ae1990d7c8d6fb0f5a8919235ccaf9643e10d347c9ace62253bcf1"
            },
            "downloads": -1,
            "filename": "fstoolbox-0.0.2-cp39-cp39-manylinux_2_35_x86_64.whl",
            "has_sig": false,
            "md5_digest": "54f2899347aef4f488f9e5b41093d0c9",
            "packagetype": "bdist_wheel",
            "python_version": "cp39",
            "requires_python": ">=3.9, <4",
            "size": 52760,
            "upload_time": "2022-12-10T16:17:33",
            "upload_time_iso_8601": "2022-12-10T16:17:33.929068Z",
            "url": "https://files.pythonhosted.org/packages/7b/68/af758071bec34225a2e535730e1a88359c19caa5560bdae7c98fd43f921b/fstoolbox-0.0.2-cp39-cp39-manylinux_2_35_x86_64.whl",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2022-12-10 16:17:34",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "github_user": "Craigacp",
    "github_project": "FEAST",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "lcname": "fstoolbox"
}
        
Elapsed time: 0.03049s