Name | scikit-plot JSON |

Version | 0.3.1 JSON |

download | |

home_page | https://github.com/reiinakano/scikit-plot |

Summary | An intuitive library to add plotting functionality to scikit-learn objects. |

upload_time | 2017-09-17 03:25:43 |

maintainer | |

docs_url | None |

author | Reiichiro Nakano |

requires_python | |

license | MIT License |

keywords | |

VCS | |

bugtrack_url | |

requirements | matplotlib scikit-learn scipy joblib |

Travis-CI | |

coveralls test coverage | No coveralls. |

# Welcome to Scikit-plot [![PyPI version](https://badge.fury.io/py/scikit-plot.svg)](https://badge.fury.io/py/scikit-plot) [![license](https://img.shields.io/github/license/mashape/apistatus.svg)]() [![Build Status](https://travis-ci.org/reiinakano/scikit-plot.svg?branch=master)](https://travis-ci.org/reiinakano/scikit-plot) [![PyPI](https://img.shields.io/pypi/pyversions/scikit-plot.svg)]() [![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.293191.svg)](https://doi.org/10.5281/zenodo.293191) ### Single line functions for detailed visualizations ### The quickest and easiest way to go from analysis... ![roc_curves](docs/_static/readme_collage.jpg) ### ...to this. Scikit-plot is the result of an unartistic data scientist's dreadful realization that *visualization is one of the most crucial components in the data science process, not just a mere afterthought*. Gaining insights is simply a lot easier when you're looking at a colored heatmap of a confusion matrix complete with class labels rather than a single-line dump of numbers enclosed in brackets. Besides, if you ever need to present your results to someone (virtually any time anybody hires you to do data science), you show them visualizations, not a bunch of numbers in Excel. That said, there are a number of visualizations that frequently pop up in machine learning. Scikit-plot is a humble attempt to provide aesthetically-challenged programmers (such as myself) the opportunity to generate quick and beautiful graphs and plots with as little boilerplate as possible. ## Okay then, prove it. Show us an example. Say we use Naive Bayes in multi-class classification and decide we want to visualize the results of a common classification metric, the Area under the Receiver Operating Characteristic curve. Since the ROC is only valid in binary classification, we want to show the respective ROC of each class if it were the positive class. As an added bonus, let's show the micro-averaged and macro-averaged curve in the plot as well. Let's use scikit-plot with the sample digits dataset from scikit-learn. ```python # The usual train-test split mumbo-jumbo from sklearn.datasets import load_digits from sklearn.model_selection import train_test_split from sklearn.naive_bayes import GaussianNB X, y = load_digits(return_X_y=True) X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.33) nb = GaussianNB() nb.fit(X_train, y_train) predicted_probas = nb.predict_proba(X_test) # The magic happens here import matplotlib.pyplot as plt import scikitplot as skplt skplt.metrics.plot_roc_curve(y_test, predicted_probas) plt.show() ``` ![roc_curves](examples/roc_curves.png) Pretty. And... That's it. Encaptured in that small example is the entire philosophy of Scikit-plot: **single line functions for detailed visualization**. You simply browse the plots available in the documentation, and call the function with the necessary arguments. Scikit-plot tries to stay out of your way as much as possible. No unnecessary bells and whistles. And when you *do* need the bells and whistles, each function offers a myriad of parameters for customizing various elements in your plots. Finally, compare and [view the non-scikit-plot way of plotting the multi-class ROC curve](http://scikit-learn.org/stable/auto_examples/model_selection/plot_roc.html). Which one would you rather do? ## Maximum flexibility. Compatibility with non-scikit-learn objects. Although Scikit-plot is loosely based around the scikit-learn interface, you don't actually need Scikit-learn objects to use the available functions. As long as you provide the functions what they're asking for, they'll happily draw the plots for you. Here's a quick example to generate the precision-recall curves of a Keras classifier on a sample dataset. ```python # Import what's needed for the Functions API import matplotlib.pyplot as plt import scikitplot as skplt # This is a Keras classifier. We'll generate probabilities on the test set. keras_clf.fit(X_train, y_train, batch_size=64, nb_epoch=10, verbose=2) probas = keras_clf.predict_proba(X_test, batch_size=64) # Now plot. skplt.metrics.plot_precision_recall_curve(y_test, probas) plt.show() ``` ![p_r_curves](examples/p_r_curves.png) You can see clearly here that `skplt.metrics.plot_precision_recall_curve` needs only the ground truth y-values and the predicted probabilities to generate the plot. This lets you use *anything* you want as the classifier, from Keras NNs to NLTK Naive Bayes to that groundbreaking classifier algorithm you just wrote. The possibilities are endless. ## Installation Installation is simple! First, make sure you have the dependencies [Scikit-learn](http://scikit-learn.org) and [Matplotlib](http://matplotlib.org/) installed. Then just run: ```bash pip install scikit-plot ``` Or if you want the latest development version, clone this repo and run ```bash python setup.py install ``` at the root folder. If using conda, you can install Scikit-plot by running: ```bash conda install -c conda-forge scikit-plot ``` ## Documentation and Examples Explore the full features of Scikit-plot. You can find detailed documentation [here](http://scikit-plot.readthedocs.io). Examples are found in the [examples folder of this repo](examples/). ## Contributing to Scikit-plot Reporting a bug? Suggesting a feature? Want to add your own plot to the library? Visit our [contributor guidelines](CONTRIBUTING.md). ## Citing Scikit-plot Are you using Scikit-plot in an academic paper? You should be! Reviewers love eye candy. If so, please consider citing Scikit-plot with DOI [![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.293191.svg)](https://doi.org/10.5281/zenodo.293191) #### APA > Reiichiro Nakano. (2017). reiinakano/scikit-plot: 0.3.1 [Data set]. Zenodo. http://doi.org/10.5281/zenodo.293191 #### IEEE > [1]Reiichiro Nakano, “reiinakano/scikit-plot: 0.3.1”. Zenodo, 19-Feb-2017. #### ACM > [1]Reiichiro Nakano 2017. reiinakano/scikit-plot: 0.3.1. Zenodo. Happy plotting!

{ "maintainer": "", "docs_url": null, "requires_python": "", "maintainer_email": "", "cheesecake_code_kwalitee_id": null, "keywords": "", "upload_time": "2017-09-17 03:25:43", "requirements": [ { "name": "matplotlib", "specs": [ [ ">=", "1.3.1" ] ] }, { "name": "scikit-learn", "specs": [ [ ">=", "0.18" ] ] }, { "name": "scipy", "specs": [ [ ">=", "0.9" ] ] }, { "name": "joblib", "specs": [ [ ">=", "0.10" ] ] } ], "author": "Reiichiro Nakano", "home_page": "https://github.com/reiinakano/scikit-plot", "github_user": "reiinakano", "download_url": "https://pypi.python.org/packages/36/f5/fa6ce1ec48c65ed20f2831b0ccf2442297040effe651ea17b25dab72455b/scikit-plot-0.3.1.tar.gz", "platform": "any", "version": "0.3.1", "cheesecake_documentation_id": null, "description": "# Welcome to Scikit-plot\n\n[![PyPI version](https://badge.fury.io/py/scikit-plot.svg)](https://badge.fury.io/py/scikit-plot)\n[![license](https://img.shields.io/github/license/mashape/apistatus.svg)]()\n[![Build Status](https://travis-ci.org/reiinakano/scikit-plot.svg?branch=master)](https://travis-ci.org/reiinakano/scikit-plot)\n[![PyPI](https://img.shields.io/pypi/pyversions/scikit-plot.svg)]()\n[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.293191.svg)](https://doi.org/10.5281/zenodo.293191)\n\n### Single line functions for detailed visualizations\n\n### The quickest and easiest way to go from analysis...\n\n![roc_curves](docs/_static/readme_collage.jpg)\n\n### ...to this.\n\nScikit-plot is the result of an unartistic data scientist's dreadful realization that *visualization is one of the most crucial components in the data science process, not just a mere afterthought*.\n\nGaining insights is simply a lot easier when you're looking at a colored heatmap of a confusion matrix complete with class labels rather than a single-line dump of numbers enclosed in brackets. Besides, if you ever need to present your results to someone (virtually any time anybody hires you to do data science), you show them visualizations, not a bunch of numbers in Excel.\n\nThat said, there are a number of visualizations that frequently pop up in machine learning. Scikit-plot is a humble attempt to provide aesthetically-challenged programmers (such as myself) the opportunity to generate quick and beautiful graphs and plots with as little boilerplate as possible.\n\n## Okay then, prove it. Show us an example.\n\nSay we use Naive Bayes in multi-class classification and decide we want to visualize the results of a common classification metric, the Area under the Receiver Operating Characteristic curve. Since the ROC is only valid in binary classification, we want to show the respective ROC of each class if it were the positive class. As an added bonus, let's show the micro-averaged and macro-averaged curve in the plot as well.\n\nLet's use scikit-plot with the sample digits dataset from scikit-learn.\n\n```python\n# The usual train-test split mumbo-jumbo\nfrom sklearn.datasets import load_digits\nfrom sklearn.model_selection import train_test_split\nfrom sklearn.naive_bayes import GaussianNB\n\nX, y = load_digits(return_X_y=True)\nX_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.33)\nnb = GaussianNB()\nnb.fit(X_train, y_train)\npredicted_probas = nb.predict_proba(X_test)\n\n# The magic happens here\nimport matplotlib.pyplot as plt\nimport scikitplot as skplt\nskplt.metrics.plot_roc_curve(y_test, predicted_probas)\nplt.show()\n```\n![roc_curves](examples/roc_curves.png)\n\nPretty.\n\nAnd... That's it. Encaptured in that small example is the entire philosophy of Scikit-plot: **single line functions for detailed visualization**. You simply browse the plots available in the documentation, and call the function with the necessary arguments. Scikit-plot tries to stay out of your way as much as possible. No unnecessary bells and whistles. And when you *do* need the bells and whistles, each function offers a myriad of parameters for customizing various elements in your plots.\n\nFinally, compare and [view the non-scikit-plot way of plotting the multi-class ROC curve](http://scikit-learn.org/stable/auto_examples/model_selection/plot_roc.html). Which one would you rather do?\n\n## Maximum flexibility. Compatibility with non-scikit-learn objects.\n\nAlthough Scikit-plot is loosely based around the scikit-learn interface, you don't actually need Scikit-learn objects to use the available functions. As long as you provide the functions what they're asking for, they'll happily draw the plots for you.\n\nHere's a quick example to generate the precision-recall curves of a Keras classifier on a sample dataset.\n\n```python\n# Import what's needed for the Functions API\nimport matplotlib.pyplot as plt\nimport scikitplot as skplt\n\n# This is a Keras classifier. We'll generate probabilities on the test set.\nkeras_clf.fit(X_train, y_train, batch_size=64, nb_epoch=10, verbose=2)\nprobas = keras_clf.predict_proba(X_test, batch_size=64)\n\n# Now plot.\nskplt.metrics.plot_precision_recall_curve(y_test, probas)\nplt.show()\n```\n![p_r_curves](examples/p_r_curves.png)\n\nYou can see clearly here that `skplt.metrics.plot_precision_recall_curve` needs only the ground truth y-values and the predicted probabilities to generate the plot. This lets you use *anything* you want as the classifier, from Keras NNs to NLTK Naive Bayes to that groundbreaking classifier algorithm you just wrote.\n\nThe possibilities are endless.\n\n## Installation\n\nInstallation is simple! First, make sure you have the dependencies [Scikit-learn](http://scikit-learn.org) and [Matplotlib](http://matplotlib.org/) installed.\n\nThen just run:\n```bash\npip install scikit-plot\n```\n\nOr if you want the latest development version, clone this repo and run\n```bash\npython setup.py install\n```\nat the root folder.\n\nIf using conda, you can install Scikit-plot by running:\n```bash\nconda install -c conda-forge scikit-plot\n```\n\n## Documentation and Examples\n\nExplore the full features of Scikit-plot.\n\nYou can find detailed documentation [here](http://scikit-plot.readthedocs.io).\n\nExamples are found in the [examples folder of this repo](examples/).\n\n## Contributing to Scikit-plot\n\nReporting a bug? Suggesting a feature? Want to add your own plot to the library? Visit our [contributor guidelines](CONTRIBUTING.md).\n\n## Citing Scikit-plot\n\nAre you using Scikit-plot in an academic paper? You should be! Reviewers love eye candy.\n\nIf so, please consider citing Scikit-plot with DOI [![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.293191.svg)](https://doi.org/10.5281/zenodo.293191)\n\n#### APA\n\n> Reiichiro Nakano. (2017). reiinakano/scikit-plot: 0.3.1 [Data set]. Zenodo. http://doi.org/10.5281/zenodo.293191\n\n#### IEEE\n\n> [1]Reiichiro Nakano, \u201creiinakano/scikit-plot: 0.3.1\u201d. Zenodo, 19-Feb-2017.\n\n#### ACM\n\n> [1]Reiichiro Nakano 2017. reiinakano/scikit-plot: 0.3.1. Zenodo.\n\nHappy plotting!", "tox": true, "lcname": "scikit-plot", "bugtrack_url": null, "github": true, "coveralls": false, "name": "scikit-plot", "license": "MIT License", "travis_ci": true, "github_project": "scikit-plot", "summary": "An intuitive library to add plotting functionality to scikit-learn objects.", "split_keywords": [], "author_email": "reiichiro.s.nakano@gmail.com", "urls": [ { "has_sig": false, "upload_time": "2017-09-17T03:25:43", "comment_text": "", "python_version": "source", "url": "https://pypi.python.org/packages/36/f5/fa6ce1ec48c65ed20f2831b0ccf2442297040effe651ea17b25dab72455b/scikit-plot-0.3.1.tar.gz", "md5_digest": "f7e605d29c6357fe3e849e95baa74b64", "downloads": 0, "filename": "scikit-plot-0.3.1.tar.gz", "packagetype": "sdist", "path": "36/f5/fa6ce1ec48c65ed20f2831b0ccf2442297040effe651ea17b25dab72455b/scikit-plot-0.3.1.tar.gz", "size": 26306 } ], "_id": null, "cheesecake_installability_id": null }