🤖⚡ scikit-learn tip #20 (video)¶

New in scikit-learn 0.22: Plot a confusion matrix in one line of code!

Highly customizable, including the colormap, display labels, and value formatting.

See example 👇

In [1]:

import pandas as pd
df = pd.read_csv('http://bit.ly/kaggletrain')

In [2]:

cols = ['Pclass', 'Fare']
X = df[cols]
y = df['Survived']

In [3]:

from sklearn.model_selection import train_test_split
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import plot_confusion_matrix

In [4]:

X_train, X_test, y_train, y_test = train_test_split(X, y, random_state=0)

In [5]:

clf = LogisticRegression()
clf.fit(X_train, y_train);

In [6]:

# pass it a trained model: it makes predictions for X_test and compares them to y_test
disp = plot_confusion_matrix(clf, X_test, y_test, cmap='Blues', values_format='d')

In [7]:

# print the "normal" confusion matrix
disp.confusion_matrix

Out[7]:

array([[122,  17],
       [ 48,  36]])

🤖⚡ scikit-learn tip #20 (video)¶

Want more tips? View all tips on GitHub or Sign up to receive 2 tips by email every week 💌¶