TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial intelligence writing from the former Towards Data Science Medium publication.

Follow publication

PyCaret 2.3.6 is Here! Learn What’s New?

Moez Ali
TDS Archive
Published in
6 min readJan 14, 2022

--

Official Documentation — https://pycaret.gitbook.io

🚀 Introduction

PyCaret is an open-source, low-code machine learning library in Python that automates machine learning workflows. It is an end-to-end machine learning and model management tool that speeds up the experiment cycle exponentially and makes you more productive.

By far PyCaret 2.3.6 is the biggest release in terms of the new features and functionalities. This article demonstrates the use of new functionalities added in the recent release of PyCaret 2.3.6.

Important Links

📚 Official Docs: The bible of PyCaret. Everything is here.
🌐 Official Web: Check out our official website
😺 GitHub Check out our Git
Tutorials New to PyCaret? Check out our official notebooks!
📋 Example Notebooks created by the community.
📙 Blog Tutorials and articles by contributors.
FAQs Check out frequently asked questions.
📺 Video Tutorials Our video tutorial from various events.
📢 Discussions Have questions? Engage with community and contributors.
🛠️ Changelog Changes and version history.
🙌 User Group Join our Meetup user group.

💻 Installation

Installation is easy and will only take a few minutes. PyCaret’s default installation from pip only installs hard dependencies as listed in the requirements.txt file.

pip install pycaret

To install the full version:

pip install pycaret[full]

📈 Dashboard

This function will generate the interactive dashboard for a trained model. The dashboard is implemented using the ExplainerDashboard.

# load dataset
from pycaret.datasets import get_data
data = get_data('iris')
# init setup
from pycaret.classification import *
s = setup(data, target = 'species', session_id = 123)
# train model
lr = create_model('lr')
# generate dashboard
dashboard(lr)
Explainer Dashboard

Video Demo:

📊 Exploratory Data Analysis (EDA)

This function will generate automated EDA using the AutoViz integration.

# load dataset
from pycaret.datasets import get_data
data = get_data('iris')
# init setup
from pycaret.classification import *
s = setup(data, target = 'species', session_id = 123)
# generate EDA
eda()
Exploratory Data Analysis (EDA) in PyCaret

Video Demo:

🚊 Convert Model

This function will transpile trained machine learning models into native inference scripts in different programming languages (Python, C, Java, Go, JavaScript, Visual Basic, C#, PowerShell, R, PHP, Dart, Haskell, Ruby, F#). This functionality is very useful if you want to deploy models into environments where you can’t install your normal Python stack to support model inference.

# load dataset
from pycaret.datasets import get_data
data = get_data('iris')
# init setup
from pycaret.classification import *
s = setup(data, target = 'species', session_id = 123)
# train model
lr = create_model('lr')
# convert model
lr_java = convert_model(lr, language = 'java')
print(lr_java)
Convert Model in PyCaret

Video Demo:

☑️ Check Fairness

There are many approaches to conceptualizing fairness. This new function follows the approach known as group fairness, which asks: Which groups of individuals are at risk for experiencing harm. This function provides fairness-related metrics between different groups (also called subpopulations).

# load dataset
from pycaret.datasets import get_data
data = get_data('income')
# init setup
from pycaret.classification import *
s = setup(data, target = 'income >50K', session_id = 123)
# train model
lr = create_model('lr')
# check fairness
check_fairness(lr, sensitive_features = ['race'])
Check Fairness in PyCaret

Video Demo:

📩 Create Web API

This function will create a POST API for the ML pipeline for inference using FastAPI framework. It only creates the API and doesn’t run it automatically.

# load dataset
from pycaret.datasets import get_data
data = get_data('iris')
# init setup
from pycaret.classification import *
s = setup(data, target = 'species', session_id = 123)
# train model
lr = create_model('lr')
# create API
create_api(lr, 'my_first_api')
# Run the API
!python my_first_api.py
Create API Function in PyCaret
Create API Function in PyCaret (Testing on localhost)

Video Demo:

🚢 Create Docker

This function will create a Dockerfileand requirementsfile for your API end-point.

# load dataset
from pycaret.datasets import get_data
data = get_data('iris')
# init setup
from pycaret.classification import *
s = setup(data, target = 'species', session_id = 123)
# train model
lr = create_model('lr')
# create API
create_api(lr, 'my_first_api')
# create Docker
create_docker('my_first_api')
Create Docker Function in PyCaret

Video Demo:

💻 Create Web Application

This function creates a basic Gradio web app for inference. It will later be expanded for other app types such as Streamlit.

# load dataset
from pycaret.datasets import get_data
data = get_data('iris')
# init setup
from pycaret.classification import *
s = setup(data, target = 'species', session_id = 123)
# train model
lr = create_model('lr')
# create gradio app
create_app(lr)
Create App Function in PyCaret

Video Demo:

🎰 Monitor Drift of ML Models

A new parameter called drift_report is added to the predict_model function that generates the drift report using Evidently AI framework. At the moment this functionality is in experimental mode and will only work on test data. Later on, it will be expanded for production use.

# load dataset
from pycaret.datasets import get_data
data = get_data('iris')
# init setup
from pycaret.classification import *
s = setup(data, target = 'species', session_id = 123)
# train model
lr = create_model('lr')
# generate report
preds = predict_model(lr, drift_report = True)
Monitoring Drift in PyCaret
Drift Report in PyCaret (Generated using Evidently AI)

Video Demo:

🔨 Plot Model is now more configurable

plot_model function is PyCaret is now more configurable. For example, previously if you wanted to see percentages in Confusion Matrix instead of absolute numbers, it wasn’t possible, or if you want to change the color map of visuals, it wasn’t possible. Now it is possible with the new parameter plot_kwargs in the plot_model function. See example:

# load dataset
from pycaret.datasets import get_data
data = get_data('iris')
# init setup
from pycaret.classification import *
s = setup(data, target = 'species', session_id = 123)
# train model
lr = create_model('lr')
# plot model (without plot kwargs)
plot_model(lr, plot = 'confusion_matrix')
# plot model (with plot kwargs)
plot_model(lr, plot = 'confusion_matrix', plot_kwargs = {'percent' : True})
plot_model function in PyCaret

🏆Optimize Threshold

This is not a new function but it was completely revamped in 2.3.6. This function is to optimize the probability threshold for binary classification problems. Previously you had to pass cost function as true_positive , false_positive , true_negative , false_negative in this function and now it automatically picks up all the metrics including the custom ones from your active experiment run.

# load dataset
from pycaret.datasets import get_data
data = get_data('blood')
# init setup
from pycaret.classification import *
s = setup(data, target = 'Class', session_id = 123)
# train model
lr = create_model('lr')
# optimize threshold
optimize_threshold(lr)
Optimize Threshold Function in PyCaret

📚 New Documentation

The biggest and hardest of all is the completely new documentation. This is a single source of truth for everything related to PyCaret, from official tutorials to release notes and from API ref to community contributions. Take a video tour:

Finally, if you want to take the tour of all new functionalities added in 2.3.6, watch this 10 minutes video:

To learn about all the other changes, bug fixes, and minor updates in PyCaret 2.3.6, check out the detailed release notes.

Thank you for reading.

Author:

I write about PyCaret and its use-cases in the real world, If you would like to be notified automatically, you can follow me on Medium, LinkedIn, and Twitter.

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

--

--

TDS Archive
TDS Archive

Published in TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial intelligence writing from the former Towards Data Science Medium publication.

Moez Ali
Moez Ali

Written by Moez Ali

Data Scientist, Founder & Creator of PyCaret

Responses (9)

Write a response