scikit-learn

Articles: 51

Recently published

Article

Implementing Other SVM Flavors with Python's Scikit-Learn

This guide is the third and final part of three guides about Support Vector Machines (SVMs). In this guide, we will keep working with the forged bank notes use case, have a quick recap about the general idea behind SVMs, understand what is the kernel trick, and implement different types...

Cássia Sampaio

Apr 24, 2023·16 min read

Article

Understanding SVM Hyperparameters

This guide is the second part of three guides about Support Vector Machines (SVMs). In this guide, we will keep working on the forged bank notes use case, understand what SVM parameters are already being set by Scikit-Learn, what are C and Gamma hyperparameters, and how to tune them using...

Cássia Sampaio

Apr 21, 2023·22 min read

Article

Implementing SVM and Kernel SVM with Python's Scikit-Learn

This guide is the first part of three guides about Support Vector Machines (SVMs). In this series, we will work on a forged bank notes use case, learn about the simple SVM, then about SVM hyperparameters and, finally, learn a concept called the kernel trick and explore other types of...

Cássia Sampaio

Apr 17, 2023·45 min read

Article

DBSCAN with Scikit-Learn in Python

You are working in a consulting company as a data scientist. The project you were currently assigned to has data from students who have recently finished courses about finances. The financial company that conducts the courses wants to understand if there are common factors that influence students to purchase the...

Cássia Sampaio

Mar 17, 2023·27 min read

Byte

Plot Decision Boundaries Using Python and Scikit-Learn

Tree-based models have become a popular choice for Machine Learning, not only due to their results, and the need for fewer transformations when working with data (due to robustness to input and scale invariance), but also because there is a way to take a peek inside of them to see...

Cássia Sampaio

Nov 20, 2022·13 min read

Byte

Plot Decision Trees Using Python and Scikit-Learn

Decision trees are widely used in machine learning problems. We'll assume you are already familiar with the concept of decision trees and you've just trained your tree based algorithm! Advice: If not, you can read our in-depth guide on "Decision Trees in Python with Scikit-Learn guide". Now, it...

Cássia Sampaio

Nov 12, 2022·7 min read

Article

Definitive Guide to the Random Forest Algorithm with Python and Scikit-Learn

The Random Forest algorithm is one of the most flexible, powerful and widely-used algorithms for classification and regression, built as an ensemble of Decision Trees. If you aren't familiar with these - no worries, we'll cover all of these concepts. In this in-depth hands-on guide, we'll build an intuition on...

Cássia Sampaio

Oct 25, 2022·53 min read

Article

Get Feature Importances for Random Forest with Python and Scikit-Learn

The Random Forest algorithm is a tree-based supervised learning algorithm that uses an ensemble of predictions of many decision trees, either to classify a data point or determine its approximate value. This means it can either be used for classification or regression. When applied for classification, the class of the...

Cássia Sampaio

Oct 18, 2022·12 min read

Article

Definitive Guide to Logistic Regression in Python

Sometimes confused with linear regression by novices - due to sharing the term regression - logistic regression is far different from linear regression. While linear regression predicts values such as 2, 2.45, 6.77 or continuous values, making it a regression algorithm, logistic regression predicts values such as 0...

Cássia Sampaio

Sep 03, 2022·53 min read