Machine Learning Mastery

How to Choose Data Preparation Methods for Machine Learning
5-7-2020 Artificial Intelligence Machine Learning Mastery 786

Data preparation is an important part of a predictive modeling project. Correct application of data preparation will transform raw data into a...
8 Top Books on Data Cleaning and Feature Engineering
1-7-2020 Artificial Intelligence Machine Learning Mastery 1014

Data preparation is the transformation of raw data into a form that is more appropriate for modeling. It is a challenging topic to discuss as...
Feature Engineering and Selection (Book Review)
28-6-2020 Artificial Intelligence Machine Learning Mastery 873

Data preparation is the process of transforming raw data into learning algorithms. In some cases, data preparation is a required step in order...
kNN Imputation for Missing Values in Machine Learning
24-6-2020 Artificial Intelligence Machine Learning Mastery 963

Datasets may have missing values, and this can cause problems for many machine learning algorithms. As such, it is good practice to identify...
How to Avoid Data Leakage When Performing Data Preparation
22-6-2020 Artificial Intelligence Machine Learning Mastery 2376

Data preparation is the process of transforming raw data into a form that is appropriate for modeling. A naive approach to preparing data...
Tour of Data Preparation Techniques for Machine Learning
21-6-2020 Artificial Intelligence Machine Learning Mastery 2647

Predictive modeling machine learning projects, such as classification and regression, always involve some form of data preparation. The...
What Is Data Preparation in a Machine Learning Project
18-6-2020 Artificial Intelligence Machine Learning Mastery 991

Data preparation may be one of the most difficult steps in any machine learning project. The reason is that each dataset is different and...
Why Data Preparation Is So Important in Machine Learning
15-6-2020 Artificial Intelligence Machine Learning Mastery 2363

On a predictive modeling project, machine learning algorithms learn a mapping from input variables to a target variable. The most common form...
Ordinal and One-Hot Encodings for Categorical Data
13-6-2020 Artificial Intelligence Machine Learning Mastery 1270

Machine learning models require all input and output variables to be numeric. This means that if your data contains categorical data, you must...
How to Use StandardScaler and MinMaxScaler Transforms in Python
11-6-2020 Artificial Intelligence Machine Learning Mastery 1435

Many machine learning algorithms perform better when numerical input variables are scaled to a standard range. This includes algorithms that...
How to Perform Feature Selection With Numerical Input Data
5-6-2020 Artificial Intelligence Machine Learning Mastery 2216

Feature selection is the process of identifying and selecting a subset of input features that are most relevant to the target variable. Feature...
Iterative Imputation for Missing Values in Machine Learning
4-6-2020 Artificial Intelligence Machine Learning Mastery 1268

Datasets may have missing values, and this can cause problems for many machine learning algorithms. As such, it is good practice to identify...
Test-Time Augmentation For Structured Data With Scikit-Learn
2-6-2020 Artificial Intelligence Machine Learning Mastery 2543

Test-time augmentation, or TTA for short, is a technique for improving the skill of predictive models. It is typically used to improve the...
How to Use Polynomial Feature Transforms for Machine Learning
31-5-2020 Artificial Intelligence Machine Learning Mastery 1415

Often, the input features for a predictive modeling task interact in unexpected and often nonlinear ways. These interactions can be identified...
How to Scale Data With Outliers for Machine Learning
28-5-2020 Artificial Intelligence Machine Learning Mastery 1358

Many machine learning algorithms perform better when numerical input variables are scaled to a standard range. This includes algorithms that...
Recursive Feature Elimination (RFE) for Feature Selection in Python
25-5-2020 Artificial Intelligence Machine Learning Mastery 1539

Recursive Feature Elimination , or RFE for short, is a popular feature selection algorithm. RFE is popular because it is easy to configure and...