What Upsampling Meaning, Applications & Example

The process of increasing the resolution or size of data.

What is Upsampling?

Upsampling is a technique used to increase the number of instances in the minority class in an imbalanced dataset. This is typically done by duplicating existing data points or generating synthetic data . The goal of upsampling is to create a more balanced dataset that can help improve the performance of machine learning models, especially in classification tasks where one class is underrepresented.

Methods of Upsampling

  1. Random Upsampling: Randomly duplicates instances from the minority class to increase its representation.
  2. SMOTE (Synthetic Minority Over-sampling Technique): Generates synthetic samples by interpolating between existing instances of the minority class.
  3. ADASYN (Adaptive Synthetic Sampling): Similar to SMOTE, but it focuses more on generating synthetic data for difficult-to-classify instances.

Applications of Upsampling

Example of Upsampling

In a spam email classifier, if the dataset contains 90% non-spam emails and only 10% spam emails, upsampling can be applied to increase the number of spam emails in the dataset, either by duplicating existing spam samples or generating synthetic samples. This helps the model learn to better identify spam, as it will have more balanced exposure to both classes during training.

Read the Governor's Letter

Stay ahead with Governor's Letter, the newsletter delivering expert insights, AI updates, and curated knowledge directly to your inbox.

By subscribing to the Governor's Letter, you consent to receive emails from AI Guv.
We respect your privacy - read our Privacy Policy to learn how we protect your information.

A

B

C

D

E

F

G

H

I

J

K

L

M

N

O

P

Q

R

S

T

U

V

W

X

Y

Z