Hashing Trick

2024 | AI Dictionary

Technique for reducing dimensionality of categorical variables.

What is Hashing Trick?

The Hashing Trick is a technique used to convert categorical data or high-dimensional features into fixed-length vectors. By applying a hash function, the technique maps data into a smaller vector space, reducing memory usage and computational complexity while maintaining most of the original information.

Types of Hashing Trick

  1. Feature Hashing: Hashes input features into a fixed-size vector, often used in natural language processing tasks to handle large vocabularies.
  2. Hashing Vectorization: Converts high-dimensional text or categorical data into a lower-dimensional vector representation.

Applications of Hashing Trick

Example of Hashing Trick

In a text classification task, a model might use the Hashing Trick to convert a large vocabulary into a fixed-length vector of size 10,000. Instead of using a one-hot encoding for each word, which would require a huge amount of memory, the Hashing Trick reduces the dimensionality while still capturing the essential features of the words.

Did you liked the Hashing Trick gist?

Learn about 250+ need-to-know artificial intelligence terms in the AI Dictionary.

Read the Governor's Letter

Stay ahead with Governor's Letter, the newsletter delivering expert insights, AI updates, and curated knowledge directly to your inbox.

By subscribing to the Governor's Letter, you consent to receive emails from AI Guv.
We respect your privacy - read our Privacy Policy to learn how we protect your information.

A

B

C

D

E

F

G

H

I

J

K

L

M

N

O

P

Q

R

S

T

U

V

W

X

Y

Z