Data Science Articles

Sorted By: Most Recent
András Gefferth András Gefferth
Updated on February 15, 2024

One Hot Encoding Explained

One hot encoding is a machine learning technique that encodes categorical data into numerical ones. Here’s how to apply it in Scikit-Learn and Pandas.

Image: Shutterstock / Built In
Ashwin Raj Ashwin Raj
Updated on February 14, 2024

How to Build a Logistic Regression Model for Classification

Logistic regression is a classification technique that identifies the best fitting model to describe the relationship between the dependent and independent variables in a data set.

Image: Shutterstock / Built In
Ellen Glover Ellen Glover
Updated on February 13, 2024

Machine Translation: How It Works and Tools to Choose From

Machine translation breaks down language barriers using artificial intelligence.

Image: Shutterstock
Cory Maklin Cory Maklin
Updated on February 08, 2024

Fast Fourier Transform Explained

Fast Fourier transform is an algorithm that can speed up the training process for a convolutional neural network. Here’s how it works.

Image: Shutterstock / Built In
Anthony Corbo Anthony Corbo
Updated on February 08, 2024

What Is Data Wrangling?

Data wrangling is the process of transforming raw data into easily understandable formats and organizing sets into a single structure for further processing.

Image: Shutterstock / Built In
Shivam Chaudhary Shivam Chaudhary
Updated on January 24, 2024

Why 1.5 Is Used in the IQR Rule for Outlier Detection

The 1.5 IQR rule determines that any data point that’s 1.5 IQR points below the first quartile of data or above the third quartile is an outlier. Here’s why it’s 1.5 times IQR.  

Image: Shutterstock / Built In
Akshay Kumar Akshay Kumar
Updated on January 18, 2024

A Guide to HashMap in Java With Examples

HashMap is a popular data structure in Java that uses the Map interface and a hash table to provide efficient access and manipulation of data based unique keys.

Image: Shutterstock / Built In
Dhilip Subramanian Dhilip Subramanian
Updated on January 18, 2024

An Introduction to Apache Kafka

Kafka is a distributed streaming platform built using the publish/subscribe model to help companies receive large amounts of real-time data and process it. Here’s what you need to know.

Image: Shutterstock / Built In
Ahmed Yahya Khaled Ahmed Yahya Khaled
Updated on January 17, 2024

Data Transformations in R

Data transformation is the process of cleaning and organizing data from one format into another. Here’s how to do it in R.

Image: Shutterstock / Built In
Jeremy Zhang Jeremy Zhang
Updated on January 17, 2024

Importance Sampling Explained

Importance sampling is an approximation method that uses a mathematical transformation to take the average of all samples to estimate an expectation. Here’s how to do it.   

Image: Shutterstock / Built In

Related Topics