Data Science Articles

Sorted By: Most Recent
Sanskar Wagavkar Sanskar Wagavkar
Updated on March 17, 2023

Introduction to the Correlation Matrix

The correlation matrix is a matrix that shows the correlation between variables. It gives the correlation between all the possible pairs of values in a matrix format.

Image: Shutterstock / Built In
Alex Williams Alex Williams
Updated on March 16, 2023

What Is a Knowledge Graph? Examples, Uses and More.

Knowledge graphs are becoming increasingly common thanks to their wide range of applications across industries. This guide introduces you to their basic principles and some examples.

Image: Shutterstock / Built In
Catherine Gitau Catherine Gitau
Updated on March 16, 2023

Fuzzy String Matching in Python: Introduction to FuzzyWuzzy

Fuzzy string matching is the process of finding strings that approximately match a pattern. FuzzyWuzzy is a Python library that calculates the differences between sequences and patterns.

Image: Shutterstock / Built In
Abdishakur Hassan Abdishakur Hassan
Updated on March 15, 2023

What Is Statistical Analysis?

Statistical analysis helps you pull meaningful insights from data. The process involves working with data and deducing numbers to tell quantitative stories.

Image: Shutterstock / Built In
Jye Sawtell-Rickson Jye Sawtell-Rickson
Updated on March 15, 2023

What Are Web Analytics?

Web analytics refer to the practice of understanding the online behavior of people and turning that behavior into actionable insights to optimize a website or product.

Image: Shutterstock / Built In
Chris Dowsett Chris Dowsett
Updated on March 15, 2023

What Is a Data Pipeline?

A data pipeline is a series of data processing steps. A data pipeline might move a data set from one data storage location to another data storage location.

Image: Shutterstock / Built In
Artturi Jalli Artturi Jalli
Updated on March 15, 2023

What Is HBase?

HBase is a non-relational database management system for real-time data processing that runs on top of the Hadoop distributed file system

Image: Shutterstock / Built In
Julia Zolotarev Julia Zolotarev
Updated on March 15, 2023

What Is a Non-Relational Database?

Non-relational databases (NoSQL databases) are data stores that are either schema-free, or have relaxed schemas that allow for changes in the data structure.

Image: Shutterstock / Built In
Sara A. Metwalli Sara A. Metwalli
Updated on March 15, 2023

What Is MySQL?

MySQL is a relational database management system that stores all the information you need to run complex websites. Here’s how it works.

Image: Shutterstock / Built In
Nicolai Berg Andersen Nicolai Berg Andersen
Updated on March 15, 2023

What Is Pandas?

Pandas is a Python package built for a broad range of data analysis and manipulation including tabular data, time series and many types of data sets.

Image: Shutterstock / Built In

Related Topics