Remove data notebook how-to-check-if-any-value-is-nan-in-a-pandas-dataframe
article thumbnail

Data Exploration with Pandas Profiler and D-Tale

Domino Data Lab

We all have heard how data is the new oil. For data, this refinement includes doing some cleaning and manipulations that provide a better understanding of the information that we are dealing with. The purpose of Data Exploration. Data exploration is a very important step before jumping onto the machine learning wagon.

article thumbnail

The importance of structure, coding style, and refactoring in notebooks

Domino Data Lab

Notebooks are increasingly crucial in the data scientist’s toolbox. This form of interactive workflow was introduced to assist data scientists in documenting their work, facilitating reproducibility, and prompting collaboration with their team members. Notebook Structure.

Testing 116
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Techniques for Collecting, Prepping, and Plotting Data: Predicting Social Media-Influence in the NBA

Domino Data Lab

It covers questions to consider as well as collecting, prepping and plotting data. Collecting and prepping data are core research tasks. While the most ideal situation is to start a project with clean well-labeled data, the reality is that data scientists spend countless hours on obtaining and prepping data.

article thumbnail

Credit Card Fraud Detection using XGBoost, SMOTE, and threshold moving

Domino Data Lab

In this article, we’ll discuss the challenge organizations face around fraud detection, how machine learning can be used to identify and spot anomalies that the human eye might not catch. Historically, fraud detection has been performed by analysts who are responsible for identifying and tracing suspicious patterns in the data.