Data Leaders Brief

data-science-dictionary feature-selection

Build efficient, cross-Regional, I/O-intensive workloads with Dask on AWS

AWS Big Data

MAY 4, 2023

Welcome to the era of data. The sheer volume of data captured daily continues to grow, calling for platforms and solutions to evolve. The Amazon Sustainability Data Initiative (ASDI) uses the capabilities of Amazon S3 to provide a no-cost solution for you to store and share climate science workloads across the globe.

Data Processing

Data Processing Metadata Informatics Interactive

The state of data quality in 2020

O'Reilly on Data

FEBRUARY 11, 2020

We suspected that data quality was a topic brimming with interest. The responses show a surfeit of concerns around data quality and some uncertainty about how best to address those concerns. Key survey results: The C-suite is engaged with data quality. Data quality might get worse before it gets better.

Data Quality

Data Quality Metadata Data Governance Publishing

Join 52,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Communication

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Manufacturing Sustainability Surge: Your Guide to Data-Driven Energy Optimization & Decarbonization

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Understanding User Needs and Satisfying Them

MORE WEBINARS

Trending Sources

How to Easily Understand Your Python Objects

Insight

JULY 23, 2019

I frequently run into this issue in my data science workflow with complex objects in libraries, like TensorFlow. kwonlydefaults is a dictionary with keyword-only arg default values. annotations is a dictionary specifying any type annotations. args contains the argument names. kwonlyargs lists names of keyword-only args.

Testing

Testing Data Science IT Interactive

Webinars

The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Communication

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Manufacturing Sustainability Surge: Your Guide to Data-Driven Energy Optimization & Decarbonization

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Understanding User Needs and Satisfying Them

MORE WEBINARS

How to supercharge data exploration with Pandas Profiling

Domino Data Lab

JANUARY 21, 2021

Producing insights from raw data is a time-consuming process. The Importance of Exploratory Analytics in the Data Science Lifecycle. Exploratory analysis is a critical component of the data science lifecycle. For one, Python remains the leading language for data science research. ref: [link].

Statistics

Statistics Unstructured Data Data Science Predictive Modeling

Switching from CPUs to GPUs for NYC Taxi Fare Predictions with NVIDIA RAPIDS

Cloudera

NOVEMBER 3, 2021

Have you ever asked a data scientist if they wanted their code to run faster? According to a poll in Kaggle’s State of Machine Learning and Data Science 2020 , A Convolutional Neural Network was the most popular deep learning algorithm used amongst polled individuals, but it was not even in the top 3. In fact only 43.2%

Deep Learning

Deep Learning Machine Learning Data Science Cost-Benefit

Building a Named Entity Recognition model using a BiLSTM-CRF network

Domino Data Lab

JULY 1, 2021

The model achieves relatively high accuracy and all data and code is freely available in the article. The drawback with statistical model-based techniques is that the automated extraction of a comprehensive set of rules requires a large amount of labeled training data. Data exploration and preparation.

Modeling

Modeling Statistics Testing Metrics

Manual Feature Engineering

Domino Data Lab

AUGUST 20, 2019

Many thanks to AWP Pearson for the permission to excerpt “Manual Feature Engineering: Manipulating Data for Fun and Profit” from the book, Machine Learning with Python for Everyone by Mark E. Feature engineering is useful for data scientists when assessing tradeoff decisions regarding the impact of their ML models.

Testing

Testing Modeling Interactive Measurement

Towards Predictive Accuracy: Tuning Hyperparameters and Pipelines

Domino Data Lab

AUGUST 26, 2019

Data scientists, machine learning (ML) researchers, and business stakeholders have a high-stakes investment in the predictive accuracy of models. Data scientists and researchers ascertain predictive accuracy of models using different techniques, methodologies, and settings, including model parameters and hyperparameters. Introduction.

Testing

Testing Modeling Machine Learning Metrics

Your Effective Roadmap To Implement A Successful Business Intelligence Strategy

datapine

FEBRUARY 22, 2022

Over the past 5 years, big data and BI became more than just data science buzzwords. Without real-time insight into their data, businesses remain reactive, miss strategic growth opportunities, lose their competitive edge, fail to take advantage of cost savings options, don’t ensure customer satisfaction… the list goes on.

Business Intelligence

Business Intelligence Strategy Cost-Benefit Key Performance Indicator

Deep Learning Illustrated: Building Natural Language Processing Models

Domino Data Lab

AUGUST 22, 2019

Data scientists and researchers require an extensive array of techniques, packages, and tools to accelerate core work flow tasks including prepping, processing, and analyzing data. Utilizing NLP helps researchers and data scientists complete core tasks faster. Preprocessing Natural Language Data. Example 11.4

Deep Learning

Deep Learning Modeling Metrics Testing

AWS Professional Services scales by improving performance and democratizing data with Amazon QuickSight

AWS Big Data

JUNE 14, 2023

The AWS Professional Services (ProServe) Insights team builds global operational data products that serve over 8,000 users within Amazon. In this post, we discuss how QuickSight has helped us improve our performance, democratize our data, and provide insights to our internal customers at scale.

Dashboards

Dashboards KPI Scorecard Consulting

Build efficient, cross-Regional, I/O-intensive workloads with Dask on AWS

The state of data quality in 2020

Webinars

Trending Sources

How to Easily Understand Your Python Objects

Webinars

How to supercharge data exploration with Pandas Profiling

Switching from CPUs to GPUs for NYC Taxi Fare Predictions with NVIDIA RAPIDS

Building a Named Entity Recognition model using a BiLSTM-CRF network

Manual Feature Engineering

Towards Predictive Accuracy: Tuning Hyperparameters and Pipelines

Your Effective Roadmap To Implement A Successful Business Intelligence Strategy

Deep Learning Illustrated: Building Natural Language Processing Models

AWS Professional Services scales by improving performance and democratizing data with Amazon QuickSight

Stay Connected