October, 2021

article thumbnail

Tech workers warned they were going to quit. Now, the problem is spiralling out of control

DataKitchen

The post Tech workers warned they were going to quit. Now, the problem is spiralling out of control first appeared on DataKitchen.

363
363
article thumbnail

Databricks Lakehouse Platform Streamlines Big Data Processing

David Menninger's Analyst Perspectives

Databricks is a data engineering and analytics cloud platform built on top of Apache Spark that processes and transforms huge volumes of data and offers data exploration capabilities through machine learning models. It can enable data engineers, data scientists, analysts and other workers to process big data and unify analytics through a single interface.

Big Data 318
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

5 Features Of Snowflake That Data Engineers Must Know

Analytics Vidhya

This article was published as a part of the Data Science Blogathon Snowflake is a cloud data platform that comes with a lot of unique features when compared to traditional on-premise RDBMS systems. In this tutorial, you will see the top 5 features that developers should know before implementing a solution on the Snowflake data […]. The post 5 Features Of Snowflake That Data Engineers Must Know appeared first on Analytics Vidhya.

article thumbnail

The Quality of Auto-Generated Code

O'Reilly on Data

Kevlin Henney and I were riffing on some ideas about GitHub Copilot , the tool for automatically generating code base on GPT-3’s language model, trained on the body of code that’s in GitHub. This article poses some questions and (perhaps) some answers, without trying to present any conclusions. First, we wondered about code quality. There are lots of ways to solve a given programming problem; but most of us have some ideas about what makes code “good” or “bad.”

Testing 300
article thumbnail

Beyond the Basics of A/B Tests: Innovative Experimentation Tactics You Need to Know as a Data or Product Professional

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

Humans and AI: Bargaining Power

DataRobot

I have a confession to make—I’m a back-seat driver! When sitting in a taxi, I can’t help but grumble when the ride isn’t smooth, or the driver chooses the slowest lane of traffic. I have to fight the urge to take control. When it comes to shopping, I passively accept what is offered for sale. But my wife, who grew up in Asia where haggling is part of the culture, is different.

article thumbnail

3 Practices to Help Build a Strong Data Culture

Dataiku

Let’s be frank — creating a lasting data culture in your company isn’t going to happen overnight. No technology you install or datasets you gather will do that for you. You need time and, as we’ve seen across pop culture, it usually takes a new idea or innovation (or an old idea packaged as new) to change culture. This change usually falls on data leaders to drive because they have a unique perspective across data, technology, and the organization.

More Trending

article thumbnail

Alteryx Tackles Analytics Ops

David Menninger's Analyst Perspectives

Alteryx is a data analytics software company that offers data preparation and analytics tools to simplify and automate data wrangling, data cleaning and modeling processes, enabling line-of-business personnel to quickly access, manipulate, analyze and output data. The platform features tools to run a variety of analytic functions such as diagnostic, predictive, prescriptive and geospatial analytics in a unified platform, and can connect to various data warehouses, cloud applications, spreadsheet

Analytics 274
article thumbnail

An Easy introduction to Flask Framework for beginners

Analytics Vidhya

This article was published as a part of the Data Science Blogathon Overview What is a Framework FrontEnd vs BackEnd What is Flask Framework Installation of Flask Creating our first Flask app Routing Static Route vs Dynamic Route HTML Injections HTML Escaping Hypertext Transfer Protocol GET and POST Methods What is a Framework? The framework […].

article thumbnail

Be the Best – 9 Ways to Market Your Business with Big Data

Smart Data Collective

Big data technology has been a highly valuable asset for many companies around the world. Countless companies are utilizing big data to improve many aspects of their business. Some of the best applications of data analytics and AI technology has been in the field of marketing. Data-Driven Marketing is More Important than Ever. The competition out there is fierce, so it is vital that you find ways to make your business stand out from the crowd.

Big Data 137
article thumbnail

Diversity Report Template

Juice Analytics

Pressure is building for companies to provide more transparency into the diversity of their workforce. Along with the #MeToo and BLM social movements, there are economic reasons why diversity data can be an indicator of company health. “A McKinsey study found that companies in the top quartile for gender diversity in corporate leadership had a 21% likelihood of outperforming bottom-quartile industry peers on profitability.

Reporting 135
article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

Deep Learning for Time Series Forecasting: Is It Worth It?

Dataiku

Using RNNs & DeepAR Models to Find Out. Time series forecasting use cases are certainly the most common time series use cases, as they can be found in all types of industries and in various contexts. Whether it is forecasting future sales to optimize inventory, predicting energy consumption to adapt production levels, or estimating the number of airline passengers to ensure high-quality services, time is a key variable.

article thumbnail

How Predictive and Prescriptive Analytics Improve the Call Center Experience

DataKitchen

The post How Predictive and Prescriptive Analytics Improve the Call Center Experience first appeared on DataKitchen.

article thumbnail

Use External Data Platform to Improve Analytics

David Menninger's Analyst Perspectives

Access to external data can provide a competitive advantage. Our research shows that more than three-quarters (77%) of participants consider external data to be an important part of their machine learning (ML) efforts. The most important external data source identified is social media, followed by demographic data from data brokers. Organizations also identified government data, market data, environmental data and location data as important external data sources.

Data Lake 260
article thumbnail

End-to-End Introduction to Handling Missing Values

Analytics Vidhya

This article was published as a part of the Data Science Blogathon Overview Data provides us with the power to analyze and forecast the events of the future. With each day, more and more companies are adopting data science techniques like predictive forecasting, clustering, and so on. While it’s very intriguing to keep learning about complex […].

article thumbnail

Entity Resolution Checklist: What to Consider When Evaluating Options

Are you trying to decide which entity resolution capabilities you need? It can be confusing to determine which features are most important for your project. And sometimes key features are overlooked. Get the Entity Resolution Evaluation Checklist to make sure you’ve thought of everything to make your project a success! The list was created by Senzing’s team of leading entity resolution experts, based on their real-world experience.

article thumbnail

4 Reasons to Hire a Data Science Company

Smart Data Collective

1.145 trillion megabytes! Unbelievably, this is the amount of data that was created every day in 2021. That’s a lot of data and a lot of work for experts working in the field of data science services. Amidst growing competition, businesses are under increasing pressure to come up with unique and more cost-effective ways to manufacture and market their products.

article thumbnail

A Trick, a Tip and a Thing to Try in Your Next Presentation

Depict Data Studio

Depict Data Studio full courses always end with a graduation ceremony where participants share the progress they’ve made in the course. I’m always amazed by the transformations that take place and I can’t help but want to share their wonderful work! In this blog post, you’ll learn from Elizabeth Dove. Elizabeth is a professor at the University of Montana who teaches art and design.

article thumbnail

Dataiku’s Role in the Modern Data Stack

Dataiku

Any small or midsize business (SMB) that’s serious about making the use of data, analytics, and AI everyday behavior for everyone is using a version of the modern data stack architecture. It can even make sense in the enterprise context for teams just getting started on their AI journey.

article thumbnail

Data Quality: Volume, interdependencies can create big problems

DataKitchen

The post Data Quality: Volume, interdependencies can create big problems first appeared on DataKitchen.

article thumbnail

Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity

Speaker: Nicholas Zeisler, CX Strategist & Fractional CXO

The first step in a successful Customer Experience endeavor (or for that matter, any business proposition) is to find out what’s wrong. If you can’t identify it, you can’t fix it! 💡 That’s where the Voice of the Customer (VoC) comes in. Today, far too many brands do VoC simply because that’s what they think they’re supposed to do; that’s what all their competitors do.

article thumbnail

Data Virtualization Brings Data Together Quickly and Easily

David Menninger's Analyst Perspectives

The technology industry throws around a lot of similar terms with different meanings as well as entirely different terms with similar meanings. In this post, I don’t want to debate the meanings and origins of different terms; rather, I’d like to highlight a technology weapon that you should have in your data management arsenal. We currently refer to this technology as data virtualization.

article thumbnail

Introduction to Deep Learning in Julia

Analytics Vidhya

This article was published as a part of the Data Science Blogathon Overview In the current scenario, the Data science field is dominated by Python/R but there is another competition added not so long ago, Julia! which we will be exploring in this guide. The famous quote (motto) of Julia is – Looks like Python, runs […]. The post Introduction to Deep Learning in Julia appeared first on Analytics Vidhya.

article thumbnail

Using Machine Learning to Improve Cryptocurrency Mining Profitability

Smart Data Collective

Satoshi Nakamoto introduced the world to bitcoin in 2008. Many people speculated that the virtual currency would never gain traction and become extinct. However, it has grown faster than even some of the staunchest supporters originally predicted. Advances in AI and machine learning technology have been important in setting the trend for bitcoin. It’s been over a decade since the cryptocurrencies were introduced to the world and since it has become increasingly popular.

article thumbnail

Introducing Self-Service, No-Code Airflow Authoring UI in Cloudera Data Engineering

Cloudera

Airflow has been adopted by many Cloudera Data Platform (CDP) customers in the public cloud as the next generation orchestration service to setup and operationalize complex data pipelines. Today, customers have deployed 100s of Airflow DAGs in production performing various data transformation and preparation tasks, with differing levels of complexity.

article thumbnail

Monetizing Analytics Features

Think your customers will pay more for data visualizations in your application? Five years ago, they may have. But today, dashboards and visualizations have become table stakes. Turning analytics into a source of revenue means integrating advanced features in unique, hard-to-steal ways. Download this white paper to discover which features will differentiate your application and maximize the ROI of your analytics.

article thumbnail

How These Female Sales Leaders Are Blazing Their Own Trail: Meet Doreen, Natália, and Rachel

Dataiku

Ever wanted to know more about the people behind your favorite Everyday AI platform? You're in luck — every few weeks, meet some of the humans at Dataiku working to ensure customers and users find success as they systemize the use of data and AI in their organizations. This week, we spoke with three sales leaders: Doreen, RVP Inside Sales EMEA; Natália, RVP UK&I; and Rachel, VP Sales and General Manager Central Europe.

Sales 119
article thumbnail

Data Engineers are Burned Out and Calling for DataOps

DataKitchen

The post Data Engineers are Burned Out and Calling for DataOps first appeared on DataKitchen.

246
246
article thumbnail

Creating a Powerful Presentation: 3 Easy Changes to Revamp your PowerPoint

Depict Data Studio

Depict Data Studio full courses always end with a graduation ceremony where students share the progress they’ve made in the course. I’m always amazed by the transformations that take place and I can’t help but want to share their wonderful work! Today you’ll learn from Kelsey Watterson, an evaluator at the Centerstone Research Institute. Thanks for sharing Kelsey!

article thumbnail

A Comprehensive Guide to Time Series Analysis

Analytics Vidhya

This article was published as a part of the Data Science Blogathon This article was published as a part of the Data Science Blogathon Synopsis of Time Series Analysis A Time-Series represents a series of time-based orders. It would be Years, Months, Weeks, Days, Horus, Minutes, and Seconds A time series is an observation […]. The post A Comprehensive Guide to Time Series Analysis appeared first on Analytics Vidhya.

article thumbnail

The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data and AI

Speaker: Aindra Misra, Sr. Staff Product Manager of Data & AI at BILL (Previously PM Lead at Twitter/X)

Embark on a transformation journey into the heart of the data ecosystem! This webinar is your gateway to a deeper comprehension of the foundations that drive the data industry and will equip you with the knowledge needed to navigate the evolving landscape. Delve into the diverse use cases where data analytics plays a pivotal role. We’ll explore how these applications are transforming with the introduction of Gen AI, and discuss the anticipated use cases for 2024 and beyond.

article thumbnail

Important Steps to Take to Address the Bias in AI

Smart Data Collective

We mentioned previously that bias is a big problem in machine learning that has to be mitigated. People need to take important steps to help mitigate it for the future. Regardless of how culturally, socially, or environmentally aware people consider themselves to be, bias is an inherent trait that everyone has. We are naturally attracted to facts that confirm our own beliefs.

Modeling 130
article thumbnail

Our 2021 Data Impact Awards Finalists

Cloudera

It’s that time of year again… Award season! We are thrilled to announce the finalists of the 2021 Data Impact Awards. This year’s entrants have excelled at demonstrating how innovative data solutions can help solve real-time challenges and positively impact people around the world. . The entries are some of the most remarkable we’ve seen, giving our judges the tough task of selecting an award worthy shortlist.

article thumbnail

Maintaining and Improving Predictive Models With Dataiku

Dataiku

Managing one model at a time is pretty easy. But how do you go about managing tens of models, or even more? Vincent Gallmann, Senior Data Scientist at French bank FLOA , answered this question in a 2021 Product Days Session on managing data science projects with Dataiku.

article thumbnail

DataOps Lowers The Cost Of Asking Analytic Questions

DataKitchen

The post DataOps Lowers The Cost Of Asking Analytic Questions first appeared on DataKitchen.

Analytics 246
article thumbnail

How to Build an Experimentation Culture for Data-Driven Product Development

Speaker: Margaret-Ann Seger, Head of Product, Statsig

Experimentation is often seen as an aspirational practice, especially at smaller, fast-moving companies who are strapped for time and resources. So, how can you get your team making decisions in a more data-driven way while continuing to remain lean and maintaining ship velocity? In this webinar, Margaret-Ann Seger, Head of Product at Statsig, will teach you how to build an experimentation culture from the ground-up, graduating from just getting started with data-driven development to operating