Sat.Feb 05, 2022 - Fri.Feb 11, 2022

article thumbnail

Building the Business Case for DataOps

DataKitchen

The post Building the Business Case for DataOps first appeared on DataKitchen.

130
130
article thumbnail

Different Types of Cross-Validations in Machine Learning

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Introduction Model Development is a critical stage in the life cycle of a Data Science project. We attempt to train our data set using various forms of Machine Learning models, either supervised or unsupervised, depending on the Business Problem. Given many models available for […].

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Why AI Is Important for Automating Travel Policy Compliance

Smart Data Collective

Artificial intelligence (AI) is a trending topic commonly spoken about globally. It has come to a point where all the repetitive work that we have to do manually is taken care of by the AI. 37% of businesses and organizations employ AI, and about 15% claim to use its capabilities. So, the real question is how can AI help businesses in travel? Well, as far as many businesses are concerned, AI has many advanced capabilities in managing expenses, optimizing travel programs, and improving the overal

article thumbnail

Managing Your Reusable Python Code as a Data Scientist

KDnuggets

Here are a few approaches that I have settled on for managing my own reusable Python code as a data scientist, presented from most to least general code use, and aimed at beginners.

article thumbnail

Get Better Network Graphs & Save Analysts Time

Many organizations today are unlocking the power of their data by using graph databases to feed downstream analytics, enahance visualizations, and more. Yet, when different graph nodes represent the same entity, graphs get messy. Watch this essential video with Senzing CEO Jeff Jonas on how adding entity resolution to a graph database condenses network graphs to improve analytics and save your analysts time.

article thumbnail

New Data Horizons: Data Prep, Data Visualization, and Data Catalogs Are Ready for Prime Time

DataKitchen

The post New Data Horizons: Data Prep, Data Visualization, and Data Catalogs Are Ready for Prime Time first appeared on DataKitchen.

article thumbnail

Workflow of MLOps: Part 2 | Model Building

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. This is the 2nd blog of the MLOps series. Introduction This article is part of an ongoing blog series on Machine Learning Operations(MLOps). In the previous article, we have gone through the introduction of MLOps. We have seen differences in traditional software development in […].

Modeling 322

More Trending

article thumbnail

The Complete Collection of Data Science Cheat Sheets – Part 1

KDnuggets

A collection of cheat sheets that will help you prepare for a technical interview, assessment tests, class presentation, and help you revise core data science concepts.

article thumbnail

Doing Power BI The Right Way – for Enterprise Reporting

Paul Turley

I started a series of blog posts back in 2020 about best-practice guidelines for planning and designing enterprise reporting solutions with Power BI. To make the topics covered in this series of posts easier to find and follow, they are listed on this page: Doing Power BI The Right Way – for Enterprise Reporting | Paul Turley's SQL Server BI Blog which you can access from the main menu on the blog.

Reporting 128
article thumbnail

Optimal Resource Allocation using Python

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Objective “True optimization is the revolutionary contribution of modern research to decision processes” – George Dantzig. This article discusses solving a resource allocation problem using linear programming in Python. We will find an optimal value for a linear equation with different linear constraints.

article thumbnail

Cloud Technology Makes Virtual Assistants More Beneficial than Ever

Smart Data Collective

More companies are relying on cloud technology than ever before. They are discovering the benefits of using the cloud to utilize data and facilitate communications between employees, customers, contractors and other stakeholders. One of the underappreciated benefits of cloud technology is that it makes it easier to work with virtual assistants. Savvy executives and small business owners realize that virtual assistants can perform many important tasks a lot more efficiently.

article thumbnail

Understanding User Needs and Satisfying Them

Speaker: Scott Sehlhorst

We know we want to create products which our customers find to be valuable. Whether we label it as customer-centric or product-led depends on how long we've been doing product management. There are three challenges we face when doing this. The obvious challenge is figuring out what our users need; the non-obvious challenges are in creating a shared understanding of those needs and in sensing if what we're doing is meeting those needs.

article thumbnail

How to Learn Math for Machine Learning

KDnuggets

So how much math do you need to know in order to work in the data science industry? The answer: Not as much as you think.

article thumbnail

#ClouderaLife Spotlight: Marque Blackman, Director of Global Workplace

Cloudera

As we celebrate Black History Month, for this Employee Spotlight I sat down with Marque Blackman, co-lead of the Cloudera Black Employee Network (CBEN). We discussed his experience at Cloudera, his career transitions, and what he learned along the way. We also discussed his work with CBEN and his perspective on Black History Month. Meet Marque Blackman, Director of Global Workplace .

article thumbnail

11 Extensions to Power Up your Jupyter Notebook

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. […]. The post 11 Extensions to Power Up your Jupyter Notebook appeared first on Analytics Vidhya.

article thumbnail

5 Data Security Strategies Businesses Should Implement

Smart Data Collective

We have witnessed some horrifying data breaches over the last year. One of the worst was when a team of Chinese hackers penetrated the security of the Microsoft Exchange and accessed the accounts of over 250,000 global organizations. The Colonial Pipeline and SolarWinds were also victims to hackers. While large corporations like these will continue to be targets for data breaches, small businesses are also at risk.

Strategy 115
article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

Junior Data Scientist: The Next Level

KDnuggets

There is a difference in the level of experience compared to Junior, Mid-Level, and Senior Data Scientists. This article will go through the expectations for all job roles and what is required to move up the ladder.

122
122
article thumbnail

Getting Started with Machine Learning

Cloudera

In recent years, Ethical AI has become an area of increased importance to organisations. Advances in the development and application of Machine Learning (ML) and Deep Learning (DL) algorithms, require greater care to ensure that the ethics embedded in previous rule-based systems are not lost. This has led to Ethical AI being an increasingly popular search term and the subject of many industry analyst reports and papers.

article thumbnail

Folder Management in Python

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Overview “You’re either the one that creates the automation or you’re getting automated.” Tom Preston-Werner. Automation affects almost every aspect of modern life, and it can be used in any industry. Automation minimizes human input and eliminates doing repetitive tasks.

article thumbnail

Doing Power BI the Right Way: 8. Delivery options

Paul Turley

Part of the the series: Doing Power BI the Right Way When you sign-up for the Power BI service at PowerBI.com (this address redirects to App.PowerBI.com), use establish a tenant for your organization, hosted in the Azure cloud. Even if you setup a 90-day trial account, you have a tenant that you can upgrade later on.

article thumbnail

Manufacturing Sustainability Surge: Your Guide to Data-Driven Energy Optimization & Decarbonization

Speaker: Kevin Kai Wong, President of Emergent Energy Solutions

In today's industrial landscape, the pursuit of sustainable energy optimization and decarbonization has become paramount. Manufacturing corporations across the U.S. are facing the urgent need to align with decarbonization goals while enhancing efficiency and productivity. Unfortunately, the lack of comprehensive energy data poses a significant challenge for manufacturing managers striving to meet their targets.

article thumbnail

The Not-so-Sexy SQL Concepts to Make You Stand Out

KDnuggets

Databases are the houses of our data and data scientists HAVE TO HAVE A KEY! In this article, I discuss some lesser known concepts of SQL that data scientists do not familiarize themselves with.

118
118
article thumbnail

IP Scores Are Crucial to the Future of Data Security in 2022

Smart Data Collective

Have you stopped to think about the state of the Internet and the role it plays in our daily lives? We are more connected today than ever before. The Internet has unquestionably brought a lot of benefits to our lives. However, it has also created a lot of risks. As more data is stored over the Internet, we are more vulnerable than ever. In the first six months of 2019, over 4.1 billion records were exposed in data breaches.

Risk 75
article thumbnail

Heart Disease Prediction using Machine Learning

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Overview In this article, we will be closely working with the heart disease prediction and for that, we will be looking into the heart disease dataset from that dataset we will derive various insights that help us know the weightage of each feature and […]. The post Heart Disease Prediction using Machine Learning appeared first on Analytics Vidhya.

article thumbnail

A Visual Tool for Exploring Word Embeddings

Edwin Chen

I built a visualization to explore embeddings a few years ago, but never posted it more broadly. So here it is! [link]. These are GloVe embeddings projected into 2D, colorized via k-means in the original space. You can see, for example, that the cluster in pink ….

article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

5 Ways to Apply AI to Small Data Sets

KDnuggets

It is better to use AI algorithms on small data sets for results free of human errors and false results when applied correctly. Here are some methods to apply AI to small data sets.

IT 114
article thumbnail

Where ML Research Meets Data Science Practice: Data Changes & Model Drift

Dataiku

Every year, our Dataiku Lab team presents their annual findings for up-and-coming machine learning (ML) trends in a webinar, based on the extensive work they do in ML research all year. This year, though, we wanted to take a new approach and, instead of solely highlighting the cutting-edge research trends in the space for 2022, we wanted to root that research in reality with real-life data science and AI projects from 2021.

article thumbnail

Exploratory Data Analysis in Python

Analytics Vidhya

Overview Understanding how EDA is done in Python Various steps involved in the Exploratory Data Analysis Performing EDA on a given dataset Introduction Exploratory data analysis popularly known as EDA is a process of performing some initial investigations on the dataset to discover the structure and the content of the given dataset. It is often […].

Analytics 295
article thumbnail

Kai Ming makes more data-driven decisions with IBM Cognos Analytics

IBM Big Data Hub

In the 1960s, emerging research on the effects of poverty and its impact on education came to light. This research indicated an obligation to help disadvantaged groups, compensating for inequality in social or economic conditions. In January 1964, a former teacher and then-President Lyndon B. Johnson declared a “war on poverty.” They established Head Start, a program to promote the school readiness of infants, toddlers and preschool-aged children from low-income families as part of t

article thumbnail

The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Communication

Speaker: David Bard, Principal at VP Product Coaching

In the fast-paced world of digital innovation, success is often accompanied by a multitude of challenges - like the pitfalls lurking at every turn, threatening to derail the most promising projects. But fret not, this webinar is your key to effective product development! Join us for an enlightening session to empower you to lead your team to greater heights.

article thumbnail

Build a Web Scraper with Python in 5 Minutes

KDnuggets

In this article, I will show you how to create a web scraper from scratch in Python.

156
156
article thumbnail

Building Dash Webapps in Dataiku for Self-Service Analytics

Dataiku

To facilitate full systemization of data and AI, it is important to allow as many users to access , interact with, and gather insights from relevant data as possible. Many existing processes only allow IT administrators or analysts to extract data from databases or data warehouses and users who are not familiar with SQL face challenges in extracting and analyzing the data they require for their business needs.

article thumbnail

Guide On Customer Churn: Don’t Just Predict, Prevent it!

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Introduction Phonepe, Google Pay (Tez) are ubiquitous names in the Indian payment ecosystem and the top two players in the area. According to Phonepe pulse report, it has133 million monthly active users as of July’21. For the Q3-21 quarter, the total transactions were 526.8 Cr […].

IT 295
article thumbnail

We must check for racial bias in our machine learning models

IBM Big Data Hub

As a data scientist for IBM Consulting, I’ve been fortunate enough to work on several projects to fulfill the various needs of IBM clients. Over my time at IBM, I have seen technology applied to various use cases that I would have never originally considered possible, which is why I was thrilled to steward the implementation of artificial intelligence to address one of the most insidious societal issues we face today, racial injustice.

article thumbnail

Entity Resolution Checklist: What to Consider When Evaluating Options

Are you trying to decide which entity resolution capabilities you need? It can be confusing to determine which features are most important for your project. And sometimes key features are overlooked. Get the Entity Resolution Evaluation Checklist to make sure you’ve thought of everything to make your project a success! The list was created by Senzing’s team of leading entity resolution experts, based on their real-world experience.