Tue.Nov 30, 2021

article thumbnail

Data Sovereignty & Cross-Border Movement of Sensitive Data

Alation

One of the 14 key controls released with the EDM Council’s new Cloud Data Management Capability (CDMC) framework focuses on data sovereignty and cross-border movement. It’s critically needed, but highly complex and difficult to fully comprehend, let alone solve. Local laws matter. The focus of the capability is compliance with all laws and regulations for the handling of sensitive data within a specific jurisdiction where data resides.

article thumbnail

Good ETL Practices with Apache Airflow

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Introduction to ETL ETL is a type of three-step data integration: Extraction, Transformation, Load are processing, used to combine data from multiple sources. It is commonly used to build Big Data. In this process, data is pulled (extracted) from a source system, to […]. The post Good ETL Practices with Apache Airflow appeared first on Analytics Vidhya.

Big Data 382
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

TIBCO Broadens Portfolio for Improved Analytics Efficiency

David Menninger's Analyst Perspectives

TIBCO is a large, independent cloud-computing and data analytics software company that offers integration, analytics, business intelligence and events processing software. It enables organizations to analyze streaming data in real time and provides the capability to automate analytics processes. It offers more than 200 connectors, more than 200 enterprise cloud computing and application adapters, and more than 30 non-relational structured query language databases, relational database management

Analytics 158
article thumbnail

String Data Structure in Python | Complete Case study

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. The string is one standard Data type in Python. You will find string data type in every application programming language like Java, Python, C++ because while developing an application you need to talk to the user and it is done in strings. Whereas […]. The post String Data Structure in Python | Complete Case study appeared first on Analytics Vidhya.

article thumbnail

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Speaker: Anne Steiner and David Laribee

As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.

article thumbnail

KDnuggets: Personal History and Nuggets of Experience

KDnuggets

After 28+ years of publishing and editing KDnuggets, I am retiring and transitioning KDnuggets to Matthew Mayo, who will become the new editor-in-chief. I want to share with you my story of KDnuggets and highlight some of the useful nuggets of experience I learned along this amazing journey.

article thumbnail

Tune ML Models in No Time with Optuna

Analytics Vidhya

This article was published as a part of the Data Science Blogathon A comprehensive guide for finding the best hyper-parameter for your model efficiently. Tuning hyperparameter is more efficient with Bayesian optimized algorithms compared to Brute-force algorithms. You will see how to find the best hyperparameters for XGboost Regressor in this article.

Modeling 334

More Trending

article thumbnail

How To Containerize Your Data Science Workflow With Docker System

Analytics Vidhya

This article was published as a part of the Data Science Blogathon In my case, I was first introduced to Docker when I have to containerize a Machine Learning workflow in my organization. At that time I realized that Docker makes Machine Learning Engineers’ lives much easier. Not only that we can deploy and run applications […]. The post How To Containerize Your Data Science Workflow With Docker System appeared first on Analytics Vidhya.

article thumbnail

Clustering in Crowdsourcing: Methodology and Applications

KDnuggets

As a result of the efforts outlined in this article, we confirmed that clustering through crowdsourcing is indeed possible and works impressively well.

article thumbnail

Basic understanding of Time Series Modelling with Auto ARIMAX

Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Introduction Data Science associates with a huge variety of problems in our daily life. One major problem we see every day include examining a situation over time. Time series forecast is extensively used in various scenarios like sales, weather, prices, etc…, where the […].

Modeling 344
article thumbnail

Put Responsible AI into Practice—attend the digital event on December 7

KDnuggets

Learn best practice guidelines for building AI solutions responsibly. Join AI experts from Microsoft and BCG at Put Responsible AI into Practice—a free Azure digital event on December 7.

107
107
article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

A Comprehensive Guide on Inferrd : The easiest way to deploy ML models

Analytics Vidhya

This article was published as a part of the Data Science Blogathon Introduction Deployment is a way to integrate your machine learning model into your existing production environment and make practical business decisions based on your data. This is one of the final stages of the machine learning life cycle and can be one of the […]. The post A Comprehensive Guide on Inferrd : The easiest way to deploy ML models appeared first on Analytics Vidhya.

Modeling 324
article thumbnail

Intro to R and Power BI Presentation and a Presenting Secret

Jen Stirrup

I was all set to present this session at the European Collaboration Summit in November 2021, but the organizers needed to change the time and date of my session which was rescheduled to take place after I’d left to go back home. So, I was sorry not to be able to present the session but my plane was all organized and I could not change it at the last moment.

article thumbnail

KDnuggets: Personal History and Nuggets of Experience

KDnuggets

After 28+ years of publishing and editing KDnuggets, I am retiring and transitioning KDnuggets to Matthew Mayo, who will become the new editor-in-chief. I want to share with you my story of KDnuggets and highlight some of the useful nuggets of experience I learned along this amazing journey.

article thumbnail

The Cloudera Enterprise Data Cloud Maturity Report: Uncovering progressive steps towards a hybrid future

Cloudera

This guest blog was written by Shanice Omare, Research Manager, Vanson Bourne. Organizations’ resiliency in the wake of the pandemic . So much has changed for organizations in recent times, with the pandemic accelerating shifts toward a more digital world. Some organizations have taken this as an opportunity for positive change by moving workloads to the cloud and utilizing enterprise data strategies that are key to their business resiliency.

article thumbnail

How to Build an Experimentation Culture for Data-Driven Product Development

Speaker: Margaret-Ann Seger, Head of Product, Statsig

Experimentation is often seen as an aspirational practice, especially at smaller, fast-moving companies who are strapped for time and resources. So, how can you get your team making decisions in a more data-driven way while continuing to remain lean and maintaining ship velocity? In this webinar, Margaret-Ann Seger, Head of Product at Statsig, will teach you how to build an experimentation culture from the ground-up, graduating from just getting started with data-driven development to operating

article thumbnail

Sentiment Analysis API vs Custom Text Classification: Which one to choose?

KDnuggets

In this article, we are going to compare the sentiment extraction performance between Sentiment Analysis engines and Custom Text classification engines. The idea is to show pros and cons of these two types of engines on a concrete dataset.

81
article thumbnail

Top Mistakes in Data Migration Projects

TDAN

Having been involved in several large-scale data migration projects, my company has come across some common re-occurring themes that have the potential to derail not only the data migration itself, but the entire parent program. In a series of five mistakes, I will provide an overview of the top five mistakes that should be avoided […].

article thumbnail

Clustering in Crowdsourcing: Methodology and Applications

KDnuggets

As a result of the efforts outlined in this article, we confirmed that clustering through crowdsourcing is indeed possible and works impressively well.

article thumbnail

Bring New People and Roles Into AI Projects With Dataiku 10

Dataiku

Analysts, data engineers, and data scientists have always been core roles that contribute to advanced analytics projects. But in order for an organization to scale AI initiatives with a more systematic approach, the development, operationalization, and oversight of AI projects must also include contributors from different parts of the organization, including IT operators, project managers, risk managers, and subject matter experts (SMEs).

article thumbnail

Reimagined: Building Products with Generative AI

“Reimagined: Building Products with Generative AI” is an extensive guide for integrating generative AI into product strategy and careers featuring over 150 real-world examples, 30 case studies, and 20+ frameworks, and endorsed by over 20 leading AI and product executives, inventors, entrepreneurs, and researchers.

article thumbnail

The Data-Centric Revolution: Data-Centric Accounting

TDAN

I didn’t set out to rewrite the rules of accounting. It just sort of happened. It is a tale of emergence and synchronicity. So far, everyone we’ve reviewed our tentative findings with is enthused and eager for us to finish our experiments and publish. This blog is the first sneak preview of what we believe […].

article thumbnail

How do you identify an expert?

3AG Systems

How do you identify an expert? Let’s look at what defines an expert. Training and experience, right? But how do you tell an expert apart from a non-expert? What does an expert look like? That’s easy to answer, right? A doctor wears a white coat. A mechanic wears overalls. And a plumber wears, well. let's say low-cut jeans, shall we? So if it looks like an expert, and talks like an expert, it’s probably an expert, right?

article thumbnail

Through the Looking Glass: Caught in the Web of Data

TDAN

“There is another undefined frontier, that of time. A living language is in a continuous state of change, of ‘slow but incessant dissolution and renovation’…Where then, does one fix the dates of entry and termination of a word’s ‘current usage’?”[1] As data professionals, we’ve all been there: preparing a set of data from migration from […].

article thumbnail

Nutanix Clusters on AWS - New Improvements to Hibernate & Resume Feature

Nutanix

An enhanced architecture now makes it faster and more cost efficient to hibernate and resume your Nutanix cluster on public clouds like AWS

IT 34
article thumbnail

Entity Resolution Checklist: What to Consider When Evaluating Options

Are you trying to decide which entity resolution capabilities you need? It can be confusing to determine which features are most important for your project. And sometimes key features are overlooked. Get the Entity Resolution Evaluation Checklist to make sure you’ve thought of everything to make your project a success! The list was created by Senzing’s team of leading entity resolution experts, based on their real-world experience.

article thumbnail

The Power of the Subject Matter Expert (SME)

TDAN

The knowledge of a Subject Matter Expert, or SME, can make or break any projects of any type. It does not matter if the project is architecture, construction, business strategy, or the one of the many facets of data and information management. To become a SME, a team needs to have lived and worked in […].

article thumbnail

Traverse Trees Using Level Order Traversal in Python

Analytics Vidhya

This article was published as a part of the Data Science Blogathon Overview Trees are a non-linear data structure type. The trees are composed of nodes grouped in a hierarchical fashion. It begins with a single root node that may have child nodes of its own. All nodes are linked by edges. We can use trees […]. The post Traverse Trees Using Level Order Traversal in Python appeared first on Analytics Vidhya.

article thumbnail

Accessibility Quick Wins: Remove Legends and Directly Label

Depict Data Studio

How do we make our graphs more accessible? There’s a misconception that accessibility takes all day, that’s it’s costly, or that it’s complicated. Those are all false. Accessibility is woven into all my trainings, but since this is a topic I get asked about a lot, I decided to make a new talk that’s focused just on accessibility for dataviz.

article thumbnail

Building Massively Scalable Machine Learning Pipelines with Microsoft Synapse ML

KDnuggets

The new platform provides a single API to abstract dozens of ML frameworks and databases.

article thumbnail

Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity

Speaker: Nicholas Zeisler, CX Strategist & Fractional CXO

The first step in a successful Customer Experience endeavor (or for that matter, any business proposition) is to find out what’s wrong. If you can’t identify it, you can’t fix it! 💡 That’s where the Voice of the Customer (VoC) comes in. Today, far too many brands do VoC simply because that’s what they think they’re supposed to do; that’s what all their competitors do.