article thumbnail

Why you should care about debugging machine learning models

O'Reilly on Data

If a model is going to be used on all kinds of people, it’s best to ensure the training data has a representative distribution of all kinds of people as well. Interpretable ML models and explainable ML. The debugging techniques we propose should work on almost any kind of ML-based predictive model.

article thumbnail

Proposals for model vulnerability and security

O'Reilly on Data

The objective here is to brainstorm on potential security vulnerabilities and defenses in the context of popular, traditional predictive modeling systems, such as linear and tree-based models trained on static data sets. Applying data integrity constraints on live, incoming data streams could have the same benefits.

Modeling 227
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

How Financial Services and Insurance Streamline AI Initiatives with a Hybrid Data Platform

Cloudera

Perhaps the biggest challenge of all is that AI solutions—with their complex, opaque models, and their appetite for large, diverse, high-quality datasets—tend to complicate the oversight, management, and assurance processes integral to data management and governance. Train and upskill employees.

article thumbnail

How SikSin improved customer engagement with AWS Data Lab and Amazon Personalize

AWS Big Data

An AWS Glue crawler populates the AWS Glue Data Catalog with the data schema definitions (in a landing folder). AWS Glue is a serverless data integration service that makes it easier to discover, prepare, move, and integrate data from multiple sources for analytics, ML, and application development.