article thumbnail

What Are the Most Important Steps to Protect Your Organization’s Data?

Smart Data Collective

Based on figures from Statista , the volume of data breaches increased from 2005 to 2008, then dropped in 2009 and rose again in 2010 until it dropped again in 2011. They can use AI and data-driven cybersecurity technology to address these risks. The instances of data breaches in the United States are rather interesting. In summary.

Testing 122
article thumbnail

New Thinking, Old Thinking and a Fairytale

Peter James Thomas

Of course it can be argued that you can use statistics (and Google Trends in particular) to prove anything [1] , but I found the above figures striking. Feel free to substitute Data Lake for Data Warehouse if you want a more modern vibe, sadly it won’t change the failure statistics. . [5]. – CIO.com 2010. “61%

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Changing assignment weights with time-based confounders

The Unofficial Google Data Science Blog

One reason to do ramp-up is to mitigate the risk of never before seen arms. For example, imagine a fantasy football site is considering displaying advanced player statistics. A ramp-up strategy may mitigate the risk of upsetting the site’s loyal users who perhaps have strong preferences for the current statistics that are shown.

article thumbnail

Proposals for model vulnerability and security

O'Reilly on Data

Like many others, I’ve known for some time that machine learning models themselves could pose security risks. An attacker could use an adversarial example attack to grant themselves a large loan or a low insurance premium or to avoid denial of parole based on a high criminal risk score. 2010): 121-148. Barreno, Marco, et al.

Modeling 222
article thumbnail

Estimating the prevalence of rare events — theory and practice

The Unofficial Google Data Science Blog

But importance sampling in statistics is a variance reduction technique to improve the inference of the rate of rare events, and it seems natural to apply it to our prevalence estimation problem. High Risk 10% 5% 33.3% Statistical Science. Statistics in Biopharmaceutical Research, 2010. [4] 16 (2): 101–133. [3]

Metrics 98
article thumbnail

10 Fundamental Web Analytics Truths: Embrace 'Em & Win Big

Occam's Razor

Part of it is fueled by a vocal minority genuinely upset that 10 years on we are still not a statistically powered bunch doing complicated analysis that is shifting paradigms. Yet case studies in some sense reduced risk, even if they were simply over blown marketing fluff written by the vendor. Part of it fueled by some Consultants.

Analytics 118
article thumbnail

Unintentional data

The Unofficial Google Data Science Blog

1]" Statistics, as a discipline, was largely developed in a small data world. More people than ever are using statistical analysis packages and dashboards, explicitly or more often implicitly, to develop and test hypotheses. This question is statistical or methodological in nature. Know what matters.