Remove 2017 Remove Blog Remove Data Collection Remove Risk
article thumbnail

Top 7 Data Governance Blog Posts of 2018

erwin

The driving factors behind data governance adoption vary. Whether implemented as preventative measures (risk management and regulation) or proactive endeavors (value creation and ROI), the benefits of a data governance initiative is becoming more apparent. Defining Data Governance. The Top 6 Benefits of Data Governance.

article thumbnail

Leveraging Data Analytics in the Fight Against Prescription Opioid Abuse

Cloudera

Since the 1990s, opioid abuse in the US skyrocketed to the point that in 2017 the Department of Health and Human Services declared the opioid crisis a public health emergency. With the Controlled Substance Analytics platform online, KMC has eliminated manual data collection and streamlined data processing.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Strengthening cybersecurity in life sciences with IBM and AWS

IBM Big Data Hub

According to a report on mapping the cloud maturity curve from the EIU , 48% of industry executives said cloud has improved data access, analysis and utilization, 45% say cloud has sped up delivery of new IT services and capabilities, and 44% say cloud has expanded sales channels across digital avenues.

article thumbnail

Our quest for robust time series forecasting at scale

The Unofficial Google Data Science Blog

Facebook in a recent blog post unveiled Prophet , which is also a regression-based forecasting tool. They can arise from data collection errors or other unlikely-to-repeat causes such as an outage somewhere on the Internet. The regression-based bsts framework can handle predictor variables, in contrast to our approach.

article thumbnail

Themes and Conferences per Pacoid, Episode 9

Domino Data Lab

The lens of reductionism and an overemphasis on engineering becomes an Achilles heel for data science work. Instead, consider a “full stack” tracing from the point of data collection all the way out through inference. Finale Doshi-Velez, Been Kim (2017-02-28) ; see also the Domino blog article about TCAV.

article thumbnail

ML internals: Synthetic Minority Oversampling (SMOTE) Technique

Domino Data Lab

Insufficient training data in the minority class — In domains where data collection is expensive, a dataset containing 10,000 examples is typically considered to be fairly large. This carries the risk of this modification performing worse than simpler approaches like majority under-sampling. Chawla et al.

article thumbnail

Themes and Conferences per Pacoid, Episode 6

Domino Data Lab

People who attended JupyterCon 2017–2018 can attest, an “industry poster session” includes an open bar, catered hors d’oeuvres, lots of mingling … to paraphrase feedback from JupyterCon, “As a tech person, would I get up extra early to meet strangers for coffee at 8:00 am? The ability to measure results (risk-reducing evidence).