Remove 2001 Remove Interactive Remove Risk Remove Statistics
article thumbnail

Speed up queries with the cost-based optimizer in Amazon Athena

AWS Big Data

Amazon Athena is a serverless, interactive analytics service built on open source frameworks, supporting open table file formats. Starting today, the Athena SQL engine uses a cost-based optimizer (CBO), a new feature that uses table and column statistics stored in the AWS Glue Data Catalog as part of the table’s metadata.

article thumbnail

Reclaiming the stories that algorithms tell

O'Reilly on Data

In 2001, just as the Lexile system was rolling out state-wide, a professor of education named Stephen Krashen took to the pages of the California School Library Journal to raise an alarm. The report has pages of careful caveats, but in the end it treats these risk-adjusted ratios as a good measure of a surgeon’s performance.

Risk 354
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Data Science, Past & Future

Domino Data Lab

He was saying this doesn’t belong just in statistics. It involved a lot of work with applied math, some depth in statistics and visualization, and also a lot of communication skills. You see these drivers involving risk and cost, but also opportunity. I can point to the year 2001. Tukey did this paper. All righty.

article thumbnail

Themes and Conferences per Pacoid, Episode 5

Domino Data Lab

What are the projected risks for companies that fall behind for internal training in data science? In terms of teaching and learning data science, Project Jupyter is probably the biggest news over the past decade – even though Jupyter’s origins go back to 2001! In business terms, why does this matter ?

article thumbnail

Themes and Conferences per Pacoid, Episode 12

Domino Data Lab

Consider the following timeline: 2001 – Physics grad students are getting hired in quantity by hedge funds to work on Wall St. The probabilistic nature changes the risks and process required. They tend to use less machine learning, but more advanced statistical practices, since the outcomes (government policies, etc.)

article thumbnail

Data Science at The New York Times

Domino Data Lab

In 2001, Bill Cleveland writes this article saying, “You are doing it wrong.” Here is a picture of The New York Times on its birthday in 1851, and for the vast majority of its lifespan this is pretty much what the user experience of interacting with The New York Times looks like. Editors can interact with this bot.