Remove 09 24 open-source-data-science-projects-2019
article thumbnail

Themes and Conferences per Pacoid, Episode 11

Domino Data Lab

Paco Nathan ‘s latest article covers program synthesis, AutoPandas, model-driven data queries, and more. In other words, using metadata about data science work to generate code. In this case, code gets generated for data preparation, where so much of the “time and labor” in data science work is concentrated.

Metadata 105
article thumbnail

Themes and Conferences per Pacoid, Episode 9

Domino Data Lab

Software engineering made major breakthroughs two decades ago by applying reductionist techniques to project planning and management. Admittedly, throughout large swaths of computer science, reductionism serves quite well. The lens of reductionism and an overemphasis on engineering becomes an Achilles heel for data science work.

article thumbnail

Natural Language in Python using spaCy: An Introduction

Domino Data Lab

The complementary Domino project is also available. This article and paired Domino project provide a brief introduction to working with natural language (sometimes called “text analytics”) in Python using spaCy and related libraries. Check out the Domino project to run the code. Introduction. Getting Started.