Sat.Dec 15, 2018

article thumbnail

Data Scientist’s Dilemma – The Cold Start Problem

Rocket-Powered Data Science

The ancient philosopher Confucius has been credited with saying “study your past to know your future.” This wisdom applies not only to life but to machine learning also. Specifically, the availability and application of labeled data (things past) for the labeling of previously unseen data (things future) is fundamental to supervised machine learning.

article thumbnail

Using the SSIS Multiple Flat Files Connection Manager

Tim Mitchell

When building an ETL pipeline to import data from a text file, it’s very common to have the incoming data spread across multiple files. For example, if you are ingesting files generated on a periodic basis (per day, per hour, etc.), you could have dozens or hundreds of files with identical structure. This is an ideal setup for building a. The post Using the SSIS Multiple Flat Files Connection Manager appeared first on Tim Mitchell.