The Data Foundation for Machine Learning