The Role of Data in Statistical Modeling and Machine Learning

Statistical modeling.jpg

Oliver Schabenberger, EVP and CTO, SAS, on The difference between Statistical Modeling and Machine Learning:

We can… distinguish statistical modeling, classical machine learning and modern machine learning by the role of the data.

In statistical modeling, the data guide us to the selection of a stochastic model which serves as the abstraction for making probabilistic statements about questions of interest, such as hypotheses, predictions and forecasts.

In classical machine learning, the data drive the selection of the analytic technique to best perform the task at hand. The data trains the algorithms.

In modern machine learning, the data drive systems based on neural nets that self-determine the regularities in the data in order to learn a task. The process of training the neural network on the data learns the task. As someone put it, “The data does the programming.”

About GilPress

I'm Managing Partner at gPress, a marketing, publishing, research and education consultancy. Also a Senior Contributor forbes.com/sites/gilpress/. Previously, I held senior marketing and research management positions at NORC, DEC and EMC. Most recently, I was Senior Director, Thought Leadership Marketing at EMC, where I launched the Big Data conversation with the “How Much Information?” study (2000 with UC Berkeley) and the Digital Universe study (2007 with IDC). Twitter: @GilPress
This entry was posted in AI, deep learning, Machine Learning, Statistics and tagged . Bookmark the permalink.

Leave a Reply

Your email address will not be published. Required fields are marked *