Category Archives: Data Science
Statistics Without the Agonizing Pain: John Rauser Keynote at Strata + Hadoop 2014 (Video)
[youtube https://www.youtube.com/watch?v=5Dnw46eC-0o?rel=0] There are two essential skills for the data scientist: engineering and statistics. A great many data scientists are very strong engineers but feel like impostors when it comes to statistics. In this talk John will argue that the … Continue reading
A Statistician’s View on Big Data and Data Science in Pharmaceutical Development
[slideshare id=40244839&doc=bigdatadatasciencepharmaoct2014-141014064735-conversion-gate01]
Gartner Hype Cycle for Emerging Technologies 2014
Source: Gartner Note that the Internet of Things has replaced big data at the top of the “peak of inflated expectations” (see Gartner’s 2013 Hype Cycle). Both will take another 5 to 10 years to reach the … Continue reading
Josh Wills on Machine Learning in a Business Setting
[youtube https://www.youtube.com/watch?v=IgfRdDjLxe0?rel=0] Academic machine learning is all about optimization. Machine learning in a business setting is all about understanding: “My focus is always on how do I understand what the system is doing, come up with new hypotheses about this … Continue reading
Big Data and Statisticians, Revisited (Video)
[vimeo 91502942 w=500 h=257] Data Science, Big Data and Statistics – can we all live together? from Chalmers Internal on Vimeo. Terry Speed on how (and a bit on why) statisticians have been left out of the big data movement. … Continue reading
The Tom Davenport Guide to Big Data
Big Data at Work: Dispelling the Myths, Uncovering the Opportunities, is a new book from Tom Davenport, a veteran observer of the data analysis scene. It’s a required reading for managers that need a straightforward, hype-free introduction to big data, … Continue reading
Doing Data Science at Manheim
As ones and zeros eat the world, data is the new product and data science is the new process of innovation. The International Institute for Analytics predicts that in 2014 companies in a variety of industries will increasingly use analytics … Continue reading
Big Data Debates: Individuals Vs. Teams
Gregory Piatetsky recently ran a poll on his popular KDnuggets website where he asked his readers to vote for the preferred way to build data science capabilities in their organizations. The poll was prompted by the strong reaction to a … Continue reading
Joe Hellerstein and Tutti Taygerly on Big Data Moonshots and Predictive Interaction (Video)
[youtube http://www.youtube.com/watch?v=p-ZPnKnt5So] “Automation is not the solution to data transformation and data cleaning” “Predictive Interaction is a new model for high-level human-data interaction that radically improves the productivity and accessibility of the most time-consuming work in the analytics lifecycle.”
Joe Hellerstein on Washing Data: Bringing Productivity Technology to Data Science (Video)
[youtube http://www.youtube.com/watch?v=3r4hQlpPZLw] Joe Hellerstein: “We are washing our data on the side of the river on stones, we are really in the early early stages of productivity technology in data science.”