Why Apache Hadoop for data science? - RSVP Now
April 11, 7am - 7:45am, Sheraton 3
Speaker(s): Ofer Mendelevitch, Director of Data Sciences at Hortonworks
Duration: 45 minutes
Data scientists are using tools like R, Matlab and SAS for advanced data analysis and to build machine-learning models. With the advent of big data, and the integration of Apache Hadoop into the data architecture of enterprise IT, it is now possible to apply data science to much larger datasets than ever before. In this session, Ofer will discuss the advantages of using Hadoop for data science and describe some typical usage patterns like large-scale pre-processing, exploratory data analysis, and online personalization.