The aim of this course is to study and apply the most relevant statistical models in the analysis of large data set.
The perspective in mainly applicative: choosing and applying suitable models to exploit the whole informative content of (large) data set with a particular attention to the correct and contextualized interpretation of the final results. Moreover, a focus will be set on some frameworks for the management of large data set like MapReduce for data clustering.
The course will be held with the interactive employment of open source softwares like R and Python to learn practically the complete analysis work-flow.
A particular emphasis will be given to social network data, textual data, business-financial case studies.
For further information please refer to the Online Courses Catalogue – CLICK HERE
For lecture materials please refer to KIRO – CLICK HERE