4408155: C15 Data Mining & Predictive Analytics

zurück zur Übersicht
Semester:WS 17/18
Art:Modul
Sprache:Englisch
ECTS-Credits:6.0
Plansemester:3
Lektionen / Semester:52.0 L / 39.0 h
Selbststudium:141.0 h

Modulleitung/Dozierende

Studiengang

Masterstudium Information Systems (01.09.2015)

Beschreibung

Short description
The course covers various statistical techniques for making sense of the vast and complex data sets that have emerged in business in the past twenty years. Students will learn to detect patterns in large data sets of various formats (quantitative and qualitative) and translate them into actionable insights.

Topics

  • Supervised learning techniques for regression (e.g. linear regression)
  • Supervised learning techniques for classification (e.g. classification trees)
  • Unsupervised learning techniques (e.g. clustering, dimensionality reduction)
  • Text mining (e.g. topic modeling)
  • Hands-on labs with R

Ziele

  • Students will know and understand the basic concepts and methods of data mining and predictive analyticsStudents will assess the assumptions and quality of statistical modelsStudents will select and apply the right statistical models for a given task or data setStudents will derive actionable insights from statistical results

Kompetenzen

Lehrmethoden

  • The module integrates theoretical knowledge and practical skills in an interactive lecture. The e-learning platform Moodle will be used throughout the course for the dissemination of course material and discussions.

Voraussetzungen (inhaltlich)

  • Recommended previous knowledgeModule “Business Statistics I”Module “Business Statistics II”Basic knowledge of statistical software R - online course available: tryr.codeschool.com

Literatur

  • Compulsory readingJames, G., Witten, D., Hastie, T., & Tibshirani, R. (2013). An Introduction to Statistical Learning. With Applications in R. New York: Springer (a free online version is available at http://www-bcf.usc.edu/~gareth/ISL/) - Further readingProvost, F. & Fawcett, T. (2013). Data Science for Business. Sebastopol: O'Reilly Media

Prüfungsmodalitäten

Written exam (90min)