4608155: C15 Data Mining & Predictive Analytics

back to overview
Semester:WS 18/19
Type:Module
Language:English
ECTS-Credits:6.0
Scheduled in semester:3
Semester Hours per Week / Contact Hours:52.0 L / 39.0 h
Self-directed study time:141.0 h

Module coordination/Lecturers

Curricula

Master's degree programme in Information Systems (01.09.2015)

Description

Short description
The course covers various statistical techniques for making sense of the vast and complex data sets that have emerged in business in the past twenty years. Students will learn to detect patterns in large data sets of various formats (quantitative and qualitative) and translate them into actionable insights.

Topics

  • Data Visualization and Exploration
  • Supervised learning techniques for regression (e.g. linear regression)
  • Supervised learning techniques for classification (e.g. classification trees)
  • Unsupervised learning techniques (e.g. clustering, dimensionality reduction)
  • Deep Learning Fundamentals
  • Text mining (e.g. topic modeling)
  • Hands-on labs with Python

Lecture Goals

  • Students will know and understand the basic concepts and methods of data mining and predictive analyticsStudents will assess the assumptions and quality of statistical modelsStudents will select and apply the right statistical models for a given task or data setStudents will derive actionable insights from statistical results

Qualifications

Lectures Method

The module integrates theoretical knowledge and practical skills in an interactive lecture. The e-learning platform Moodle will be used throughout the course for the dissemination of course material and discussions.

Admission Requirements

  • Recommended previous knowledgeModule “Business Statistics I”Module “Business Statistics II”Basic knowledge of statistical software R - online course available: tryr.codeschool.com

Literature

  • Compulsory readingJames, G., Witten, D., Hastie, T., & Tibshirani, R. (2013). An Introduction to Statistical Learning. With Applications in R. New York: Springer (a free online version is available at http://www-bcf.usc.edu/~gareth/ISL/) - Further readingProvost, F. & Fawcett, T. (2013). Data Science for Business. Sebastopol: O'Reilly Media

Exam Modalities

Written exam (90min)