25 August 2017
Item Nonresponse and Multiple Imputation
his course provides an introduction to the theory and application of Multiple Imputation (MI) (Rubin 1987) which has become a very popular way for handling missing data, because it allows for correct statistical
inference in the presence of missing data. With the advent of MI algorithms implemented in statistical standard software (SAS, Stata, SPSS,…), the method has become more accessible to data analysts. For didactic purposes, we start by introducing some naive ways of handling missing data, and the investigation of their weaknesses is used to create an understanding of the framework of Multiple Imputation. The first half of this course is of a somewhat theoretical nature, but we believe that a fundamental understanding of the MI principle helps to adapt to a wider range of practical problems than focusing on a few select situations. However, at the latter stages of the course, frequent problems like regression with missing data will be addressed, and further typical situations will be covered by the lab session throughout the course, which is predominantly based on the statistical language R. We recommend basic R skills for this course, but it can be attended without prior knowledge in R, as the main MI algorithms are almost identical across all major software packages.
Therefore, by the end of the course, we hope that people will be able to use MI algorithms in general (irrespective of the software package) and to apply Rubin's combining rules correctly, as well as to explain to readers of their work how and why they used the method.
Dr. Florian Meinfelder is senior lecturer at the Department for Statistics and Econometrics at the University of Bamberg, Germany;
Prof. Dr. Susanne Rässler holds the Chair of Statistics and Econometrics at the University of Bamberg, Germany
Participants will find the course useful if they are:
- survey methodologists working with incomplete data;
- researchers who want to learn more about the analysis of incomplete data in general;
- already aware of MI and its benefits, but feel insecure about the available parameter settings in MI algorithms implemented in their preferred statistical software.
- profound understanding of sampling theory;
- an advanced understanding of the (generalized) linear model;
- familiarity with statistical distributions;
- basic knowledge of matrix algebra;
- solid skills in either R, SPSS or Stata (recommended for exercises).
Knowledge of R can be acquired in the course “Introduction to Data Analysis Using R” by Steinhauer/Würbach in the previous week.
By the end of the course participants will:
- be familiar with the theoretical implications of the MI framework and will be aware of the explicit and implicit assumptions (e.g. will be able to explain within an article why MAR was assumed, etc.);
- know when to use MI (and when not);
- be aware how to specify a "good" imputation model and how to use diagnostics;
- be familiar with the availability of the various MI algorithms;
- be able to not only replicate situations akin to the case studies covered in the course, but also know how to handle incomplete data in general.
- Certificate of attendance issued upon completion.
- 2 ECTS points via the University of Mannheim for regular attendance and satisfactory work on daily assignments (EUR 20).
- 4 ECTS points via the University of Mannheim for regular attendance and satisfactory work on daily assignments and for submitting a paper/report of about 5000 words to the lecturer(s) up to 4 weeks after the end of the summer school (EUR 50).
EUR 250: Student/PhD student rate.
EUR 350: Academic/non-profit rate.
The rates include the tuition fee, course materials, the academic program, access to library and IT facilities, coffee/tea, and a number of social activities.
10 DAAD scholarships are available via the Center for Doctoral Studies in Social and Behavioral Sciences (CDSS) at the University of Mannheim.
5 ESRA scholarships are available for participation in one main course.