St. Gallen, Switzerland
Online Behavioral Research
When:
23 June - 27 June 2025
Credits:
4.0 EC
Read more
Social Sciences
When:
12 August - 16 August 2024
School:
GESIS Summer School in Survey Methodology
Institution:
GESIS-Leibniz Institute for the Social Sciences
City:
Country:
Language:
English
Credits:
4.0 EC
Fee:
550 EUR
A variety of digital data sources are providing new avenues for empirical social science research. In order to effectively utilize these data for answering substantive research questions, a modern methodological toolkit paired with a critical perspective on data quality is needed. This course introduces state-of-the-art data science techniques that are suited for collecting and analyzing digital behavioral data, so-called "big data", and traditional survey data. In addition, aspects of data quality and error frameworks for digital (big) data sources are discussed.
The course will cover the following topics and techniques:
- Overview of Big Data: What is it and why does it matter?
- Total Survey Error for Big Data
- Web Scraping
- Machine Learning for Social Scientists
- Regularized regression
- Tree-based methods
- Support vector machines
- LLMs
After the course, you will have a profound understanding of important methods from the data science toolkit for collecting and analyzing the data types mentioned. You will be able to apply these methods and techniques in your research using statistical software.
Anna-Carolina Haensch is an assistant research professor at the University of Maryland (UMD) and a Senior researcher at the LMU Munich.
You will find the course useful if:
- you are interested in learning some fundamental techniques in data science,
- you want to collect and work with digital behavioral data, be it administrative data or data found online,
- you want to understand what machine learning is.
Prerequisites
- General knowledge of statistics and statistical modelling (i.e., regression)
- Prior experiences with syntax-based software (like R, Stata, or Python)
Some basic experience with programming in R is very helpful, but not strictly necessary. For those without prior exposure to R, we will ensure everyone is able to execute R markdown files. If you have no previous R knowledge, we encourage you to work through one or more R tutorials prior of the course. Some resources can be found here:
https://rstudio.cloud/learn/primers
http://www.statmethods.net/
https://swirlstats.com/
https://rmarkdown.rstudio.com/lesson-1.html (for R Markdown)
By the end of the course, you will:
- understand the challenges when analyzing digital behavioral data,
- know the promises and benefits of (supervised) machine learning,
- be able to use (supervised) machine learning for data analysis,
- learn some of the metrics used to assess data quality for gathered data types.
Fee
550 EUR, Student/PhD student rate.
Fee
825 EUR, Academic/non-profit rate. The rates include the tuition fee, course materials, the academic program, and coffee/tea breaks.
When:
12 August - 16 August 2024
School:
GESIS Summer School in Survey Methodology
Institution:
GESIS-Leibniz Institute for the Social Sciences
Language:
English
Credits:
4.0 EC
St. Gallen, Switzerland
When:
23 June - 27 June 2025
Credits:
4.0 EC
Read more
Maastricht, Netherlands
When:
23 June - 27 June 2025
Credits:
2 EC
Read more
Oslo, Norway
When:
17 June - 19 June 2025
Read more