Leeds, United Kingdom

Data Mining and Text Analytics

online course
when 19 July 2021 - 30 July 2021
language English
duration 2 weeks
credits 5 EC
fee GBP 975

Big data is becoming more and more important in fields from science to marketing, engineering, medicine and government. This module introduces the principles of data mining and text analytics. You'll apply these principles in practical exercises with a data mining toolkit and real data. You'll compare a range of different techniques and algorithms and evaluate their performance.

The module introduces:

Data mining inputs and outputs, instances, attributes, classes and concepts.
Machine learning and data mining with the Waikato Environment for Knowledge Analysis (WEKA) toolkit.
Real-world data-sets and competitions with Kaggle.com.
Cross Industry Standard Process for Data Mining (CRISP-DM).
Evaluation of data mining and text analytics results.
Text classification.
Text search and information retrieval.
You'll take part in practical team work as part of a data mining and text analytics challenge. You're not expected to have previous expertise in data mining but you should be familiar with using and creating data files. For example, Word documents, Excel spreadsheets, PowerPoint presentations, YouTube videos, Wikipedia web pages, Twitter/Facebook or other social media data. Please note you're expected to use your own laptop for this module. Before the summer school, you must have set up user accounts at kaggle.com and a video streaming site eg YouTube. You'll also need to download the free WEKA data mining toolkit from the WEKA open-access website.

Please note, the Leeds International Summer School runs for four weeks (5 - 30 July 2021). You can choose to study with us for two weeks or for the full four-week programme. Each module lasts two weeks so you study one module if you choose to attend for two weeks and two modules if you choose to attend for four weeks.

Course leader

Professor Eric Atwell

Target group

Undergraduate/Bachelor students

Course aim

You will learn the principles of data mining and text analytics; apply these principles in practical exercises with a data mining toolkit and real data; compare a range of different techniques and algorithms and evaluate their performance.

Credits info

5 EC
The module is worth 10 Leeds credits = 5 ECTS. You can transfer the credits earned back to your home degree subject to approval.

Fee info

GBP 975: The fee includes high quality tuition delivered by University of Leeds staff, dedicated time with your tutor each week, all learning materials and software needed for the programme, access to digital University library resources, support to help you get the most out of studying online, academic skills support services, a full social and cultural programme as well as wellbeing and support services.

Scholarships

Scholarships worth £300 for the full four week programme or £150 for a two week block. Scholarships are available for students with good academic standing to help pay your fees. You can apply as part of your application to LISS.