Hyppää pääsisältöön

Tampere University Student’s Guide

Opintojakso, lukuvuosi 2025–2026
DATA.ML.370

Mining of Big Datasets, 5 op

Tampereen yliopisto
Opetusperiodit
Aktiivinen periodissa 3 (1.1.2026–1.3.2026)
Aktiivinen periodissa 4 (2.3.2026–31.5.2026)
Koodi
DATA.ML.370
Opetuskieli
English, Finnish
Lukuvuodet
2024–2025, 2025–2026, 2026–2027
Opintojakson taso
Intermediate studies
Arviointiasteikko
General scale, 0-5
Vastuuhenkilö
Responsible teacher:
Tarmo Lipping
Responsible teacher:
Jari Turunen
Responsible organisation
Faculty of Information Technology and Communication Sciences 100 %
Coordinating organisation
Computing Sciences Studies 100 %
Core content
  • The concept and terminology of data mining.
  • Understanding the principles of processing large, non-structured datasets.
  • Basic methods and algorithms for the analysis of large datasets
  • Common tasks of mining large datasets such as similarity analysis, link analysis, finding frequent itemsets, clustering
  • Common applications of mining large datasets such as recommendation systems, web search, mining of social network graphs
Complementary knowledge
  • Mining data streams
  • Special challenges of processing large datasets: memory usage and data formats.
  • Deep learning methods in mining large datasets
Specialist knowledge
  • Mapreduce algorithm.
  • Locality-sensitive hashing
  • Distance measures
  • More advanced algorithms for mining large datasets
Osaamistavoitteet
Esitietovaatimukset
Lisätiedot
Oppimateriaalit
Kokonaisuudet, joihin opintojakso kuuluu
Suoritustapa 1
The course will involve exercises and Teams discussions

Independent study

Online or distance self learning
07.01.2026 30.05.2026
Aktiivinen periodissa 3 (1.1.2026–1.3.2026)
Aktiivinen periodissa 4 (2.3.2026–31.5.2026)