Opintojakso, lukuvuosi 2025–2026
DATA.ML.370
Mining of Big Datasets, 5 op
Tampereen yliopisto
- Kuvaus
- Suoritustavat
Opetusperiodit
Aktiivinen periodissa 3 (1.1.2026–1.3.2026)
Aktiivinen periodissa 4 (2.3.2026–31.5.2026)
Koodi
DATA.ML.370Opetuskieli
English, FinnishLukuvuodet
2024–2025, 2025–2026, 2026–2027Opintojakson taso
Intermediate studiesArviointiasteikko
General scale, 0-5Vastuuhenkilö
Responsible teacher:
Tarmo LippingResponsible teacher:
Jari TurunenResponsible organisation
Faculty of Information Technology and Communication Sciences 100 %
Coordinating organisation
Computing Sciences Studies 100 %
Core content
- The concept and terminology of data mining.
- Understanding the principles of processing large, non-structured datasets.
- Basic methods and algorithms for the analysis of large datasets
- Common tasks of mining large datasets such as similarity analysis, link analysis, finding frequent itemsets, clustering
- Common applications of mining large datasets such as recommendation systems, web search, mining of social network graphs
Complementary knowledge
- Mining data streams
- Special challenges of processing large datasets: memory usage and data formats.
- Deep learning methods in mining large datasets
Specialist knowledge
- Mapreduce algorithm.
- Locality-sensitive hashing
- Distance measures
- More advanced algorithms for mining large datasets
Osaamistavoitteet
Esitietovaatimukset
Lisätiedot
Oppimateriaalit
Kokonaisuudet, joihin opintojakso kuuluu
Suoritustapa 1
The course will involve exercises and Teams discussions
Independent study
Online or distance self learning
07.01.2026 – 30.05.2026
Aktiivinen periodissa 3 (1.1.2026–1.3.2026)
Aktiivinen periodissa 4 (2.3.2026–31.5.2026)