// Kajaani University of Applied Sciences
DATA โ AI Engineering
We train engineers who build the full pipeline โ from raw data ingestion to production AI systems. Linux-native. CSC-powered. FOSS-first.
4
Years of Study
CSC
Supercomputers
100%
Linux Environment
FOSS
Open Source First
What You'll Learn
Data Engineering Fundamentals
ETL pipelines, SQL/NoSQL, data lakes, Apache Spark on Linux.
Machine Learning in Practice
Scikit-learn, PyTorch, model training, evaluation, and deployment.
HPC & CSC Computing
Slurm job scheduling, Puhti/Mahti/LUMI access, large-scale training.
MLOps & Production AI
CI/CD for ML, monitoring, model registries, containerization.
Tools of the Trade
Linux
Native environment for everything
Python
Primary language from day one
PyTorch
Deep learning framework
CSC / HPC
Supercomputers
Docker
Containerize all the things
Git / GitHub / GitLab
FOSS collaboration workflow
Terraform
IaC โ Infrastructure as Code
Jupyter
Interactive data science
Train on LUMI
Our students get hands-on access to CSC's national HPC infrastructure โ including LUMI, one of the world's most powerful supercomputers. Real GPU clusters, real scale.
CSC Tools & Access โ$ squeue --me
JOBID PARTITION NAME ST TIME
12801 gpu train_llm R 2:14:07
12802 cpu preprocess RUN 0:03:21
$ sinfo | grep lumi
gpu-lumi up 12:00:00 idle lumi[001-128]
_