Projects

Real projects built by students and staff. FOSS where possible โ€” check the repos.

FinBERT Fine-tuning on LUMI

โ— active

Fine-tuning a Finnish BERT model for domain-specific NER using CSC LUMI A100 nodes. Full DDP training, Weights & Biases logging.

PyTorchHuggingFaceLUMINLP
3 contributors โ†’ repo

Real-time Sensor Data Pipeline

โ— active

Apache Kafka + Spark Streaming pipeline ingesting IoT sensor data from the KAMK campus building, anomaly detection with Isolation Forest.

KafkaSparkPythonIoT
4 contributors โ†’ repo

Open LLM Benchmark Suite

โœ“ done

Benchmarking open-source LLMs (Mistral, LLaMA, Phi) on Finnish language tasks. Results published as open dataset.

LLMBenchmarkFinnish NLPFOSS
2 contributors โ†’ repo

MLOps Reference Platform

โœ“ done

A containerized, self-hosted MLOps stack using MLflow, DVC, and Prefect โ€” deployable on any Linux server or CSC cPouta.

MLOpsDockerMLflowDVC
2 contributors โ†’ repo

Nordic Energy Demand Forecasting

โœ“ done

Time-series forecasting of Finnish electricity demand using Temporal Fusion Transformers. Data from Fingrid open API.

Time SeriesPyTorchForecasting
3 contributors โ†’ repo

CSC Jupyter Environment Templates

โ— active

Ready-to-use Jupyter environment configs for CSC Notebooks โ€” pre-installed with common data science and ML libraries.

CSCJupyterFOSSTools
1 contributor โ†’ repo

Have a project idea? Talk to us.

Get in Touch โ†’