Blog

Less polished, a bit more opinionated and more emojis. 🙃

2025

2024

September 11, 2024 European Tweeties
Creating language models for all 24 EU languages
August 30, 2024 Setting up a decent SentencePiece tokenizer
Reasonable monolingual tokenization from noisy data
January 05, 2024 Evaluating Dutch LLMs
SQUAD-NL

2023

December 27, 2023 Dutch Chat Toolkit
Creating retrieval-augmented chatbots
November 04, 2023 Building a language learning app
Day 3: Prompting and basic UI
November 01, 2023 Building a language learning app
Day 2: setting up the app
October 30, 2023 Building a language learning app
Day 1: planning
October 24, 2023 Migrating from HuggingFace AdamW
Drop-in replacement optimizer with learning schedule

2022

November 10, 2022 Updating RobBERT (part 2)
Bringing a language model to 2022
August 12, 2022 Updating RobBERT
Bringing a language model to 2022