Working on large language models and fairness in AI systems.
Led R&D of translation and explainability techniques, demonstrated to EU parliament. Contributed to research on bias mitigation and LLM steering. Developed various inference-related features, e.g. multi-node Deepseek over Infiniband.
Led workshops on the workings of LLMs and AI safety for clients, e.g. KBC senior management. Developed LLMQ Python package for scheduling large LLM inference jobs. NLP Expert for the EU AI Office's code of practice for foundation models.
Led development of tokenization research and released BübleLM-2B with HU Berlin. Research visit to Alan Akbik's group at HU Berlin. Lecturer for the NLP course at KULAK. Member of the KU Leuven GenAI committee.
Led AurA project on controlling LLMs for toxicity reduction, published at ICML 2024. Worked with state-of-the-art models: Falcon 7B/40B, OPT, MPT, Mistral. Collaborated with researchers in Barcelona, Paris and Cambridge.
PhD on fairer foundation models at the DTAI lab with Prof. Luc De Raedt and Prof. Bettina Berendt. Developed RobBERT, a Dutch language model ranking in the global top 80 on Hugging Face. Experience with major language models (Falcon, OPT, MPT) and research visits to MilaNLP and Weizenbaum Institute.
Conducted research on fault analysis in a distributed stream processing system.
Thesis on bias mitigation in large language models, supervised by Prof. Luc De Raedt and Prof. Bettina Berendt.
Focus on machine learning and computer vision. Thesis on obstacle detection with Prof. Johan Suykens and CNH Industrial.
Focus on distributed systems. Thesis on failure detection in stream processing at Nokia Bell Labs.
Organized the LLM session at the Flanders AI Research Day. Received Best Student Paper at FAccT 2022. Previously lecturer of NLP (2024) and TA for AI fundamentals courses (2019-2024).
Active in the research community as program committee member and reviewer for top-tier conferences.
Venues: FAccT (2020, 2022), ACL (2023), IJCAI (2022, 2023), ECML-PKDD, ARR (2022-present)
Contributing expertise as NLP Expert for the EU AI Office's code of practice and member of the KU Leuven GenAI expert committee. Industry collaborations with VDAB and language learning centers.
Currently (co-)supervising Thomas Bauwens on tokenization and cross-lingual transfer for his PhD. Previously supervised 15+ master's theses, leading to an ICCC'21 publication and collaborations with TU Berlin and Acapela on resume writing and prosody prediction.