Every Accent Counts: Flemish Citizens Crowdsource Speech to Train AI

KU Leuven and Scivil launch Maarallee to collect Flemish speech and cut ASR errors from NL-trained models. Record sentences and verify transcripts at maarallee.be.

Categorized in: AI News Science and Research

Published on: Oct 18, 2025

Help AI catch Flemish accents: KU Leuven and Scivil launch Maarallee

KU Leuven and the Flemish Knowledge Centre for Citizen Science (Scivil) have launched Maarallee, a citizen science platform that collects speech from across Flanders to improve automatic speech recognition (ASR). The goal is simple: reduce errors caused by models trained mostly on data from the Netherlands.

If your GPS struggles with "Turnhout" in a Kempen accent, this is the fix. Record short sentences, help validate transcriptions, and move ASR closer to real Flemish speech coverage.

Why a Flemish dataset matters

Most commercial ASR models lean on Dutch data from the Netherlands. That skews pronunciation, word usage, and prosody away from Flemish speech. The result: higher error rates, frustrating user experiences, and avoidable accessibility gaps.

Broad, diverse speech data changes that. As the project partners note, without it, AI stays useful for a few and unreliable for many.

How Maarallee works

Contributors read short sentences on maarallee.be.
The platform auto-transcribes each clip; volunteers correct the text.
KU Leuven uses the cleaned data to build and evaluate a Flemish-focused ASR model.

Every accent, region, and age group adds signal. Even small batches of clips help close dialect gaps that standard datasets miss.

Where this matters now

Education and media: automatic subtitling that reflects Flemish speech.
Healthcare: hands-free note-taking and dictation that hear patients and clinicians correctly.
Industry and services: voice interfaces that work on the shop floor and in vehicles.
Accessibility: voice-controlled tools that respond reliably for people with disabilities.

For researchers and R&D teams

Expect gains in word error rate on Flemish benchmarks by expanding phonetic and dialect coverage.
Use the data to stress-test ASR across regions (e.g., Kempen, West-Flemish, Antwerp) and age cohorts.
Pair contributions with clear evaluation sets to quantify accent-specific improvements.

Participation is open to everyone. Add your voice, review a few transcripts, and help build ASR that works for all Flemish speakers.

Get involved: Record at maarallee.be. Learn more about the academic partner at KU Leuven.

Get Daily AI News

Your membership also unlocks:

700+ AI Courses

700+ Certifications

Personalized AI Learning Plan

6500+ AI Tools (no Ads)

Daily AI News by job industry (no Ads)

Advertisement

Every Accent Counts: Flemish Citizens Crowdsource Speech to Train AI

Help AI catch Flemish accents: KU Leuven and Scivil launch Maarallee

Why a Flemish dataset matters

How Maarallee works

Where this matters now

For researchers and R&D teams

Related AI News for Science and Research

AI boosts individual research output-yet may slow scientific progress

Why AI Is Now Essential to Research-and National Competitiveness

AI Can Crunch Data, But Science Still Needs Humans

BD Unveils Research Cloud 7.0 and Horizon Panel Maker to Speed Immunology and Cancer Research

About Complete AI:

Latest AI News for your Job:

Courses by AI Skill:

Courses by Job Field:

Courses by AI Company:

AI Tools for your Job:

AI Tools by Type:

AI Certifications by Skill:

AI Certifications by Job Field:

AI Certifications by Company: