SaameAI – Speech recognition for the Saami languages

Project

SaameAI – Speech recognition for the Saami languages is a co-project of Aalto University and the University of Lapland. The aim of the project is to develop an automatic speech recognition (ASR) system for the Saami languages. First, the system is developed for North Saami, but in the future we will also expand for other Saami languages. The project is estimated to last for three years, and it has received funding from the Finnish Cultural Foundation. The University of Helsinki is collaborating with the project.

Speech recognition tools for the Saami languages

There is a growing demand for different technological solutions and tools among the Saami languages. An automatic speech recognition, or speech-to-text, tool can be beneficial, for instance, in fields such as public services, education, and research. Transcribing speech to text manually is a slow and time-consuming job that often requires specialized skills or training. Automatic tools designed for this task will spare time and resources.

Developing a speech recognition system for the Saami languages is also a question of linguistic equity. Language technology solutions have been developed for only a few of the world’s languages, which also happen to be the ones with the most resources – both material and human – available. While smaller and endangered languages do not have the same resources, the benefit that different language technology tools may offer for these languages is significant. Besides the time and resources saved by automating some of the work stages, language technology tools may offer possibilities or support to use languages in new and different settings.

Updates and progress

A speech recognition model has been built at Aalto University using data provided by the Finnish National Audiovisual Institute (KAVI) and the Norwegian Saami Parliament. A demo version of the model is publicly available at HuggingFace .

Anyone may test the tool on this website, but please be aware that we do not recommend uploading any private or sensitive data on this page, since we cannot ensure information security on this page. An ASR program that runs in an information secure environment is currently under development at Aalto University.

The accuracy of the model will be further improved in the future by training it with more data. The training and fine-tuning require audio material with corresponding text.

Contact

Project manager

Mikko Kurimo

Aalto-university, Professor, Information and communication engineering, speech recognition

E-mail mikko.kurimo@aalto.fi

Phone +358503476221

Project partner

Pigga Keskitalo

University of Lapland

Professor, Education sciences

E-mail
pigga.keskitalo@ulapland.fi

Doctoral researcher

Kristiina Ojala

University of Helsinki

Doctoral researcher, Language studies program

Partners and collaborators

Mikko Kurimo (Aalto-university), project leader

Pigga Keskitalo (University of Lapland), project partner

Kristiina Ojala (University of Helsinki), doctoral researcher

Yaroslav Getman (Aalto-university), doctoral researcher

Riho Grünthal (University of Helsinki), PhD supervisor