Who are you?
I am Tuukka Törö. I have been working as a doctoral researcher at the University of Helsinki’s Phonetics and Speech Synthesis Research Group since the beginning of this year. My background is in linguistics and phonetics, and I hold a BA in English studies from the University of Malmö and an MA in Phonetics from the University of Helsinki. After writing my Master’s thesis on controlling speaking styles in speech synthesis, I spent some time working with YLE on AI radio projects where we created synthetic ‘actors’ for radio features.
In my current position, I work in the Academy of Finland funded project
While my focus has become more technically inclined, the primary inspiration behind my work stems from a fascination with how social structures influence speech, from macro level variation to how people convey social dynamics in specific contexts.
What is your research topic?
Currently I am researching macro level language variation using neural-network models built for TTS and speech recognition. While the models’ original purpose is in technological applications, they enable us to analyze speech in new ways. As the models are trained with large amounts of audio, they can be used to model ’wild’ data of varying quality on a large scale instead of picking apart specific acoustic features from small, professionally recorded datasets.
Within the academy project, my aim is to tie together sociolinguistic variation with the predictive processing and speech synthesis side of things. Hopefully, in the coming years we will learn something new about how humans perceive social cues in speech and how high-level social predictions can be utilized to improve speech synthesis.
How is your research related to Kielipankki?
I often use corpora from Kielipankki such as
Recent publications
Törö, T., Suni, A. and Šimko, J. (2024).
Šimko, J., Törö, T., Vainio M., and Suni, A. (2023).
Other related work
(AI radio drama, Yle, 2023) (AI radio drama, Yle, 2023)
Corpora
The