About this project
Digi Sami is a research project at University of Helsinki which aims to support content generation of less resourced languages with the help of language technology.
We are currently developing a version of WikiTalk in Sami language. WikiTalk is a spoken dialogue system which allows the user to use Wikipedia by having a conversation with a humanoid robot. Our partners in Aalto University are developing the necessary Sami speech technology components for the robot. We have collected and are annotating a Sami spoken language corpus.
You can contact the project leader Kristiina Jokinen by emailing Kristiina dot Jokinen at helsinki dot fi
People working on this project
Principal investigator and the project leader
Adjunct professor at the Institute of Behavioral Sciences, University of Helsinki.
Adjunct professor at the Department of Modern Languages, University of Helsinki.
Doctoral student at the Department of Modern Languages, University of Helsinki.
13-16, Saariselkä, Finland. Seventh International Workshop on Spoken Dialogue System (IWSDS 2016) was held in Saariselkä, Finland. Whole team attented the meeting and gave presentations titled Towards SamiTalk: a Sami-speaking Robot linked to Sami Wikipedia, Internationalisation and localisation of spoken dialogue systems and DigiSami and Digital Natives: Interaction Technology for the North Sami language.
6-10, Dresden, Germany. Kristiina Jokinen attended Interspeech 2015, where she gave a talk on Multimodal engagement in the WikiTalk robot application at the International Workshop on Speech Robotics (IWSR 2015).
2-4, Prague, Czech Republic. Kristiina Jokinen and Graham Wilcock attended SIGDIAL 2015, where they gave a presentation on Multilingual WikiTalk: Wikipedia-based talking robots that switch languages.
17-21, Oulu, Finland. Ilona Rauhala gave a presentation in CIFU XII on The variation of adjective attributes in Saami.
Helsinki, Finland. Kristiina Jokinen held an intensive course on human-robot interaction.
11-13, Antalya, Turkey. Niklas Laxström attended EAMT2015 and presented Content Translation: Computer assisted translation tool for Wikipedia articles.
6, Helsinki, Finland. Kristiina Jokinen gave a presentation Social Robotics – from Fancy Interface to Interactive Agents at POP-ROBOTICS Helsinki Think Tank event.
15, Kitakyushu, Japan. Kristiina Jokinen gave a presentation Multimodal Interaction in the Nao WikiTalk Application at Waseda University Kitakyushu Campus.
25-28, Shonan, Japan. Kristiina Jokinen was an invited participant at the NII Shonan Meeting Seminar The Future of Human-Robot Spoken Dialogue: from Information Services to Virtual Assistants.
11, Helsinki, Finland. Kristiina Jokinen and Graham Wilcock demonstrated Nao robot at Digital.Finland.Go! – Boosting Business with Digitalisation event at Finlandia Hall where three new Tekes programmes utilizing digitalisation were launched.
Kyoto, Japan. Graham Wilcock was a Visiting Professor at Doshisha University in March–April.
16, Tromsø, Norway. Two papers from us were accepted to First International Workshop on Computational Linguistics for Uralic Languages.
Kristiina was interviewed at Radio Vega: God morgon Svenskfinland.
19, Finland. Kristiina and Graham were featured in a two page article Moro, sanoi robotti in the Yliopisto magazine.
22-28, Helsinki, Finland. Graham, Kristiina and Niklas attended the Finnish Robotics Week event (Robottiviikko 2014) and presented MoroTalk, Finnish WikiTalk and English WikiTalk with Nao robots. We were interviewed and recorded by Iltalehti and Robottiviikko and Graham made an appearance in a news article published in Turun Sanomat.
18-22, Turku, Finland. Niklas attended the Langnet Summer School.
26-31, Reykjavik, Iceland. Kristiina attended the 9th Language Resources and Evaluation Conference (LREC-2014).
14-16, St Petersburg, Russia. Kristiina attended the 4th International Workshop on Spoken Language Technologies for Under-resourced Languages (SLTU'14).
DigiSami project was mentioned in the Sami language news (Article on YLE website).
Data collection events organised in Enontekiö, Kautokeino, Inari, Utsjoki, and Ivalo. Article in school blog.
18-20, Napa, CA, USA. Niklas attended IWSDS 2014 and presented a joint paper by Laxström, Jokinen and Wilcock Situated Interaction in a Multilingual Spoken Information Access Framework.
2-5, Budapest, Hungary. Graham attended the CogInfoCom 2013 conference and presented a paper Towards Cloud-based Speech Interfaces for Open-Domain CogInfoCom Systems.
9-13, Sydney. Kristiina co-chaired the ACM-ICMI 2013 conference and presented a joint paper by Wilcock and Jokinen at the main conference. She also authored a paper together by Lim Kai Keats, Max Friedrich, Jenny Radun at the GAZE-IN workshop related to the main conference.
29, Yle. Kristiina was a panelist in the Yle Robottiviikko (robot week) panel: Robotit: ohjelmoitavasta oppijaksi.
17–18, Valletta, Malta. Kristiina was an invited speaker at the 1st European Symposium on Multimodal Interaction. She gave a keynote Studying multimodal communication with eye-tracking.14, Nagoya, Japan. Kristiina and Graham gave a half-day tutorial at IJCNLP 2013 on Open-domain Conversations with Humanoid Robots.
Hokkaido, Japan. Graham was on a bilateral exchange visit to the University of Hokkaido.
27–29, Inari, Finland. Kristiina attended and gave a talk Finno-Ugric Digital Natives - prospects for open-domain interaction with online content at Oovtâst – Together conference.
Our research partner in the project Finno-Ugric Digital Natives: Linguistic support for Finno-Ugric digital communities in generating online content is Department of Language Technology, Research Institute for Linguistics, Hungarian Academy of Sciences, Budapest, Hungary led by Tamas Varadi.
In Finland, we collaborate with Mikko Kurimo from Aalto University and his group on Sami speech technology.
We also collaborate with Jack Rueter concerning small Finno-Ugric Languages.
Jokinen, Kristiina, Trung Ngo Trong and Ville Hautamäki. Variation in Spoken North Sami Language, Interspeech, San Francisco, USA, 2016 (accepted). Fulltext.
Trong, Trung Ngo, Ville Hautamäki and Kong Aik Lee. Deep Language: a comprehensive deep learning approach to end-to-end language recognition, Speaker Odyssey, Bilbao, Spain, 2016 (accepted). Fulltext.
Jokinen, Kristiina, Katri Hiovain, Niklas Laxström, Ilona Rauhala, and Graham Wilcock. DigiSami and Digital Natives: Interaction Technology for the North Sami language, International Workshop on Spoken Dialogue Systems (IWSDS 2016), 2016. Fulltext.
Wilcock, Graham, Niklas Laxström, Juho Leinonen, Peter Smit, Mikko Kurimo, and Kristiina Jokinen Towards SamiTalk: a Sami-speaking robot linked to Sami Wikipedia, International Workshop on Spoken Dialogue Systems (IWSDS 2016), 2016. Fulltext.
Laxström, Niklas, Graham Wilcock and Kristiina Jokinen. Internationalisation and localisation of spoken dialogue systems, International Workshop on Spoken Dialogue Systems (IWSDS 2016), 2016. Fulltext.
Rauhala, Ilona. The variation of adjective attributes in Saami, XII International Congress for Finno-Ugric Studies, 2015. Slides.
Wilcock, Graham, and Kristiina Jokinen. Multilingual WikiTalk: Wikipedia-based talking robots that switch languages, 16th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL 2015), 2015.
Jokinen, Kristiina. Multimodal engagement in the WikiTalk robot application, International Workshop on Speech Robotics (IWSR 2015), Dresden, 2015.
Laxström, Niklas, Pau Giner, and Santhosh Thottingal. Content Translation: Computer assisted translation tool for Wikipedia articles, 18th Annual Conference of the European Association for Machine Translation, 194–197, 2015. Fulltext.
Grönroos, Stig-Arne, Kristiina Jokinen, Katri Hiovain, Mikko Kurimo, and Sami Virpioja. Pohjoissaamen morfologisen segmentaation aktiivinen oppiminen pienin resurssein, XXIX Fonetiikan päivät, 2015
Laxström, Niklas, and Antti Kanner. Multilingual Semantic MediaWiki for Finno-Ugric dictionaries, First International Workshop on Computational Linguistics for Uralic Languages 75–86, 2015. Fulltext.
Grönroos, Stig-Arne, Kristiina Jokinen, Katri Hiovain, Mikko Kurimo, and Sami Virpioja Low-Resource Active Learning of North Sámi Morphological Segmentation, First International Workshop on Computational Linguistics for Uralic Languages 20–33, 2015. Fulltext.
Jokinen, Kristiina. Open-domain Interaction and Online Content in the Sami Language, Proceedings of the Language Resources and Evaluation Conference (LREC 2014), 2014. Fulltext.
Jokinen, Kristiina, and Graham Wilcock. Community-based Resource Building and Data Collection, Proceedings of 4th International Workshop on Spoken Language Technologies for Under-resourced Languages (SLTU 2014), 201-206, 2014. Fulltext.
Laxström, Niklas, Kristiina Jokinen, and Graham Wilcock. Situated Interaction in a Multilingual Spoken Information Access Framework, Proceedings of 5th International Workshop on Spoken Dialog Systems, 161–171, 2014