Digi
Sami
Sápmi
Saame

About this project

Digi Sami is a research project at University of Helsinki which aims to support content generation of less resourced languages with the help of language technology.

We are currently developing a version of WikiTalk in Sami language. WikiTalk is a spoken dialogue system which allows the user to use Wikipedia by having a conversation with a humanoid robot. Our partners in Aalto University are developing the necessary Sami speech technology components for the robot. We have collected and are annotating a Sami spoken language corpus.

New book is published: Dialogues with Social Robots – Enablements, Analyses, and Evaluation. Lecture Notes in Electrical Engineering, Vol 427, Springer, DOI: 0.1007/978-981-10-2585-3

Contact

You can contact the project leader Kristiina Jokinen by emailing Kristiina dot Jokinen at helsinki dot fi

People working on this project

Kristiina Jokinen

Principal investigator and the project leader
Adjunct professor at the Institute of Behavioral Sciences, University of Helsinki.
Research activities

Graham Wilcock

Principal investigator
Adjunct professor at the Department of Modern Languages, University of Helsinki.
Research activities

Niklas Laxström

Doctoral student at the Department of Modern Languages, University of Helsinki.

Katri Hiovain

Research assistant

Trung Ngo Trong

Research assistant

Former members

Ilona Rauhala
Hanna Kellokoski

Research assistant

Jani Koskinen

Research assistant

Event calendar

August 2017

20-24, Stockholm, Sweden. Kristiina is the area chair for Spoken Dialog Systems and Analysis of Conversation and co-organiser for the Special Session Digital Revolution for Under-resourced Languages (DigRev-URL) at Interspeech 2017

15-17, Saarbrücken, Germany. Kristiina is General-chair together with Manfred Stede for SIGDIAL 2017

February 2017

Trung Ngo Trong finished his M.Sc. Thesis entitled A comprehensive deep learning approach to end-to-end language identification (Fulltext), and applied for the PhD status with the thesis plan entitled End-to-end deep learning for interactive multimodal learning.

January 2017

The book Dialogues with Social Robots – Enablements, Analyses, and Evaluation (Springer) was published.

Helsinki, Finland. Kristiina held an intensive course Robo-Ope (Robot Teacher) at the Department of Teacher Education at University of Helsinki.

December 2016

11-16, Osaka, Japan. Graham and Kristiina attended COLING 2016, in Osaka, Japan, where they gave a talk Double Topic Shifts in Open Domain Conversations: Natural Language Interface for a Wikipedia-based Robot Application at the COLING Workshop Open Knowledge Base and Question Answering (OKBQA), and demonstration What topic do you want to hear about? A bilingual talking robot using English and Japanese Wikipedias.

November 2016

Kyoto, Japan. Graham Wilcock was a Visiting Professor at Doshisha University in November-December. He was external evaluator for Ms Xiaoyun Wang's PhD-thesis.

Yokosuka, Japan. Kristiina visited NTT Media Intelligence Labs and gave a talk What Topic Do You Want To Hear About? Topic Shifts in Open Domain Conversations.

Saitama, Japan. Kristiina visited KDDI Research Labs and gave a talk Engagement and Social Interaction in Human-Robot Interactions.

12-16, Tokyo, Japan. Kristiina also attended the 18th ACM International Conference on Multimodal Interaction (ICMI 2016) and gave a talk Body movements and laughter recognition: experiments in first encounter dialogues at the satellite workshop Multimodal Analyses enabling Artificial Agents in Human–­Machine Interaction (MA3HMI) at Tokyo, Japan.

September 2016

29-30, Copenhagen, Denmark. Kristiina is the organiser of The 4th European and 7th Nordic Symposium on Multimodal Communication (MMSYM 2016) and attended the symposium in Copenhagen. She gave a talk Laughing and co-construction of common ground in human conversations.

20-23, Los Angeles, USA. Kristiina attended the 16th Conference on Intelligent Virtual Agent (IVA-2016) and gave a talk Automated Questions for Chat Dialogues with a Student Office Virtual Agent at the WOCHAT – Workshop on Chatbots and Conversational Agents.

13-15, Los Angeles, USA. Kristiina attended SIGDial 2016, and Young Researchers’ Roundtable on Spoken Dialog Systems (YRRSDS 2016).

8-12, San Francisco, USA. Kristiina attended Interspeech 2016 where she presented the paper Variation in Spoken North Sami Language together with Ville Hautamäki.

August 2016

Trung and Kristiina attended the eNTERFACE Summer School in Enschede, The Netherlands, and participated in the project The Roberta IRONSIDE project: A dialog capable humanoid personal assistant in a wheelchair for dependent persons. Fulltext.

Kristiina was also Invited Speaker and gave a talk Social Engagement via Eye-Gaze in Multimodal Robot Applications

The work at the Summer School resulted in a paper LifeLine Dialogues with Roberta at the Conference Future and Emerging Trends in Language Technology, Machine Learning and Big Data 2016 (FETLT’16) in Seville, Spain.

June 2016

21-24, Bilbao, Spain. Trung Ngo Trong attended Odyssey 2016: The Speaker and Language Recognition Workshop and presented a paper Deep Language: a comprehensive deep learning approach to end-to-end language recognition

May 2016

23-28, Portorož, Slovenia. Kristiina attended the 10th Language Resources and Evaluation Conference (LREC-2016) and gave a paper Acoustic Features of Different Types of Laughter in North Sami Conversational Speech in the LREC Workshop Just talking – casual talk among humans and machines.

January 2016

13-16, Saariselkä, Finland. Seventh International Workshop on Spoken Dialogue System (IWSDS 2016) was held in Saariselkä, Finland. Whole team attented the meeting and gave presentations titled Towards SamiTalk: a Sami-speaking Robot linked to Sami Wikipedia, Internationalisation and localisation of spoken dialogue systems and DigiSami and Digital Natives: Interaction Technology for the North Sami language.

October 2015

Kristiina visited Moscow Linguistic State University and gave an invited talk Engagement and Autonomous Robot Agents – Social Interaction in the Wikitalk Application. at the International Symposium Gesture research applied to human-computer interaction: The case of robots and virtual agents.

September 2015

6-10, Dresden, Germany. Kristiina Jokinen attended Interspeech 2015, where she gave a talk on Multimodal engagement in the WikiTalk robot application at the International Workshop on Speech Robotics (IWSR 2015).

2-4, Prague, Czech Republic. Kristiina Jokinen and Graham Wilcock attended SIGDIAL 2015, where they gave a presentation on Multilingual WikiTalk: Wikipedia-based talking robots that switch languages.

August 2015

17-21, Oulu, Finland. Ilona Rauhala gave a presentation in CIFU XII on The variation of adjective attributes in Saami.

June 2015

Helsinki, Finland. Kristiina Jokinen held an intensive course on human-robot interaction.

May 2015

11-13, Antalya, Turkey. Niklas Laxström attended EAMT2015 and presented Content Translation: Computer assisted translation tool for Wikipedia articles.

6, Helsinki, Finland. Kristiina Jokinen gave a presentation Social Robotics – from Fancy Interface to Interactive Agents at POP-ROBOTICS Helsinki Think Tank event.

April 2015

15, Kitakyushu, Japan. Kristiina Jokinen gave a presentation Multimodal Interaction in the Nao WikiTalk Application at Waseda University Kitakyushu Campus.

March 2015

25-28, Shonan, Japan. Kristiina Jokinen was an invited participant at the NII Shonan Meeting Seminar The Future of Human-Robot Spoken Dialogue: from Information Services to Virtual Assistants.

11, Helsinki, Finland. Kristiina Jokinen and Graham Wilcock demonstrated Nao robot at Digital.Finland.Go! – Boosting Business with Digitalisation event at Finlandia Hall where three new Tekes programmes utilizing digitalisation were launched.

Kyoto, Japan. Graham Wilcock was a Visiting Professor at Doshisha University in March–April.

January 2015

16, Tromsø, Norway. Two papers from us were accepted to First International Workshop on Computational Linguistics for Uralic Languages.

Dember 2014

Kristiina was interviewed at Radio Vega: God morgon Svenskfinland.

19, Finland. Kristiina and Graham were featured in a two page article Moro, sanoi robotti in the Yliopisto magazine.

November 2014

22-28, Helsinki, Finland. Graham, Kristiina and Niklas attended the Finnish Robotics Week event (Robottiviikko 2014) and presented MoroTalk, Finnish WikiTalk and English WikiTalk with Nao robots. We were interviewed and recorded by Iltalehti and Robottiviikko and Graham made an appearance in a news article published in Turun Sanomat.

16, Istanbul, Turkey. Graham attented the Multimodal, Multi-Party, Real-World Human-Robot Interaction workshop (HRI) at 16th ACM International Conference on Multimodal Interaction (ICMI 2014).

August 2014

18-22, Turku, Finland. Niklas attended the Langnet Summer School.

May 2014

26-31, Reykjavik, Iceland. Kristiina attended the 9th Language Resources and Evaluation Conference (LREC-2014).

14-16, St Petersburg, Russia. Kristiina attended the 4th International Workshop on Spoken Language Technologies for Under-resourced Languages (SLTU'14).

February

DigiSami project was mentioned in the Sami language news (Article on YLE website).

Data collection events organised in Enontekiö, Kautokeino, Inari, Utsjoki, and Ivalo. Article in school blog.

January 2014

18-20, Napa, CA, USA. Niklas attended IWSDS 2014 and presented a joint paper by Laxström, Jokinen and Wilcock Situated Interaction in a Multilingual Spoken Information Access Framework.

December 2013

2-5, Budapest, Hungary. Graham attended the CogInfoCom 2013 conference and presented a paper Towards Cloud-based Speech Interfaces for Open-Domain CogInfoCom Systems.

9-13, Sydney. Kristiina co-chaired the ACM-ICMI 2013 conference and presented a joint paper by Wilcock and Jokinen at the main conference. She also authored a paper together by Lim Kai Keats, Max Friedrich, Jenny Radun at the GAZE-IN workshop related to the main conference.

November 2013

29, Yle. Kristiina was a panelist in the Yle Robottiviikko (robot week) panel: Robotit: ohjelmoitavasta oppijaksi.

October 2013

17–18, Valletta, Malta. Kristiina was an invited speaker at the 1st European Symposium on Multimodal Interaction. She gave a keynote Studying multimodal communication with eye-tracking.

14, Nagoya, Japan. Kristiina and Graham gave a half-day tutorial at IJCNLP 2013 on Open-domain Conversations with Humanoid Robots.

Hokkaido, Japan. Graham was on a bilateral exchange visit to the University of Hokkaido.

September 2013

27–29, Inari, Finland. Kristiina attended and gave a talk Finno-Ugric Digital Natives - prospects for open-domain interaction with online content at Oovtâst – Together conference.

Partners

Our research partner in the project Finno-Ugric Digital Natives: Linguistic support for Finno-Ugric digital communities in generating online content is Department of Language Technology, Research Institute for Linguistics, Hungarian Academy of Sciences, Budapest, Hungary led by Tamas Varadi.

In Finland, we collaborate with Mikko Kurimo from Aalto University and his group on Sami speech technology.

We also collaborate with Jack Rueter concerning small Finno-Ugric Languages.

Publications

Jokinen, K. and Wilcock, G. (2017, eds.) Dialogues with Social Robots – Enablements, Analyses, and Evaluation. Springer. http://www.springer.com/gb/book/9789811025846 DOI: 10.1007/978-981-10-2585-3

Grönroos, S-A., Hiovain, K, Smit, P., Rauhala, I., Jokinen, K., Kurimo, M., Virpioja, S. Low-Resource Active Learning of Morphological Segmentation, Northern European Journal of Language Technology (NEJLT), 2016.

Jokinen, K., Wilcock, G. Double Topic Shifts in Open Domain Conversations: Natural Language Interface for a Wikipedia-based Robot Application, COLING Workshop Open Knowledge Base and Question Answering (OKBQA), Osaka, Japan, December 2016.

Wilcock, G., Jokinen, K., Yamamoto, S. What topic do you want to hear about? A bilingual talking robot using English and Japanese Wikipedias, Proceedings of the COLING Demos, Osaka, Japan, December 2016.

Lopez, A., Ratni, A., Ngo Trong, T., Olaso, J.M., Montenegro, S., Lee, M., Haider, F., Schlögl, S., Chollet, G., Jokinen, K., Petrovska D., Sansen, H., Torres, M. I. LifeLine Dialogues with Roberta, Proceedings of the Future and Emerging Trends in Language Technology, Machine Learning and Big Data 2016 (FETLT’16), Seville, Spain, 2016.

Jokinen, K., NgoTrong, T., Wilcock, G. Body movements and laughter recognition: experiments in first encounter dialogues, Proceedings of the Workshop on Multimodal Analyses Enabling Artificial Agents in Human- Machine Interaction (MA3HMI '16), held at the 18th ACM International Conference on Multimodal Interaction (ICMI), Tokyo, Japan, 2016.

NgoTrong, T., Hiovain, K., Jokinen, K. Laughing and co-construction of common ground in human conversations, The 4th European and 7th Nordic Symposium on Multimodal Communication Copenhagen, Denmark, 2016.

Muron, M., Jokinen, K. Automated Questions for Chat Dialogues with a Student Office Virtual Agent, The Second Workshop on Chatbots and Conversational Agent Technologies Held at IVA 2016, Los Angeles, US.

Jokinen, K., NgoTrong, T., Hautamäki, V. Variation in Spoken North Sami Language, Interspeech 2016.

Hiovain, K., Jokinen, K. Acoustic Features of Different Types of Laughter in North Sami Conversational Speech, Proceedings of the LREC Workshop Just talking – casual talk among humans and machines, Portorož, Slovenia, 2016

Jokinen, Kristiina, Trung Ngo Trong and Ville Hautamäki. Variation in Spoken North Sami Language, Interspeech, San Francisco, USA, 2016. Fulltext.

Trong, Trung Ngo, Ville Hautamäki and Kong Aik Lee. Deep Language: a comprehensive deep learning approach to end-to-end language recognition, Speaker Odyssey, Bilbao, Spain, 2016. Fulltext.

Jokinen, Kristiina, Katri Hiovain, Niklas Laxström, Ilona Rauhala, and Graham Wilcock. DigiSami and Digital Natives: Interaction Technology for the North Sami language, International Workshop on Spoken Dialogue Systems (IWSDS 2016), 2016. Fulltext.

Wilcock, Graham, Niklas Laxström, Juho Leinonen, Peter Smit, Mikko Kurimo, and Kristiina Jokinen Towards SamiTalk: a Sami-speaking robot linked to Sami Wikipedia, International Workshop on Spoken Dialogue Systems (IWSDS 2016), 2016. Fulltext.

Laxström, Niklas, Graham Wilcock and Kristiina Jokinen. Internationalisation and localisation of spoken dialogue systems, International Workshop on Spoken Dialogue Systems (IWSDS 2016), 2016. Fulltext.

Rauhala, Ilona. The variation of adjective attributes in Saami, XII International Congress for Finno-Ugric Studies, 2015. Slides.

Wilcock, Graham, and Kristiina Jokinen. Multilingual WikiTalk: Wikipedia-based talking robots that switch languages, 16th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL 2015), 2015.

Jokinen, Kristiina. Multimodal engagement in the WikiTalk robot application, International Workshop on Speech Robotics (IWSR 2015), Dresden, 2015.

Laxström, Niklas, Pau Giner, and Santhosh Thottingal. Content Translation: Computer assisted translation tool for Wikipedia articles, 18th Annual Conference of the European Association for Machine Translation, 194–197, 2015. Fulltext.

Grönroos, Stig-Arne, Kristiina Jokinen, Katri Hiovain, Mikko Kurimo, and Sami Virpioja. Pohjoissaamen morfologisen segmentaation aktiivinen oppiminen pienin resurssein, XXIX Fonetiikan päivät, 2015

Laxström, Niklas, and Antti Kanner. Multilingual Semantic MediaWiki for Finno-Ugric dictionaries, First International Workshop on Computational Linguistics for Uralic Languages 75–86, 2015. Fulltext.

Grönroos, Stig-Arne, Kristiina Jokinen, Katri Hiovain, Mikko Kurimo, and Sami Virpioja Low-Resource Active Learning of North Sámi Morphological Segmentation, First International Workshop on Computational Linguistics for Uralic Languages 20–33, 2015. Fulltext.

Jokinen, Kristiina. Open-domain Interaction and Online Content in the Sami Language, Proceedings of the Language Resources and Evaluation Conference (LREC 2014), 2014. Fulltext.

Jokinen, Kristiina, and Graham Wilcock. Community-based Resource Building and Data Collection, Proceedings of 4th International Workshop on Spoken Language Technologies for Under-resourced Languages (SLTU 2014), 201-206, 2014. Fulltext.

Laxström, Niklas, Kristiina Jokinen, and Graham Wilcock. Situated Interaction in a Multilingual Spoken Information Access Framework, Proceedings of 5th International Workshop on Spoken Dialog Systems, 161–171, 2014

Videos

Towards SamiTalk: A Sami-speaking robot linked to Sami Wikipedia