Research Seminar in Language Technology - Spring 2019

Course Information

This is the time table of the general research seminar in language technology. Presentations will be given primarily by doctoral students but also other researchers in language technology including external guests

Place and Time:

Registration and Announcements:

  • Please subscribe to our e-mail list lt-research by sending a message to majordomo@helsinki.fi with the following text in the body of your e-mail:

    subscribe lt-research your.email@helsinki.fi

Tentative Schedule:

January, 9: Måns Huldén
Wednesday 9.1.2019 at 14-16
Metsätalo Lecture room 24 (Unioninkatu 40, 5th floor)
Title: Linguistics with Neural Networks
Abstract: Neural networks have led to previously unimaginable advances in computational linguistics. The main criticism against them from a linguistic point of view is that neural models - while fine for "language engineering tasks" - are thought of as being black boxes, and that their parameter opacity prevents us from discovering new facts about the nature of language itself, or specific languages. In this talk I will challenge that assumption to show that there are ways to uncover facts about language, even with a black box learner.
I will discuss specific experiments with neural models and sound embeddings that reveal new information about the organization of sound systems in human languages, give us insight into the limits of complexity of word-formation, give us models of why and when irregular forms - surely an inefficiency in a communication system - can persist over long periods of time, and reveal what the boundaries of pattern learning is (how much information do we minimally need to learn a grammatical aspect of language such as its word inflection or sentence formation).
BIO: Mans Hulden is an assistant professor in Linguistics at the University of Colorado Boulder where he also holds an affiliation with the Institute of Cognitive Science. His research focuses on developing computational methods to infer and model linguistic structure using varying degrees of prior linguistic knowledge, particularly in the domains of phonology and morphology. He has worked extensively with linguistic applications of finite state technology, modeling of linguistic theory, grammatical inference, and the development of language resources.
January, 17: group meeting / reading group
January, 24: Emily Öhman
Title: New developments with Sentimentator & teaching computational methods.
January, 31: Mikko Aulamo and Jörg Tiedemann
Fiskmö and OPUS: Tools and interfaces
OPUS is a large collection of freely available parallel corpora. Fiskmö is a project dedicated to collect Finnish/Swedish translations to build better machine translation for the two national languages of Finland. In this presentation we will introduce the resources and tools included and currently developed within both projects. In particular, we will present tools for finding, accessing and converting data in OPUS, we will present the on-going development of our resource repository software with its frontend and backend as well as the machine translation demonstrator that we created for the fiskmö project.
February, 7: Kimmo Koskenniemi
Title: Discovering morphophonemes mechanically from carefully chosen examples
More about the method: morphophon and the implementation on github
February, 14: reading group
February, 21: -- no seminar this week --
February, 28: Tommi Gröndahl
March, 7: Sami Virpioja
March, 14: reading group
March, 21: Danny Merkx (tbc)
March, 28: Visit by translation studies
April, 4: reading group
April, 11
April, 18
April, 25: reading group?
May, 2
May, 9: reading group
May, 16: Jouna Pyysalo (tbc)
May, 23: Ozan Çağlayan (tbc)
June, 6 (cancel because of NAACL?)
June, 13 - reading group