Group leaders:
When deliberating on policy decisions, lawmakers regularly look beyond their national borders to learn from positive and negative examples set by other states. In light of pressing global challenges, such as climate change or escalating geopolitical tensions, the need to account for the experiences, behaviors, and intentions of foreign nations in national policymaking is greater than ever. In this project, we will explore the global dimensions of national policy debates and investigate how comparisons shape the way decisions are advocated and reached on a national stage. The role that transnational policy comparisons play in the shaping of politics is extensively theorized and studied qualitatively, but quantitative insights on the practical role of references to foreign nations are relatively scarce.
To analyze the role of foreign nations in national policy debates, we will use the ParlaMint dataset, which includes parliamentary speeches from 29 European countries. Leveraging computational methods from natural language processing to network analysis, we will jointly leverage text data, structural data, and metadata to understand where, how, and why parliamentarians reference foreign nations in their speeches. Beyond gaining an overview of how European countries perceive other countries, we will also collaboratively select specific global challenges and research questions to investigate in greater depth.
As a multidisciplinary team, we will iteratively translate domain questions into computational tasks, integrating assumptions and concepts from the humanities and social sciences. This group is ideal for anyone interested in using text mining, network science, and large-scale data to study the global dimension of contemporary policy problems!
Computational tasks can include but are not limited to:
Humanities and social-science tasks include but are not limited to:
Data:
Read More:
Baden, C., Pipal, C., Schoonvelde, M., & van der Velden, M. A. G. (2022). Three gaps in computational text analysis methods for social sciences: A research agenda. Communication Methods and Measures, 16(1), 1-18.
Kukkonen, A., & Ylä-Anttila, T. (2020). The science–policy interface as a discourse network: Finland’s climate change policy 2002–2015. Politics and Governance, 8(2), 200-214.
Skubic J., Bruncrona, A., Angermeier, J., Evkoski, B., & Leiminger, L. Networks of Power – Gender Analysis in European Parliaments. [2022 DHH project]
Steinmetz, W. (2020). Introduction: Concepts and practices of comparison in modern history. In W. Steinmetz (Ed.), The Force of Comparison: A New Perspective on Modern European History and the Contemporary World (pp. 1–32). Berghahn Books.
Theocharis, Y., & Jungherr, A. (2021). Computational social science and the study of political communication. Political Communication, 38(1-2), 1-22.
Group leaders:
Britain of the late 18th and 19th centuries was characterised by rapid urbanisation, political unrest, emergence of modern police, economic inequality and dynamic print culture. In these conditions, newspaper reporting about crimes was on the rise, and public interest in the topic rose: This was the period of Jack the Ripper and Arthur Conan Doyle’s Sherlock Holmes. As the topic was equally capable of inciting fears of unsafety and providing entertainment ('true crime' is not a new thing) it was adaptive and adjustable to different contexts. At the same time, the British legal system created more consistent and systematic records about crime.
This group uses newspaper stories and court records to study how crimes and punishments were discussed and distributed during the Georgian and Victorian eras. Textual data from the Times enables the group to analyse how crimes were discussed and represented in a major newspaper of the era. The Old Bailey Records provide comprehensive information about court cases and those sentenced. Together, these resources can be used to ask various questions related to criminal activities and their representation. How were different kinds of crimes discussed? Was the tone in the newspapers moralizing, sensational, or both? Were some crimes common as court cases but unreported by the press? Did specific locations in the city of London develop associations with certain types of crime, can we see public perception of “good” and “bad” neighbourhoods as crimes are reported in the press?
Many backgrounds and interests can be put to good use in the group. The questions studied by the group should be interesting not only for historians and media researchers, but anyone interested in questions related to media representation and/or crime. Computational methods ranging from natural language processing to spatial and network analysis can be explored based on the interests of the participants.
Further reading:
Group leaders:
Post-WW2 Europe went through considerable changes in corporate and legal frameworks during the 1950s and 1960s. The UN, OECD and other institutions developed new legal frameworks to enable large-scale co-operation and integration of labor and management. This ethos of collective responsibility was reflected in corporate culture in what has been described as “stakeholder capitalism”. This is the idea that companies should benefit society at large, from employees and producers to customers and communities. By the 1970s-80s, market-oriented policies associated with leaders like Margaret Thatcher and Ronald Reagan and economists like Milton Friedman had become mainstream. Our team hypothesises that this change is visible in different linguistic layers of texts: how information was presented and how the meaning of various concepts changed over time. By analysing changes in laws and companies’ annual reports, we aim to understand how corporate and legal language shifted from stakeholder-focused to later profit-oriented discourse.
Further reading:
Group leaders:
This group uses a huge structured dataset on over 200 million books published all across Europe to study large-scale, long-term patterns of knowledge production from the 1400s to the present day.
Potential topics include:
For inspiration, the following show examples of what has already been done with various historical subsets of the data:
Group leaders:
In early 2026, the FILTER group finally managed to (partially) crack one of the roadblocks prohibiting data-centric study of one of the largest transcribed collections of oral poetry in existence in digital form. Through the use of LLMs, we now have English translations and linguistic analyses of all verses and words in the runosongs, which can be used as anchors to transcend the ever-present, multilayered dialectal, linguistic and poetic variation inherent in the data.
What this potentially enables is that, for the first time, it could be possible to, in a large-scale, data-centric manner, analyse the dynamics of the system as a whole – to see what parts are typically stable, what is improvised in each recital, and in general what the building blocks are from which the performers build their performances. This exploration is what this group will focus on.
Further reading: