Work packages

The work done in the project is divided into five working packages which are described below.

Throughout the project there will be monthly seminars of the research group, where the status of the workflow is under discussion. We will also invite external experts to Helsinki to give lectures on their work and discuss and advice ours.

1. Morphosyntactic variation as signal of bilingualism

This question is tackled by carrying out detailed morphosyntactic annotation (treebanking). This work package will be going on throughout the whole project.

2. Morpholocigal tagging of the papyrological corpus

The first months of the project are devoted to the preprocessing of the corpus into two layers. This step has to be performed before we can proceed with morphological parsing. Then the possibilities of morphological parsing are deciphered and Morphus is tested. If using Morpheus with some training does not work as wished, we have to devote more time in developing morphological parser. If the parser is functional quickly, we will continue with the phonological tools.

3. Phonological analyses and establishing search parametres

We will work on search parametres as regards to searching a) the whole corpus, b) regularisations and c) contact-induced phenomena.

4. Morphological and phonological quantitative research

The research relating to this work package is based on the new annotated corpora answering our research questions such as: 1) To what extent did the phonological changes cause the disappearance or transfromation of certain noun and verbal inflections? A related question is: can we show examples where morphosyntactic changes were not dependent on phonological changes?; and 2) Can we prove, with quantitative data, that some phonological changes were solely language internal, or that bilingualism was an important factor behind them?

5. Designing and building the online portal

During the last 2 years of the project, we will be working on with an online portal. Portal will include a) morphologically annotated corpus and a search engine, b) phonological search engine, and c) how they link to the treebanked data, which can also be searched using outside search engines. It is our aim also to build the portal so that it is synchronized with newly published papyri in the Papyrological Navigator.