Hola! I’m Cristina (for some Christine or Christina or Gracia Holdago) and I’m a young research engineer at the University of Poitiers and a PhD student at the University of Corsica, under the supervision of Stella Retali-Medori (University of Corsica) and co-supervision of Marianne Vergez-Couret (University of Poitiers).
My thesis focuses on low-resource languages and linguistic variation in the area of Natural Language Processing. In particular, for Corsican and Poitevin-Saintongeais, two regional languages of France. It is part of the ANR DIVITAL project, a projet aiming to provide linguistic resources and support with language technologies to several regional languages of France, including Alsatian and Occitan.
Earlier, I worked at Lattice (Paris), where I focused on syntactic parsing and lexicons for Old French within the Profiterole project. I joined the project during a master internship at the ATILF laboratory (Nancy) under the supervision of Mathieu Constant (University of Lorraine) and Alexey Lavrentiev (ENS Lyon). During that time, my work focused on the challenges of automatic lemmatization for an historical non-standardized language, which is described in the following article.
My research interests are primarily focused on low-resource and nonstandard languages, both from historical and contemporary perspectives. I am interested in other areas, such as automatic text simplification (the main subject of my master’s thesis), as well as misinformation and political polarization.
Since 2023, I have been teaching in Sciences du langage (Linguistics) at the University of Poitiers. My teaching covers core areas of linguistics (lexicology and pragmatics), as well as corpus-based approaches and methodological tools for linguists.
September 2025 — Co-organiser of the LLcD workshop Developing models for linguistic research, Lille, France
July 2025 — Attendance at the Lisbon Machine Learning School (LxMLS), Lisbon, Portugal
November 2024 — Oral presentation at the ViGRamm seminar, “L’analyse de la variation dialectale à l’ère de l’apprentissage automatique.”, Corte, Corse, France
July 2024 — Poster presentation at the UniDive 2024 Summer School, hosted by the Technical University of Moldova, “Beyond Standardization: Crafting a Comprehensive Annotated Corpus for Corsican and Poitevin-Saintongeais.”, Chișinău, Moldova
April 2024 — Participation in the Autogramm training session, Paris, France
February 2024 — Oral presentation at the AFIA Forum Industriel de l’IA (FIIA 2024): “Can LLMs be used to understand clinical notes better?”, Paris, France