Published December 6, 2019 | Version v1
Conference paper Open

Vir is to Moderatus as Mulier is to Intemperans. Lemma Embeddings for Latin

  • 1. Università Cattolica del Sacro Cuore, Milan, Italy

Description

This paper presents a new set of lemma embeddings for the Latin language. Embeddings are trained on a manually annotated corpus of texts belonging to the Classical era: different models, architectures and dimensions are tested and evaluated using a novel benchmark for the synonym selection task. A qualitative evaluation is also performed on the embeddings of rare lemmas. In addition, we release vectors pre-trained on the “Opera Maiora” by Thomas Aquinas, thus providing a resource to analyze Latin in a diachronic perspective.

Files

2019_Sprugnoli-et-al_Latin-Embeddings_CLiC-it.pdf

Files (262.7 kB)

Name Size Download all
md5:b5f3007008be955fcec1a4b927cbd36c
262.7 kB Preview Download

Additional details

Related works

Is part of
Book: 1613-0073 (ISSN)

Funding

LiLa – Linking Latin. Building a Knowledge Base of Linguistic Resources for Latin 769994
European Commission