Introducing Deep-Littré
A walkthrough of Deep-Littré: a deeply structured edition of Émile Littré's French dictionary, available as TEI Lex-0 XML and SQLite.
Corpus linguistics, literary translation, and the short stories of Guy de Maupassant
Guy de Maupassant wrote more than 300 short stories. English readers have access to good translations of a few dozen of them. Many of the rest are either still untranslated or available only in century-old public domain editions, often abridged or bowdlerized1 See my notes here, for example. .
This site follows my study of the stories in the original French — my findings, the computational tools I've built along the way, and my attempt to translate the stories into English, all of them, beginning with some of the lesser-knowns.
A walkthrough of Deep-Littré: a deeply structured edition of Émile Littré's French dictionary, available as TEI Lex-0 XML and SQLite.
Building a layered annotation pipeline that combines rule-based checks, cross-pipeline comparison, and local LLM classifiers to improve CoNLL-U accuracy on literary French.
A quick introduction to the Isosceles corpus for anyone interested in working with it, and as contextual grounding for later work.