Newer
Older
:last-update-label: Supported by Université de Strasbourg's IdEx program (Attractivité 2020 Call). Last updated on
:title: MeThAL: Towards a macroanalysis of theater in Alsatian
:imagesdir: img
:favicon: ./img/favicon.ico
:stylesdir: styles/
[opts=align-to-page]
//image::test_image_adoc_jumbotron.png["D'r Candidat",width=100%,align="center"]
//image::image_to_test_asciidoc_long_2.png["D'r Candidat",width=100%,align="center"]
image::image_methal_8.png["Methal project banner",width=100%,align="center"]
[discrete]
= MeThAL: Towards a macroanalysis of theater in Alsatian
The Alsatian dialect theater tradition is based predominantly on popular and humour genres. What are the *major trends* in this tradition, regarding dramatic technique and character types? What are its major geographic locations? To what an extent do Alsatian dialect plays document the sociolinguistic situation of the period when they were written?
In order to answer these questions and carry out quantitative *analyses*, a large corpus, representative of the tradition, is required, as well as corpus annotations for the relevant variables: geographical origin of plays and authors, places where the plays take place, their period and genre. Regarding the characters, attributes such as their profession, social status, origin, gender or age must be made available. It is also necessary to formalize the plays' structure, identifying act and scene divisions, characters' speech and stage directions.
Our project's first goal is creating such a *corpus*, encoded in the link:https://tei-c.org/guidelines/[*TEI*,role=external,window=_blank] format (Text Encoding Initiative), whose link:https://www.tei-c.org/release/doc/tei-p5-doc/en/html/DR.html[Performance^] module covers the types of annotations we're interested in. We're working on a representative collection of link:https://www.numistral.fr/services/engine/search/sru?operation=searchRetrieve&exactSearch=false&collapsing=true&version=1.2&query=(colnum%20adj%20%22BNUStr058%22)&suggest=10&keywords=[plays^], which were recently digitized by the Bibliothèque Nationale et Universitaire (Bnu) in Strasbourg. We're currently performing OCR on the plays and their TEI encoding.
The corpus thus created will allow a _distant reading_ or *macroanalysis* approach to Alsatian theater. Such approaches have been applied successfully to the major European dramatic traditions, as shown in a 2017 special issue of the link:https://sht.asso.fr/revue/etudes-theatrales-et-humanites-numeriques/[Revue d’Historiographie du Théâtre^]. However, such analyses are still impossible for Alsatian, given lack of an appropriate digital corpus. The MeThAL projects seeks to make up for this lack of resources.
To that end, we will apply natural language processing and document representation techniques, besides web technologies which will contribute to corpus navigability.
The huge orthographic variety of Alsatian presents specific challenges for Natural Language Processing (*NLP*), as is the case for any *low-resource language*. These challenges highlight needs which are only partially addressed by existing text analysis tools, mainly geared towards majority languages. The project will exploit and contribute to the resources created by the link:http://restaure.unistra.fr/[RESTAURE^] project, on NLP for France's regional languages.
* Pablo Ruiz, Delphine Bernhard, Andrew Briand, Carole Werner. (2024). Computational drama analysis from almost zero electronic text: The case of Alsatian theater. In Melanie Andresen and Nils Reiter. _Computational Drama Analysis: Reflecting On Methods and Interpretations_: 57-85 link:https://doi.org/10.1515/9783111071824[⟨doi.org.10.1515/9783111071824⟩,role=external,window=_blank^]
* Qinyue Liu, Pablo Ruiz, Delphine Bernhard. (2023). Towards emotion analysis for Alsatian theater. Poster presented at _Computational Humanities Research_, Paris, France. _Poster:_ link:https://hal.science/hal-04213017[⟨hal-04213017⟩,role=external,window=_blank^]. __Abstract:__ link:https://zenodo.org/doi/10.5281/zenodo.8404252[⟨zenodo.8404252⟩,role=external,window=_blank^]
* Pablo Ruiz. (2023). The MeThAL Alsatian theater corpus and related resources: Work done and perspectives. _5èmes journées scientifiques du Groupement de Recherche Linguistique Informatique Formelle et de Terrain (LIFT)_{nbsp}: 113-118. Nancy, France. link:https://hal.science/hal-04391970[⟨hal-04391970⟩,role=external,window=_blank^]
* Pablo Ruiz, Helena Bermúdez. (2022). Feature structures for character social variable annotation and an application to Alsatian theater. Poster accepted at _TEI 2022 - Text Encoding Initiative Conference_ (Virtual poster session). September 2022. Newcastle, United Kingdom. link:https://doi.org/10.5281/zenodo.7110069[⟨10.5281/zenodo.7110069⟩^] ⟨hal-036762679⟩
* Delphine Bernhard, Pablo Ruiz. (2022). ELAL: An emotion lexicon for the analysis of Alsatian theatre plays. _LREC 2022_, _Language Resources and Evaluation Conference_. link:https://hal.archives-ouvertes.fr/hal-03655148[⟨hal-03655148⟩^]
* Pablo Ruiz, Carole Werner, Delphine Bernhard. (2022). The benefits of increasing the digital availability of Alsatian theater. _Digital Humanities 2022_{nbsp}: 557-560. link:https://doi.org/10.5281/zenodo.7014965[⟨10.5281/zenodo.7014965⟩^] ⟨hal-03660481⟩
* Pablo Ruiz, Carole Werner. (2022). Théâtre alsacien{nbsp}: Personographie en TEI et navigation du corpus selon les attributs sociaux des personnages. _Humanistica 2022_. link:https://hal.archives-ouvertes.fr/hal-03660506[⟨hal-03660506⟩^]
* Pablo Ruiz, Carole Werner, Delphine Bernhard, Pascale Erhart, Dominique Huck. (2021). MeThAL : Ressources numériques pour une relecture du théâtre en alsacien. Poster presented at _10 ans avec CAHIER : Des corpus d'auteurs pour les humanités numériques à leur exploitation numérique_, June 2021, Bordeaux, France. link:https://doi.org/10.5281/zenodo.4908212[⟨10.5281/zenodo.4908212⟩,role=external,window=_blank]. ⟨hal-03255403⟩
* Pablo Ruiz, Carole Werner. (2021). Exploration du théâtre alsacien à travers ses listes de personnages pendant la période 1870-1940. _Humanistica 2021_{nbsp}: 27-29, Rennes, France. link:https://doi.org/10.5281/zenodo.4762732[⟨10.5281/zenodo.4762732⟩,role=external,window=_blank] ⟨hal-03226579⟩ link:docs/methal_humanistica_2021.pdf[[slides\],role=external,window=_blank^]
* Pablo Ruiz, Delphine Bernhard, Carole Werner. (2020). Création d'un corpus FAIR de théâtre en alsacien et normalisation de variétés non-contemporaines. _2èmes journées scientifiques du Groupement de Recherche Linguistique Informatique Formelle et de Terrain (LIFT)_{nbsp}: 32-43. Montrouge, France. link:https://doi.org/10.5281/zenodo.4323301[⟨10.5281/zenodo.4323301⟩,role=external,window=_blank] ⟨hal-03047152⟩ link:docs/methal_gdr_lift_2020.pdf[[slides\],role=external,window=_blank^]
* Pablo Ruiz, Delphine Bernhard, Pascale Erhart, Dominique Huck, Carole Werner. (2020). MeThAL : Vers une macroanalyse du théâtre en alsacien. _Humanistica 2020_, Bordeaux, France. link:https://dx.doi.org/10.5281/zenodo.3788019[⟨10.5281/zenodo.3788019⟩,role=external,window=_blank]. link:https://hal.archives-ouvertes.fr/hal-02564694[⟨hal-02564694⟩^]
You can **explore the corpus** (read the plays, filter according to plays' and character attributes) at its navigation interface: link:https://methal.eu/ui/[https://methal.eu/ui/^]
* Sources are updated in the link:https://git.unistra.fr/methal/methal-sources[methal-sources,role=external,window=_blank^] repository as encoding progresses
* Permanent (DOI-based) publication takes place through a link:https://nakala.fr/collection/10.34847/nkl.feb4r8j9[collection^] on the Nakala platform
Besides the plays, a TEI link:https://git.unistra.fr/methal/methal-sources/-/tree/master/personography[**personography**^] was published. It describes over 2,350 characters from around 230 plays, using social variables like age, gender, professional activity or social class
// comment out as now point to interface
//=== Read the plays
//* See section _link:./read/en.html[[read]]_ to read already encoded plays (##nbrPieces plays at this point)
//
* link:https://git.unistra.fr/methal/fete[FETE^]: _Fast Encoding of Theater in TEI_. Automatic TEI encoding based on OCR outputs, using sequence labeling methods
* link:https://git.unistra.fr/methal/edytha[EDYTHA^]: _Emotion Dynamics in Theater in Alsatian_. Labeling emotion expressions and their evolution within a play. Based on the link:https://github.com/Priya22/EmotionDynamics[TED] tool and the link:https://nakala.fr/10.34847/nkl.40cex998[ELAL] lexicon
* MeThAL: Towards a macroanalysis of theater in Alsatian. May 2024, Insitute for Digital Humanities, Universität zu Köln. link:https://dhc.hypotheses.org/programm-2024[Colloquim program^]
* Towards the computational analysis of a peripheral literary tradition: The case of Alsatian theater. November 2023, Institute of Contemporary History. Universidade NOVA de Lisboa, FCSH. link:https://cordis.europa.eu/project/id/101090327[REWIND^] project link:https://ihc.fcsh.unl.pt/en/events/towards-computational-analysis/[workshop, role=external,window=_blank]
* Analyze peripheral literatures -- or create the resources trying. June 2023. link:https://web.archive.org/web/20230927200609/https://citius.gal/events/conferencia-analizar-literaturas-perifericas-o-crear-los-recursos-en-el-intento[Seminar^] at link:https://citius.gal[CiTIUS, role=external,window=_blank], Universidade de Santiago de Compostela
* Réutilisation et création de données ouvertes interopérables pour l’étude du théâtre en alsacien dans le cadre du projet MeThAL. link:https://scienceouverte.unistra.fr/agenda/evenement/news/inauguration/[Open Access Month^]. October 2022, Université de Strasbourg. link:https://podv2.unistra.fr/video/49529-open-access-month-inauguration/[[video\],role=external,window=_blank] (starting at 0:50:25)
* De l'OCR à la TEI dans un corpus de théâtre alsacien dans le cadre du projet MeThAL. link:https://estrades.hypotheses.org/460[2èmes rencontres Estrades-Eveille^]. September 2022, MISHA, Université de Strasbourg.
* MeThAL: Ressources numériques pour une relecture du théâtre en alsacien. Workshop link:https://langues.unistra.fr/websites/lge/departements/allemand/JE_20_mai_2022.pdf[« Théâtre dialectal », role=external,window=_blank], May 2022, Department of German Studies, Université de Strasbourg
* MeThAL : Vers une macroanalyse du théâtre en alsacien. link:https://frlc.hypotheses.org/160[FRLC Seminar (Language and Cognition Research Group),role=external,window=_blank], Februrary 2021, Université de Strasbourg : link:./docs/methal_frlc_20210211.pdf[[slides\],role=external,window=_blank]
* MeThAL: Vers une macroanalyse du théâtre en alsacien. link:http://lilpa.unistra.fr/actualites-agenda/agenda/evenement/?tx_ttnews%5Btt_news%5D=20818&cHash=5936425c65df4e4cb35d75dc930ed24c[LiLPa Lab Seminar,role=external,window=_blank], December 2019, Université de Strasbourg : link:./docs/methal_lilpa_sem.pdf[[slides\],role=external,window=_blank]
[discrete]
=== Master's theses
* Yang, H. (2022). Détection de la variation graphique dans une langue non standardisée : le cas des dialectes alsaciens. Master's thesis in Language Sciences (Language Industries option). Université Grenoble Alpes. University supervisor : Claude Ponton (UGA). Internship supervision : Pablo Ruiz Fabo (Unistra), Alice Millour (Paris 8), Delphine Bernhard (Unistra). link:https://dumas.ccsd.cnrs.fr/dumas-03794680[⟨dumas-03794680⟩,role=external,window=_blank^]
[discrete]
=== Events
* 23/06/2023 : Workshop _Amateur theater and digital resources: Stabilization factors in the practice of minority language varieties?_ link:./docs/programme_je_theatre_amateur_20230623.pdf[[program\]]
- Our work on link:https://bnu.hypotheses.org/8392[character distribution^] based on the plays' _dramatis personæ_
- Publication of the link:https://bnu.hypotheses.org/9722[first 25 TEI-encoded^] plays
* Digital Humanities Cologne published a link:https://dhc.hypotheses.org/2958[report^] on the project following our presentation at the department colloquim "Digital Humanities – Aktuelle Forschungsthemen"
* New project related to MeThAL! In 2023-24, we will also be working on the link:https://thealtres.pages.unistra.fr[TheALTReS, role=external,window=_blank] project (__Comparing Theater in ALsacian with the TRaditions at its Source__), supported by the MISHA as part of its link:https://www.misha.fr/recherche[scientific program, role=external,window=_blank].
* The link:https://dracor.org/[DraCor,role=external,window=_blank] platform (Drama Corpora) has accepted to host the encoded plays, making some first analyses possible:
- link:https://www.dracor.org/als[dracor.org/als^]: Digital edition browsing, character networks and character-relation networks
- link:https://shiny.dracor.org/[shiny.dracor.org^]: Character interaction metrics. For instance, the interaction matrix below, for characters in _Der Pfingstmontag_ (Arnold, 1816).
image::pfingstmontag-matrice.png[Pfingstmontag,width=450,align="center"]
Project participants are members of the LiLPa lab: link:https://ruizfabo.link/unistra[Pablo Ruiz,role=external,window=_blank] (lead), link:http://lilpa.unistra.fr/theme-2-langage-parole-et-variation/membres/enseignants-chercheurs/bernhard-delphine/[Delphine Bernhard^], link:https://lilpa.unistra.fr/theme-3-langues-et-societe/membres/enseignants-chercheurs/erhart-pascale/[Pascale Erhart^], link:http://lilpa.unistra.fr/theme-2-langage-parole-et-variation/membres/enseignants-chercheurs/huck-dominique/[Dominique Huck^] and link:https://lilpa.unistra.fr/theme-2-langage-parole-et-variation/membres/doctorants/werner-carole/[Carole Werner^].
We are also in contact with the Bnu's Datalab and the Bnu's special interest group on corpora (SIG Corpus).
Special thanks to the many **interns** that we've been fortunate to work with in the project, from several fields and programs (Language Technologies, Linguistics at Master's level; Modern Languages, Computer Science at Bachelor's level): Nathanaël Beiner, Lena Camillone, Hoda Chouaib, Audrey Deck, Valentine Jung, Salomé Klein, Audrey Li-Thiao-Té, Kévin Michoud and Vedisha Toory among University of Strasbourg students. From other schools, Andrew Briand (University of Washington via IFE Strasbourg), Barbara Hoff (University of Edinburgh) and Qinyue Liu and Heng Yang (Université Grenoble Alpes).
Interested in OCR and TEI encoding, language technology application to Alsatian, digital editing, Alsatian linguistics or literature? Interested in an internship about these topics?
You have questions about the project?
// Disable figure caption to avoid "Figure X" counter (block title still renders as caption)
:!figure-caption:
// Block image title (starts with period) allows links in caption title
.Cover page for play _D'r Candidat_. Source: link:https://archive.org/details/lethtrealsac00schouoft/page/164/mode/2up[Internet Archive^]
image::dr_candidat_r.png["D'r Candidat",width=400,align="center"]
== About this website
- This site is maintained by mailto:ruizfabo@unistra.fr[Pablo Ruiz Fabo] (Université de Strasbourg)
[discrete]
=== Hosting
- The site is hosted at link:https://unistra.fr[Université de Strasbourg^]
[discrete]
=== License
- Content whose URL starts with _\https://methal.pages.unistra.fr_ is licensed under link:https://creativecommons.org/licenses/by/4.0/[CC-BY-4.0^]
- The licenses for content available on the corpus explorer interface (link:https://methal.eu/ui/[https://methal.eu/ui/^]), linked to from the present site via options _Explore the corpus_ and _Interface_ on the menu, are specified at link:https://methal.eu/ui/about[https://methal.eu/ui/about^]