<!DOCTYPE html> <html lang="en"> <head> <meta charset="UTF-8"> <meta http-equiv="X-UA-Compatible" content="IE=edge"> <meta name="viewport" content="width=device-width, initial-scale=1.0"> <meta name="generator" content="Asciidoctor 2.0.10"> <link rel="icon" type="image/x-icon" href="./img/favicon.ico"> <title>Objectives</title> <link rel="stylesheet" href="https://fonts.googleapis.com/css?family=Open+Sans:300,300italic,400,400italic,600,600italic%7CNoto+Serif:400,400italic,700,700italic%7CDroid+Sans+Mono:400,700"> <link rel="stylesheet" href="./asciidoctor.css"> </head> <body class="book toc2 toc-left"> <div id="header"> <div id="toc" class="toc2"> <div id="toctitle">Navigation</div> <ul class="sectlevel1"> <li><a href="#_objectives">Objectives</a></li> <li><a href="#_challenges">Challenges</a></li> <li><a href="#_outputs">Outputs</a></li> <li><a href="#_participants">Participants</a></li> <li><a href="#_web_presence">Web presence</a></li> <li><a href="#_get_in_touch">Get in touch</a></li> </ul> </div> </div> <div id="content"> <div id="preamble"> <div class="sectionbody"> <div class="paragraph text-right"> <p><a href="index.html">fr</a> | <a href="./en.html">en</a></p> </div> <h1 id="_methal_towards_a_macroanalysis_of_theater_in_alsatian" class="discrete">MeThAL: Towards a macroanalysis of theater in Alsatian</h1> </div> </div> <div class="sect1"> <h2 id="_objectives">Objectives</h2> <div class="sectionbody"> <div class="paragraph"> <p>The Alsatian dialect theater tradition is based predominantly on popular and humour genres. What are the <strong>major trends</strong> in this tradition, regarding dramatic technique and character types? What are its major geographic locations? To what an extent do Alsatian dialect plays document the sociolinguistic situation of the period when they were written?</p> </div> <div class="paragraph"> <p>In order to answer these questions and carry out quantitative <strong>analyses</strong>, a large corpus representative of the tradition is required, as well as corpus annotations for the relevant variables: geographical origin plays and authors, places where the plays take place, their period and genre. Regarding the characters, attributes such as their profession, social status, origin, gender or age must be made available. It is also necessary to formalize the plays' structure, identifying act and scene divisions, characters' speech and stage directions.</p> </div> <div class="paragraph"> <p>Our project’s goal is creating such a <strong>corpus</strong>, encoded in the <a href="https://tei-c.org/guidelines/" class="external" target="_blank" rel="noopener"><strong>TEI</strong></a> format (Text Encoding Initiative), whose <a href="https://www.tei-c.org/release/doc/tei-p5-doc/en/html/DR.html" target="_blank" rel="noopener">Performance</a> module covers the types of annotations we’re interested in. We’re working on a representative collection of <a href="https://www.numistral.fr/services/engine/search/sru?operation=searchRetrieve&exactSearch=false&collapsing=true&version=1.2&query=(colnum%20adj%20%22BNUStr058%22)&suggest=10&keywords=" target="_blank" rel="noopener">plays</a>, which were recently digitized by the Bibliothèque Nationale et Universitaire (Bnu) in Strasbourg. We’re currently performing OCR on the plays and their TEI encoding.</p> </div> <div class="paragraph"> <p>The corpus thus created will allow a <em>distant reading</em> or <strong>macroanalysis</strong> approach to Alsatian theater. Such approaches have been applied succesfully to the majour European dramatic traditions, as shown in a 2017 special issue of the <a href="https://sht.asso.fr/revue/etudes-theatrales-et-humanites-numeriques/" target="_blank" rel="noopener">Revue d’Historiographie du Théâtre</a>. However, such analyses are still impossible for Alsatian, given lack of an appropriate digital corpus. The MeThAL projects seeks to fill this void.</p> </div> <div class="paragraph"> <p>To that end, we will apply natural language processing and document representation techiques, besides web technologies which will contribute to corpus navigability.</p> </div> </div> </div> <div class="sect1"> <h2 id="_challenges">Challenges</h2> <div class="sectionbody"> <div class="paragraph"> <p>The huge orthographic variety of Alsatian presents specific challenges for Natural Language Processing (<strong>NLP</strong>), as is the case for any <strong>low-resource language</strong>. These challenges highlight needs which are only partially adressed by existing text analysis tools, mainly geared towards majority languages. The project will exploit and contribute to the resources created by the <a href="http://restaure.unistra.fr/" target="_blank" rel="noopener">RESTAURE</a> project, on NLP for France’s regional languages.</p> </div> </div> </div> <div class="sect1"> <h2 id="_outputs">Outputs</h2> <div class="sectionbody"> <div class="sect2"> <h3 id="_publications">Publications</h3> <div class="ulist"> <ul> <li> <p>Pablo Ruiz, Delphine Bernhard, Pascale Erhart, Dominique Huck, Carole Werner. (2020). MeThAL : Vers une macroanalyse du théâtre en alsacien. <em>Humanistica 2020</em>, Bordeaux, France. <a href="https://dx.doi.org/10.5281/zenodo.3788019" class="external" target="_blank" rel="noopener">⟨10.5281/zenodo.3788019⟩</a>. <a href="https://hal.archives-ouvertes.fr/hal-02564694" target="_blank" rel="noopener">⟨hal-02564694⟩</a></p> </li> </ul> </div> </div> <div class="sect2"> <h3 id="_corpus">Corpus</h3> <div class="ulist"> <ul> <li> <p><a href="https://git.unistra.fr/methal/methal-sources">methal-sources</a>: The TEI-encoded plays are publicly available on the University’s Git repositories: <a href="https://git.unistra.fr/methal/methal-sources">https://git.unistra.fr/methal/methal-sources</a></p> </li> </ul> </div> </div> <div class="sect2"> <h3 id="_presentations">Presentations</h3> <div class="ulist"> <ul> <li> <p>Pablo Ruiz at the LiLPa Lab seminar, December 2019: <a href="http://prf1.org/docs/methal.pdf" class="external" target="_blank" rel="noopener">[pdf]</a></p> </li> </ul> </div> </div> </div> </div> <div class="sect1"> <h2 id="_participants">Participants</h2> <div class="sectionbody"> <div class="paragraph"> <p>Project participants are members of the LiLPa lab: <a href="http://lilpa.unistra.fr/fdt/membres/chercheurs/ruiz-fabo-pablo/" class="external" target="_blank" rel="noopener">Pablo Ruiz</a> (lead), <a href="http://lilpa.unistra.fr/fdt/membres/chercheurs/bernhard-delphine/" target="_blank" rel="noopener">Delphine Bernhard</a>, <a href="http://lilpa.unistra.fr/gepe/membres/chercheures/erhart-pascale/" target="_blank" rel="noopener">Pascale Erhart</a>, <a href="http://lilpa.unistra.fr/gepe/membres/chercheures/huck-dominique/" target="_blank" rel="noopener">Dominique Huck</a> and <a href="http://lilpa.unistra.fr/gepe/membres/doctorantes/werner-carole/" target="_blank" rel="noopener">Carole Werner</a>.</p> </div> <div class="paragraph"> <p>We are also in contact with the Bnu’s Datalab and the Bnu’s special interest group on corpora (SIG Corpus).</p> </div> </div> </div> <div class="sect1"> <h2 id="_web_presence">Web presence</h2> <div class="sectionbody"> <div class="ulist"> <ul> <li> <p>The <a href="https://bnu.hypotheses.org/5343" target="_blank" rel="noopener">Bnu’s research blog</a> talks about the project</p> </li> <li> <p>The <a href="https://dracor.org/" class="external" target="_blank" rel="noopener">DraCor</a> platform (Drama Corpora) has accepted to host the encoded plays, making some first analyses possible:</p> <div class="ulist"> <ul> <li> <p><a href="https://www.dracor.org/als" target="_blank" rel="noopener">dracor.org/als</a>: Digital edition browsing, character networks and character-relation networks</p> </li> <li> <p><a href="https://shiny.dracor.org/" target="_blank" rel="noopener">shiny.dracor.org</a>: Character interaction metrics. For instance, the interaction matrix below, for characters in <em>Der Pfingstmontag</em> (Arnold, 1816).</p> </li> </ul> </div> </li> </ul> </div> <div class="imageblock text-center"> <div class="content"> <img src="img/pfingstmontag-matrice.png" alt="Pfingstmontag" width="450"> </div> </div> </div> </div> <div class="sect1"> <h2 id="_get_in_touch">Get in touch</h2> <div class="sectionbody"> <div class="paragraph"> <p>Interested in doing an internship on OCR and TEI encoding, language technology application to Alsatian, digital editing, Alsatian linguistics or literature, or database and web development?</p> </div> <div class="paragraph"> <p>You have questions about the project?</p> </div> <div class="paragraph"> <p>Do contact us!</p> </div> <div class="imageblock text-center"> <div class="content"> <img src="img/dr_candidat_r.png" alt="D’r Candidat" width="400"> </div> <div class="title">Cover page for play <em>D’r Candidat</em>. Source: <a href="https://archive.org/details/lethtrealsac00schouoft/page/164/mode/2up" target="_blank" rel="noopener">Internet Archive</a></div> </div> </div> </div> </div> <div id="footer"> <div id="footer-text"> With the support of Université de Strasbourg's IdEx program (Attractivité 2020). Last updated on 2020-07-29 21:59:15 +0200 </div> </div> </body> </html>