GitLab now enforces expiry dates on tokens that originally had no set expiration date. Those tokens were given an expiration date of one year later. Please review your personal access tokens, project access tokens, and group access tokens to ensure you are aware of upcoming expirations. Administrators of GitLab can find more information on how to identify and mitigate interruption in our documentation.
<h1id="_methal_towards_a_macroanalysis_of_theater_in_alsatian"class="discrete">MeThAL: Towards a macroanalysis of theater in Alsatian</h1>
</div>
</div>
<divclass="sect1">
<h2id="_objectives">Objectives</h2>
<divclass="sectionbody">
<divclass="paragraph">
<p>The Alsatian dialect theater tradition is based predominantly on popular and humour genres. What are the <strong>major trends</strong> in this tradition, regarding dramatic technique and character types? What are its major geographic locations? To what an extent do Alsatian dialect plays document the sociolinguistic situation of the period when they were written?</p>
</div>
<divclass="paragraph">
<p>In order to answer these questions and carry out quantitative <strong>analyses</strong>, a large corpus, representative of the tradition, is required, as well as corpus annotations for the relevant variables: geographical origin of plays and authors, places where the plays take place, their period and genre. Regarding the characters, attributes such as their profession, social status, origin, gender or age must be made available. It is also necessary to formalize the plays' structure, identifying act and scene divisions, characters' speech and stage directions.</p>
</div>
<divclass="paragraph">
<p>Our project’s goal is creating such a <strong>corpus</strong>, encoded in the <ahref="https://tei-c.org/guidelines/"class="external"target="_blank"rel="noopener"><strong>TEI</strong></a> format (Text Encoding Initiative), whose <ahref="https://www.tei-c.org/release/doc/tei-p5-doc/en/html/DR.html"target="_blank"rel="noopener">Performance</a> module covers the types of annotations we’re interested in. We’re working on a representative collection of <ahref="https://www.numistral.fr/services/engine/search/sru?operation=searchRetrieve&exactSearch=false&collapsing=true&version=1.2&query=(colnum%20adj%20%22BNUStr058%22)&suggest=10&keywords="target="_blank"rel="noopener">plays</a>, which were recently digitized by the Bibliothèque Nationale et Universitaire (Bnu) in Strasbourg. We’re currently performing OCR on the plays and their TEI encoding.</p>
</div>
<divclass="paragraph">
<p>The corpus thus created will allow a <em>distant reading</em> or <strong>macroanalysis</strong> approach to Alsatian theater. Such approaches have been applied successfully to the major European dramatic traditions, as shown in a 2017 special issue of the <ahref="https://sht.asso.fr/revue/etudes-theatrales-et-humanites-numeriques/"target="_blank"rel="noopener">Revue d’Historiographie du Théâtre</a>. However, such analyses are still impossible for Alsatian, given lack of an appropriate digital corpus. The MeThAL projects seeks to make up for this lack of resources.</p>
</div>
<divclass="paragraph">
<p>To that end, we will apply natural language processing and document representation techniques, besides web technologies which will contribute to corpus navigability.</p>
</div>
</div>
</div>
<divclass="sect1">
<h2id="_challenges">Challenges</h2>
<divclass="sectionbody">
<divclass="paragraph">
<p>The huge orthographic variety of Alsatian presents specific challenges for Natural Language Processing (<strong>NLP</strong>), as is the case for any <strong>low-resource language</strong>. These challenges highlight needs which are only partially addressed by existing text analysis tools, mainly geared towards majority languages. The project will exploit and contribute to the resources created by the <ahref="http://restaure.unistra.fr/"target="_blank"rel="noopener">RESTAURE</a> project, on NLP for France’s regional languages.</p>
</div>
</div>
</div>
<divclass="sect1">
<h2id="_outputs">Outputs</h2>
<divclass="sectionbody">
<divclass="sect2">
<h3id="_publications">Publications</h3>
<divclass="ulist">
<ul>
<li>
<p>Pablo Ruiz, Delphine Bernhard, Pascale Erhart, Dominique Huck, Carole Werner. (2020). MeThAL : Vers une macroanalyse du théâtre en alsacien. <em>Humanistica 2020</em>, Bordeaux, France. <ahref="https://dx.doi.org/10.5281/zenodo.3788019"class="external"target="_blank"rel="noopener">⟨10.5281/zenodo.3788019⟩</a>. <ahref="https://hal.archives-ouvertes.fr/hal-02564694"target="_blank"rel="noopener">⟨hal-02564694⟩</a></p>
</li>
</ul>
</div>
</div>
<divclass="sect2">
<h3id="_corpus">Corpus</h3>
<divclass="ulist">
<ul>
<li>
<p><ahref="https://git.unistra.fr/methal/methal-sources">methal-sources</a>: The TEI-encoded plays are publicly available on the University’s Git repositories: <ahref="https://git.unistra.fr/methal/methal-sources">https://git.unistra.fr/methal/methal-sources</a></p>
</li>
</ul>
</div>
</div>
<divclass="sect2">
<h3id="_presentations">Presentations</h3>
<divclass="ulist">
<ul>
<li>
<p>Pablo Ruiz at the LiLPa Lab seminar, December 2019: <ahref="http://prf1.org/docs/methal.pdf"class="external"target="_blank"rel="noopener">[pdf]</a></p>
</li>
</ul>
</div>
</div>
</div>
</div>
<divclass="sect1">
<h2id="_participants">Participants</h2>
<divclass="sectionbody">
<divclass="paragraph">
<p>Project participants are members of the LiLPa lab: <ahref="http://lilpa.unistra.fr/fdt/membres/chercheurs/ruiz-fabo-pablo/"class="external"target="_blank"rel="noopener">Pablo Ruiz</a> (lead), <ahref="http://lilpa.unistra.fr/fdt/membres/chercheurs/bernhard-delphine/"target="_blank"rel="noopener">Delphine Bernhard</a>, <ahref="http://lilpa.unistra.fr/gepe/membres/chercheures/erhart-pascale/"target="_blank"rel="noopener">Pascale Erhart</a>, <ahref="http://lilpa.unistra.fr/gepe/membres/chercheures/huck-dominique/"target="_blank"rel="noopener">Dominique Huck</a> and <ahref="http://lilpa.unistra.fr/gepe/membres/doctorantes/werner-carole/"target="_blank"rel="noopener">Carole Werner</a>.</p>
</div>
<divclass="paragraph">
<p>We are also in contact with the Bnu’s Datalab and the Bnu’s special interest group on corpora (SIG Corpus).</p>
</div>
</div>
</div>
<divclass="sect1">
<h2id="_web_presence">Web presence</h2>
<divclass="sectionbody">
<divclass="ulist">
<ul>
<li>
<p>The <ahref="https://bnu.hypotheses.org/5343"target="_blank"rel="noopener">Bnu’s research blog</a> talks about the project</p>
</li>
<li>
<p>The <ahref="https://dracor.org/"class="external"target="_blank"rel="noopener">DraCor</a> platform (Drama Corpora) has accepted to host the encoded plays, making some first analyses possible:</p>
<divclass="ulist">
<ul>
<li>
<p><ahref="https://www.dracor.org/als"target="_blank"rel="noopener">dracor.org/als</a>: Digital edition browsing, character networks and character-relation networks</p>
</li>
<li>
<p><ahref="https://shiny.dracor.org/"target="_blank"rel="noopener">shiny.dracor.org</a>: Character interaction metrics. For instance, the interaction matrix below, for characters in <em>Der Pfingstmontag</em> (Arnold, 1816).</p>
<p>Interested in doing an internship on OCR and TEI encoding, language technology application to Alsatian, digital editing, Alsatian linguistics or literature, or database and web development?</p>
<divclass="title">Cover page for play <em>D’r Candidat</em>. Source: <ahref="https://archive.org/details/lethtrealsac00schouoft/page/164/mode/2up"target="_blank"rel="noopener">Internet Archive</a></div>
</div>
</div>
</div>
</div>
<divid="footer">
<divid="footer-text">
Supported by Université de Strasbourg's IdEx program (Attractivité 2020). Last updated on 2020-07-30 19:10:32 +0200