GitLab now enforces expiry dates on tokens that originally had no set expiration date. Those tokens were given an expiration date of one year later. Please review your personal access tokens, project access tokens, and group access tokens to ensure you are aware of upcoming expirations. Administrators of GitLab can find more information on how to identify and mitigate interruption in our documentation.
@@ -22,13 +22,13 @@ In order to answer these questions and carry out quantitative *analyses*, a larg
Our project's goal is creating such a *corpus*, encoded in the link:https://tei-c.org/guidelines/[*TEI*,role=external,window=_blank] format (Text Encoding Initiative), whose link:https://www.tei-c.org/release/doc/tei-p5-doc/en/html/DR.html[Performance^] module covers the types of annotations we're interested in. We're working on a representative collection of link:https://www.numistral.fr/services/engine/search/sru?operation=searchRetrieve&exactSearch=false&collapsing=true&version=1.2&query=(colnum%20adj%20%22BNUStr058%22)&suggest=10&keywords=[plays^], which were recently digitized by the Bibliothèque Nationale et Universitaire (Bnu) in Strasbourg. We're currently performing OCR on the plays and their TEI encoding.
The corpus thus created will allow a _distant reading_ or *macroanalysis* approach to Alsatian theater. Such approaches have been applied succesfully to the major European dramatic traditions, as shown in a 2017 special issue of the link:https://sht.asso.fr/revue/etudes-theatrales-et-humanites-numeriques/[Revue d’Historiographie du Théâtre^]. However, such analyses are still impossible for Alsatian, given lack of an appropriate digital corpus. The MeThAL projects seeks to make up for this lack of resources.
The corpus thus created will allow a _distant reading_ or *macroanalysis* approach to Alsatian theater. Such approaches have been applied successfully to the major European dramatic traditions, as shown in a 2017 special issue of the link:https://sht.asso.fr/revue/etudes-theatrales-et-humanites-numeriques/[Revue d’Historiographie du Théâtre^]. However, such analyses are still impossible for Alsatian, given lack of an appropriate digital corpus. The MeThAL projects seeks to make up for this lack of resources.
To that end, we will apply natural language processing and document representation techiques, besides web technologies which will contribute to corpus navigability.
To that end, we will apply natural language processing and document representation techniques, besides web technologies which will contribute to corpus navigability.
== Challenges
The huge orthographic variety of Alsatian presents specific challenges for Natural Language Processing (*NLP*), as is the case for any *low-resource language*. These challenges highlight needs which are only partially adressed by existing text analysis tools, mainly geared towards majority languages. The project will exploit and contribute to the resources created by the link:http://restaure.unistra.fr/[RESTAURE^] project, on NLP for France's regional languages.
The huge orthographic variety of Alsatian presents specific challenges for Natural Language Processing (*NLP*), as is the case for any *low-resource language*. These challenges highlight needs which are only partially addressed by existing text analysis tools, mainly geared towards majority languages. The project will exploit and contribute to the resources created by the link:http://restaure.unistra.fr/[RESTAURE^] project, on NLP for France's regional languages.