Skip to content
Snippets Groups Projects
Commit 7d10b253 authored by pruizf's avatar pruizf
Browse files

[ct] correct typos en

parent c34346d3
No related merge requests found
Pipeline #36497 passed with stages
in 1 minute and 40 seconds
......@@ -22,13 +22,13 @@ In order to answer these questions and carry out quantitative *analyses*, a larg
Our project's goal is creating such a *corpus*, encoded in the link:https://tei-c.org/guidelines/[*TEI*,role=external,window=_blank] format (Text Encoding Initiative), whose link:https://www.tei-c.org/release/doc/tei-p5-doc/en/html/DR.html[Performance^] module covers the types of annotations we're interested in. We're working on a representative collection of link:https://www.numistral.fr/services/engine/search/sru?operation=searchRetrieve&exactSearch=false&collapsing=true&version=1.2&query=(colnum%20adj%20%22BNUStr058%22)&suggest=10&keywords=[plays^], which were recently digitized by the Bibliothèque Nationale et Universitaire (Bnu) in Strasbourg. We're currently performing OCR on the plays and their TEI encoding.
The corpus thus created will allow a _distant reading_ or *macroanalysis* approach to Alsatian theater. Such approaches have been applied succesfully to the major European dramatic traditions, as shown in a 2017 special issue of the link:https://sht.asso.fr/revue/etudes-theatrales-et-humanites-numeriques/[Revue d’Historiographie du Théâtre^]. However, such analyses are still impossible for Alsatian, given lack of an appropriate digital corpus. The MeThAL projects seeks to make up for this lack of resources.
The corpus thus created will allow a _distant reading_ or *macroanalysis* approach to Alsatian theater. Such approaches have been applied successfully to the major European dramatic traditions, as shown in a 2017 special issue of the link:https://sht.asso.fr/revue/etudes-theatrales-et-humanites-numeriques/[Revue d’Historiographie du Théâtre^]. However, such analyses are still impossible for Alsatian, given lack of an appropriate digital corpus. The MeThAL projects seeks to make up for this lack of resources.
To that end, we will apply natural language processing and document representation techiques, besides web technologies which will contribute to corpus navigability.
To that end, we will apply natural language processing and document representation techniques, besides web technologies which will contribute to corpus navigability.
== Challenges
The huge orthographic variety of Alsatian presents specific challenges for Natural Language Processing (*NLP*), as is the case for any *low-resource language*. These challenges highlight needs which are only partially adressed by existing text analysis tools, mainly geared towards majority languages. The project will exploit and contribute to the resources created by the link:http://restaure.unistra.fr/[RESTAURE^] project, on NLP for France's regional languages.
The huge orthographic variety of Alsatian presents specific challenges for Natural Language Processing (*NLP*), as is the case for any *low-resource language*. These challenges highlight needs which are only partially addressed by existing text analysis tools, mainly geared towards majority languages. The project will exploit and contribute to the resources created by the link:http://restaure.unistra.fr/[RESTAURE^] project, on NLP for France's regional languages.
== Outputs
......
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment