Thursday, April 24 (between 11:00 a.m. and 12:00 a.m.) an operation is planned on the database.
which may cause disturbances on Sciencesconf |
|
KeynotesQuentin FeltgenUniversity of Ghent Resampling techniques in corpus data statistical analysisIn this talk, I intend to present a general approach on statistical analysis, which consists in resampling data to build distributions over relevant quantities. An observed empirical value can then be compared to such a distribution in order to assess its significance. Compared to more traditional statistical tests, this approach is extremely versatile and can be tailored to address virtually any research question, without too much of a concern for the underlying distribution of the data (e.g. one does not need to check for normality). From an epistemological point of view, resampling data, insofar as it probes their statistical structure, not only provides a data analysis toolkit, but also a way forward in unravelling the underlying properties of language organization. I will present the main idea behind resampling, highlight one key caveat of these techniques when applied to language data, and detail three applications of resampling techniques: the study of a linguistic pattern’s productivity, the comparison of diachronic dynamics across different types of a given construction, and the automatic detection of semantic change in semi-schematic constructions.
Francesca FrontiniComputational linguistics institute Antonio Zampolli, CNR Pisa Towards FAIR Specialized Corpora: a Bilingual Corpus in the Wastewater and Stormwater DomainIn this presentation, we will explore the challenges related to the creation, annotation, and dissemination of a multilingual corpus dedicated to the field of wastewater and stormwater networks. We will specifically address the stages of corpus creation, alignment, and annotation, with a particular focus on named entity recognition and information extraction methods. A major objective of this work is to ensure compliance with the FAIR principles (Findable, Accessible, Interoperable, Reusable), through the integration of the corpus in the CLARIN infrastructure.
Biagio UrsiUniversity of Orléans Linguistics and interactional corpora: queries, exploitations and comparisonsIn this talk, I will present my current research trajectories in the field of interactional corpus linguistics, focusing on three areas.
Geoffrey WilliamsUniversity of Grenoble Alpes Corpus linguistics: from exploratory origins to a necessary futureSo-called “Large Language Models” have become the flavour of the month, and have superseded Web as Corpus in language engineering. There usability in computing cannot be denied, but are they really added value in Corpus linguistics? |
Online user: 4 | Privacy | Accessibility |
![]() ![]() |