beandeau>

Workshop

Training workshops will be offered on Tuesday, October 21, from 9:30 a.m. to 12:30 p.m., before the start of the conference.

They are free and open to all participants, but registration is required by email at jlc2025@sciencesconf.org,

specifying the workshop you wish to attend. Your registration is binding, as the number of places is limited.

We warmly thank our colleagues who are organizing these workshops.

NB: the workshops will be held in French.

  • AVAA Toolkit: a toolbox to support the analysis of interactions based on multimodal corpora
  • Video collection and processing, which is divided into two parts; you may attend only one of them
    • Collection (9:30 a.m. to 11:00 a.m.) : audiovisual equipment, points of view, which parameters to use, storage, and GDPR
    • Processing (11:00 a.m. to 12:30 p.m.) : import, storage, editing, synchronization, anonymization, automatic transcription
  • CORLI CORPUCIT : A web tool for creating permanent citations of corpus or text excerpts and inserting them into your scientific articles and books in PDF format.
  • TXM beginner written corpus
  • TXM beginner oral corpus

===============================================================================================

AVAA Toolkit: a toolkit to support the analysis of interactions from multimodal corpora

Introduction

The AVAA Toolkit (Audio and Video Annotations Analysis Toolkit) software offers numerous features for interaction analysis. It can be used at different stages of the research process based on annotated audiovisual data corpora: processing, mining, data visualization, but also the annotation process itself with an intercoding procedure enabling the collaborative construction of coding items.

Facilitator

Clotilde George (University of Lorraine, language sciences researcher, associate member of ATILF)

Topic & Objective

Getting started with the AVAA Toolkit software, which is suitable for analyzing annotated audiovisual corpora (particularly with ELAN). Creating combined primary and secondary data collections.

Resources & Prerequisites

Software: www.avaa-toolkit.org

Quick overview: https://avaa-toolkit.org/features/

Documentation: https://avaa-toolkit.org/documentation

Participants must have a corpus of annotations aligned with the signal (eaf, azp, cha, textgrid formats, etc.).

Terms

Duration: 3 hours, 9h30-12h30

Number of participants: 10 maximum

Pre-installation of software: yes (www.avaa-toolkit.org). Please contact dev@avaa-toolkit.org if you have any difficulties installing the software.

=============================================================================================

Audiovisual data collection

Description

The production and processing of corpora involves methodological considerations, technical knowledge, and legal and ethical issues. We will look at the different types of video and audio recording equipment available. What equipment can be used depending on the field and research questions? In addition to traditional camcorders and lapel microphones, we will look at equipment such as 360° cameras, subjective cameras, and action cameras.

Facilitators

Justine Lascar, CNRS research engineer at the ICAR laboratory, head of the Audiovisual Engineering Corpus (CIA) unit

Léa Mouton, CNRS assistant engineer at the ICAR laboratory, member of the Audiovisual Engineering Corpus (CIA) unit

Resources

No software installation required

CIA page https://icar.cnrs.fr/recherche/services/

Terms

Duration: 1.5 hours, 9h30-11h

Number of participants: 10 maximum

==

 Processing audiovisual data

Description

We will address the issue of formats, editing, and exporting audiovisual materials. We will carry out the entire processing chain from import to export of synchronized files. Then, we will perform automatic transcription of audiovisual files using several methods.

Facilitators

Justine Lascar, CNRS research engineer at the ICAR laboratory, head of the Audiovisual Engineering Corpus (CIA) unit

Léa Mouton, CNRS assistant engineer at the ICAR laboratory, member of the Audiovisual Engineering Corpus (CIA) unit

Resources

No software installation required

CIA page https://icar.cnrs.fr/recherche/services/

Terms 

Duration: 1.5 hours, 11h-12h30

Number of participants: 10 maximum

=======================================================================

CORLI CORPUCIT : A web tool for creating permanent citations of corpus or text excerpts and inserting them into your scientific articles and books in PDF format.

Introduction

The development of electronic publishing has greatly enriched the citation of language or corpus excerpts. Multiple formats and presentations can be used, and sound, images, and video can be added. However, unlike paper, these presentations are ephemeral.

CORPUCIT is a tool that allows you to create citations of corpus or text excerpts, which are permanently stored on NAKALA, inserted into your articles or books, and which refer to the original corpus or sources. As the citations are electronic, there is no format limit.

CORPUCIT promotes open science and the reproduction of scientific work by ensuring the longevity of the data described. It also raises awareness of the use of corpora on the Internet.

Facilitator

Christophe Parisse (CORLI & Modyco - CNRS & Université Paris Nanterre)

Driss Sadoun (PostLab & ERTIM/INALCO )

Topic & Objective

Getting started with the CORPUCIT website, inserting citations into electronic documents.

Resources & Prerequisites

A web browser with an internet connection.

A text editing tool (LibreOffice, Latex, or Microsoft Word, for example).

Terms

Duration: 3 hours, 9:30 a.m. to 12:30 p.m.

Number of participants: 20 maximum

=============================================================================================

TXM for beginners (written corpora)

 Introduction

TXM is a software program that allows you to search corpora and extract concordances and statistics.
 
 The training is intended for beginners and will consist of two parts.
 
1. Importing a corpus

 We will see how to organize and import your corpora. Demo corpora will be provided, but you can also bring your own corpus
(plain text, XML − but *not* PDF), and we will see what we can do...
If you bring your own corpus, you will need to send it to me a few days before the training.
 
2. Searching a corpus
 We will see how to explore the corpus and search it using the CQL query language (also used by other
software).
 
Facilitator

Achille Falaise, Laboratoire de Linguistique Formelle  (LLF - UMR7110)

Prerequisites
 You must have a computer with TXM installed. You can download TXM here: https://txm.gitpages.huma-num.fr/textometrie/files/software/TXM/0.8.4/
 . You do not need the latest version. However, please make sure
that TXM starts up properly! I will not be able to provide technical support
during the training.
 We will also use a plain text editor (https://www.sublimetext.com/ is recommended) and a spreadsheet program (https://fr.libreoffice.org/download/telecharger-libreoffice/ is recommended—please note that Excel is also a spreadsheet program, but often causes problems for the purposes for which we will be using it).
 
Terms and conditions
 Duration: 3 hours, 9h30-12h30

Number of participants: 15 maximum

==============================================================================================

Description of the TXM beginners workshop (spoken corpus)  coming soon...

 

Loading... Loading...