Investigating associative, switchable and negatable Winograd items on renewed French data sets - Conférences TALN RECITAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2022

Investigating associative, switchable and negatable Winograd items on renewed French data sets

Résumé

The Winograd Schema Challenge (WSC) consists of a set of anaphora resolution problems resolvable only by reasoning about world knowledge. This article describes the update of the existing French data set and the creation of three subsets allowing for a more robust, fine-grained evaluation protocol of WSC in French (FWSC) : an associative subset (items easily resolvable with lexical co-occurrence), a switchable subset (items where the inversion of two keywords reverses the answer) and a negatable subset (items where applying negation on its verb reverses the answer). Experiences on these data sets with CamemBERT reach SOTA performances. Our evaluation protocol showed in addition that the higher performance could be explained by the existence of associative items in FWSC. Besides, increasing the size of training corpus improves the model’s performance on switchable items while the impact of larger training corpus remains small on negatable items.
Fichier principal
Vignette du fichier
7675.pdf (208.35 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte

Dates et versions

hal-03701511 , version 1 (24-06-2022)

Identifiants

  • HAL Id : hal-03701511 , version 1

Citer

Xiaoou Wang, Olga Seminck, Pascal Amsili. Investigating associative, switchable and negatable Winograd items on renewed French data sets. Traitement Automatique des Langues Naturelles (TALN 2022), Jun 2022, Avignon, France. pp.136-143. ⟨hal-03701511⟩
38 Consultations
33 Téléchargements

Partager

Gmail Facebook X LinkedIn More