FrSemCor: Annotating a French corpus with supersenses

L Barque; Pauline Haas; R Huyghe; Delphine Tribout; M Candito; B Crabbé; V Segonne

Communication Dans Un Congrès Année : 2020

FrSemCor: Annotating a French corpus with supersenses

(1, 2) , (3, 1) , (4) , (5, 6) , (7, 2) , (7, 2) , (7, 2)

1
2
3
4
5
6
7

L Barque

Fonction : Auteur

Université Paris 13 - UFR Lettres, langues, sciences humaines et des sociétés

Laboratoire de Linguistique Formelle

Pauline Haas

Fonction : Auteur
PersonId : 10830
IdHAL : pauline-haas
IdRef : 14452774X

Lattice - Langues, Textes, Traitements informatiques, Cognition - UMR 8094

Université Paris 13 - UFR Lettres, langues, sciences humaines et des sociétés

R Huyghe

Fonction : Auteur

Université de Fribourg = University of Fribourg

Delphine Tribout

Fonction : Auteur

Université de Lille - Faculté des Humanités

Savoirs, Textes, Langage (STL) - UMR 8163

M Candito

Fonction : Auteur

Université Paris Diderot - Paris 7

Laboratoire de Linguistique Formelle

B Crabbé

Fonction : Auteur

Université Paris Diderot - Paris 7

Laboratoire de Linguistique Formelle

V Segonne

Fonction : Auteur

Université Paris Diderot - Paris 7

Laboratoire de Linguistique Formelle

Résumé

French, as many languages, lacks semantically annotated corpus data. Our aim is to provide the linguistic and NLP research communities with a gold standard sense-annotated corpus of French, using WordNet Unique Beginners as semantic tags, thus allowing for interoperability. In this paper, we report on the first phase of the project, which focused on the annotation of common nouns. The resulting dataset consists of more than 12,000 French noun tokens which were annotated in double blind and adjudicated according to a carefully redefined set of supersenses. The resource is released online under a Creative Commons Licence.

Mots clés

semantic annotation supersenses corpus

Domaines

Linguistique

Fichier principal

Fr_SemCor_LREC2020.pdf (170.93 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Lucie Barque : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02511929

Soumis le : jeudi 19 mars 2020-11:00:33

Dernière modification le : vendredi 19 avril 2024-16:18:57

Archivage à long terme le : samedi 20 juin 2020-13:45:20

Dates et versions

hal-02511929 , version 1 (19-03-2020)

Identifiants

HAL Id : hal-02511929 , version 1

Citer

L Barque, Pauline Haas, R Huyghe, Delphine Tribout, M Candito, et al.. FrSemCor: Annotating a French corpus with supersenses. LREC-2020, May 2020, Marseille, France. ⟨hal-02511929⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-PARIS13 ENS-PARIS CNRS UNIV-PARIS3 LATTICE LLF STL CAMPUS-AAR AAI PSL USPC UNIV-LILLE SORBONNE-PARIS-NORD UP-SOCIETES-HUMANITES ACT-R

349 Consultations

320 Téléchargements

FrSemCor: Annotating a French corpus with supersenses

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager