Structural generalization in COGS: Supertagging is (almost) all you need

Alban Petit; Caio Corro; François Yvon

doi:10.18653/v1/2023.emnlp-main.69

Communication Dans Un Congrès Année : 2023

Structural generalization in COGS: Supertagging is (almost) all you need

(1) , (2) , (2)

1
2

Alban Petit

Fonction : Auteur
PersonId : 1144131

Traitement du Langage Parlé - LISN

Caio Corro

Fonction : Auteur
PersonId : 740403
IdHAL : caiocorro
ORCID : 0000-0001-7443-4109
IdRef : 242971059

Machine Learning and Information Access

François Yvon

Fonction : Auteur
PersonId : 5347
IdHAL : francois-yvon
ORCID : 0000-0002-7972-7442
IdRef : 057593531

Machine Learning and Information Access

Résumé

In many Natural Language Processing applications, neural networks have been found to fail to generalize on out-of-distribution examples. In particular, several recent semantic parsing datasets have put forward important limitations of neural networks in cases where compositional generalization is required. In this work, we extend a neural graph-based semantic parsing framework in several ways to alleviate this issue. Notably, we propose: (1) the introduction of a supertagging step with valency constraints, expressed as an integer linear program; (2) a reduction of the graph prediction problem to the maximum matching problem; (3) the design of an incremental early-stopping training strategy to prevent overfitting. Experimentally, our approach significantly improves results on examples that require structural generalization in the COGS dataset, a known challenging benchmark for compositional generalization. Overall, our results confirm that structural constraints are important for generalization in semantic parsing.

Domaines

Informatique et langage [cs.CL]

Fichier principal

2023.emnlp-main.69.pdf (344.86 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

Alban Petit : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04382463

Soumis le : mardi 9 janvier 2024-14:00:50

Dernière modification le : dimanche 10 novembre 2024-20:13:52

Dates et versions

hal-04382463 , version 1 (09-01-2024)

Identifiants

HAL Id : hal-04382463 , version 1
DOI : 10.18653/v1/2023.emnlp-main.69

Citer

Alban Petit, Caio Corro, François Yvon. Structural generalization in COGS: Supertagging is (almost) all you need. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Dec 2023, Singapour, Singapore. pp.1089-1101, ⟨10.18653/v1/2023.emnlp-main.69⟩. ⟨hal-04382463⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA ISIR CENTRALESUPELEC UNIV-PARIS-SACLAY SORBONNE-UNIVERSITE SU-SCIENCES ANR LISN GS-COMPUTER-SCIENCE LISN-TLP ISIR_MLIA

228 Consultations

64 Téléchargements

Structural generalization in COGS: Supertagging is (almost) all you need

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager