Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Overview of PRESTA at IberLEF 2025: question answering over tabular data In Spanish

Grijalba, Jorge Osés, Ure˜na-López, Luis Alfonso, Cámara, Eugenio Martínez and Camacho-Collados, Jose ORCID: https://orcid.org/0000-0003-1618-7239 2025. Overview of PRESTA at IberLEF 2025: question answering over tabular data In Spanish. Procesamiento del Lenguaje Natural 75 , pp. 475-486. 10.26342/2025-75-35

Full text not available from this repository.

Abstract

We present the findings and results of the PRESTA track at IberLEF 2025, focused on question answering over tabular data in Spanish. The task challenges participants to build systems capable of interpreting natural language questions and retrieving accurate answers from semi-structured tabular sources in Spanish. In this paper, we describe the task design, dataset construction, evaluation methodology, and participant systems. We analyze a range of submitted approaches and discuss key trends observed across systems. Our results show that methods leveraging large language models (LLMs) clearly outperformed traditional pipelines, with larger multilingual models exhibiting very high accuracy. It is of note that the performance of small open-source models is up to par with the bigger proprietary ones when paired with good system designs. These findings confirm that the strong performance of LLMs in English carries over to Spanish in the context of tabular question answering, though some linguistic and domain-specific challenges remain.

Item Type: Article
Date Type: Publication
Status: Published
Schools: Schools > Computer Science & Informatics
Publisher: Sociedad Española para el Procesamiento del Lenguaje Natural
ISSN: 1135-5948
Related URLs:
Date of Acceptance: 1 August 2025
Last Modified: 11 Dec 2025 12:01
URI: https://orca.cardiff.ac.uk/id/eprint/183135

Actions (repository staff only)

Edit Item Edit Item