Grijalba, Jorge Osés, Ure˜na-López, Luis Alfonso, Cámara, Eugenio Martínez and Camacho-Collados, Jose ORCID: https://orcid.org/0000-0003-1618-7239
2025.
Overview of PRESTA at IberLEF 2025: question answering over tabular data In Spanish.
Procesamiento del Lenguaje Natural
75
, pp. 475-486.
10.26342/2025-75-35
|
Abstract
We present the findings and results of the PRESTA track at IberLEF 2025, focused on question answering over tabular data in Spanish. The task challenges participants to build systems capable of interpreting natural language questions and retrieving accurate answers from semi-structured tabular sources in Spanish. In this paper, we describe the task design, dataset construction, evaluation methodology, and participant systems. We analyze a range of submitted approaches and discuss key trends observed across systems. Our results show that methods leveraging large language models (LLMs) clearly outperformed traditional pipelines, with larger multilingual models exhibiting very high accuracy. It is of note that the performance of small open-source models is up to par with the bigger proprietary ones when paired with good system designs. These findings confirm that the strong performance of LLMs in English carries over to Spanish in the context of tabular question answering, though some linguistic and domain-specific challenges remain.
| Item Type: | Article |
|---|---|
| Date Type: | Publication |
| Status: | Published |
| Schools: | Schools > Computer Science & Informatics |
| Publisher: | Sociedad Española para el Procesamiento del Lenguaje Natural |
| ISSN: | 1135-5948 |
| Related URLs: | |
| Date of Acceptance: | 1 August 2025 |
| Last Modified: | 11 Dec 2025 12:01 |
| URI: | https://orca.cardiff.ac.uk/id/eprint/183135 |
Actions (repository staff only)
![]() |
Edit Item |





Altmetric
Altmetric