Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Extracting conceptual spaces from LLMs using prototype embeddings

Kumar, Nitesh ORCID: https://orcid.org/0000-0002-9301-3876, Chatterjee, Usashi and Schockaert, Steven ORCID: https://orcid.org/0000-0002-9256-2881 2025. Extracting conceptual spaces from LLMs using prototype embeddings. Presented at: Findings of EMNLP, Suzhou, China, 4-9 November 2025. Published in: Christodoulopoulos, Christos, Chakraborty, Tanmoy, Rose, Carolyn and Peng, Violet eds. Findings of the Association for Computational Linguistics: EMNLP 2025. Suzhou, China: Association for Computational Linguistics, pp. 9275-9298. 10.18653/v1/2025.findings-emnlp.493

[thumbnail of 2025.findings-emnlp.493.pdf]
Preview
PDF - Published Version
Available under License Creative Commons Attribution.

Download (3MB) | Preview

Abstract

Conceptual spaces represent entities and concepts using cognitively meaningful dimensions, typically referring to perceptual features. Such representations are widely used in cognitive science and have the potential to serve as a cornerstone for explainable AI. Unfortunately, they have proven notoriously difficult to learn, although recent LLMs appear to capture the required perceptual features to a remarkable extent. Nonetheless, practical methods for extracting the corresponding conceptual spaces are currently still lacking. While various methods exist for extracting embeddings from LLMs, extracting conceptual spaces also requires us to encode the underlying features. In this paper, we propose a strategy in which features (e.g. sweetness) are encoded by embedding the description of a corresponding prototype (e.g. a very sweet food). To improve this strategy, we fine-tune the LLM to align the prototype embeddings with the corresponding conceptual space dimensions. Our empirical analysis finds this approach to be highly effective.

Item Type: Conference or Workshop Item - published (Paper)
Date Type: Publication
Status: Published
Schools: Schools > Computer Science & Informatics
Publisher: Association for Computational Linguistics
ISBN: 979-8-89176-332-6
Date of First Compliant Deposit: 21 October 2025
Date of Acceptance: 20 August 2025
Last Modified: 30 Jan 2026 16:52
URI: https://orca.cardiff.ac.uk/id/eprint/181776

Actions (repository staff only)

Edit Item Edit Item

Downloads

Downloads per month over past year

View more statistics