Khallaf, Nouran, Ezeani, Ignatius, Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502, Rayson, Paul, El-Haj, Mo, Vidler, John, Davies, James and Alva-Manchego, Fernando
2025.
FreeTxt: Analyse and visualise multilingual qualitative survey data for cultural heritage sites.
Presented at: Recent Advances in Natural Language Processing (RANLP) 2025,
Varna, Bulgaria,
8 -10 September 2025.
Proceedings of the 15th International Conference on Recent Advances in Natural Language Processing - Natural Language Processing in the Generative AI Era.
pp. 541-545.
|
Preview |
PDF
- Published Version
Available under License Creative Commons Attribution. Download (162kB) | Preview |
Abstract
We introduce FreeTxt, a free and open-source web-based tool designed to support the analysis and visualisation of multilingual qualitative survey data, with a focus on low-resource languages. Developed in collaboration with stakeholders, FreeTxt integrates established techniques from corpus linguistics with modern natural language processing methods in an intuitive interface accessible to non-specialists. The tool currently supports bilingual processing and visualisation of English and Welsh responses, with ongoing extensions to other languages such as Vietnamese. Key functionalities include semantic tagging via PyMUSAS, multilingual sentiment analysis, keyword and collocation visualisation, and extractive summarisation. User evaluations with cultural heritage institutions demonstrate the system’s utility and potential for broader impact.
| Item Type: | Conference or Workshop Item - published (Paper) |
|---|---|
| Date Type: | Published Online |
| Status: | Published |
| Schools: | Schools > English, Communication and Philosophy Schools > Computer Science & Informatics |
| Date of First Compliant Deposit: | 10 September 2025 |
| Last Modified: | 06 Feb 2026 15:11 |
| URI: | https://orca.cardiff.ac.uk/id/eprint/180930 |
Actions (repository staff only)
![]() |
Edit Item |





Download Statistics
Download Statistics