Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Browse by Current Cardiff authors

Number of items: 51.

McClaughlin, Emma, Vilar-Lluch, Sara, Parnell, Tamsin, Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502, Nichele, Elena, Adolphs, Svenja, Clos, Jeremie and Schiazza, Giovanni 2022. The reception of public health messages during the COVID-19 pandemic. Applied Corpus Linguistics 3 (1) , 100037. 10.1016/j.acorp.2022.100037
Item availability restricted.
filefile

Ezeani, Ignatius, El-Haj, Mahmoud, Morris, Jonathan ORCID: https://orcid.org/0000-0003-3463-5277 and Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 2022. Introducing the Welsh text summarisation dataset and baseline systems. Presented at: 13th ELRA Language Resources and Evaluation Conference (LREC 2022), Marseille, France, 20-25 June 2022.
file

El-Haj, Mahmoud, Ezeani, Ignatius, Morris, Jonathan ORCID: https://orcid.org/0000-0003-3463-5277 and Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 2022. Creation of an evaluation corpus and baseline evaluation scores for Welsh text summarisation. Presented at: 4th Celtic Language Technology Workshop (CLTW 2022), Marseille, France, 20 June 2022.
file

Clos, Jeremie, McClaughlin, Emma, Barnard, Pepita, Nichele, Elena, Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502, McAuley, Derek and Adolphs, Svenja 2022. PriPA: a tool for privacy-preserving analytics of linguistic data. Presented at: Legal and Ethical Issues in Human Language Technologies 2022, Marseille, France, 24 June 2022.
file

Morris, Jonathan ORCID: https://orcid.org/0000-0003-3463-5277, Ezeani, Ignatius, Gruffydd, Ianto, Young, Katharine, Davies, Lynne, El-Haj, Mahmoud and Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 2022. Welsh automatic text summarisation. Presented at: Wales Academic Symposium on Language Technologies 2022, Bangor, Wales, 28/01/2022. Language and Technology in Wales. Bangor: Banolfan Bedwyr,
file

McClaughlin, Emma, Nichele, Elena, Adolphs, Svenja, Barnard, Pepita, Clos, Jeremie, Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502, McAuley, Derek, Aydt, Miriam, Tom, Tino and Lang, Alexandra 2021. Privacy preserving corpus linguistics: investigating the trajectories of public health messaging online. University of Nottingham.

Muralidaran, Vignesh, Spasic, Irena ORCID: https://orcid.org/0000-0002-8132-3885 and Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 2021. A systematic review of unsupervised approaches to grammar induction. Natural Language Engineering 27 (6) , pp. 647-689. 10.1017/S1351324920000327

Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502, Morris, Steve, Arman, Laura ORCID: https://orcid.org/0000-0002-2517-845X, Needs, Jennifer and Rees, Mair 2021. Building a national corpus: a Welsh language case study. Basingstoke: Palgrave Macmillan.

Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502, Loizides, Fernando ORCID: https://orcid.org/0000-0003-0531-6760, Neale, Steven, Anthony, Laurence and Spasic, Irena ORCID: https://orcid.org/0000-0002-8132-3885 2021. Developing computational infrastructure for the CorCenCC corpus - the National Corpus of Contemporary Welsh. Language Resources and Evaluation 55 , pp. 789-816. 10.1007/s10579-020-09501-9
file

McClaughlin, Emma, Nichele, Elena, Adolphs, Svenja, Barnard, Pepita, Clos, Jeremie, Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502, McAuley, Derek and Lang, Alexandra 2021. Public health messaging by political leaders: a corpus linguistic analysis of COVID-19 speeches delivered by Boris Johnson. University of Nottingham. Available at: https://doi.org/10.17639/3fgb-fn44
file

Corcoran, Padraig ORCID: https://orcid.org/0000-0001-9731-3385, Palmer, Geraint ORCID: https://orcid.org/0000-0001-7865-6964, Arman, Laura ORCID: https://orcid.org/0000-0002-2517-845X, Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 and Spasic, Irena ORCID: https://orcid.org/0000-0002-8132-3885 2021. Creating Welsh language word embeddings. Applied Sciences 11 (15) , 6896. 10.3390/app11156896
file

Espinosa-Anke, Luis ORCID: https://orcid.org/0000-0001-6830-9176, Palmer, Geraint ORCID: https://orcid.org/0000-0001-7865-6964, Filimonov, Maxim, Corcoran, Padraig ORCID: https://orcid.org/0000-0001-9731-3385, Spasic, Irena ORCID: https://orcid.org/0000-0002-8132-3885 and Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 2021. English–Welsh cross-lingual embeddings. Applied Sciences 11 (14) , 6541. 10.3390/app11146541
file

Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502, Morris, Steve and Fitzpatrick, Tess 2021. Corpus design and construction in minoritised language contexts - Cynllunio a chreu corpws mewn cyd-destunau Ieithoedd lleiafrifoledig: The National Corpus of Contemporary Welsh - Corpws Cenedlaethol Cymraeg Cyfoes. Basingstoke: Palgrave Macmillan.

McClaughlin, Emma, Nichele, Elena, Adolphs, Svenja, Barnard, Pepita, Clos, Jeremy, Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502, MacAuley, Derek and Lang, Alexandra 2021. Using online news comments to gather fast feedback on issues with public health messaging: The Guardian as a case study. [Project Report]. University of Nottingham. Available at: https://nottingham-repository.worktribe.com/output...
file

Palmer, Geraint ORCID: https://orcid.org/0000-0001-7865-6964, Corcoran, Padraig ORCID: https://orcid.org/0000-0001-9731-3385, Arman, Laura ORCID: https://orcid.org/0000-0002-2517-845X, Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 and Spasic, Irena ORCID: https://orcid.org/0000-0002-8132-3885 2021. A closer look at Welsh word embeddings. Prys, Delyth, ed. Language and Technology in Wales: Volume 1, Bangor: Bangor University, pp. 21-29.
file

Muralidaran, Vigneshwaran, Palmer, Geraint ORCID: https://orcid.org/0000-0001-7865-6964, Arman, Laura ORCID: https://orcid.org/0000-0002-2517-845X, O'Hare, Keziah, Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 and Spasic, Irena ORCID: https://orcid.org/0000-0002-8132-3885 2021. A practical implementation of a porter stemmer for Welsh. Prys, Delyth, ed. Language and Technology in Wales: Volume 1, Bangor: Bangor University, pp. 30-43.
file

Chen, Yaoyao, Adolphs, Svenja and Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 2020. Multimodal discourse analysis. Friginal, Eric and Hardy, Jack, eds. The Routledge Handbook of Corpus Approaches to Discourse Analysis, London: Routledge,

Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 and Adolphs, Svenja 2020. Multimodal corpora. Paquot, Magali and Gries, Stefan Th, eds. A Practical Handbook of Corpus Linguistics, Springer International Publishing, pp. 351-369.

Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502, Morris, Steve, Fitzpatrick, Tess, Rayson, Paul, Spasić, Irena and Môn Thomas, Enlli 2020. The national corpus of contemporary Welsh: project report | Y corpws cenedlaethol Cymraeg cyfoes: adroddiad y prosiect. [Project Report]. CorCenCC.
file

Muralidaran, Vigneshwaran, Spasic, Irena ORCID: https://orcid.org/0000-0002-8132-3885 and Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 2020. A cognitive approach to parsing with neural networks. Presented at: International Conference on Statistical Language and Speech Processing (SLSP), Cardiff, UK, 14–16 Oct 2020. Statistical Language and Speech Processing. Lecture Notes in Computer Science. Springer Verlag, pp. 71-84. 10.1007/978-3-030-59430-5_6

Adolphs, Svenja, Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502, Smith, Catherine and Price, Dominic 2020. Crowdsourcing formulaic phrases: towards a new type of spoken corpus. Corpora 15 (2) , pp. 141-168. 10.3366/COR.2020.0192
file

Adolphs, Svenja and Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502, eds. 2020. The Routledge handbook of English language and digital humanities. Routledge Handbooks in English Language Studies, Abingdon: Routledge.

Ezeani, I, Piao, S, Neale, Steven, Rayson, P and Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 2019. Leveraging pre-trained embeddings for Welsh Taggers. Presented at: 4th Workshop on Representation Learning for NLP, Florence, Italy, July 2019. ACL Anthology: Proceedings of the 4th Workshop on Representation Learning for NLP. Association for Computational Linguistics, -. 10.18653/v1/W19-4332

Spasic, Irena ORCID: https://orcid.org/0000-0002-8132-3885, Owen, David ORCID: https://orcid.org/0000-0002-4028-0591, Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 and Artemiou, Andreas ORCID: https://orcid.org/0000-0002-7501-4090 2019. Unsupervised multi-word term recognition in Welsh. Presented at: Celtic Language Technology Workshop 2019, Dublin, Ireland, 19 August 2019. Published in: Lynn, Teresa, Prys, Delyth, Batchelor, Colin and Tyers, Francis eds. Proceedings of the Celtic Language Technology Workshop. European Association for Machine Translation,
file

Piao, S., Rayson, P., Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 and Watkins, Gareth 2018. Towards a Welsh semantic annotation system. Presented at: LREC (Language Resources Evaluation) 2018 Conference, Miyazaki, Japan., 7 - 12 May 2018.

Neale, S., Donnelly, K., Watkins, Gareth and Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 2018. Leveraging lexical resources and constraint grammar for rule-based part-of-speech tagging in Welsh. Presented at: LREC (Language Resources Evaluation) 2018 Conference, Miyazaki, Japan, 7 - 12 May 2018.

Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502, Walsh, Steve and Papagiannidis, Savvas 2017. I’m having a spring clear out: a corpus-based analysis of e-transactional discourse. Applied Linguistics 38 (2) , pp. 234-257. 10.1093/applin/amv019
file

Neale, Steven, Spasic, Irena ORCID: https://orcid.org/0000-0002-8132-3885, Needs, Jennifer, Watkins, Gareth, Morris, Steve, Fitzpatrick, Teresa, Marshall, L and Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 2017. The CorCenCC crowdsourcing app: a bespoke tool for the user-driven creation of the national corpus of contemporary Welsh. Presented at: The 9th International Corpus Linguistics Conference, Birmingham, UK, 24-28 July 2017.

Walsh, Steve and Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 2016. Analysing spoken discourse in University small group teaching. Corrigan, Karen P. and Mearns, Adam, eds. Creating and Digitizing Language Corpora: Volume 3: Databases for Public Engagement, Vol. 3. Basingstoke: Palgrave Macmillan, pp. 291-319.

Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502, Piao, Scott, Rayson, Paul, Archer, Dawn, Bianchi, Francesca, Dayrell, Carmen, El-Haj, Mahmoud, Jiminez, Ricardo-Maria, Kren, Michal, Lofberg, Laura, Nawab, Rao Muhammad Adeel, Shafi, Jawad, Teh, Phoey Lee and Mudraya, Olga 2016. Lexical coverage evaluation of large-scale multilingual semantic lexicons for twelve languages. Presented at: LREC 2016, Tenth International Conference on Language Resources and Evaluation. European Language Resources Association (ELRA), Portoro, Slovenia, 23-28 May 2016.
file

Seedhouse, Paul and Dawn, Knight ORCID: https://orcid.org/0000-0002-4745-6502 2016. Applying digital sensor technology: A problem-solving approach. Applied Linguistics 37 (1) , pp. 7-32. 10.1093/applin/amv065
file

Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 2015. e-Language: communication in the digital age. Baker, Paul and McEnery, Tony, eds. Corpora and Discourse Studies: Integrating Discourse and Corpora, Palgrave Advances in Language and Linguistics, Basingstoke: Palgrave Macmillan, London, pp. 20-40. (10.1057/9781137431738_2)
file

Crabtree, Andy, Tennent, Paul, Brundell, Pat and Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 2015. Digital records and the digital replay system. Halfpenny, Peter J. and Proctor, Rob, eds. Innovations in Digital Research Methods, London: Sage,

Dörk, Marian and Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 2015. WordWanderer: A navigational approach to text visualisation. Corpora 10 (1) , pp. 83-94. 10.3366/cor.2015.0067
file

Adolphs, Svenja and Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 2015. Beyond monomodal spoken corpora. Baker, Paul and McEnery, Tony, eds. Corpora and Discourse Studies: Integrating Discourse and Corpora, Palgrave Advances in Language and Linguistics, Houndsmill, Basingstoke: Palgrave Macmillan, pp. 41-62.
file

Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502, Adolphs, Svenja and Ronald, Carter 2014. CANELC – constructing an e-language corpus. Corpora 9 (1) , pp. 29-56. 10.3366/cor.2014.0050
file

Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502, Adolphs, Svenja and Carter, Ronald 2013. Formality in digital discourse: a study of hedging in CANELC. Romero-Trillo, Jesus, ed. Yearbook of corpus linguistics and pragmatics 2013: new domains and methodologies, Yearbook of corpus linguistics and pragmatics, vol. 1. Springer Netherlands, pp. 131-152. (10.1007/978-94-007-6250-3_7)

Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 2013. Corpus linguistics: methods, theory and practice by Tony McEnery and Andrew Hardie [Book Review]. Romero-Trillo, Jesus, ed. Yearbook of corpus linguistics and pragmatics 2013: new domains and methodologies, Yearbook of corpus linguistics and pragmatics, vol. 1. Springer Netherlands, pp. 275-277. (10.1007/978-94-007-6250-3_13)

Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 2011. Multimodality and active listenership: a corpus approach. Corpus and discourse, London: Bloomsbury.

Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 2011. The future of multimodal corpora. Revista Brasileira de Linguística Aplicada 11 (2) , pp. 391-415. 10.1590/S1984-63982011000200006

Adolphs, Svenja, Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 and Carter, Ronald 2011. Capturing context for heterogeneous corpus analysis: some first steps. International journal of corpus linguistics 16 (3) , pp. 305-324. 10.1075/ijcl.16.3.02ado

Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502, Tennent, P., Adolphs, S. and Carter, R. 2010. Developing heterogeneous corpora using the Digital Replay System (DRS). Presented at: Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality, Malta, 18 May 2010. Published in: Kipp, M., Martin, J. - C., Paggio, P. and Heylen, D. eds. Proceedings of the LREC 2010 (Language Resources Evaluation Conference) Workshop on Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality, May 2010, Malta. European Language Resources Association, pp. 16-21.

Adolphs, Svenja and Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 2010. Building a spoken corpus: What are the basics? O’Keeffe, Anne and McCarthy, Michael, eds. The Routledge handbook of corpus linguistics, Routledge handbooks in applied linguistics, Oxford: Routledge,

Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502, Evans, David, Carter, Ronald and Adolphs, Svenja 2009. HeadTalk, HandTalk and the corpus: towards a framework for multi-modal, multi-media corpus development. Corpora 4 (1) , pp. 1-32. 10.3366/E1749503209000203

Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 2009. A multi-modal corpus approach to the analysis of backchanneling behaviour. PhD Thesis, University of Nottingham.

Brundell, P., Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502, Tennent, P., Naeem, A., Adolphs, S., Ainsworth, S., Carter, R., Crabtree, A., Greenhalgh, C., O'Malley, C., Pridmore, T. and Rodden, T. 2008. The experience of using Digital Replay System for social science research. Presented at: 4th International Conference on e-Social Science (ICeSS), Manchester, UK, 18-20 June 2008. Proceedings of the 4th International Conference on e-Social Science (ICeSS), Manchester, 18-20 June 2008. ICeSS, pp. 1-10.
file

Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 and Tennent, P. 2008. Introducing DRS (The Digital Replay System): A tool for the future of corpus linguistic research and analysis. Presented at: Sixth International Conference on Language Resources and Evaluation (LREC'08, Marrakesh, Morocco, 26 May -1 June 2008. Published in: Calzolari, Nicoletta, Choukri, Khalid, Maegaard, Beate, Mariani, Joseph, Odijk, Jan, Piperidis, Stelios and Tapias, Daniel eds. Proceedings of the 6th Language Resources and Evaluation Conference (LREC), Palais des Congrés, Marrakech, Morocco, 28-30th May 2008. European Language Resources Association, pp. 26-31.

Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502, Adolphs, S., Tennent, P. and Carter, R. 2008. The Nottingham Multi-Modal Corpus: a demonstration. Presented at: 6th Language Resources and Evaluation Conference (LREC), Marrakesh, Morocco, 28-30 May 2008. Proceedings of the 6th Language Resources and Evaluation Conference (LREC), Palais des Congrés, Marrakech, Morocco, 28-30th May 2008. European Language Resources Association, pp. 1-7.
file

Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502 and Adolphs, Svenja 2008. Multi-modal corpus pragmatics: the case of active listenership. Romero-Trillo, Jesus, ed. Pragmatics and corpus linguistics: a mutualistic entente, Mouton series in pragmatics, vol. 2. Mouton de Gruyter, pp. 175-190.

Brundell, P., Tennent, P., Greenhalgh, C., Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502, Crabtree, A., O'Malley, C., Ainsworth, S., Clarke, D., Carter, R. and Adolphs, S. 2008. Digital Replay System (DRS): a tool for interaction analysis. Presented at: ICLS2008: International Perspectives in the Learning Sciences Cre8ing a learning world, Utrecht, The Netherlands, 23-28 June 2008.

Knight, Dawn ORCID: https://orcid.org/0000-0002-4745-6502, Bayoumi, S., Mills, S., Crabtree, A., Adolphs, S., Pridmore, T. and Carter, R. 2006. Beyond the text: construction and analysis of multi-modal linguistic corpora. Presented at: 2nd International Conference on e-Social Science, Manchester, UK, 28-30 June 2006. Proceedings of the 2nd International Conference on e-Social Science, Manchester, 28 - 30 June 2006. ICeSS, n/a.

This list was generated on Sat Dec 3 03:58:45 2022 GMT.