Buerki, Andreas ORCID: https://orcid.org/0000-0003-2151-3246 2011. SubString. GitHub. |
Official URL: http://buerki.github.io/SubString/
Abstract
The SubString package is an open-source set of Unix Shell scripts used for substring reduction and frequency consolidation of word n-grams of different length. In the process, the frequencies of substrings are reduced by the frequencies of their superstrings and a consolidated list with n-grams of different lengths is produced without an inflation of the overall word count. The functions performed by SubString will primarily be of interest to linguists working on formulaic language, multi-word sequences and similar phraseological phenomena.
Item Type: | Other |
---|---|
Date Type: | Publication |
Status: | Published |
Schools: | English, Communication and Philosophy |
Subjects: | P Language and Literature > P Philology. Linguistics |
Publisher: | GitHub |
Last Modified: | 28 Oct 2022 10:23 |
URI: | https://orca.cardiff.ac.uk/id/eprint/78129 |
Actions (repository staff only)
Edit Item |