Tokenize and count

textTokenizeAndCount(data, n_remove_threshold = 3)

Arguments

data

(string) Language to tokenise and count.

n_remove_threshold

(numeric) Threshold deciding which words to remove

Value

A word-frequency data frame (can be saved to a model object or compared in textDomainCompare).

See also

Examples

if (FALSE) { # \dontrun{
textTokenizeAndCount(Language_based_assessment_data_8["harmonytexts"])
} # }