Installation |
|
---|---|
Install text required python packages in conda or virtualenv environment |
|
Uninstall textrpp conda environment |
|
Initialize text required python packages |
|
Transform text to word embeddings |
|
Embed text |
|
Change dimension names |
|
Extract layers of hidden states |
|
Aggregate layers |
|
Pre-trained dimension reduction (experimental) |
|
Apply static word embeddings |
|
Fine-tuning models |
|
Task Adapted Pre-Training (EXPERIMENTAL - under development) |
|
Domain Adapted Pre-Training (EXPERIMENTAL - under development) |
|
Text language analysis tasks |
|
Text generation |
|
Named Entity Recognition. (experimental) |
|
Summarize texts. (experimental) |
|
Question Answering. (experimental) |
|
Translation. (experimental) |
|
Zero Shot Classification (Experimental) |
|
The text-train functions |
|
Trains word embeddings |
|
Train lists of word embeddings |
|
Train word embeddings to a numeric variable. |
|
Trains word embeddings usig random forest |
|
Cross-validated accuracies across sample-sizes |
|
Plot cross-validated accuracies across sample sizes |
|
The text-predict functions |
|
textPredict, textAssess and textClassify |
|
Significance testing correlations If only y1 is provided a t-test is computed, between the absolute error from yhat1-y1 and yhat2-y1. |
|
Predict from several models, selecting the correct input |
|
The LBAM library |
|
Semantic similarities and distances functions |
|
Semantic Similarity |
|
Semantic distance |
|
Semantic similarity across multiple word embeddings |
|
Semantic distance across multiple word embeddings |
|
Semantic similarity between a text variable and a word norm |
|
Semantic distance between a text variable and a word norm |
|
Visualise words in the word embedding space |
|
Supervised Dimension Projection |
|
Plot words |
|
Plot Supervised Dimension Projection |
|
Semantic similarity score between single words' and an aggregated word embeddings |
|
Plots words from textCentrality() |
|
textPCA() |
|
textPCAPlot |
|
BERTopics |
|
BERTopics |
|
This function tests the relationship between a single topic or all topics and a variable of interest. Available tests include correlation, t-test, linear regression, binary regression, and ridge regression. (EXPERIMENTAL - under development) |
|
Plots wordcloud (experimental) |
|
textTopicsReduce (EXPERIMENTAL) |
|
textTopicsTest (EXPERIMENTAL) to get the hierarchical topic tree |
|
View or delete downloaded HuggingFace models |
|
Check downloaded, available models. |
|
Number of layers |
|
Delete a specified model |
|
Miscellaneous |
|
Compute descriptive statistics of character variables. |
|
Tokenize text-variables |
|
Tokenize and count |
|
Compare two language domains |
|
Detect non-ASCII characters |
|
Clean non-ASCII characters |
|
Example Data |
|
Text and numeric data for 10 participants. |
|
Word embeddings for 4 text variables for 40 participants |
|
Word embeddings from textEmbedRawLayers function |
|
Example text and numeric data. |
|
Data for plotting a Dot Product Projection Plot. |
|
Example data for plotting a Semantic Centrality Plot. |
|
Example data for plotting a Principle Component Projection Plot. |