textTinyR

Text Processing for Small or Big Data Files

It offers functions for splitting, parsing, tokenizing and creating a vocabulary for big text data files. Moreover, it includes functions for building a document-term matrix and extracting information from those (term-associations, most frequent terms). It also embodies functions for calculating token statistics (collocations, look-up tables, string dissimilarities) and functions to work with sparse matrices. Lastly, it includes functions for Word Vector Representations (i.e. 'GloVe', 'fasttext') and incorporates functions for the calculation of (pairwise) text document dissimilarities. The source code is based on 'C++11' and exported in R through the 'Rcpp', 'RcppArmadillo' and 'BH' packages.

Total

9,425

Last month

1,213

Last week

168

Average per day

40

Daily downloads

Total downloads

Description file content

Package
textTinyR
Type
Package
Title
Text Processing for Small or Big Data Files
Version
1.1.3
Date
2019-04-14
Author
Lampros Mouselimis
Maintainer
Lampros Mouselimis
BugReports
https://github.com/mlampros/textTinyR/issues
URL
Description
It offers functions for splitting, parsing, tokenizing and creating a vocabulary for big text data files. Moreover, it includes functions for building a document-term matrix and extracting information from those (term-associations, most frequent terms). It also embodies functions for calculating token statistics (collocations, look-up tables, string dissimilarities) and functions to work with sparse matrices. Lastly, it includes functions for Word Vector Representations (i.e. 'GloVe', 'fasttext') and incorporates functions for the calculation of (pairwise) text document dissimilarities. The source code is based on 'C++11' and exported in R through the 'Rcpp', 'RcppArmadillo' and 'BH' packages.
License
GPL-3
Copyright
inst/COPYRIGHTS
SystemRequirements
The package requires a C++11 compiler
Encoding
UTF-8
LazyData
TRUE
Depends
R(>= 3.2.3), Matrix
Imports
Rcpp (>= 0.12.10), R6, data.table, utils
LinkingTo
Rcpp, RcppArmadillo (>= 0.7.8), BH
Suggests
testthat, covr, knitr, rmarkdown
VignetteBuilder
knitr
RoxygenNote
6.1.0
NeedsCompilation
yes
Packaged
2019-04-14 10:24:03 UTC; lampros
Repository
CRAN
Date/Publication
2019-04-14 11:42:42 UTC

install.packages('textTinyR')

1.1.3

3 months ago

https://github.com/mlampros/textTinyR

Lampros Mouselimis

GPL-3

Depends on

R(>= 3.2.3), Matrix

Imports

Rcpp (>= 0.12.10), R6, data.table, utils

Suggests

testthat, covr, knitr, rmarkdown

Discussions