corpus

Text Corpus Analysis

Text corpus data analysis, with full support for international text (Unicode). Functions for reading data from newline-delimited 'JSON' files, for normalizing and tokenizing text, for searching for term occurrences, and for computing term occurrence frequencies, including n-grams.

Total

19,701

Last month

690

Last week

181

Average per day

23

Daily downloads

Total downloads

Description file content

Package
corpus
Version
0.10.0
Title
Text Corpus Analysis
Depends
R (>= 3.3),
Imports
stats, utf8 (>= 1.1.0)
Suggests
knitr, Matrix, testthat
Enhances
quanteda, tm
Description
Text corpus data analysis, with full support for international text (Unicode). Functions for reading data from newline-delimited 'JSON' files, for normalizing and tokenizing text, for searching for term occurrences, and for computing term occurrence frequencies, including n-grams.
License
Apache License (== 2.0) | file LICENSE
URL
BugReports
https://github.com/patperry/r-corpus/issues
LazyData
Yes
Encoding
UTF-8
VignetteBuilder
knitr
NeedsCompilation
yes
Packaged
2017-12-12 20:42:29 UTC; ptrck
Author
Patrick O. Perry [aut, cph, cre], Finn Årup Nielsen [cph, dtc] (AFINN Sentiment Lexicon), Martin Porter and Richard Boulton [ctb, cph, dtc] (Snowball Stemmer and Stopword Lists), The Regents of the University of California [ctb, cph] (Strtod Library Procedure), Carlo Strapparava and Alessandro Valitutti [cph, dtc] (WordNet-Affect Lexicon), Unicode, Inc. [cph, dtc] (Unicode Character Database)
Maintainer
Patrick O. Perry
Repository
CRAN
Date/Publication
2017-12-12 22:10:07 UTC

install.packages('corpus')

0.10.0

a month ago

http://corpustext.com

Patrick O. Perry

Apache License (== 2.0) | file LICENSE

Depends on

R (>= 3.3),

Imports

stats, utf8 (>= 1.1.0)

Suggests

knitr, Matrix, testthat

Discussions