quanteda

Quantitative Analysis of Textual Data

A fast, flexible, and comprehensive framework for quantitative text analysis in R. Provides functionality for corpus management, creating and manipulating tokens and ngrams, exploring keywords in context, forming and manipulating sparse matrices of documents by features and feature co-occurrences, analyzing keywords, computing feature similarities and distances, applying content dictionaries, applying supervised and unsupervised machine learning, visually representing text and text analyses, and more.

Total

122,450

Last month

10,918

Last week

4,097

Average per day

364

Daily downloads

Total downloads

Description file content

Package
quanteda
Version
1.2.0
Title
Quantitative Analysis of Textual Data
Description
A fast, flexible, and comprehensive framework for quantitative text analysis in R. Provides functionality for corpus management, creating and manipulating tokens and ngrams, exploring keywords in context, forming and manipulating sparse matrices of documents by features and feature co-occurrences, analyzing keywords, computing feature similarities and distances, applying content dictionaries, applying supervised and unsupervised machine learning, visually representing text and text analyses, and more.
License
GPL-3
Depends
R (>= 3.1.0), methods
Imports
extrafont, Matrix (>= 1.2), data.table (>= 1.9.6), SnowballC, sna, ggrepel, network, Rcpp (>= 0.12.12), RcppParallel, RSpectra, stringi, fastmatch, ggplot2 (>= 2.2.0), XML, yaml, lubridate, magrittr, spacyr, stopwords
LinkingTo
Rcpp, RcppParallel, RcppArmadillo (>= 0.7.600.1.0)
Suggests
knitr, rmarkdown, lda, proxy, topicmodels, tm (>= 0.6), slam, testthat, RColorBrewer, xtable, DT, ca, purrr
URL
Encoding
UTF-8
BugReports
https://github.com/quanteda/quanteda/issues
LazyData
TRUE
VignetteBuilder
knitr
Collate
'RcppExports.R' 'View.R' 'bootstrap_dfm.R' 'casechange-functions.R' 'character-methods.R' 'convert.R' 'corpus-methods-base.R' 'corpus-methods-quanteda.R' 'corpus-methods-tm.R' 'corpus.R' 'corpus_reshape.R' 'corpus_sample.R' 'corpus_segment.R' 'corpus_subset.R' 'corpus_trim.R' 'corpuszip.R' 'data-deprecated.R' 'data-documentation.R' 'defunct-functions.R' 'dfm-classes.R' 'dfm-methods.R' 'dfm-print.R' 'dfm-subsetting.R' 'dfm.R' 'dfm_compress.R' 'dfm_group.R' 'dfm_lookup.R' 'dfm_replace.R' 'dfm_sample.R' 'dfm_select.R' 'dfm_sort.R' 'dfm_subset.R' 'dfm_trim.R' 'dfm_weight.R' 'dictionaries.R' 'docnames.R' 'docvars.R' 'fcm-classes.R' 'fcm-methods.R' 'fcm-subsetting.R' 'fcm.R' 'kwic.R' 'nfunctions.R' 'nscrabble.R' 'nsyllable.R' 'pattern2fixed.R' 'phrases.R' 'quanteda-documentation.R' 'quanteda_options.R' 'readtext-methods.R' 'settings.R' 'spacyr-methods.R' 'stopwords.R' 'textmodel-methods.R' 'textmodel_affinity.R' 'textmodel_ca.R' 'textmodel_lsa.R' 'textmodel_nb.R' 'textmodel_wordfish.R' 'textmodel_wordscores.R' 'textplot_influence.R' 'textplot_keyness.R' 'textplot_network.R' 'textplot_scale1d.R' 'textplot_wordcloud.R' 'textplot_xray.R' 'textstat-methods.R' 'textstat_collocations.R' 'textstat_dist.R' 'textstat_frequency.R' 'textstat_keyness.R' 'textstat_lexdiv.R' 'textstat_readability.R' 'textstat_simil.R' 'tokens.R' 'tokens_compound.R' 'tokens_group.R' 'tokens_lookup.R' 'tokens_ngrams.R' 'tokens_replace.R' 'tokens_segment.R' 'tokens_select.R' 'tokens_subset.R' 'utils.R' 'wordstem.R' 'zzz.R'
RoxygenNote
6.0.1
SystemRequirements
C++11
NeedsCompilation
yes
Packaged
2018-04-15 17:51:45 UTC; kbenoit
Author
Kenneth Benoit [aut, cre, cph], Kohei Watanabe [ctb], Paul Nulty [ctb], Adam Obeng [ctb], Haiyan Wang [ctb], Stefan Müller [ctb], Benjamin Lauderdale [ctb], Will Lowe [ctb]
Maintainer
Kenneth Benoit
Repository
CRAN
Date/Publication
2018-04-15 19:13:13 UTC

install.packages('quanteda')

1.2.0

4 days ago

http://quanteda.io

Kenneth Benoit

GPL-3

Depends on

R (>= 3.1.0), methods

Imports

extrafont, Matrix (>= 1.2), data.table (>= 1.9.6), SnowballC, sna, ggrepel, network, Rcpp (>= 0.12.12), RcppParallel, RSpectra, stringi, fastmatch, ggplot2 (>= 2.2.0), XML, yaml, lubridate, magrittr, spacyr, stopwords

Suggests

knitr, rmarkdown, lda, proxy, topicmodels, tm (>= 0.6), slam, testthat, RColorBrewer, xtable, DT, ca, purrr

Discussions