quanteda

Quantitative Analysis of Textual Data

A fast, flexible, and comprehensive framework for quantitative text analysis in R. Provides functionality for corpus management, creating and manipulating tokens and ngrams, exploring keywords in context, forming and manipulating sparse matrices of documents by features and feature co-occurrences, analyzing keywords, computing feature similarities and distances, applying content dictionaries, applying supervised and unsupervised machine learning, visually representing text and text analyses, and more.

Total

211,923

Last month

16,935

Last week

3,807

Average per day

565

Daily downloads

Total downloads

Description file content

Package
quanteda
Version
1.3.14
Title
Quantitative Analysis of Textual Data
Description
A fast, flexible, and comprehensive framework for quantitative text analysis in R. Provides functionality for corpus management, creating and manipulating tokens and ngrams, exploring keywords in context, forming and manipulating sparse matrices of documents by features and feature co-occurrences, analyzing keywords, computing feature similarities and distances, applying content dictionaries, applying supervised and unsupervised machine learning, visually representing text and text analyses, and more.
License
GPL-3
Depends
R (>= 3.1.0), methods
Imports
data.table (>= 1.9.6), extrafont, fastmatch, ggplot2 (>= 2.2.0), ggrepel, lubridate, magrittr, Matrix (>= 1.2), network, RSpectra, Rcpp (>= 0.12.12), RcppParallel, sna, SnowballC, spacyr, stopwords, stringi, xml2, yaml
LinkingTo
Rcpp, RcppParallel, RcppArmadillo (>= 0.7.600.1.0)
Suggests
spelling, ca, dplyr, DT, e1071, ExPosition, lda, lsa, proxy, purrr, RColorBrewer, rmarkdown, slam, stm, svs, testthat, text2vec, tibble, tidytext, tm (>= 0.6), topicmodels, xtable, knitr, igraph, wordcloud
URL
Encoding
UTF-8
BugReports
https://github.com/quanteda/quanteda/issues
LazyData
TRUE
VignetteBuilder
knitr
Language
en-UK
Collate
'RcppExports.R' 'View.R' 'bootstrap_dfm.R' 'casechange-functions.R' 'directionchange-functions.R' 'character-methods.R' 'convert.R' 'corpus-methods-base.R' 'corpus-methods-quanteda.R' 'corpus-methods-tm.R' 'corpus.R' 'corpus_reshape.R' 'corpus_sample.R' 'corpus_segment.R' 'corpus_subset.R' 'corpus_trim.R' 'corpuszip.R' 'data-deprecated.R' 'data-documentation.R' 'defunct-functions.R' 'dfm-classes.R' 'dfm-methods.R' 'dfm-print.R' 'dfm-subsetting.R' 'dfm.R' 'dfm_compress.R' 'dfm_group.R' 'dfm_lookup.R' 'dfm_replace.R' 'dfm_sample.R' 'dfm_select.R' 'dfm_sort.R' 'dfm_subset.R' 'dfm_trim.R' 'dfm_weight.R' 'dictionaries.R' 'docnames.R' 'docvars.R' 'fcm-classes.R' 'fcm-methods.R' 'fcm-subsetting.R' 'fcm.R' 'kwic.R' 'nfunctions.R' 'nscrabble.R' 'nsyllable.R' 'pattern2fixed.R' 'phrases.R' 'quanteda-documentation.R' 'quanteda_options.R' 'readtext-methods.R' 'settings.R' 'spacyr-methods.R' 'stopwords.R' 'textmodel-methods.R' 'textmodel_affinity.R' 'textmodel_ca.R' 'textmodel_lsa.R' 'textmodel_nb.R' 'textmodel_wordfish.R' 'textmodel_wordscores.R' 'textplot_influence.R' 'textplot_keyness.R' 'textplot_network.R' 'textplot_scale1d.R' 'textplot_wordcloud.R' 'textplot_xray.R' 'textstat-methods.R' 'textstat_collocations.R' 'textstat_dist_old.R' 'textstat_frequency.R' 'textstat_keyness.R' 'textstat_lexdiv.R' 'textstat_readability.R' 'textstat_simil.R' 'textstat_simil_old.R' 'tokens.R' 'tokens_compound.R' 'tokens_group.R' 'tokens_lookup.R' 'tokens_ngrams.R' 'tokens_replace.R' 'tokens_segment.R' 'tokens_select.R' 'tokens_subset.R' 'tokens_sample.R' 'utils.R' 'wordstem.R' 'zzz.R'
RoxygenNote
6.1.1
SystemRequirements
C++11
NeedsCompilation
yes
Packaged
2018-11-19 18:22:44 UTC; kbenoit
Author
Kenneth Benoit [cre, aut, cph] (), Kohei Watanabe [aut] (), Haiyan Wang [aut] (), Paul Nulty [aut] (), Adam Obeng [aut] (), Stefan Müller [aut] (), Akitaka Matsuo [aut] (), Patrick O. Perry [aut] (), Jouni Kuha [aut] (), Benjamin Lauderdale [aut] (), William Lowe [aut] (), Christian Müller [ctb], Lori Young [dtc] (Lexicoder Sentiment Dictionary 2015), Stuart Soroka [dtc] (Lexicoder Sentiment Dictionary 2015), Ian Fellows [cph] (authored wordcloud C source code (modified)), European Research Council [fnd] (ERC-2011-StG 283794-QUANTESS)
Maintainer
Kenneth Benoit
Repository
CRAN
Date/Publication
2018-11-19 19:50:03 UTC

install.packages('quanteda')

1.3.14

20 days ago

https://quanteda.io

Kenneth Benoit

GPL-3

Depends on

R (>= 3.1.0), methods

Imports

data.table (>= 1.9.6), extrafont, fastmatch, ggplot2 (>= 2.2.0), ggrepel, lubridate, magrittr, Matrix (>= 1.2), network, RSpectra, Rcpp (>= 0.12.12), RcppParallel, sna, SnowballC, spacyr, stopwords, stringi, xml2, yaml

Suggests

spelling, ca, dplyr, DT, e1071, ExPosition, lda, lsa, proxy, purrr, RColorBrewer, rmarkdown, slam, stm, svs, testthat, text2vec, tibble, tidytext, tm (>= 0.6), topicmodels, xtable, knitr, igraph, wordcloud

Discussions