koRpus

An R Package for Text Analysis

A set of tools to analyze texts. Includes, amongst others, functions for automatic language detection, hyphenation, several indices of lexical diversity (e.g., type token ratio, HD-D/vocd-D, MTLD) and readability (e.g., Flesch, SMOG, LIX, Dale-Chall). Basic import functions for language corpora are also provided, to enable frequency analyses (supports Celex and Leipzig Corpora Collection file formats) and measures like tf-idf. Support for additional languages can be added on-the-fly or by plugin packages. Note: For full functionality a local installation of TreeTagger is recommended. 'koRpus' also includes a plugin for the R GUI and IDE RKWard, providing graphical dialogs for its basic features. The respective R package 'rkward' cannot be installed directly from a repository, as it is a part of RKWard. To make full use of this feature, please install RKWard from <https://rkward.kde.org> (plugins are detected automatically). Due to some restrictions on CRAN, the full package sources are only available from the project homepage. To ask for help, report bugs, request features, or discuss the development of the package, please subscribe to the koRpus-dev mailing list (<http://korpusml.reaktanz.de>).

Total

85,134

Last month

2,678

Last week

585

Average per day

89

Daily downloads

Total downloads

Description file content

Package
koRpus
Type
Package
Title
An R Package for Text Analysis
Depends
R (>= 2.10.0),methods,data.table
Enhances
rkward
Suggests
testthat,tm,SnowballC,shiny
Description
A set of tools to analyze texts. Includes, amongst others, functions for automatic language detection, hyphenation, several indices of lexical diversity (e.g., type token ratio, HD-D/vocd-D, MTLD) and readability (e.g., Flesch, SMOG, LIX, Dale-Chall). Basic import functions for language corpora are also provided, to enable frequency analyses (supports Celex and Leipzig Corpora Collection file formats) and measures like tf-idf. Support for additional languages can be added on-the-fly or by plugin packages. Note: For full functionality a local installation of TreeTagger is recommended. 'koRpus' also includes a plugin for the R GUI and IDE RKWard, providing graphical dialogs for its basic features. The respective R package 'rkward' cannot be installed directly from a repository, as it is a part of RKWard. To make full use of this feature, please install RKWard from (plugins are detected automatically). Due to some restrictions on CRAN, the full package sources are only available from the project homepage. To ask for help, report bugs, request features, or discuss the development of the package, please subscribe to the koRpus-dev mailing list ().
License
GPL (>= 3)
Encoding
UTF-8
LazyLoad
yes
URL
Version
0.10-2
Date
2017-04-04
Collate
'00_environment.R' '01_class_01_kRp.tagged.R' '01_class_02_kRp.TTR.R' '01_class_03_kRp.txt.freq.R' '01_class_04_kRp.txt.trans.R' '01_class_05_kRp.analysis.R' '01_class_06_kRp.corp.freq.R' '01_class_07_kRp.hyph.pat.R' '01_class_08_kRp.hyphen.R' '01_class_09_kRp.lang.R' '01_class_10_kRp.readability.R' '02_method_cTest.R' 'kRp.filter.wclass.R' 'koRpus-internal.R' '02_method_clozeDelete.R' '02_method_correct.R' '02_method_freq.analysis.R' '02_method_hyphen.R' '02_method_kRp.taggedText.R' '02_method_lex.div.R' '02_method_plot.kRp.tagged.R' '02_method_query.R' '02_method_read.corp.custom.R' '02_method_readability.R' '02_method_show.kRp.lang.R' '02_method_show.kRp.TTR.R' '02_method_show.kRp.corp.freq.R' '02_method_show.kRp.hyphen.R' '02_method_show.kRp.readability.R' '02_method_show.kRp.taggedText.R' '02_method_summary.kRp.lang.R' '02_method_summary.kRp.TTR.R' '02_method_summary.kRp.hyphen.R' '02_method_summary.kRp.readability.R' '02_method_summary.kRp.tagged.R' '02_method_summary.kRp.txt.freq.R' '02_method_types_tokens.R' 'get.kRp.env.R' 'guess.lang.R' 'hyph.XX-data.R' 'jumbleWords.R' 'kRp.POS.tags.R' 'kRp.cluster.R' 'kRp.text.analysis.R' 'kRp.text.paste.R' 'kRp.text.transform.R' 'koRpus-internal.freq.analysis.R' 'koRpus-internal.hyphen.R' 'koRpus-internal.import.R' 'koRpus-internal.lexdiv.formulae.R' 'koRpus-internal.rdb.formulae.R' 'koRpus-internal.rdb.params.grades.R' 'koRpus-internal.read.corp.custom.R' 'koRpus-internal.roxy.all.R' 'koRpus-package.R' 'set.lang.support.R' 'lang.support-de.R' 'lang.support-en.R' 'lang.support-es.R' 'lang.support-fr.R' 'lang.support-it.R' 'lang.support-ru.R' 'lex.div.num.R' 'manage.hyph.pat.R' 'read.BAWL.R' 'read.corp.LCC.R' 'read.corp.celex.R' 'read.hyph.pat.R' 'read.tagged.R' 'readability.num.R' 'segment.optimizer.R' 'set.kRp.env.R' 'textFeatures.R' 'tokenize.R' 'treetag.R' 'wrapper_functions_lex.div.R' 'wrapper_functions_readability.R'
RoxygenNote
6.0.1
NeedsCompilation
no
Packaged
2017-04-04 17:22:37 UTC; m
Author
m.eik michalke [aut, cre], Earl Brown [ctb], Alberto Mirisola [ctb], Alexandre Brulet [ctb], Laura Hauser [ctb]
Maintainer
m.eik michalke
Repository
CRAN
Date/Publication
2017-04-04 22:04:32 UTC

install.packages('koRpus')

0.10-2

a year ago

https://reaktanz.de/?c=hacking&s=koRpus

m.eik michalke

GPL (>= 3)

Depends on

R (>= 2.10.0),methods,data.table

Suggests

testthat,tm,SnowballC,shiny

Discussions