udpipe

Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing with the 'UDPipe' 'NLP' Toolkit

This natural language processing toolkit provides language-agnostic 'tokenization', 'parts of speech tagging', 'lemmatization' and 'dependency parsing' of raw text. Next to text parsing, the package also allows you to train annotation models based on data of 'treebanks' in 'CoNLL-U' format as provided at <http://universaldependencies.org/format.html>. The techniques are explained in detail in the paper: 'Tokenizing, POS Tagging, Lemmatizing and Parsing UD 2.0 with UDPipe', available at <doi:10.18653/v1/K17-3009>.

Total

1,429

Last month

406

Last week

111

Average per day

14

Daily downloads

Total downloads

Description file content

Package
udpipe
Type
Package
Title
Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing with the 'UDPipe' 'NLP' Toolkit
Version
0.2.2
Maintainer
Jan Wijffels
Description
This natural language processing toolkit provides language-agnostic 'tokenization', 'parts of speech tagging', 'lemmatization' and 'dependency parsing' of raw text. Next to text parsing, the package also allows you to train annotation models based on data of 'treebanks' in 'CoNLL-U' format as provided at . The techniques are explained in detail in the paper: 'Tokenizing, POS Tagging, Lemmatizing and Parsing UD 2.0 with UDPipe', available at .
License
MPL-2.0
URL
Encoding
UTF-8
Depends
R (>= 2.10)
Imports
Rcpp (>= 0.11.5), data.table (>= 1.9.6), Matrix
LinkingTo
Rcpp
Suggests
knitr, topicmodels
SystemRequirements
C++11
RoxygenNote
6.0.1
VignetteBuilder
knitr
NeedsCompilation
yes
Packaged
2017-12-07 09:17:33 UTC; Jan
Author
Jan Wijffels [aut, cre, cph], BNOSAC [cph], Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University in Prague, Czech Republic [cph], Milan Straka [cph], Jana Straková [cph]
Repository
CRAN
Date/Publication
2017-12-07 13:08:12 UTC

install.packages('udpipe')

0.2.2

6 days ago

https://github.com/bnosac/udpipe

Jan Wijffels

MPL-2.0

Depends on

R (>= 2.10)

Imports

Rcpp (>= 0.11.5), data.table (>= 1.9.6), Matrix

Suggests

knitr, topicmodels

Discussions