udpipe

Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing with the 'UDPipe' 'NLP' Toolkit

This natural language processing toolkit provides language-agnostic 'tokenization', 'parts of speech tagging', 'lemmatization' and 'dependency parsing' of raw text. Next to text parsing, the package also allows you to train annotation models based on data of 'treebanks' in 'CoNLL-U' format as provided at <http://universaldependencies.org/format.html>. The techniques are explained in detail in the paper: 'Tokenizing, POS Tagging, Lemmatizing and Parsing UD 2.0 with UDPipe', available at <doi:10.18653/v1/K17-3009>.

Total

2,632

Last month

677

Last week

141

Average per day

23

Daily downloads

Total downloads

Description file content

Package
udpipe
Type
Package
Title
Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing with the 'UDPipe' 'NLP' Toolkit
Version
0.4
Maintainer
Jan Wijffels
Description
This natural language processing toolkit provides language-agnostic 'tokenization', 'parts of speech tagging', 'lemmatization' and 'dependency parsing' of raw text. Next to text parsing, the package also allows you to train annotation models based on data of 'treebanks' in 'CoNLL-U' format as provided at . The techniques are explained in detail in the paper: 'Tokenizing, POS Tagging, Lemmatizing and Parsing UD 2.0 with UDPipe', available at .
License
MPL-2.0
URL
Encoding
UTF-8
Depends
R (>= 2.10)
Imports
Rcpp (>= 0.11.5), data.table (>= 1.9.6), Matrix, methods
LinkingTo
Rcpp
Suggests
knitr, topicmodels, lattice
SystemRequirements
C++11
RoxygenNote
6.0.1
VignetteBuilder
knitr
NeedsCompilation
yes
Packaged
2018-02-06 17:43:30 UTC; Jan
Author
Jan Wijffels [aut, cre, cph], BNOSAC [cph], Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University in Prague, Czech Republic [cph], Milan Straka [cph], Jana Straková [cph]
Repository
CRAN
Date/Publication
2018-02-07 13:28:42 UTC

install.packages('udpipe')

0.4

15 days ago

https://bnosac.github.io/udpipe/en/index.html

Jan Wijffels

MPL-2.0

Depends on

R (>= 2.10)

Imports

Rcpp (>= 0.11.5), data.table (>= 1.9.6), Matrix, methods

Suggests

knitr, topicmodels, lattice

Discussions