CRAN/E | koRpus

koRpus

Text Analysis with Emphasis on POS Tagging, Readability, and Lexical Diversity

Installation

About

A set of tools to analyze texts. Includes, amongst others, functions for automatic language detection, hyphenation, several indices of lexical diversity (e.g., type token ratio, HD-D/vocd-D, MTLD) and readability (e.g., Flesch, SMOG, LIX, Dale-Chall). Basic import functions for language corpora are also provided, to enable frequency analyses (supports Celex and Leipzig Corpora Collection file formats) and measures like tf-idf. Note: For full functionality a local installation of TreeTagger is recommended. It is also recommended to not load this package directly, but by loading one of the available language support packages from the 'l10n' repository . 'koRpus' also includes a plugin for the R GUI and IDE RKWard, providing graphical dialogs for its basic features. The respective R package 'rkward' cannot be installed directly from a repository, as it is a part of RKWard. To make full use of this feature, please install RKWard from (plugins are detected automatically). Due to some restrictions on CRAN, the full package sources are only available from the project homepage. To ask for help, report bugs, request features, or discuss the development of the package, please subscribe to the koRpus-dev mailing list ().

Citation koRpus citation info
reaktanz.de/?c=hacking&s=koRpus
Bug report File report

Key Metrics

Version 0.13-8
R ≥ 3.0.0
Published 2021-05-17 1073 days ago
Needs compilation? no
License GPL (≥ 3)
CRAN checks koRpus results

Downloads

Yesterday 199 -10%
Last 7 days 1.134 -10%
Last 30 days 4.173 +24%
Last 90 days 10.317 +18%
Last 365 days 44.049 +6%

Maintainer

Maintainer

Meik Michalke

meik.michalke@hhu.de

Authors

Meik Michalke

aut / cre

Earl Brown

ctb

Alberto Mirisola

ctb

Alexandre Brulet

ctb

Laura Hauser

ctb

Material

README
NEWS
ChangeLog
Reference manual
Package source

In Views

NaturalLanguageProcessing

Additional repos

undocumeantit.github.io/repos/l10n

Vignettes

Using the koRpus Package for Text Analysis

macOS

r-release

arm64

r-oldrel

arm64

r-release

x86_64

r-oldrel

x86_64

Windows

r-devel

x86_64

r-release

x86_64

r-oldrel

x86_64

Old Sources

koRpus archive

Depends

R ≥ 3.0.0
sylly ≥ 0.1-6

Imports

data.table
methods
Matrix

Suggests

testthat
tm
SnowballC
shiny
knitr
rmarkdown
koRpus.lang.de
koRpus.lang.en
koRpus.lang.es
koRpus.lang.fr
koRpus.lang.it
koRpus.lang.nl
koRpus.lang.pt
koRpus.lang.ru

Enhances

rkward

Reverse Depends

koRpus.lang.en
tm.plugin.koRpus

Reverse Imports

textstem

Reverse Suggests

qdap