koRpus: An R Package for Text Analysis

A set of tools to analyze texts. Includes, amongst others, functions for automatic language detection, hyphenation, several indices of lexical diversity (e.g., type token ratio, HD-D/vocd-D, MTLD) and readability (e.g., Flesch, SMOG, LIX, Dale-Chall). Basic import functions for language corpora are also provided, to enable frequency analyses (supports Celex and Leipzig Corpora Collection file formats) and measures like tf-idf. Support for additional languages can be added on-the-fly or by plugin packages. Note: For full functionality a local installation of TreeTagger is recommended. After installation, additional language support needs to be fetched from the 'l10n' repository https://undocumeantit.github.io/repos/l10n>. It is recommended to add it to your list of package repositories permanently, to receive updates for these packages and be able to install support for further languages. 'koRpus' also includes a plugin for the R GUI and IDE RKWard, providing graphical dialogs for its basic features. The respective R package 'rkward' cannot be installed directly from a repository, as it is a part of RKWard. To make full use of this feature, please install RKWard from https://rkward.kde.org> (plugins are detected automatically). Due to some restrictions on CRAN, the full package sources are only available from the project homepage. To ask for help, report bugs, request features, or discuss the development of the package, please subscribe to the koRpus-dev mailing list (< http://korpusml.reaktanz.de >).

Version: 0.11-2
Depends: R (>= 2.10.0),sylly (>= 0.1-4)
Imports: data.table,methods
Suggests: testthat,tm,SnowballC,shiny,knitr,rmarkdown,koRpus.lang.de,koRpus.lang.en,koRpus.lang.es,koRpus.lang.fr,koRpus.lang.it,koRpus.lang.ru
Enhances: rkward
Additional repositories: https://undocumeantit.github.io/repos/l10n
Published: 2018-01-07
Author: Meik Michalke [aut, cre], Earl Brown [ctb], Alberto Mirisola [ctb], Alexandre Brulet [ctb], Laura Hauser [ctb]
Maintainer: Meik Michalke <meik.michalke at hhu.de>
BugReports: https://github.com/unDocUMeantIt/koRpus/issues
License: GPL (>= 3)
URL: https://reaktanz.de/?c=hacking&s=koRpus
NeedsCompilation: no
Citation: koRpus citation info

Downloads:

Package source: koRpus_0.11-2.tar.gz
MacOS X binaries: R 3.4: koRpus_0.11-2.tgz, R 3.3: koRpus_0.11-2.tgz, R 3.2: koRpus_0.11-2.tgz
Windows binaries: R 3.4: koRpus_0.11-2.zip, R 3.3: koRpus_0.11-2.zip, R 3.2: koRpus_0.11-2.zip
Debain binary package: Learn how to install Debian packages from this repository
Reference manual: koRpus.pdf
Vignettes: Using the koRpus Package for Text Analysis
News/ChangeLog: NEWS RSS feed for R package koRpus