CRAN/E | tesseract

tesseract

Open Source OCR Engine

Installation

About

Bindings to 'Tesseract': a powerful optical character recognition (OCR) engine that supports over 100 languages. The engine is highly configurable in order to tune the detection algorithms and obtain the best possible results.

docs.ropensci.org/tesseract/ (website) https://github.com/ropensci/tesseract (devel)
docs.ropensci.org/tesseract/ (website) https://github.com/ropensci/tesseract (devel)
System requirements Tesseract >= 3.03 (libtesseract-dev / tesseract-devel) and Leptonica (libleptonica-dev / leptonica-devel). On Debian you need to install the English training data separately (tesseract-ocr-eng)
Bug report File report

Key Metrics

Version 5.2.1
Published 2023-11-20 161 days ago
Needs compilation? yes
License Apache License 2.0
CRAN checks tesseract results
Language en-US

Downloads

Yesterday 142 0%
Last 7 days 1.445 +5%
Last 30 days 5.384 -11%
Last 90 days 17.248 -21%
Last 365 days 76.177 -4%

Maintainer

Maintainer

Jeroen Ooms

jeroen@berkeley.edu

Authors

Jeroen Ooms

aut / cre

Material

NEWS
Reference manual
Package source

In Views

NaturalLanguageProcessing

Vignettes

Using the Tesseract OCR engine in R

macOS

r-release

arm64

r-oldrel

arm64

r-release

x86_64

r-oldrel

x86_64

Windows

r-devel

x86_64

r-release

x86_64

r-oldrel

x86_64

Old Sources

tesseract archive

Imports

Rcpp ≥ 0.12.12
pdftools ≥ 1.5
curl
rappdirs
digest

Suggests

magick ≥ 1.7
spelling
knitr
tibble
rmarkdown

LinkingTo

Rcpp

Reverse Suggests

camtrapR
imagerExtra
magick
pdftools