CRAN/E | archiveRetriever

archiveRetriever

Retrieve Archived Web Pages from the 'Internet Archive'

Installation

About

Scraping content from archived web pages stored in the 'Internet Archive' () using a systematic workflow. Get an overview of the mementos available from the respective homepage, retrieve the Urls and links of the page and finally scrape the content. The final output is stored in tibbles, which can be then easily used for further analysis.

github.com/liserman/archiveRetriever/

Key Metrics

Version 0.3.1
Published 2022-12-23 489 days ago
Needs compilation? no
License Apache License (≥ 2.0)
CRAN checks archiveRetriever results

Downloads

Yesterday 9 +50%
Last 7 days 64 -14%
Last 30 days 319 -29%
Last 90 days 1.068 -6%
Last 365 days 4.167 -13%

Maintainer

Maintainer

Lukas Isermann

lukas.isermann@uni-mannheim.de

Authors

Konstantin Gavras

aut

Lukas Isermann

aut / cre

Material

README
NEWS
Reference manual
Package source

macOS

r-release

arm64

r-oldrel

arm64

r-release

x86_64

r-oldrel

x86_64

Windows

r-devel

x86_64

r-release

x86_64

r-oldrel

x86_64

Old Sources

archiveRetriever archive

Imports

anytime
dplyr
ggplot2
gridExtra
httr
jsonlite
lubridate
rvest
stringr
tibble
tidyr
utils
xml2

Suggests

vcr ≥ 1.0.0
testthat
webmockr