CRAN/E | analysisPipelines

analysisPipelines

Compose Interoperable Analysis Pipelines & Put Them in Production

Installation

About

Enables data scientists to compose pipelines of analysis which consist of data manipulation, exploratory analysis & reporting, as well as modeling steps. Data scientists can use tools of their choice through an R interface, and compose interoperable pipelines between R, Spark, and Python. Credits to Mu Sigma for supporting the development of the package. Note - To enable pipelines involving Spark tasks, the package uses the 'SparkR' package. The SparkR package needs to be installed to use Spark as an engine within a pipeline. SparkR is distributed natively with Apache Spark and is not distributed on CRAN. The SparkR version needs to directly map to the Spark version (hence the native distribution), and care needs to be taken to ensure that this is configured properly. To install SparkR from Github, run the following command if you know the Spark version: 'devtools::install_github('apache/spark@v2.x.x', subdir='R/pkg')'. The other option is to install SparkR by running the following terminal commands if Spark has already been installed: '$ export SPARK_HOME=/path/to/spark/directory && cd $SPARK_HOME/R/lib/SparkR/ && R -e "devtools::install('.')"'.

github.com/Mu-Sigma/analysis-pipelines
Bug report File report

Key Metrics

Version 1.0.2
R ≥ 3.4.0
Published 2020-06-12 1414 days ago
Needs compilation? no
License Apache License 2.0
CRAN checks analysisPipelines results

Downloads

Yesterday 3 0%
Last 7 days 6 -14%
Last 30 days 32 -44%
Last 90 days 123 -22%
Last 365 days 518 -88%

Maintainer

Maintainer

"Mu Sigma, Inc."

ird.experiencelab@mu-sigma.com

Authors

Naren Srinivasan

aut

Zubin Dowlaty

aut

Sanjay

ctb

Neeratyoy Mallik

ctb

Anoop S

ctb

Mu Sigma
Inc.

cre

Material

README
Reference manual
Package source

Vignettes

Analysis pipelines for working with Python functions
Analysis pipelines for working with R data frames
Analysis pipelines for working with Spark DataFrames for batch analyses
Interoperable analysis pipelines
Meta-pipelines
Streaming Analysis Pipelines for working with Apache Spark Structured Streaming
Using pipelines inside Shiny widgets or apps

macOS

r-release

arm64

r-oldrel

arm64

r-release

x86_64

r-oldrel

x86_64

Windows

r-devel

x86_64

r-release

x86_64

r-oldrel

x86_64

Old Sources

analysisPipelines archive

Depends

R ≥ 3.4.0
magrittr
pipeR
methods

Imports

ggplot2
dplyr
futile.logger
RCurl
rlang ≥ 0.3.0
proto
purrr

Suggests

plotly
knitr
rmarkdown
parallel
visNetwork
rjson
DT
shiny
R.devices
corrplot
car
foreign

Enhances

SparkR
reticulate