What does the R-package 'analysisPipelines' do?

Compose Interoperable Analysis Pipelines & Put Them in Production

Who authored analysisPipelines?

Naren Srinivasan, Zubin Dowlaty, Sanjay, Neeratyoy Mallik, Anoop S, Mu Sigma, Inc.

CRAN/E | analysisPipelines

Installation

About

Enables data scientists to compose pipelines of analysis which consist of data manipulation, exploratory analysis & reporting, as well as modeling steps. Data scientists can use tools of their choice through an R interface, and compose interoperable pipelines between R, Spark, and Python. Credits to Mu Sigma for supporting the development of the package. Note - To enable pipelines involving Spark tasks, the package uses the 'SparkR' package. The SparkR package needs to be installed to use Spark as an engine within a pipeline. SparkR is distributed natively with Apache Spark and is not distributed on CRAN. The SparkR version needs to directly map to the Spark version (hence the native distribution), and care needs to be taken to ensure that this is configured properly. To install SparkR from Github, run the following command if you know the Spark version: 'devtools::install_github('apache/spark@v2.x.x', subdir='R/pkg')'. The other option is to install SparkR by running the following terminal commands if Spark has already been installed: '$ export SPARK_HOME=/path/to/spark/directory && cd $SPARK_HOME/R/lib/SparkR/ && R -e "devtools::install('.')"'.

github.com/Mu-Sigma/analysis-pipelines
Bug report	File report

Key Metrics

Version	1.0.2
R	≥ 3.4.0
Published	2020-06-12 1414 days ago
Needs compilation?	no
License	Apache License 2.0
CRAN checks	analysisPipelines results

Downloads

Yesterday	3 0%
Last 7 days	6 -14%
Last 30 days	32 -44%
Last 90 days	123 -22%
Last 365 days	518 -88%

Maintainer

"Mu Sigma, Inc."

ird.experiencelab@mu-sigma.com

Authors

Naren Srinivasan	aut
Zubin Dowlaty	aut
Sanjay	ctb
Neeratyoy Mallik	ctb
Anoop S	ctb
Mu Sigma
Inc.	cre

Material

README
Reference manual
Package source

Vignettes

Analysis pipelines for working with Python functions
Analysis pipelines for working with R data frames
Analysis pipelines for working with Spark DataFrames for batch analyses
Interoperable analysis pipelines
Meta-pipelines
Streaming Analysis Pipelines for working with Apache Spark Structured Streaming
Using pipelines inside Shiny widgets or apps

macOS

r-release	arm64
r-oldrel	arm64
r-release	x86_64
r-oldrel	x86_64

Windows

r-devel	x86_64
r-release	x86_64
r-oldrel	x86_64

Old Sources

analysisPipelines archive

Depends

R	≥ 3.4.0
magrittr
pipeR
methods

Imports

ggplot2
dplyr
futile.logger
RCurl
rlang	≥ 0.3.0
proto
purrr

Suggests

plotly
knitr
rmarkdown
parallel
visNetwork
rjson
DT
shiny
R.devices
corrplot
car
foreign

Enhances

SparkR
reticulate