CRAN/E | feamiR

feamiR

Classification and Feature Selection for microRNA/mRNA Interactions

Installation

About

Comprises a pipeline for predicting microRNA/mRNA interactions, as detailed in Williams, Calinescu, Mohorianu (2020) doi:10.1101/2020.12.23.424130. Its input consists of [a] a messenger RNA (mRNA) dataset (either in fasta format, focused on 3' UTRs or in gtf format; for the latter, the sequences of the 3’ UTRs are generated using the genomic coordinates), [b] a microRNA dataset (in fasta format, retrieved from miRBase, ) and [c] an interaction dataset (in csv format, from miRTarBase ). To characterise and predict microRNA/mRNA interactions, we use [a] statistical analyses based on Chi-squared and Fisher exact tests and [b] Machine Learning classifiers (decision trees, random forests and support vector machines). To enhance the accuracy of the classifiers we also employ feature selection approaches used in on conjunction with the classifiers. The feature selection approaches include a voting scheme for decision trees, a measure based on Gini index for random forests, forward feature selection and Genetic Algorithms on SVMs. The pipeline also includes a novel approach based on embryonic Genetic Algorithms which combines and optimises the forward feature selection and Genetic Algorithms. All analyses, including the classification and feature selection, are applicable on the microRNA seed features (default), on the full microRNA features and/or flanking features on the mRNA. The sets of features can be combined.

github.com/Core-Bioinformatics/feamiR
System requirements Python (>=3.6) sreformat patman
Bug report File report

Key Metrics

Version 0.1.0
R ≥ 3.1.2
Published 2021-01-19 1192 days ago
Needs compilation? no
License GPL-2
CRAN checks feamiR results

Downloads

Yesterday 3 0%
Last 7 days 8 -27%
Last 30 days 41 -7%
Last 90 days 135 -21%
Last 365 days 570 -74%

Maintainer

Maintainer

Eleanor Williams

ecw63@cam.ac.uk

Authors

Eleanor Williams

aut / cre

Irina Mohorianu

aut

Material

Reference manual
Package source

macOS

r-release

arm64

r-oldrel

arm64

r-release

x86_64

r-oldrel

x86_64

Windows

r-devel

x86_64

r-release

x86_64

r-oldrel

x86_64

Depends

R ≥ 3.1.2

Imports

stringr
randomForest
rpart
rpart.plot
GA
e1071
ggplot2
magrittr
tibble
dplyr
reticulate

Suggests

parallel
doParallel