CRAN/E | MantaID

MantaID

A Machine-Learning Based Tool to Automate the Identification of Biological Database IDs

Installation

About

The number of biological databases is growing rapidly, but different databases use different IDs to refer to the same biological entity. The inconsistency in IDs impedes the integration of various types of biological data. To resolve the problem, we developed 'MantaID', a data-driven, machine-learning based approach that automates identifying IDs on a large scale. The 'MantaID' model's prediction accuracy was proven to be 99%, and it correctly and effectively predicted 100,000 ID entries within two minutes. 'MantaID' supports the discovery and exploitation of ID patterns from large quantities of databases. (e.g., up to 542 biological databases). An easy-to-use freely available open-source software R package, a user-friendly web application, and APIs were also developed for 'MantaID' to improve applicability. To our knowledge, 'MantaID' is the first tool that enables an automatic, quick, accurate, and comprehensive identification of large quantities of IDs, and can therefore be used as a starting point to facilitate the complex assimilation and aggregation of biological data across diverse databases.

Key Metrics

Version 1.0.2
R ≥ 4.2.0
Published 2022-10-18 559 days ago
Needs compilation? no
License GPL (≥ 3)
CRAN checks MantaID results

Downloads

Yesterday 5 0%
Last 7 days 41 -16%
Last 30 days 154 +18%
Last 90 days 410 -20%
Last 365 days 1.610 +106%

Maintainer

Maintainer

Zeng Zhengpeng

molaison@foxmail.com

Authors

Zeng Zhengpeng

aut / cre / ctb

Mao Longfei

aut

Yu Feng

aut

Material

Reference manual
Package source

macOS

r-release

arm64

r-oldrel

arm64

r-release

x86_64

r-oldrel

x86_64

Windows

r-develnot available

x86_64

r-releasenot available

x86_64

r-oldrelnot available

x86_64

Old Sources

MantaID archive

Depends

R ≥ 4.2.0

Imports

biomaRt
caret
data.table
dplyr
ggplot2
keras
magrittr
mlr3
purrr
reshape2
scutr
stringr
tibble
tidyr
tidyselect
mlr3tuning
paradox
RColorBrewer
ggcorrplot