CRAN/E | Semblance

Semblance

A Data-Driven Similarity Kernel on Probability Spaces

Installation

About

We present a rank-based Mercer kernel to compute a pair-wise similarity metric corresponding to informative representation of data. We tailor the development of a kernel to encode our prior knowledge about the data distribution over a probability space. The philosophical concept behind our construction is that objects whose feature values fall on the extreme of that feature’s probability mass distribution are more similar to each other, than objects whose feature values lie closer to the mean. Semblance emphasizes features whose values lie far away from the mean of their probability distribution. The kernel relies on properties empirically determined from the data and does not assume an underlying distribution. The use of feature ranks on a probability space ensures that Semblance is computational efficacious, robust to outliers, and statistically stable, thus making it widely applicable algorithm for pattern analysis. The output from the kernel is a square, symmetric matrix that gives proximity values between pairs of observations.

Key Metrics

Version 1.1.0
Published 2019-01-25 1908 days ago
Needs compilation? no
License GPL-2
CRAN checks Semblance results

Downloads

Yesterday 3 0%
Last 7 days 12 -54%
Last 30 days 108 -11%
Last 90 days 576 +41%
Last 365 days 1.779 -18%

Maintainer

Maintainer

Divyansh Agarwal

divyansh@upenn.edu

Authors

Divyansh Agarwal Nancy R. Zhang

Material

Reference manual
Package source

macOS

r-release

arm64

r-oldrel

arm64

r-release

x86_64

r-oldrel

x86_64

Windows

r-devel

x86_64

r-release

x86_64

r-oldrel

x86_64

Old Sources

Semblance archive

Imports

fields ≥ 9.6
PerformanceAnalytics ≥ 1.5.2
DescTools ≥ 0.99.26
msos ≥ 1.1.0

Suggests

kernlab