Content area

Abstract

Copulas enable flexible parameterization of multivariate distributions in terms of constituent marginals and dependence families. Vine copulas, hierarchical collections of bivariate copulas, can model a wide variety of dependencies in multivariate data including asymmetric and tail dependencies which the more widely used Gaussian copulas, used in Meta-Gaussian distributions, cannot. However, current inference algorithms for vines cannot fit data with mixed--a combination of continuous, binary and ordinal--features that are common in many domains. We design a new inference algorithm to fit vines on mixed data thereby extending their use to several applications. We illustrate our algorithm by developing a dependency-seeking multi-view clustering model based on Dirichlet Process mixture of vines that generalizes previous models to arbitrary dependencies as well as to mixed marginals. Empirical results on synthetic and real datasets demonstrate the performance on clustering single-view and multi-view data with asymmetric and tail dependencies and with mixed marginals.

Details

Title
Vine copulas for mixed data : multi-view clustering for mixed data beyond meta-Gaussian dependencies
Author
Tekumalla, Lavanya Sita 1 ; Rajan, Vaibhav 2 ; Bhattacharyya, Chiranjib 3 

 Indian Institute of Science, Bengaluru, India; Amazon Development Center, Bengaluru, India 
 Yen4Ken Software Pvt. Ltd, Bengaluru, India 
 Indian Institute of Science, Bengaluru, India 
Pages
1331-1357
Section
Special Issue of the ECML PKDD 2017 Journal Track
Publication year
2017
Publication date
Oct 2017
Publisher
Springer Nature B.V.
ISSN
08856125
e-ISSN
15730565
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
1936128113
Copyright
Machine Learning is a copyright of Springer, 2017.