Hierarchical Maximum Likelihood Clustering Approach

Alok Sharma, Keith A. Boroevich, Daichi Shigemizu, Yoichiro Kamatani, Michiaki Kubo, Tatsuhiko Tsunoda

Research output: Contribution to journalArticle

14 Citations (Scopus)

Abstract

Objective: In this paper, we focused on developing a clustering approach for biological data. In many biological analyses, such as multiomics data analysis and genome-wide association studies analysis, it is crucial to find groups of data belonging to subtypes of diseases or tumors. Methods: Conventionally, the k-means clustering algorithm is overwhelmingly applied in many areas including biological sciences. There are, however, several alternative clustering algorithms that can be applied, including support vector clustering. In this paper, taking into consideration the nature of biological data, we propose a maximum likelihood clustering scheme based on a hierarchical framework. Results: This method can perform clustering even when the data belonging to different groups overlap. It can also perform clustering when the number of samples is lower than the data dimensionality. Conclusion: The proposed scheme is free from selecting initial settings to begin the search process. In addition, it does not require the computation of the first and second derivative of likelihood functions, as is required by many other maximum likelihood-based methods. Significance: This algorithm uses distribution and centroid information to cluster a sample and was applied to biological data. A MATLAB implementation of this method can be downloaded from the web link http://www.riken.jp/en/research/labs/ims/med-sci-math/.

Original languageEnglish
Article number7440832
Pages (from-to)112-122
Number of pages11
JournalIEEE Transactions on Biomedical Engineering
Volume64
Issue number1
DOIs
Publication statusPublished - 01-01-2017

Fingerprint

Clustering algorithms
Maximum likelihood
MATLAB
Tumors
Genes
Derivatives

All Science Journal Classification (ASJC) codes

  • Biomedical Engineering

Cite this

Sharma, A., Boroevich, K. A., Shigemizu, D., Kamatani, Y., Kubo, M., & Tsunoda, T. (2017). Hierarchical Maximum Likelihood Clustering Approach. IEEE Transactions on Biomedical Engineering, 64(1), 112-122. [7440832]. https://doi.org/10.1109/TBME.2016.2542212
Sharma, Alok ; Boroevich, Keith A. ; Shigemizu, Daichi ; Kamatani, Yoichiro ; Kubo, Michiaki ; Tsunoda, Tatsuhiko. / Hierarchical Maximum Likelihood Clustering Approach. In: IEEE Transactions on Biomedical Engineering. 2017 ; Vol. 64, No. 1. pp. 112-122.
@article{ff5e467f3efe4015bcc3d5aa761a00d5,
title = "Hierarchical Maximum Likelihood Clustering Approach",
abstract = "Objective: In this paper, we focused on developing a clustering approach for biological data. In many biological analyses, such as multiomics data analysis and genome-wide association studies analysis, it is crucial to find groups of data belonging to subtypes of diseases or tumors. Methods: Conventionally, the k-means clustering algorithm is overwhelmingly applied in many areas including biological sciences. There are, however, several alternative clustering algorithms that can be applied, including support vector clustering. In this paper, taking into consideration the nature of biological data, we propose a maximum likelihood clustering scheme based on a hierarchical framework. Results: This method can perform clustering even when the data belonging to different groups overlap. It can also perform clustering when the number of samples is lower than the data dimensionality. Conclusion: The proposed scheme is free from selecting initial settings to begin the search process. In addition, it does not require the computation of the first and second derivative of likelihood functions, as is required by many other maximum likelihood-based methods. Significance: This algorithm uses distribution and centroid information to cluster a sample and was applied to biological data. A MATLAB implementation of this method can be downloaded from the web link http://www.riken.jp/en/research/labs/ims/med-sci-math/.",
author = "Alok Sharma and Boroevich, {Keith A.} and Daichi Shigemizu and Yoichiro Kamatani and Michiaki Kubo and Tatsuhiko Tsunoda",
year = "2017",
month = "1",
day = "1",
doi = "10.1109/TBME.2016.2542212",
language = "English",
volume = "64",
pages = "112--122",
journal = "IEEE Transactions on Biomedical Engineering",
issn = "0018-9294",
publisher = "IEEE Computer Society",
number = "1",

}

Sharma, A, Boroevich, KA, Shigemizu, D, Kamatani, Y, Kubo, M & Tsunoda, T 2017, 'Hierarchical Maximum Likelihood Clustering Approach', IEEE Transactions on Biomedical Engineering, vol. 64, no. 1, 7440832, pp. 112-122. https://doi.org/10.1109/TBME.2016.2542212

Hierarchical Maximum Likelihood Clustering Approach. / Sharma, Alok; Boroevich, Keith A.; Shigemizu, Daichi; Kamatani, Yoichiro; Kubo, Michiaki; Tsunoda, Tatsuhiko.

In: IEEE Transactions on Biomedical Engineering, Vol. 64, No. 1, 7440832, 01.01.2017, p. 112-122.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Hierarchical Maximum Likelihood Clustering Approach

AU - Sharma, Alok

AU - Boroevich, Keith A.

AU - Shigemizu, Daichi

AU - Kamatani, Yoichiro

AU - Kubo, Michiaki

AU - Tsunoda, Tatsuhiko

PY - 2017/1/1

Y1 - 2017/1/1

N2 - Objective: In this paper, we focused on developing a clustering approach for biological data. In many biological analyses, such as multiomics data analysis and genome-wide association studies analysis, it is crucial to find groups of data belonging to subtypes of diseases or tumors. Methods: Conventionally, the k-means clustering algorithm is overwhelmingly applied in many areas including biological sciences. There are, however, several alternative clustering algorithms that can be applied, including support vector clustering. In this paper, taking into consideration the nature of biological data, we propose a maximum likelihood clustering scheme based on a hierarchical framework. Results: This method can perform clustering even when the data belonging to different groups overlap. It can also perform clustering when the number of samples is lower than the data dimensionality. Conclusion: The proposed scheme is free from selecting initial settings to begin the search process. In addition, it does not require the computation of the first and second derivative of likelihood functions, as is required by many other maximum likelihood-based methods. Significance: This algorithm uses distribution and centroid information to cluster a sample and was applied to biological data. A MATLAB implementation of this method can be downloaded from the web link http://www.riken.jp/en/research/labs/ims/med-sci-math/.

AB - Objective: In this paper, we focused on developing a clustering approach for biological data. In many biological analyses, such as multiomics data analysis and genome-wide association studies analysis, it is crucial to find groups of data belonging to subtypes of diseases or tumors. Methods: Conventionally, the k-means clustering algorithm is overwhelmingly applied in many areas including biological sciences. There are, however, several alternative clustering algorithms that can be applied, including support vector clustering. In this paper, taking into consideration the nature of biological data, we propose a maximum likelihood clustering scheme based on a hierarchical framework. Results: This method can perform clustering even when the data belonging to different groups overlap. It can also perform clustering when the number of samples is lower than the data dimensionality. Conclusion: The proposed scheme is free from selecting initial settings to begin the search process. In addition, it does not require the computation of the first and second derivative of likelihood functions, as is required by many other maximum likelihood-based methods. Significance: This algorithm uses distribution and centroid information to cluster a sample and was applied to biological data. A MATLAB implementation of this method can be downloaded from the web link http://www.riken.jp/en/research/labs/ims/med-sci-math/.

UR - http://www.scopus.com/inward/record.url?scp=85008481824&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85008481824&partnerID=8YFLogxK

U2 - 10.1109/TBME.2016.2542212

DO - 10.1109/TBME.2016.2542212

M3 - Article

VL - 64

SP - 112

EP - 122

JO - IEEE Transactions on Biomedical Engineering

JF - IEEE Transactions on Biomedical Engineering

SN - 0018-9294

IS - 1

M1 - 7440832

ER -

Sharma A, Boroevich KA, Shigemizu D, Kamatani Y, Kubo M, Tsunoda T. Hierarchical Maximum Likelihood Clustering Approach. IEEE Transactions on Biomedical Engineering. 2017 Jan 1;64(1):112-122. 7440832. https://doi.org/10.1109/TBME.2016.2542212