ESSEC METALAB

RESEARCH

SHARP OPTIMAL RECOVERY IN THE TWO COMPONENT GAUSSIAN MIXTURE MODEL

[ARTICLE] This paper improves the nonasymptotic lower bound for minimax Hamming risk in Two-Component Gaussian Mixture Model clustering and proposes an optimal, adaptive procedure, revealing a sharp phase transition for exact recovery.

by Mohamed Ndaoud (ESSEC Business School)

In this paper, we study the problem of clustering in the Two component Gaussian mixture model when the centers are separated by some Δ>0. We present a nonasymptotic lower bound for the corresponding minimax Hamming risk improving on existing results. We also propose an optimal, efficient and adaptive procedure that is minimax rate optimal. The rate optimality is moreover sharp in the asymptotics when the sample size goes to infinity. Our procedure is based on a variant of Lloyd’s iterations initialized by a spectral method. As a consequence of nonasymptotic results, we find a sharp phase transition for the problem of exact recovery in the Gaussian mixture model.

[Please read the research paper here]

Research list
arrow-right