shared-component Gaussian Mixture Models

In some applications we want to model the multiple datasets simultaneously using the same Gaussian mixture model (GMM) components (mu_k and Sigma_k) across the multiple datasets. That is, all the datasets share the same mu_k and Sigma_k, but each having different set of pi_k--pi_ks for each dataset s.

The derivation note can be found here-- sorry in advance for all the mess in the note.

The MATLAB code is made available here.

The toy dataset

Datasets: The experiment contains 3 datasets, each of which has 1000 data examples, thus 3000 examples in total. Each dataset has the following components:

The dataset can be summarized in figure below.