GitHub - qlero/EMq: Implementation of a bivariate NA-robust Expectation-Maximization algo.

Content

This repository contains:

The implementation from-scratch of an example bivariate EM algorithm that is robust to missing values, i.e. NAs, in both variables.
A few 'tutorial' examples from Q for Mortals and an awesome lecture by Tim Thornton.

Note

For easiness of reading the algorithm, present in the EMq.q file, we only consider here the construction of a single bivariate distribution starting with the sample mean and covariance matrix of the whole Wine dataset (here restricted to the feature Alcohol and Malic.Acid).

In a more authentic case, we would compute a sample mean and covariance matrix for each of the 3 types of wine present in the dataset and compute the EM algorithm over each of them. Doing so would approximate, up to a local optimum, the distribution of each population as a Gaussian.

Formulas used to compute the Expectation-Maximization Algorithm

Expectation step

There are m elements with missing data out of n elements.

Maximization step

We update the parameters such that:

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
images		images
tutorials		tutorials
EMq.q		EMq.q
README.md		README.md
wine.csv		wine.csv
wineNA.csv		wineNA.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Content

Note

Formulas used to compute the Expectation-Maximization Algorithm

Expectation step

Maximization step

About

Releases

Packages

Languages

qlero/EMq

Folders and files

Latest commit

History

Repository files navigation

Content

Note

Formulas used to compute the Expectation-Maximization Algorithm

Expectation step

Maximization step

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages