Material Detail

An integrated generative and discriminative Bayesian model for binary classification

An integrated generative and discriminative Bayesian model for binary classification

This video was recorded at 4th International Workshop on Machine Learning in Systems Biology (MLSB), Edinburgh 2010. much smaller number of samples. Analysing such data is statistically challenging, as the covariates are highly correlated, which results in unstable parameter estimates and inaccurate prediction. To alleviate this problem, we have developed a statistical model which uses a small number of meta-covariates inferred from the data through a Gaussian mixture model, rather than all the original covariates, to classify samples via a probit regression model. A graphical overview of our model is presented in Figure 1 below. The novelty of our approach is that our meta-covariates are formed considering predictor-outcome correlations as well as inter-predictor correlations. This idea was partly inspired by recent empirical research that has shown that optimum predictive performance often corresponds to an intermediate trade-off between the purely generative and purely discriminative approaches to classification [2]. The main advantage over using a sparse classification model [1] is that we can extract a much larger subset of covariates with essential predictive power and partition this subset into groups, within which the covariates are similar.

Quality

  • User Rating
  • Comments
  • Learning Exercises
  • Bookmark Collections
  • Course ePortfolios
  • Accessibility Info

More about this material

Comments

Log in to participate in the discussions or sign up if you are not already a MERLOT member.