Material Detail

Inferring gene regulatory networks using ensembles of feature selection techniques

Inferring gene regulatory networks using ensembles of feature selection techniques

This video was recorded at 6th International Workshop on Machine Learning in Systems Biology (MLSB), Basel 2012. Motivation: One of the long-standing open challenges in computational systems biology is the inference of gene regulatory networks from expression data. Recently, two community-wide efforts have been established to benchmark network inference techniques (DREAM4 and DREAM5), where it was shown that a feature selection method us- ing variable importance scores obtained from tree-based ensemble methods (GENIE3) achieved top performance. Despite the success of this algorithm, little research has been carried out to understand why this approach works so well, and if equally good or better results could be obtained using other types of feature selection techniques. Results: In this work, we present a large scale analysis of feature selection approaches to the network inference problem. We show that, using the recent concept of ensemble feature selection techniques, equally good or better results than GENIE3 can be achieved, demonstrating that the ensemble setting is a necessary requirement for feature selection techniques to achieve good performance on the network inference task. Furthermore, we show that by combining several ensemble feature selection techniques the performance can be made more robust and slightly improved. This analysis opens up new avenues for the development of novel types of ensemble based feature selection techniques in this setting. Availability: R source code of all methods can be downloaded at


  • Editor Reviews
  • User Rating
  • Comments
  • Learning Exercises
  • Bookmark Collections
  • Course ePortfolios
  • Accessibility Info

More about this material


Log in to participate in the discussions or sign up if you are not already a MERLOT member.