English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Journal Article

A guided hybrid genetic algorithm for feature selection with expensive cost functions

MPS-Authors
/persons/resource/persons62425

Jung,  Martin
Global Diagnostic Modelling, Dr. Martin Jung, Department Biogeochemical Integration, Dr. M. Reichstein, Max Planck Institute for Biogeochemistry, Max Planck Society;

/persons/resource/persons76340

Zscheischler,  Jakob
Department Biogeochemical Integration, Dr. M. Reichstein, Max Planck Institute for Biogeochemistry, Max Planck Society;
IMPRS International Max Planck Research School for Global Biogeochemical Cycles, Max Planck Institute for Biogeochemistry , Max Planck Society;

External Resource
Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)

BGC1822.pdf
(Publisher version), 168KB

Supplementary Material (public)
There is no public supplementary material available
Citation

Jung, M., & Zscheischler, J. (2013). A guided hybrid genetic algorithm for feature selection with expensive cost functions. Procedia Computer Science, 18, 2337-2346. doi:10.1016/j.procs.2013.05.405.


Cite as: https://hdl.handle.net/11858/00-001M-0000-0013-AE18-0
Abstract
We present a guided hybrid genetic algorithm for feature selection which is tailored to minimize the number of cost function evaluations. Guided variable elimination is used to make the stochastic backward search of the genetic algorithm much more efficient. Guiding means that a promising feature set is selected from a population and suggestions (for example by a trained Random Forest) are made which variable could be removed. It uses implicit diversity management and is able to return multiple optimal solutions if present, which might be important for interpreting the results. It uses a dynamic cost function that avoids prescribing an expected upper limit of performance or the number of features of the optimal solution. We illustrate the performance of the algorithm on artificial data, and show that the algorithm provides accurate results and is very efficient in minimizing the number of cost function evaluations.We present a guided hybrid genetic algorithm for feature selection which is tailored to minimize the number of cost function evaluations. Guided variable elimination is used to make the stochastic backward search of the genetic algorithm much more efficient. Guiding means that a promising feature set is selected from a population and suggestions (for example by a trained Random Forest) are made which variable could be removed. It uses implicit diversity management and is able to return multiple optimal solutions if present, which might be important for interpreting the results. It uses a dynamic cost function that avoids prescribing an expected upper limit of performance or the number of features of the optimal solution. We illustrate the performance of the algorithm on artificial data, and show that the algorithm provides accurate results and is very efficient in minimizing the number of cost function evaluations.