Evolving Deep Neural Networks (2023)
Risto Miikkulainen, Jason Liang, Elliot Meyerson, Aditya Rawal, Dan Fink, Olivier Francon, Bala Raju, Hormoz Shahrzad, Arshak Navruzyan, Nigel Duffy, Babak Hodjat
The success of deep learning depends on finding an architecture to fit the task. As deep learning has scaled up to more challenging tasks, the architectures have become difficult to design by hand. This paper proposes an automated method, CoDeepNEAT, for optimizing deep learning architectures through evolution. By extending existing neuroevolution methods to topology, components, and hyperparameters, this method achieves results comparable to the best human designs in standard benchmarks in object recognition and language modeling. It also supports building a real-world application of automated image captioning on a magazine website. Given the anticipated increases in available computing power, evolution of deep networks is a promising approach to constructing deep learning applications in the future.
View:
PDF
Citation:
To Appear In R. Kozma, C. Alippi, Y. Choe, and F. Morabito, editors, Artificial Intelligence in the Age of Neural Networks and Brain Computing (second edition), New York, 2023. Elsevier.
Bibtex:

Jason Zhi Liang Ph.D. Alumni jasonzliang [at] utexas edu
Elliot Meyerson Ph.D. Alumni ekm [at] cs utexas edu
Risto Miikkulainen Faculty risto [at] cs utexas edu
Aditya Rawal Ph.D. Alumni aditya [at] cs utexas edu
Hormoz Shahrzad Masters Student hormoz [at] cognizant com