Analysis of Q-learning with random exploration for selection of auxiliary objectives in random local search

Allbwn ymchwil: Pennod mewn Llyfr/Adroddiad/Trafodion CynhadleddTrafodion Cynhadledd (Nid-Cyfnodolyn fathau)

3 Dyfyniadau (Scopus)

Crynodeb

We perform theoretical analysis for a previously proposed method of enhancing performance of an evolutionary algorithm with reinforcement learning. The method adaptively chooses between auxiliary objectives in a single-objective evolutionary algorithm using reinforcement learning. We consider the Q-learning algorithm with ϵ-greedy strategy (ϵ > 0), using a benchmark problem based on ONEMAX. For the evolutionary algorithm, we consider the Random Local Search. In our setting, ONEMAX problem should be solved in the presence of the obstructive ZEROMAX objective. This benchmark tests the ability of the reinforcement learning algorithm to ignore such an inefficient objective. It was previously shown that in the case of the greedy strategy (ϵ = 0), the considered algorithm performs on the described benchmark problem in the best possible time for a conventional evolutionary algorithm. However, the ϵ-greedy strategy appears to perform in exponential time. Furthermore, every selection algorithm which selects an inefficient auxiliary objective with probability of at least δ is shown to be asymptotically inefficient when δ > 0 is a constant.

Iaith wreiddiolSaesneg
Teitl2015 IEEE Congress on Evolutionary Computation, CEC 2015
Is-deitlProceedings
CyhoeddwrIEEE Press
Tudalennau1776-1783
Nifer y tudalennau8
ISBN (Electronig)9781479974924
Dynodwyr Gwrthrych Digidol (DOIs)
StatwsCyhoeddwyd - 14 Medi 2015
Cyhoeddwyd yn allanolIe
DigwyddiadIEEE Congress on Evolutionary Computation, CEC 2015 - Sendai, Siapan
Hyd: 25 Mai 201528 Mai 2015

Cynhadledd

CynhadleddIEEE Congress on Evolutionary Computation, CEC 2015
Gwlad/TiriogaethSiapan
DinasSendai
Cyfnod25 Mai 201528 Mai 2015

Ôl bys

Gweld gwybodaeth am bynciau ymchwil 'Analysis of Q-learning with random exploration for selection of auxiliary objectives in random local search'. Gyda’i gilydd, maen nhw’n ffurfio ôl bys unigryw.

Dyfynnu hyn