Runtime analysis of (1 + 1) evolutionary algorithm controlled with Q-learning using greedy exploration strategy on OneMAX+ZEROMAX problem

Denis Antipov, Maxim Buzdalov, Benjamin Doerr

Allbwn ymchwil: Pennod mewn Llyfr/Adroddiad/Trafodion CynhadleddTrafodion Cynhadledd (Nid-Cyfnodolyn fathau)

1 Dyfyniad (Scopus)

Crynodeb

There exist optimization problems with the target objective, which is to be optimized, and several extra objectives. The extra objectives may or may not be helpful in optimization process in terms of the number of objective evaluations necessary to reach an optimum of the target objective. OneMax+ZeroMax is a previously proposed benchmark optimization problem where the target objective is OneMax and a single extra objective is ZeroMax, which is equal to the number of zero bits in the bit vector. This is an example of a problem where extra objectives are not good, and objective selection methods should ignore the extra objectives. The EA+RL method is a method which selects objectives to be optimized by evolutionary algorithms (EA) using reinforcement learning (RL). Previously it was shown that it runs in Θ(N logN) on OneMax+ZeroMax when configured to use the randomized local search algorithm and the Q-learning algorithm with the greedy exploration strategy. We present the runtime analysis for the case when the (1 + 1)-EA algorithm is used. It is shown that the expected running time is at most 3.12eN log N.
Iaith wreiddiolSaesneg
TeitlEvolutionary Computation in Combinatorial Optimization
Is-deitl15th European Conference, EvoCOP 2015, Copenhagen, Denmark, April 8-10, 2015, Proceedings
CyhoeddwrSpringer Nature
Tudalennau160-172
Nifer y tudalennau13
ISBN (Electronig)978-3-319-16468-7
ISBN (Argraffiad)978-3-319-16467-0
Dynodwyr Gwrthrych Digidol (DOIs)
StatwsCyhoeddwyd - 15 Maw 2015
Cyhoeddwyd yn allanolIe

Cyfres gyhoeddiadau

EnwLecture Notes in Computer Science
CyhoeddwrSpringer Nature
Cyfrol9026
ISSN (Argraffiad)0302-9743
ISSN (Electronig)1611-3349

Ôl bys

Gweld gwybodaeth am bynciau ymchwil 'Runtime analysis of (1 + 1) evolutionary algorithm controlled with Q-learning using greedy exploration strategy on OneMAX+ZEROMAX problem'. Gyda’i gilydd, maen nhw’n ffurfio ôl bys unigryw.

Dyfynnu hyn