TY - JOUR
T1 - The Augmented Intelligence Perspective on Human-in-the-Loop Reinforcement Learning
T2 - Review, Concept Designs, and Future Directions
AU - Yau, Kok Lim Alvin
AU - Saleem, Yasir
AU - Chong, Yung Wey
AU - Fan, Xiumei
AU - Eyu, Jer Min
AU - Chieng, David
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2024/10/18
Y1 - 2024/10/18
N2 - Augmented intelligence (AuI) is a concept that combines human intelligence (HI) and artificial intelligence (AI) to leverage their respective strengths. While AI typically aims to replace humans, AuI integrates humans into machines, recognizing their irreplaceable role. Meanwhile, human-in-the-loop reinforcement learning (HITL-RL) is a semisupervised algorithm that integrates humans into the traditional reinforcement learning (RL) algorithm, enabling autonomous agents to gather inputs from both humans and environments, learn, and select optimal actions across various environments. Both AuI and HITL-RL are still in their infancy. Based on AuI, we propose and investigate three separate concept designs for HITL-RL: HI-AI, AI-HI, and parallel-HI-and-AI approaches, each differing in the order of HI and AI involvement in decision making. The literature on AuI and HITL-RL offers insights into integrating HI into existing concept designs. A preliminary study in an Atari game offers insights for future research directions. Simulation results show that human involvement maintains RL convergence and improves system stability, while achieving approximately similar average scores to traditional Q-learning in the game. Future research directions are proposed to encourage further investigation in this area.
AB - Augmented intelligence (AuI) is a concept that combines human intelligence (HI) and artificial intelligence (AI) to leverage their respective strengths. While AI typically aims to replace humans, AuI integrates humans into machines, recognizing their irreplaceable role. Meanwhile, human-in-the-loop reinforcement learning (HITL-RL) is a semisupervised algorithm that integrates humans into the traditional reinforcement learning (RL) algorithm, enabling autonomous agents to gather inputs from both humans and environments, learn, and select optimal actions across various environments. Both AuI and HITL-RL are still in their infancy. Based on AuI, we propose and investigate three separate concept designs for HITL-RL: HI-AI, AI-HI, and parallel-HI-and-AI approaches, each differing in the order of HI and AI involvement in decision making. The literature on AuI and HITL-RL offers insights into integrating HI into existing concept designs. A preliminary study in an Atari game offers insights for future research directions. Simulation results show that human involvement maintains RL convergence and improves system stability, while achieving approximately similar average scores to traditional Q-learning in the game. Future research directions are proposed to encourage further investigation in this area.
KW - Artificial intelligence (AI)
KW - augmented intelligence (AuI)
KW - human in the loop
KW - reinforcement learning (RL)
UR - http://www.scopus.com/inward/record.url?scp=85207715882&partnerID=8YFLogxK
U2 - 10.1109/THMS.2024.3467370
DO - 10.1109/THMS.2024.3467370
M3 - Article
SN - 2168-2291
JO - IEEE Transactions on Human-Machine Systems
JF - IEEE Transactions on Human-Machine Systems
ER -