The Augmented Intelligence Perspective on Human-in-the-Loop Reinforcement Learning: Review, Concept Designs, and Future Directions

Kok Lim Alvin Yau, Yasir Saleem, Yung Wey Chong*, Xiumei Fan, Jer Min Eyu, David Chieng

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Augmented intelligence (AuI) is a concept that combines human intelligence (HI) and artificial intelligence (AI) to leverage their respective strengths. While AI typically aims to replace humans, AuI integrates humans into machines, recognizing their irreplaceable role. Meanwhile, human-in-the-loop reinforcement learning (HITL-RL) is a semisupervised algorithm that integrates humans into the traditional reinforcement learning (RL) algorithm, enabling autonomous agents to gather inputs from both humans and environments, learn, and select optimal actions across various environments. Both AuI and HITL-RL are still in their infancy. Based on AuI, we propose and investigate three separate concept designs for HITL-RL: HI-AI, AI-HI, and parallel-HI-and-AI approaches, each differing in the order of HI and AI involvement in decision making. The literature on AuI and HITL-RL offers insights into integrating HI into existing concept designs. A preliminary study in an Atari game offers insights for future research directions. Simulation results show that human involvement maintains RL convergence and improves system stability, while achieving approximately similar average scores to traditional Q-learning in the game. Future research directions are proposed to encourage further investigation in this area.

Original languageEnglish
Number of pages16
JournalIEEE Transactions on Human-Machine Systems
Early online date18 Oct 2024
DOIs
Publication statusE-pub ahead of print - 18 Oct 2024

Keywords

  • Artificial intelligence (AI)
  • augmented intelligence (AuI)
  • human in the loop
  • reinforcement learning (RL)

Fingerprint

Dive into the research topics of 'The Augmented Intelligence Perspective on Human-in-the-Loop Reinforcement Learning: Review, Concept Designs, and Future Directions'. Together they form a unique fingerprint.

Cite this