ORCA
Online Research @ Cardiff

Clear Cookie - decide language by browser settings

Reinforcement learning produces dominant strategies for the Iterated Prisoner's Dilemma

Harper, Marc, Knight, Vincent

, Jones, Martin, Koutsovoulos, Georgios, Glynatsi, Nikoleta and Campbell, Owen 2017. Reinforcement learning produces dominant strategies for the Iterated Prisoner's Dilemma. Plos One 12 (12) , e0188046. 10.1371/journal.pone.0188046

Preview

PDF - Published Version
Available under License Creative Commons Attribution.
Download (23MB) | Preview

Official URL: https://doi.org/10.1371/journal.pone.0188046

Abstract

We present tournament results and several powerful strategies for the Iterated Prisoner's Dilemma created using reinforcement learning techniques (evolutionary and particle swarm algorithms). These strategies are trained to perform well against a corpus of over 170 distinct opponents, including many well-known and classic strategies. All the trained strategies win standard tournaments against the total collection of other opponents. The trained strategies and one particular human made designed strategy are the top performers in noisy tournaments also.

Item Type:	Article
Date Type:	Publication
Status:	Published
Schools:	Professional Services > Advanced Research Computing @ Cardiff (ARCCA) Schools > Mathematics
Publisher:	Public Library of Science
ISSN:	1932-6203
Date of First Compliant Deposit:	12 December 2017
Date of Acceptance:	27 October 2017
Last Modified:	07 May 2023 10:44
URI:	https://orca.cardiff.ac.uk/id/eprint/107524

Citation Data

Cited 15 times in Scopus. View in Scopus. Powered By Scopus® Data

Actions (repository staff only)

Edit Item

Dimensions

Altmetric

Download Statistics

Downloads

Downloads per month over past year

View more statistics

CORE (COnnecting REpositories)