1887

n South African Computer Journal - Upper bounds on the performance of discretisation in reinforcement learning : research article

USD

 

Abstract

Reinforcement learning is a machine learning framework whereby an agent learns to perform a task by maximising its total reward received for selecting actions in each state. The policy mapping states to actions that the agent learns is either represented explicitly, or implicitly through a value function. It is common in reinforcement learning to discretise a continuous state space using tile coding or binary features. We prove an upper bound on the performance of discretisation for direct policy representation or value function approximation.

Loading

Article metrics loading...

/content/comp/57/1/EJC181697
2015-12-01
2016-12-07
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error