中国教育图书进出口有限公司

[期刊论文][Article]

Sarsa(Λ)-Based Logistics Planning Approximated by Value Function with Policy Iteration

作者：

Yu Tang;

出版年：2015

页码：449 - 466

出版社：SAGE Publications

摘要：

The logistics planning problem has been extensively investigated for a long time. However, with the increasing number of stochastic events occurred in road, increasing number of stochastic factors should be taken into consideration. A dynamic approach is used in this paper to solve the logistics planning problem in the common form of stochastic demand with the reinforcement learning framework which is able to optimize policy in unknown environments and uncertain cases. We take advantage of clustering method to extract states as main features for basis function so as to solve the dimensionality curse problems caused by stochastic settings. We also propose an approximation approach with the policy iteration restricted by the goal of minimal time differential error to approximate the stochastic cases of the real world, and then use the attained approximation parameters as input for the proposed Sarsa(λ)-based logistics planning algorithm to determine the policy and action in accordance with the real world stochastic events. The benchmarking experimental results showed that the proposed algorithm has achieved improvements in almost all the test cases.

关键字：

Logistics Planning ; Reinforcement Learning ; Sarsa(Λ) ; Value Function Approximation ; Policy Iteration

全文

所属期刊

Journal of Algorithms & Computational Technology

ISSN: 1748-3018

来自：SAGE Publications