An improved reinforcement learning algorithm is proposed in this paper. This algorithm is based on linear programming method for finding the best-response policy. A pursuit example is tested and the results show that this algorithm has some properties, such as easy computation, simple operation procedure and can guarantee an good learning convergence.