Geoffrey Pritchard, David J. Scott, The Eigenvalues of the Empirical Transition Matrix of a Markov Chain, Journal of Applied Probability, Vol. 41, Stochastic Methods and Their Applications (2004), pp.
This paper describes sufficient conditions for the existence of optimal policies for partially observable Markov decision processes (POMDPs) with Borel state, observation, and action sets, when the ...