Hunor, Jakab: Guided exploration in policy gradient algorithms with Gaussian process function approximation. In: Conference of PhD Students in Computer Science, (7). p. 41. (2010)