Statistics for Reinforcement learning with parameterized actions