Some rough paths techniques in reinforcement learning