Abstract: This study focuses on inferring cost functions of obtained movement data using reward parameter search and pol-icy gradient based Reinforcement Learning (RL). The behavior data for this task ...