Inferring the trial-by-trial structure of pitch reinforcement learning in songbirds