A geometric interpretation of stochastic gradient descent using diffusion metrics