The standard Galerkin finit element method for spatial discretization of an integro-differential equation is studied. The model problem arises in the theory of fractional order viscoelasticity, which has a memory term with a kernel of positive type. Optimal order $L^\infty(L^2)$ and $L^\infty(H^1)$ a priori error estimates for the finite element approximation of the solution are obtained.