We should emulate the Blas interface for the smallmatvec and smallmatmat functions. Currently we don't have the alpha and beta coefficients. This is a bit annoying for reverse mode, where we often want to update, not overwrite, the output vector.
Source code here.