BLASFEO provides a set of basic linear algebra routines, performance-optimized
for matrices that fit in cache (i.e. generally up to a couple hundred size in
each dimension), as typically encountered in embedded optimization
applications.

Homepage:
https://github.com/blasfeo/
