Matrix Multiply Optimization
Project for Machine Structures class at UC Berkeley. Optimized Matrix Multiply Program using OpenMp, SSE Intructions, loop unrolling and cache blocking. Project specifications can be found at:
http://www-inst.eecs.berkeley.edu/~cs61c/sp13/projs/03/
To run program, run make and then ./bench-openmp
*Note: This program was ran on UC Berkeley's lab computers and may need some auxillary files to be compiled.