Not so impressive because, AFAIK, hardware boards support only single precision floats (useless for most numerical applications) and SIMD-like processing (applicable only to some kind of problems).
A rack full of low cost pizza-box processing nodes with Linux is much more impressive and generally usable.