[D] Why choose an H100 over an A100 for LLM inference?

faschu · 2 years ago

[D] Why choose an H100 over an A100 for LLM inference?

3DHydroPrints · 2 years ago

H100 was additionally specialized to have higher performance for transformer models. I think it is about 8x faster than a A100 for transformers, but don’t nail me down on it

norcalnatv · 2 years ago

There was quite a detailed technical blog published when H100 was announced with plenty of of comparison to A100.

I_will_delete_myself · 2 years ago

A100 is like a 3070ti with 80gb Vram. H100 is like a 4090 with 80gb of ram and optimized hardware for transformers.