rihard7854B to LocalLLaMA@poweruser.forumEnglish · 10 months agoNVidia H200 achieves nearly 12,000 tokens/sec on Llama2-13B with TensorRT-LLMgithub.comexternal-linkmessage-square24fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkNVidia H200 achieves nearly 12,000 tokens/sec on Llama2-13B with TensorRT-LLMgithub.comrihard7854B to LocalLLaMA@poweruser.forumEnglish · 10 months agomessage-square24fedilink
minus-squarejun2sanBlinkfedilinkEnglisharrow-up1·10 months agoHow much you want for your old H100? - me to ai devs
How much you want for your old H100? - me to ai devs