Balance-B to LocalLLaMA@poweruser.forumEnglish · 11 months agoIncoming: TensorRT-LLM version 0.6 with support for MoE, new models and more quantizationgithub.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkIncoming: TensorRT-LLM version 0.6 with support for MoE, new models and more quantizationgithub.comBalance-B to LocalLLaMA@poweruser.forumEnglish · 11 months agomessage-square0fedilink