Hey, I have a question please. Are you using the original models or quantized versions of them?
Hey, I have a question please. Are you using the original models or quantized versions of them?
And is there a plan to provide paid APIs for the available models that we can use programmatically, like OpenAI API?
Hey, thank you for replying! 400 words per message are good for most uses, but sometimes I need longer messages like 600 words for example. It’s great that chat.lmsys.org has many great models and they get updated all the time, so it would be great to be able to use these models with longer messages. Thanks!
Perfect!