Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
AdinaY 
posted an update 4 days ago
Post
1376
Wechat AI is shipping!

WeDLM 🔥 A new language model that generates tokens in parallel, making it faster than standard LLMs , with the same Transformer setup!
https://huggingface.co/collections/tencent/wedlm

✨ 7B/8B - Base & Instruct
✨ Apache 2.0

Is it going to work as GGUF file?

·

Apparently, quantized versions are yet to arrive

Does vllm support WeDLM architecture?