ibm-granite/granite-embedding-97m-multilingual-r2 Feature Extraction • 97.4M • Updated 20 days ago • 209k • • 115
view article Article Performant local mixture-of-experts CPU inference with GPU acceleration in llama.cpp Doctor-Shotgun • Jan 30 • 28