·
AI & ML interests
Quantization
Organizations
published an article about 1 month ago view article Follow the White Rabbit: Using Embeddings So You Never Get Lost in Translation
published an article about 1 year ago view article How to deploy and fine-tune DeepSeek models on AWS


- +1
published an article over 1 year ago view article Memory-efficient Diffusion Transformers with Quanto and Diffusers
published an article about 2 years ago view article Quanto: a PyTorch quantization backend for Optimum


- +1
published an article about 2 years ago view article Hugging Face Text Generation Inference available for AWS Inferentia2
published an article over 2 years ago view article Make your llama generation time fly with AWS Inferentia2
published an article over 2 years ago view article Make your llama generation time fly with AWS Inferentia2