RAMP: Reinforcement Adaptive Mixed Precision Quantization for Efficient On Device LLM Inference Paper โข 2603.17891 โข Published 4 days ago โข 4
Embarrassingly Simple Performance Prediction for Abductive Natural Language Inference Paper โข 2202.10408 โข Published Feb 21, 2022 โข 5