RAMP: Reinforcement Adaptive Mixed Precision Quantization for Efficient On Device LLM Inference Paper β’ 2603.17891 β’ Published 4 days ago β’ 6
Embarrassingly Simple Performance Prediction for Abductive Natural Language Inference Paper β’ 2202.10408 β’ Published Feb 21, 2022 β’ 5