NVIDIA Jetson Orin Nano Collection Ultra-efficient model variants optimized for Jetson Orin Nano. Designed for constrained edge environments requiring low memory footprint. โข 3 items โข Updated 16 days ago โข 2
NVIDIA Jetson AGX Orin Collection Models optimized and bench-marked for NVIDIA Jetson AGX Orin. Memory-efficient and latency-optimized variants designed for real-time edge inference. โข 3 items โข Updated 17 days ago โข 2
EdgeN Collection Quantization strategy where most weights are converted to INT4, activations remain in FP16, and sensitive layers are preserved in FP16. โข 5 items โข Updated 2 days ago โข 1
FlashHead Collection Efficient Drop-In Replacement for the Classification Head in Language Model Inference. โข 19 items โข Updated 2 days ago โข 1