Revisiting the Primacy of English in Zero-shot Cross-lingual Transfer Paper • 2106.16171 • Published Jun 30, 2021
CANINE: Pre-training an Efficient Tokenization-Free Encoder for Language Representation Paper • 2103.06874 • Published Mar 11, 2021 • 3
The MultiBERTs: BERT Reproductions for Robustness Analysis Paper • 2106.16163 • Published Jun 30, 2021 • 1
Measuring Attribution in Natural Language Generation Models Paper • 2112.12870 • Published Dec 23, 2021
Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding Paper • 2210.03347 • Published Oct 7, 2022 • 3
juliaturc/llama-3.2-1b-instruct-gms8k-sft-2500steps-notquantized Text Generation • 1B • Updated Feb 12, 2025 • 3
juliaturc/llama-3.2-1b-instruct-gms8k-sft-2500steps-notquantized Text Generation • 1B • Updated Feb 12, 2025 • 3