tau/scrolls
Updated • 2.21k • 28
How to use learn3r/longt5_xl_govreport_4096 with Transformers:
# Load model directly
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
tokenizer = AutoTokenizer.from_pretrained("learn3r/longt5_xl_govreport_4096")
model = AutoModelForSeq2SeqLM.from_pretrained("learn3r/longt5_xl_govreport_4096")This model is a fine-tuned version of google/long-t5-tglobal-xl on the tau/scrolls gov_report dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 1.6043 | 1.0 | 68 | 1.4757 |
| 1.4471 | 1.99 | 136 | 1.4020 |
| 1.315 | 2.99 | 204 | 1.3935 |
| 1.2098 | 3.99 | 272 | 1.3889 |
| 1.0888 | 4.99 | 340 | 1.4389 |
Base model
google/long-t5-tglobal-xl