Translation not working outputing garbage tokens

#21

by athaze - opened Mar 31

Mar 31

Used the usage code provided in this Repo
from transformers import T5ForConditionalGeneration, T5Tokenizer

model_name = 'jbochi/madlad400-3b-mt'
model = T5ForConditionalGeneration.from_pretrained(model_name, device_map="auto")
tokenizer = T5Tokenizer.from_pretrained(model_name)

text = "<2pt> I love pizza!"
input_ids = tokenizer(text, return_tensors="pt").input_ids.to(model.device)
outputs = model.generate(input_ids=input_ids)

tokenizer.decode(outputs[0], skip_special_tokens=True)

and i got this output

'1000000000000000000'

print (input_ids) got this tensor([[ 805, 116, 908, 10108, 88792, 918, 2]], device='cuda:0')
and here is the outputs tensors before decoding = tensor([[ 0, 805, 808, 813, 813, 813, 813, 813, 813, 813, 813, 813, 813, 813,
813, 813, 813, 813, 813, 813, 813]], device='cuda:0')

am i missing something here ??

Hunter878o

Apr 9

This comment has been hidden

Hunter878o

Apr 9

This comment has been hidden (marked as Off-Topic)

Hunter878o

Apr 9

I'm the same—I thought it was just me or that my hardware was the problem :)

athaze

Apr 9

I'm the same—I thought it was just me or that my hardware was the problem :)
yes i thought the same it must be a hardware issue but its not. something is broken here but i haven't figured it out yet. let me know if you figured out a solution or even an alternative Model or something

cointegrated

9 days ago

•

edited 9 days ago

Same for me. Curiously, the 10B model still works.

athaze

9 days ago

Same for me. Curiously, the 10B model still works.

hey the issue is with the tokenizer you can use a a sentence piece tokenizer model to resolve this here is a repo i used Heng666/madlad400-3b-mt-ct2-int8

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment