Hello,
Tensor assertion error is raised if you try to train the model. It starts with the following:
0%| | 0/10 [00:00<?, ?it/s]../aten/src/ATen/native/cuda/Indexing.cu:1289: indexSelectLargeIndex: block: [56,0,0], thread: [64,0,0] Assertion `srcIndex < srcSelectDimSize` failed.
It's possible to avoid this error by resizing embeddings:
model.resize_token_embeddings(len(tokenizer))
Unfortunately, it might affect the performance. Could you please review the finetune.py script and fix this problem?
Thank you in advance.
BR,
Mehti
Hello,
Tensor assertion error is raised if you try to train the model. It starts with the following:
It's possible to avoid this error by resizing embeddings:
Unfortunately, it might affect the performance. Could you please review the finetune.py script and fix this problem?
Thank you in advance.
BR,
Mehti