OverflowError: out of range integral type conversion attempted #35540

test3211234 · 2025-01-07T01:34:49Z

System Info

transformers version: 4.47.1
Platform: Windows-10-10.0.22631-SP0
Python version: 3.11.8
Huggingface_hub version: 0.27.0
Safetensors version: 0.4.5
Accelerate version: 1.2.1
Accelerate config: not found
PyTorch version (GPU?): 2.3.1+cu118 (True)
Tensorflow version (GPU?): not installed (NA)
Flax version (CPU?/GPU?/TPU?): not installed (NA)
Jax version: not installed
JaxLib version: not installed
Using distributed or parallel set-up in script?: No
Using GPU in script?: Yes
GPU type: AMD Radeon RX 6600 [ZLUDA]

Who can help?

@amyeroberts @qubvel

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

When trying to run this official example code from the GOT-OCR2_0 HuggingFace page:

from transformers import AutoModel, AutoTokenizer

tokenizer = AutoTokenizer.from_pretrained('ucaslcl/GOT-OCR2_0', trust_remote_code=True)
model = AutoModel.from_pretrained('ucaslcl/GOT-OCR2_0', trust_remote_code=True, low_cpu_mem_usage=True, device_map='cuda', use_safetensors=True, pad_token_id=tokenizer.eos_token_id)
model = model.eval().cuda()


# input your test image
image_file = 'xxx.jpg'

# plain texts OCR
res = model.chat(tokenizer, image_file, ocr_type='ocr')

# format texts OCR:
# res = model.chat(tokenizer, image_file, ocr_type='format')

# fine-grained OCR:
# res = model.chat(tokenizer, image_file, ocr_type='ocr', ocr_box='')
# res = model.chat(tokenizer, image_file, ocr_type='format', ocr_box='')
# res = model.chat(tokenizer, image_file, ocr_type='ocr', ocr_color='')
# res = model.chat(tokenizer, image_file, ocr_type='format', ocr_color='')

# multi-crop OCR:
# res = model.chat_crop(tokenizer, image_file, ocr_type='ocr')
# res = model.chat_crop(tokenizer, image_file, ocr_type='format')

# render the formatted OCR results:
# res = model.chat(tokenizer, image_file, ocr_type='format', render=True, save_render_file = './demo.html')

print(res)

Expected behavior

Should print result.

The text was updated successfully, but these errors were encountered:

qubvel · 2025-01-07T09:59:54Z

Hi @test3211234! You are using 3rd party implementation, it's better to open discussion directly on the hub
https://huggingface.co/stepfun-ai/GOT-OCR2_0/discussions

We also have transformers native implementation ongoing, you can try this out by installing it from the PR
#34721

test3211234 added the bug label Jan 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OverflowError: out of range integral type conversion attempted #35540

OverflowError: out of range integral type conversion attempted #35540

test3211234 commented Jan 7, 2025

qubvel commented Jan 7, 2025

OverflowError: out of range integral type conversion attempted #35540

OverflowError: out of range integral type conversion attempted #35540

Comments

test3211234 commented Jan 7, 2025

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

qubvel commented Jan 7, 2025