[Help] torch.compile(model.forward, mode="reduce-overhead", fullgraph=True) 出错 #1496

lilxmx · 2024-11-07T02:30:13Z

model.generation_config.cache_implementation = "static"
model.forward = torch.compile(model.forward, mode="reduce-overhead", fullgraph=True)

No response

with torch.no_grad():
vector_outputs = model(
**seq, output_hidden_states=True, return_dict=True
)
在model()运行时报错，没有进入下一层，直接报错

- OS:
- Python:3.8.20
- Transformers: 4.30.2
- PyTorch:2.0.1 
- CUDA Support:True

No response

Provide feedback