Skip to content

Commit

Permalink
Fix lora inference
Browse files Browse the repository at this point in the history
  • Loading branch information
leng-yue committed Dec 29, 2023
1 parent cdecc2a commit 982f787
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion tools/llama/generate.py
Original file line number Diff line number Diff line change
Expand Up @@ -323,7 +323,7 @@ def load_model(config_name, checkpoint_path, device, precision):
cfg = compose(config_name=config_name)

with torch.device("meta"):
model: Transformer = instantiate(cfg.model.model)
model: Transformer = instantiate(cfg.model).model

if "int8" in str(checkpoint_path):
logger.info("Using int8 weight-only quantization!")
Expand Down

0 comments on commit 982f787

Please sign in to comment.