What's the difference between the synthetic/crl/energy and synthetic/crl/envelope #24

JQuan1999 · 2022-06-28T08:17:21Z

Sorry, I have a problem why don't directly output HQ in the synthetic/crl/envelope/meta.py learn()，

__, Q = self.model_(Variable(torch.cat(state_batch, dim=0)),
                                Variable(w_batch), w_num=self.weight_num)
# detach since we don't want gradients to propagate
# HQ, _    = self.model_(Variable(torch.cat(next_state_batch, dim=0), volatile=True),
# 					  Variable(w_batch, volatile=True), w_num=self.weight_num)
 _, DQ = self.model(Variable(torch.cat(next_state_batch, dim=0), requires_grad=False),
                               Variable(w_batch, requires_grad=False))

but in the synthetic/crl/energy/meta.py learn()

__, Q = self.model(Variable(torch.cat(state_batch, dim=0)),
                               Variable(preference_batch), w_num=self.weight_num)
 # detach since we don't want gradients to propagate
HQ, _ = self.model(Variable(torch.cat(next_state_batch, dim=0)),
                               Variable(preference_batch), w_num=self.weight_num)

Why getting HQ takes two different approaches and what is the difference between them

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's the difference between the synthetic/crl/energy and synthetic/crl/envelope #24

What's the difference between the synthetic/crl/energy and synthetic/crl/envelope #24

JQuan1999 commented Jun 28, 2022 •

edited

Loading

What's the difference between the synthetic/crl/energy and synthetic/crl/envelope #24

What's the difference between the synthetic/crl/energy and synthetic/crl/envelope #24

Comments

JQuan1999 commented Jun 28, 2022 • edited Loading

JQuan1999 commented Jun 28, 2022 •

edited

Loading