The representation for learnable action tokens. #20

wyddmw · 2025-02-13T13:38:44Z

Hi, thanks for this awesome work! I would like to ask a simple question why did you choose to use a single learnable token and repeat it multiple times to represent the action instead of using different tokens? Assume we want to represent an action with 4 tokens, what is the difference between tokens = nn.Parameter(torch.zeros(token_num, self.hidden_size)) and tokens = nn.Parameter(torch.zeros(1, self.hidden_size)).repeat(token_num, 1) and is the corresponding advantage?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The representation for learnable action tokens. #20

The representation for learnable action tokens. #20

wyddmw commented Feb 13, 2025

The representation for learnable action tokens. #20

The representation for learnable action tokens. #20

Comments

wyddmw commented Feb 13, 2025