One-layer MLP Possibly Missing #3

ni9elf · 2017-05-13T18:56:36Z

The attention layer works directly on the GRU embeddings (denoted by h_it in the HAN paper) in the call function of the AttentionLayer. In the paper description, h_it should be fed to a one-layer MLP with a tanh activation to obtain u_it by u_it = tanh(W.h_it + b). The attention weights are then computed on u_it. Is this happening in the code and I have missed it out, or has this been (intentionally) left out? Please clarify.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

One-layer MLP Possibly Missing #3

One-layer MLP Possibly Missing #3

ni9elf commented May 13, 2017 •

edited

Loading

One-layer MLP Possibly Missing #3

One-layer MLP Possibly Missing #3

Comments

ni9elf commented May 13, 2017 • edited Loading

ni9elf commented May 13, 2017 •

edited

Loading