-
Notifications
You must be signed in to change notification settings - Fork 216
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Questions about the hw-modulated attention in DAB-DETR #193
Comments
|
Could you give a more detailed explanation about how this works? My personal understanding of the "H=1, W=3" in Figure 6 of the DAB paper is that "href/hq = 1, wref/wq = 3", in which larger wq will lead to smaller W. If I misunderstood something, what is the definition of H and W in Figure6? Thanks. |
The results in Fig 6 are examples. "H=1, W=3" means hq =1, wq = 3. We suppose the href and wref are 1. |
I couldn't understand this phenomenon theoretically. If the origin value of attention map at a fix point is calculated by (PE(x)*PE(xref)wref/wq + ... When we increase wq to wq'=3wq, the new value should decrease, which will result in a narrower shape attention map. Could you explain why larger wq leads to wider atten map with the formulation theoretically? Thanks. |
I have two questions about hw-modulated attention equation (Eq.(6) in DAB-DETR):
The text was updated successfully, but these errors were encountered: