You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Say there is a long document, then two users ask two different questions based on the document. These two questions are no way similar, targeting on different part of the document. In this case, can snapkv compress the context robustly?
The text was updated successfully, but these errors were encountered:
In our observation, we found out that the attention allocation depends on the nature of questions. SnapKV needs to compress individually for different turns/questions.
Say there is a long document, then two users ask two different questions based on the document. These two questions are no way similar, targeting on different part of the document. In this case, can snapkv compress the context robustly?
The text was updated successfully, but these errors were encountered: