Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multi-document setting #2

Open
rhythmcao opened this issue Jan 22, 2025 · 1 comment
Open

Multi-document setting #2

rhythmcao opened this issue Jan 22, 2025 · 1 comment

Comments

@rhythmcao
Copy link

Thanks a lot for providing this meaningful benchmark.

I notice that in the Table 2 of the raw paper, 10.9% questions include multiple documents. But when I download the test.jsonl file from the HF, the clear data format does not have the multiple document field? It only includes the pid, which I assume should be the paper id according to OpenReview.

@rhythmcao
Copy link
Author

By the way, if the external documents are implicitly mentioned in the question and should be referenced to answer the question, is it possible for me to obtain the concrete sources or URLs? Or how can I distinguish different categories of the questions (defined in Table 2) such that I can get a more fine-grained evaluation on different splits?

Image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant