Skip to content

Commit

Permalink
Update workshop_programme.md
Browse files Browse the repository at this point in the history
  • Loading branch information
vernadankers authored Nov 15, 2024
1 parent 4f22e37 commit 141e334
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions _pages/workshop_programme.md
Original file line number Diff line number Diff line change
Expand Up @@ -58,7 +58,7 @@ Dojun Park, Jiwoo Lee (presenter), Seohyun Park, Hyeyun Jeong, Youngeun Koo, Soo
Bastian Bunzeck (presenter), Sina Zarrieß

- <b>[MMLU-SR: A Benchmark for Stress-Testing Reasoning Capability of Large Language Models](https://aclanthology.org/2024.genbench-1.5.pdf)</b><br>
Wentian Wang, Sarthak Jain, Paul Kantor, Jacob Feldman, Lazaros Gallos, Hao Wang
Presenter: Hengyi Wang, Authors: Wentian Wang, Sarthak Jain, Paul Kantor, Jacob Feldman, Lazaros Gallos, Hao Wang

## <span style="color:grey"> 12:30-1:45 PM —</span> Lunch break

Expand Down Expand Up @@ -110,7 +110,7 @@ Wentian Wang, Sarthak Jain, Paul Kantor, Jacob Feldman, Lazaros Gallos, Hao Wang
</li>
<li>
<span style="color:#ffffff; background-color: #74849c; border-radius:4px; padding:3px">GenBench CBT</span> <a href="https://aclanthology.org/2024.genbench-1.5.pdf"><b>MMLU-SR: A Benchmark for Stress-Testing Reasoning Capability of Large Language Models</b></a> <br>
Wentian Wang, Sarthak Jain, Paul Kantor, Jacob Feldman, Lazaros Gallos, Hao Wang
Presenter: Hengyi Wang, Authors: Wentian Wang, Sarthak Jain, Paul Kantor, Jacob Feldman, Lazaros Gallos, Hao Wang
</li>
<li>
<span style="color:#ffffff; background-color: #0ccfbb; border-radius:4px; padding:3px">GenBench Non-archival</span> <a href="/assets/extended_abstracts_2023/18_A_Peek_into_Token_Bias_Larg.pdf"><b>A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners</b></a> <br>
Expand Down

0 comments on commit 141e334

Please sign in to comment.