You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The final average accuracy achieved is 59.2%, which shows a significant gap compared to the reported accuracy of 66.3% in the paper. Could you help me identify any potential issues?
Code to calculate average accuracy:
# Defining the provided data to calculate the total `num_samples` and combined `acc`data= [
{"num_samples": 118, "acc": 68.6},
{"num_samples": 351, "acc": 57.0},
{"num_samples": 14, "acc": 50.0},
{"num_samples": 14, "acc": 57.1},
{"num_samples": 91, "acc": 59.3},
]
# Calculating total `num_samples` and weighted `acc`total_samples=sum(item["num_samples"] foritemindata)
weighted_acc=sum(item["num_samples"] *item["acc"] foritemindata) /total_samplestotal_samples, weighted_acc
The text was updated successfully, but these errors were encountered:
I used the default command:
With the following default setup:
Attempting to reproduce results on the GaoKao dataset, I observed the following:
The final average accuracy achieved is 59.2%, which shows a significant gap compared to the reported accuracy of 66.3% in the paper. Could you help me identify any potential issues?
Code to calculate average accuracy:
The text was updated successfully, but these errors were encountered: