Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WebVoyager Baseline Agent & Benchmark #282

Open
wants to merge 54 commits into
base: main
Choose a base branch
from

fix expel sh

a94ee8c
Select commit
Loading
Failed to load commit list.
Open

WebVoyager Baseline Agent & Benchmark #282

fix expel sh
a94ee8c
Select commit
Loading
Failed to load commit list.
Codecov / codecov/project failed Feb 3, 2025 in 0s

80.11% (target 95.00%)

View this Pull Request on Codecov

80.11% (target 95.00%)

Details

Codecov Report

Attention: Patch coverage is 8.13810% with 745 lines in your changes missing coverage. Please review.

Files with missing lines Patch % Lines
...l/benchmarks/computer_use/webvoyager/webvoyager.py 0.00% 330 Missing ⚠️
...nchmarks/computer_use/webvoyager/utils_webarena.py 0.00% 187 Missing ⚠️
...benchmarks/computer_use/webvoyager/data_manager.py 0.00% 125 Missing ⚠️
...ential/benchmarks/computer_use/webvoyager/utils.py 0.00% 55 Missing ⚠️
...l/agents/computer_use/webvoyager_baseline/agent.py 0.00% 41 Missing ⚠️
...uter_use/webvoyager_baseline/strategies/general.py 89.39% 7 Missing ⚠️

❌ Your project check has failed because the head coverage (80.11%) is below the target coverage (95.00%). You can increase the head coverage or adjust the target coverage.

Files with missing lines Coverage Δ
...ter_use/webvoyager_baseline/functional_webarena.py 20.21% <100.00%> (+20.21%) ⬆️
.../agents/computer_use/webvoyager_baseline/output.py 100.00% <100.00%> (ø)
agential/agents/expel/prompts.py 100.00% <ø> (ø)
...al/benchmarks/computer_use/osworld/data_manager.py 96.19% <100.00%> (-0.96%) ⬇️
...gential/benchmarks/computer_use/osworld/osworld.py 21.66% <ø> (ø)
...uter_use/webvoyager_baseline/strategies/general.py 89.39% <89.39%> (ø)
...l/agents/computer_use/webvoyager_baseline/agent.py 0.00% <0.00%> (ø)
...ential/benchmarks/computer_use/webvoyager/utils.py 0.00% <0.00%> (ø)
...benchmarks/computer_use/webvoyager/data_manager.py 0.00% <0.00%> (ø)
...nchmarks/computer_use/webvoyager/utils_webarena.py 0.00% <0.00%> (ø)
... and 1 more

... and 2 files with indirect coverage changes