Hub
Docs
Try for Free
xiangyi-li
/
webarena
mirrored 17 minutes ago
Benchmark Card
Files and versions
Leaderboard
like
0
configs
-
test_evaluators.py
10.3 kB
test_helper_functions.py
946 B
main
/
tests
test_evaluation_harness
remove exact from evalutor names
2 years ago
update test example due to html escape
2 years ago
Shuyan Zhou
Update README.md
daee18d
remove beartype for efficency purpose
2 years ago