HN Companion
new
|
best
|
ask
|
show
|
jobs
Loading item...
Browser Agent Benchmark: Comparing LLM models for web automation | HN Companion
HN Companion
new
|
best
|
ask
|
show
|
jobs
Browser Agent Benchmark: Comparing LLM models for web automation
(
browser-use.com
)
13 points
by
MagMueller
4 days ago
|
5 comments
View on Hacker News
wiradikusuma
4 days ago
[–]
Since we're in this topic, can anyone suggest good AI-based tool for exploratory (fuzzy?) web testing?
pixel_popping
4 days ago
[–]
It's lacking the best model (Opus 4.5) on the benchmark tho.
djohnston
3 days ago
[–]
Yeah but then their own product might not score the highest.
pixel_popping
2 days ago
[–]
Exactly why I'm pointing it out, which feels a bit corrupt, but understandable.
djohnston
2 days ago
[–]
tbh i was a bit cranky yesterday - even if they are #2 on a legit benchmark that would be impressive