Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
shadowgovt
on March 14, 2024
|
parent
|
context
|
favorite
| on:
LaVague: Open-source Large Action Model to automat...
This has the potential to be a step towards the missing scripting language for graphical interfaces, which is great.
DanyWin
on March 14, 2024
[–]
Thanks! Funny thing, we did not use Vision models but text only with the HTML of the current page. However, we intend to add it to boost performance
jerpint
on March 14, 2024
|
parent
[–]
Interesting that it’s not vision based, I suspect you will get much better performance once vision is incorporated, using e.g LLaVa style models
Join us for
AI Startup School
this June 16-17 in San Francisco!
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: