> How could we give LLMs the ability to "pay attention" to different parts of im... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		QuadmasterXLII on Nov 19, 2023 \| parent \| context \| favorite \| on: Comparing humans, GPT-4, and GPT-4V on abstraction... > How could we give LLMs the ability to "pay attention" to different parts of images, as needed, so they can make back-and-forth comparisons between parts of different images to solve these kinds of visual reasoning tasks? I’ve got good news

phh on Nov 19, 2023 | [–]

It's even all we need

oefnak on Nov 19, 2023 | [–]

What is it?

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact