As an example of how hard this is their bottom left image A herd of elephants walking across a dry grass field is somewhat wrong. The field is not dry, and there is not much grass. However, they classified it as Describes without errors.
What's interesting about this is slightly wrong image analysis might yield a reasonable description.
What's interesting about this is slightly wrong image analysis might yield a reasonable description.