The scores below are obtained from GPT-4V evaluation