Scientists located that vision-language versions, extensively utilized to examine clinical photos, do not recognize negation words like ‘no’ and ‘not.’ This might trigger them to fall short all of a sudden when asked to obtain clinical photos which contain particular things however not others.
发布者:Dr.Durant,转转请注明出处:https://robotalks.cn/study-shows-vision-language-models-cant-handle-queries-with-negation-words-2/