Visual Processing Visual Perception Multimodal Perception Illusions Spatial Awareness Visual Cognition Virtual Reality Effects Color Perception Interpretation Pattern Recognition Reality Distortion Object Recognition Bayesian Inference Multisensory Perception Active Perception Cognition Human Perception Action Dynamics Action Generation Spatial Reasoning Visual Crowding Attention Mechanisms
The authors present tools to test and reduce instruction-induced mistakes in image-text models.