The Psychology of Empathy in Visual Perception
Image shown: Berthe Morisot, The Cradle, 1872, France (DETAIL)
Empathy is frequently referenced in art discussions as an emotional connection that arises when an image resonates with the viewer. While often treated as a vague or ineffable experience, psychological research defines empathy in more specific terms. It consists of distinct cognitive and affective components. Cognitive empathy refers to the ability to understand another person’s mental or emotional state. Affective empathy involves internally sharing or mirroring aspects of that state (Decety & Jackson, 2004).
Both components contribute to how viewers perceive and respond to visual imagery, particularly when subjects involve care, vulnerability, or interpersonal interaction. Although the idea that images can evoke emotion is widely accepted, the perceptual mechanisms that support empathic response are not always clearly articulated. Understanding how static visual images give rise to empathic experience requires closer examination of how visual perception, attention, and cognition interact.
The Neuroscience of Seeing Emotion
Empathic response begins with visual perception rather than conscious emotional interpretation. Before viewers can respond emotionally, the visual system must detect, organize, and prioritize socially meaningful information. Several neural systems play key roles in this process. The fusiform face area supports rapid recognition of facial features. The superior temporal sulcus is involved in processing gaze direction, facial movement, and body posture, while the amygdala contributes to evaluating emotional relevance, particularly in relation to vulnerability or threat (Vuilleumier, 2005; Pessoa, 2008).
These systems often activate rapidly, sometimes before conscious awareness. Visual features such as visible eyes, direct gaze, and clearly articulated hand positions tend to receive priority. When such cues are obscured or ambiguous, the brain’s ability to infer emotional state is reduced (LaBar & Cabeza, 2006).

In visual art, emotional response does not arise solely from symbolic meaning. It emerges from the viewer’s interaction with perceptual structures that the visual system has learned, through both evolution and experience, to associate with human behavior and intention.
Mirror Neurons and Embodied Simulation
Research on the mirror neuron system has contributed significantly to contemporary models of empathy. This system was first identified in the 1980s and 1990s by neurophysiologists Giacomo Rizzolatti, Giuseppe Di Pellegrino, Luciano Fadiga, Leonardo Fogassi, and Vittorio Gallese at the University of Parma. They discovered that mirror neurons activate both when an individual performs an action and when observing that same action performed by others (Rizzolatti & Craighero, 2004). This activity supports what Gallese describes as embodied simulation, in which observers internally model aspects of observed actions (Gallese, Keysers, & Rizzolatti, 2004).
This framework is particularly relevant when viewing images depicting physical interaction, effort, or touch. Observing a gesture, such as supporting another person or reaching outward, can activate neural systems associated with performing similar actions. The viewer does not merely observe the gesture visually; they engage with it at a sensorimotor level. However, the effectiveness of this process depends on perceptual clarity. Gestures that are structurally clear and visually legible tend to produce stronger embodied responses than those that are poorly defined or visually ambiguous.
Compositional Strategies That Amplify Empathic Response
Empathy in visual imagery is strongly shaped by compositional organization. Spatial relationships, figure arrangement, and visual hierarchy all influence how emotional content can be perceived. Close proximity between figures can often signal intimacy or support. Overlapping forms, shared points of contact, and compressed spatial arrangements increase the likelihood that viewers will interpret emotional connection. Framing also plays a role. Narrow framing can concentrate attention on an interaction, while expansive framing may diffuse emotional focus.
Light, value structure, and edge relationships further influence emotional tone by potential contribution to how viewers perceive unity or separation within an image. Gradual value transitions and softer edge relationships often promote a sense of continuity and emotional closeness, whereas abrupt contrasts and sharp edges tend to signal division, tension, or emotional distance. These perceptual effects are supported by Gestalt principles of visual organization, which describe how the brain tends to group visual elements based on proximity, similarity, continuity, and closure (Wertheimer, 1923). For example, when light gradients create smooth tonal transitions between figures and backgrounds, the principle of continuity encourages the viewer to perceive those elements as belonging to a unified whole. Similarly, when adjacent forms share value or edge softness, the principles of similarity and proximity lead the visual system to interpret them as relationally connected. In contrast, strong value separations or interrupting contours may disrupt this grouping, fostering a sense of tension or detachment. These perceptual tendencies subtly guide empathic interpretation by suggesting either cohesion or fragmentation in the emotional landscape of the image.
Historical works provide numerous examples. Rembrandt used controlled lighting to isolate moments of tenderness. Mary Cassatt emphasized gesture and spatial closeness to convey care. Japanese ukiyo-e prints often suggest familial intimacy through restrained gesture and simplified spatial organization, demonstrating how empathy can be conveyed through minimal yet precise visual cues.

Cross-Cultural Universals and Cultural Variations in Empathy
Research on facial expression recognition indicates that certain emotional expressions, such as joy or distress, are often recognized across cultures (Ekman, 1992). However, interpretations of interpersonal behavior vary widely depending on cultural context. Norms surrounding physical proximity, eye contact, and gesture differ significantly across societies (Matsumoto, 2007). A gesture perceived as comforting in one cultural context may be perceived as intrusive in another. Emotional restraint may convey dignity in one culture while appearing distant elsewhere. Artists working across cultural boundaries must remain sensitive to these variations.
While the biological foundations of empathic response are widely shared, the visual conventions used to convey compassion or care are culturally mediated. Empathy remains effective when visual cues are coherent within the viewer’s cultural framework.
The Role of Attention: What Viewers Notice First
Empathy depends not only on what is depicted, but on where attention is directed. Eye-tracking studies consistently show that viewers tend to fixate first on faces, eyes, hands, and areas of physical interaction (Yarbus, 1967; Henderson, 2003). These regions carry dense social and emotional information. In addition, artists can influence attentional flow through contrast, edge control, directional cues, and lighting. When these tools are applied deliberately, viewers are more likely to engage with an image’s emotional core.

These attentional priorities reflect evolutionary pressures. The visual system is tuned to detect socially relevant information efficiently. Empathic response follows the path of attention established by perceptual organization.
Empathy and Memory: Why Compassionate Images Endure
Modern evidence suggests that emotionally significant images are more likely to be retained in memory. Neuroscientific research shows that emotionally salient stimuli engage the amygdala and support memory consolidation processes involving the hippocampus (LaBar & Cabeza, 2006). Images depicting caregiving, vulnerability, or social bonding often persist because these themes align with fundamental biological priorities. Social connection has high adaptive value, and images that clearly convey it are more likely to remain culturally meaningful over time.
Compassion alone does not guarantee lasting impact. However, when compassionate themes are presented with perceptual clarity and structural coherence, they are more likely to be remembered and revisited. Empathy in visual perception is neither mysterious nor purely subjective (although, of course, subjectivity does play a role). It is supported by identifiable perceptual and cognitive processes that govern how viewers attend to, interpret, and remember visual information. Compassionate images resonate when they align with these systems through clear structure and intentional organization.
When artists understand how visual perception operates, empathy becomes a form of visual communication rather than an abstract emotional goal. It is achieved not through ambiguity, but through clarity, attention, and deliberate construction.
References and Bibliography
Primary Scientific and Psychological Sources
Decety, J., & Jackson, P. L. (2004). The functional architecture of human empathy. Behavioral and Cognitive Neuroscience Reviews, 3(2), 71–100.
Ekman, P. (1992). An argument for basic emotions. Cognition and Emotion, 6(3–4), 169–200.
Gallese, V., Keysers, C., & Rizzolatti, G. (2004). A unifying view of the basis of social cognition. Trends in Cognitive Sciences, 8(9), 396–403.
Henderson, J. M. (2003). Human gaze control during real-world scene perception. Trends in Cognitive Sciences, 7(11), 498–504.
LaBar, K. S., & Cabeza, R. (2006). Cognitive neuroscience of emotional memory. Nature Reviews Neuroscience, 7(1), 54–64.
Pessoa, L. (2008). On the relationship between emotion and cognition. Nature Reviews Neuroscience, 9(2), 148–158.
Rizzolatti, G., & Craighero, L. (2004). The mirror-neuron system. Annual Review of Neuroscience, 27, 169–192.
Vuilleumier, P. (2005). How brains beware: neural mechanisms of emotional attention. Trends in Cognitive Sciences, 9(12), 585–594.
Yarbus, A. L. (1967). Eye movements and vision. Plenum Press.
Perception, Vision Science, and Cognitive Context
Gibson, J. J. (1979). The ecological approach to visual perception. Houghton Mifflin.
Palmer, S. E. (1999). Vision science: Photons to phenomenology. MIT Press.
Purves, D., Lotto, R. B., Williams, S. M., Nundy, S., & Yang, Z. (2001). Why we see things the way we do: Evidence for a wholly empirical strategy of vision. Philosophical Transactions of the Royal Society B, 356(1407), 285–297.
Wertheimer, M. (1923). Laws of organization in perceptual forms. In W. D. Ellis (Ed.), A source book of Gestalt psychology (pp. 71–88). Routledge.
Cultural and Cross-Cultural Studies
Jack, R. E., Caldara, R., & Schyns, P. G. (2012). Internal representations reveal cultural diversity in expectations of facial expressions of emotion. Journal of Experimental Psychology: General, 141(1), 19–25.
Matsumoto, D. (2007). Culture, context, and behavior. Journal of Personality, 75(6), 1285–1320.
ÀNI Art Academies / Waichulis Curriculum Sources
Waichulis, A. J. (2019). What does realistic look like? ÀNI Art Academies.
Waichulis, A. J. (2016). A primer on pictorial composition. ÀNI Art Academies.
Waichulis, A. J. (2025). ÀNI art academies teaching manual.
Waichulis, A. J. (2010). ÀNI LOD drawing instructional companion text. ÀNI Art Academies.
Waichulis, A. J. (2010). ÀNI LOP painting instructional companion text. ÀNI Art Academies.

2 Comments
Excellent read! Thank you.
Thank you! I very much enjoyed researching, learning, and working on this article.