Contextual Facilitation of Colour Recognition: Penetrating Beliefs or Colour-Shape Associations? (Ruch Filozoficzny, LXXV, 2, 2019, 199-208)

Maciej Witek

My aim in this paper is to defend the view that the processes underlying early vision are informationally encapsulated. Following Marr (1982) and Pylyshyn (1999) I take early vision to be a cognitive process that takes sensory information as its input and produces the so-called primal sketches or shallow visual outputs: informational states that represent visual objects in terms of their shape, location, size, colour and luminosity. Recently, some researchers (Schirillo 1999, Macpherson 2012) have attempted to undermine the idea of the informational encapsulation of early vision by referring to experiments that seem to show that colour recognition is affected by the subject's beliefs about the typical colour of objects. In my view, however, one can reconcile the results of these experiments with the position that early vision is informationally encapsulated. Namely, I put forth a hypothesis according to which the early vision system has access to a local database that I call the mental palette and define as a network of associative links whose nodes stands for shapes and colours. The function of the palette is to facilitate colour recognition without employing central processes. I also describe two experiments by which the mental palette hypothesis can be tested.

4th draft, 30/09/2012 Maciej Witek Institute of Philosophy, University of Szczecin, Poland Contextual Facilitation of Colour Recognition: Penetrating Beliefs or Colour-Shape Associations? * According to Jerry Fodor (1983) and Zenon W. Pylyshyn (1999, 2003), the early visual system — whose function is to produce the primal sketches of distal layouts — is cognitively impenetrable. To call it cognitively impenetrable or informationally encapsulated is to maintain that its operations cannot be affected by what the subject believes, expects and desires. That is to say, the computational process that mediates the production of primal sketches has no access to the subject's central database or his background beliefs. Rather, it is an input-driven process whose job is to integrate the transduced signals coming from the subject's senses into what Fodor calls shallow outputs or percepts: information-carrying states that represent their distal objects in terms of their locations, sizes, shapes and colours.1 The only non-sensory information the system can use comes from its private or local database. To argue for the cognitive impenetrabilty of early vision is to reject the popular idea that there is a continuity between perception and cognition or, in other words, that the process of perceptual integration is top-down through and through. It is also to claim that there is a level of perceptual representation which is (a) shallow, (b) phenomenally conscious and (c) cognitively impenetrable in that its phenomenal content or character cannot be affected by the top-down processes. The paradigmatic examples of such representations are provided by the so-called persistent visual illusions, that is, by visual experiences whose phenomenal content remains unchanged despite strong beliefs to the contrary. For example, even though the subjects know that the Müller-Lyer lines are not equal in length, they cannot stop seeing one line as longer than the other. Consider, by analogy, the checker shadow illusion: despite their knowing that there is no difference in shade between the shadowed square and the non-shadowed one, the subjects cannot * The work on this paper was supported by a scholarship from the Bednarowski Trust to the University of Glasgow (8 April — 19th May 2010). 1 Admittedly, Fodor adds that percepts represent their object in terms of what he calls “basic perceptual categories”, e.g. “dog” (Fodor 1983: 94-97). Let us assume, however, that early vision — as far as it is consider as an encapsulated process — involves no categorisation or object recognition at all (see Raftopoulos 2001: 434). That is to say, its proper products are shallow in that their phenomenal character involves only shapes, sizes, locations and colours. th 1 help seeing them as different in shade. Let us assume that the overall phenomenal content of a visual experience — or, for simplicity, the experience as such — comprises two aspects: shallow and deep. The former represents the relevant distal stimulus in terms of its size, shape, location, colour and luminosity, whereas the latter represents it as falling under certain categories, such as “dog”, “rabbit”, “duck”, and so on. The proponent of the modular account of early vision would say, therefore, that only deep aspects of visual experiences are cognitively penetrable; their shallow aspects, by contrast, result from informationally encapsulated, data-driven computations. There are, however, some experimental results that seem to undermine the idea of the cognitive impenetrability of early vision. There have been reported cases of (a) shallow and (b) phenomenally conscious visual representations that seems to be (c') affected by the top-down factors, i.e., by what the subjects believe, expect or desire. 2 Typically, these cases involve two subjects focusing their visual attention on the same distal stimulus but having different visual experiences or, more precisely, producing visual experiences that differ significantly in their shallow aspects. One reaction to such findings is to account for the reported differences in terms of beliefs that penetrate the process of early perceptual integration; that is to say, the subjects produce experiences that differ in their shallow aspects because the subjects have different beliefs about what they see. Of course, the proponents of the modular account of early vision attempt to defuse such counterexamples by arguing that the allegedly penetrating beliefs affect either the (i) preperceptual or (ii) post-perceptual stages of vision, that is, either (i) manipulate the subject's visual attention or (ii) contribute to what we call the deep aspect of experience or even to the reportable content of the subjects' introspective judgements.3 In her “Cognitive Penetration of Colour Experience: Rethinking the Issue in Light of an Indirect Mechanism” Fiona Macpherson (2012) argues that there is one case of the alleged cognitive penetrability of visual experience that cannot be explained away along the lines of the modular theory. What she has in mind is the effect reported by John L. Delk and Samuel Fillenbaum4 that suggests, it seems, that the subject's colour experiences are sensitive to the subject's beliefs about the typical colours of objects. Delk and Fillenbaum: took a sheet of paper of a uniform orangish colour. From it, they cut out shapes of various object. Some of these objects were characteristically red: a heart, a pair of lips, an apple. Some were not characteristically red: an oval, a circle, an ellipse, a horse's head, a bell, a mushroom, a square. One at a time, the cutout shapes were placed in front of a coloured background that could be altered by the subject. The background could be altered from a yellow colour through orange to a red colour by twisting a nob. The subjects were instructed to change the background colour until it 2 3 4 See Bruner and Postman 1949 and Bruner and Goodman 1947. For a discussion of these strategies see Pylyshyn 1999. See Delk and Fillenbaum 1965. 2 was the same colour as the cutout shape in front of it, so that the cutout shape could no longer be distinguished from the background. When the cutout shape of an object that had a characteristically red colour was placed in front of the background, subjects altered the background colour so that is was more of a red colour that the colour that they made the background when the cutout shape was of an object that didn't have a characteristically red colour. When the cutout shape was of an object that didn't have a characteristically red colour the subjects made the background less red and more yellow.5 In short, depending on the shape of the cutouts, the subject see some of them as orangish and others as reddish. According to Macpherson, to explain this observed regularity one has no alternative but to acknowledge that the subject's visual experiences are penetrated by his or her beliefs about the typical colours of objects. In other words, one has to agree that the shallow aspect of visual experience is cognitively penetrable.6 In my view, however, what is responsible for the regularity reported by Delk and Fillenbaum are not the subject's cognitive states but, rather, the internal organization of his or her early vision system. Admittedly, the subject may possess beliefs to the effect that hearts and tomatoes are characteristically red and ovals and bells are not. This fact, nevertheless, has nothing to do with the effect described by Delk and Fillenbaum. Therefore, rather than explaining away the case by arguing that the penetrating beliefs bears on either the pre-perceptual or post-perceptual stages of visual information processing, I claim that the mechanism that mediate colour recognition involves no penetrating beliefs at all. Undoubtedly, the mechanism is sensitive to the shape of the processed object. To acknowledge this, nevertheless, is not to say that it is affected by the subject's beliefs about the typical colours of objects. The mechanism is like priming in that it employs the subject's implicit rather than explicit memory.7 That is to say, it is autonomous and sub-personal rather than cognitively penetrable and personal. In what follows I offer a sketchy model of the autonomous mechanism of colour recognition. I also suggest how the model can be tested empirically. It is worth stressing that with this model in hand not only are we in a position to explain the Delk and Fillenbaum effect away, but we are able to consider it as a positive evidence for the modular account of early vision. Let me start with an analogy. Studies on speech perception show that the reaction time for lexical decisions (i.e., word/non-word decisions) depends on whether the target words are predictable in the sentential contexts in which they occur.8 For example, “rope” is recognized as a word more quickly when it follows the phrase “The guide threw the lady a …” than it follows the 5 Macpherson 2012: 38. In a similar vein, James A. Shirillo (1999) argues that given the results of Delk and Fillenbaum's experiment we have no alternative but to accept the idea of the cognitive penetrability of colour recognition. 7 For a discussion of priming as an implicit mechanism see Bar 2000. 8 See Swinney 1979, Forster 1981, Fodor 1983: 76-82 and Fodor 1986: 130-135. 6 3 phrase “The priest threw the lady a …”. 9 The cloze value of “rope” — that is, the value that reflects its predictability in a given context — is high in the first case whereas it is low in the second. At first sight, the data under discussion seem to support the view that language perception is cognitively penetrable, that is, that the process whose job is to represent linguistic stimuli in terms of their phonetic, lexical and grammatical properties can be affected by semantic or background information. One can maintain, for example, that the process that mediates the subject's lexical decisions is penetrated by his or her beliefs, e.g., by his or her belief to the effect that priests, unlike guides, have to do with ropes. In short, one can conclude that lexical decision is informed by topdown information flow. It turns out, however, that we can escape this conclusion and stick to the view that language perception is a purely bottom-up process. In his Modularity of the Mind Fodor puts forth the hypothesis according to which lexical analysis has access to a local database that he calls the mental lexicon. The mental lexicon can be depicted as a network of lexeme-to-lexeme associations that mediate our lexical access: Suppose the mental lexicon is a sort of connected graph, with lexical items at the nodes and with paths from each item to several others. We can think of accessing an item in the lexicon as, in effect, exciting the corresponding node; and we can assume that one of the consequences of accessing a node is that excitation spreads along the pathways that lead from it. Assume, finally, that when excitation spreads through the portion of lexical network, response thresholds for the excited nodes are correspondingly lowered. Accessing a given lexical item will thus decrease the response times for items to which it is connected.10 Let us assume that the language perception system computes a hierarchy of intermediate representations — phonetic, lexical and syntactic — that are subsequently integrated in a linguistic percept or structural description. The job of the mental lexicon, then, is to facilitate the production of the lexical representation of a verbal stimulus. Let us also take for granted that utterances are processed one word at a time. Therefore, the lexical analyses of the phrase “The guide threw the lady a …” results, among others, in accessing the node GUIDE and, consistently, in exciting a number of its neighbouring lexical nodes. In particular, it results in exciting the node ROPE, which becomes highly accessible or, in other words, is assigned a high cloze value. In short, the subject's lexical decisions are contextually facilitated. Appearances to the contrary, however, the mechanism that mediates this facilitation is associative rather than intelligent; that is to say, it makes use of the associated relations between lexical items — the totality of which makes up the mental lexicon — rather than of the beliefs that form the subject's background knowledge. The structure of the mental lexicon is shaped by the subject's linguistic experience and as such mimics the structure of his or 9 10 See Fodor 1986: 131. Fodor 1983: 80. See also Fodor 1986. 4 her knowledge. It is the former, not the latter, however, that underlies the fast and subpersonal process of lexical analyses. The process, let us add, that is cognitively impenetrable. Let us return to the issue of colour recognition. In my view, the Delk and Fillenbaum effect can be accounted for by adopting a strategy similar to the one successfully used by Fodor. I assume, namely, that the early vision system computes a collection of intermediate information-carrying states that represent a distal object in terms, respectively, of its size, shape, location, colour and luminosity, and, next, integrates these representations into a percept or primary sketch. Following Fodor11, I also assume that there is top-down information flow within the system; for example, the intermediate representation that identifies the processed stimulus by its colour can affect the representation that identifies it by its shape, and vice versa. What mediates these interactions, I think, is the system's local database that can be called the mental pallet. We can think of it as a network or graph whose nodes stands for shapes or colours. More precisely, the mental pallet consists of a number of shape-colour associations that make up its associative structure. Some of the associations are strong, whereas others are weak. For example, the node standing for the leaf-like shape is strongly associated with the node representing green and weakly associated with the nodes that stand for yellow, red and brown, respectively. Note that the associations under discussion reflect regularities to be found in nature. That is to say, most leaves we encounter are green and some leaves are yellow, red or brown. Consider now a subject who is shown a leaf-shaped cutout and recognizes its shape. Assume that one of the consequences of recognizing the shape is that of exciting the relevant shape-node in the system's mental pallet. The excitation spreads along the relevant shape-colour associations. As a result, the colour-nodes that are linked with the excited shape-node become accessible and ready for use by the colour recognition subsystem. Now, let us reconsider an example of the Delk and Fillenbaum effect: a subject who is presented with a heart-shaped orangish cutout sees it as reddish (that is, he or she alters the background in front of which the cutout is placed so that the background is more reddish than orangish). How is it possible? According to the mental pallet hypothesis, the subject's early vision system recognizes the shape of the cutout and, as a result, excites the node HEART-SHAPE of the subject's mental pallet. The excitation spreads along the relevant shape-colour associations. In particular, it makes the node RED highly accessible and ready for use by the colour-recognition subsystem. That is to say, the response threshold for red is significantly lowered and, as a result, the subject sees the cutout as reddish. His or her colour-recognition mechanism is influenced by the information of the typical colour of heart-shaped objects. Note, however, that the information does not take the form of a penetrating belief; rather, it is coded in the form of a relevant shape-colour 11 See Fodor 1983: 76. 5 association that is built in to the architecture of the subject's mental pallet. Therefore, if the mental pallet hypothesis is true, the Delk and Fillenbaum effect has nothing to do with the alleged cognitive penetration of early vision. What it suggests, rather, is that there is top-down information flow within the early vision system; the flow, let us add, that is mediated by the associative structure of the mental pallet. In other words, the colour recognition tasks are contextually facilitated. What underlies the facilitation, however, is an associative rather than intelligent process that can be likened to perceptual rather than conceptual priming (i.e., to those forms of priming that depend on implicit rather than explicit memory). Note that this moral can be easily reconciled with what Delk and Fillenbaum's experiment suggests. They conclude: One is led to the conclusion that past association of color and form does in some way influence perceived color, since that is the one respect in which the figures did clearly differ.12 Note that offering the modular account of the Delk and Fillenbaum effect I assume that there are two parallel or twin databases that store roughly the same information of the typical colours of object. The first database is local. It takes the form of the mental pallet, that is, of a network of the colour-shape associations. The second database is global or central. It takes the form of a body of beliefs about the typical colours of objects. One can ask, therefore, what is the point of having these two twin databases since, it seems, one would suffice? Note, first, that the associative mechanism that uses the mental pallet works faster than its intelligent counterpart. It works faster for two reasons. First, it does not involve the identification of the stimuli it processes in terms of such categories as “heart”, “leaf”, “tomato”, and so on; that is to say, it operates on the shallow aspects of visual experiences, since it associates colours with shapes rather than with objects. Second, it does not face the frame problem, that is, the information that can influence its operations is constrained by the associative architecture of the mental pallet. The intelligent mechanism, by contrast, would have to decide which beliefs about the possible colours of, say, hearts, have to be regarded as bearing on the process of visual integration. That is why it is good to have such an associative and unintelligent system. It works fast and facilitate our performance on the colour recognition tasks (note, for example, that in poor light conditions it is easier to recognize an object's shape rather than its colour). Second, it is worth noting that even though the structure of the mental pallet results from learning by conditioning13, it cannot be modified by our experience in a flash. Normally, it takes time to alter some of the shape-colour association and, as a result, to revise the structure of the 12 13 Delk and Fillenbaum 1965: 293, my emphasis — M.W. It is not precluded, however, that the central part of the pallet is innate. 6 mental pallet. That is why it is good to have the second, intelligent mechanism. It learns faster, accommodating the structure and content of its database to new evidence. Its function is to evaluate and, if it is necessary, to correct the outputs of the early vision system. Let me end with two methodological remarks. Note, first, that if the Delk and Fillenbaum effect is persistent — that is, if it does not disappear when the subject is told that the cutouts are in fact orangish — it can be regarded as supporting rather than undermining the modular account of early vision. Second, the mental pallet hypothesis can be empirically tested. (i) One can check whether some amnesic patients — more precisely, patients that are unable to recollect beliefs about the typical colours of objects — are sensitive to the Delk and Fillenbaum illusion. If they are, then it is reasonable to suppose that the mechanism that is responsible for the Delk and Fillenbaum effect is autonomous, that is, that it performs its operations independently of our central processes. In this respect, it can be likened to implicit rather than explicit memory. (ii) One can also ask the normal subjects to learn a number of propositions about the typical colours of cutouts whose shapes are artificial though easily recognizable. Next, the subjects' explicit memory can be tested, that is, they can be asked to recollect the colours of the cutouts they were informed of. It would be very interesting to check whether the new beliefs so formed are likely to influence the subjects' performance on the Delk and Fillenbaum task. If they are not, then it is reasonable to suppose that the colour recognition system works independently of the subject's explicit memory or, in other words, that it is cognitively impenetrable. References: Bar, Moshe, 2000, “Conscious and Nonconscious Processing of Visual Object Identity”, In: Y. Rosetti and A. Revousuo (Eds.), Dissociations: Interactions between dissociable conscious and nonconscious processing, Amsterdam: John Benjamins, 153-174. Bruner, Jerome S. and Goodman Cecile C., 1947, “Value and need as organizing factors in perception”, Journal of Abnormal and Social Psychology 42, 33-44. Bruner, Jerome S. and Postman Leo, 1949, “Perception, Cognition, and Behavior”, Journal of Personality 18(1), 14-31.” Delk John L. and Fillenbaum, Samuel, 1965, “Differences in Perceived Color as a Function of Characteristic Color”, The American Journal of Psychology 78(2), 290-293. Fodor, Jerry, 1983, The Modularity of Mind, Cambridge, Mass.: MIT Press. Fodor, Jerry, 1986, “Modularity if Mind: Jerry Fodor's Response”, In: Z.W. Pylyshyn and W. Demopoulos (Eds.), Meaning and Cognitive Structure, Norwood, New Jersey: Ablex Publishing Corporation, 129-135. 7 Forster, K.I., 1981, “Priming and the effects of sentence and lexical contexts on naming time: Evidence for autonomous lexical processing”, The Quarterly Journal of Experimental Psychology Section A 33(4), 465-495. Macpherson, Fiona, 2012, “Cognitive Penetration of Colour Experience: Rethinking the Issue in Light of an Indirect Mechanism”, Philosophy and Phenomenological Research 84(1), 24-62. Pylyshyn, Zenon W., 1999, “Is vision continuous with cognition? The case for cognitive impenetrability of visual perception”, Behavioral and Brain Sciences, 22, 341-423. Pylyshyn, Zenon W., 2003, Seeing and Visualizing, Cambridge, Mass.: MIT Press. Raftopoulos, Athanassios, 2001, “Is perception informationally encapsulated? The issue of the theory-ladeness of perception”, Cognitive Scienc, 25, 423-451. Schirillo, James A., 1999, “Color memory penetrates early vision”, Behavioral and Brain Sciences 22, 393. Swinney, David A., 1979, “Lexical Access during Sentence Comprehension: (Re)Consideration of Context Effects”, Journal of Verbal Learning and Verbal Behavior 18, 645-659. 8

RELATED PAPERS

RELATED TOPICS

Log In

Contextual Facilitation of Colour Recognition: Penetrating Beliefs or Colour-Shape Associations? (Ruch Filozoficzny, LXXV, 2, 2019, 199-208)

Contextual Facilitation of Colour Recognition: Penetrating Beliefs or Colour-Shape Associations? (Ruch Filozoficzny, LXXV, 2, 2019, 199-208)

Related Papers

RELATED PAPERS

RELATED TOPICS