A set of hypotheses is formulated for a connectionist approach to cognitive modeling. These hypotheses are shown to be incompatible with the hypotheses underlying traditional cognitive models. The connectionist models considered are massively parallel numerical computational systems that are a kind of continuous dynamical system. The numerical variables in the system correspond semantically to fine-grained features below the level of the concepts consciously used to describe the task domain. The level of analysis is intermediate between those of symbolic cognitive models (...) and neural models. The explanations of behavior provided are like those traditional in the physical sciences, unlike the explanations provided by symbolic models. (shrink)
There is disagreement over the notion of representation in cognitive science. Many investigators equate representations with symbols, that is, with syntactically defined elements in an internal symbol system. In recent years there have been two challenges to this orthodoxy. First, a number of philosophers, including many outside the symbolist orthodoxy, have argued that "representation" should be understood in its classical sense, as denoting a "stands for" relation between representation and represented. Second, there has been a growing challenge to orthodoxy under (...) the banner of connectionism. Although this connectionist challenge has evoked emotionally-charged rebukes from the symbolist camp, connectionists as a group have not articulated a conception of representation to replace the symbolist view. Nonetheless, most agree on the need for a nonsymbolic notion of representation. This paper advances a "stands for" sense of representation as primary by incorporating it into a general approach to cognitive science consonant with the connectionism. The idea is to marry connectionism to a particular version of functionalism, viz., one that builds its notion of "function" on the similarity between functional analysis in biology and in psychology. It builds on my earlier work adapting and revising Marr's tri-level approach to cognition. My proposal frees Marr's analysis from its moorings in the orthodox symbolic view of representation and allies it with a notion of functional analysis akin to that proposed by Cummins, developed further by Haugeland, and invoked by Millikan and Dretske. Unlike these authors, however, I do not wed representation to a general belief-desire analysis of behavior. Rather, I follow what I take to be the lesson of psychological practice, according to which the investigator does not seek to explain behavior in general but seeks to analyze the cognitive capacities that underlie behavior, such as vision, memory, learning, and linguistic capacities. Accordingly, ascriptions of representational content are made not by working back from belief-desire ascriptions, but in the context of forming psychological models to account for specific cognitive capacities. Because conceptions of representation in cognitive science typically are embedded in a general approach to the study of cognition, arguments for the comparative plausibility of a particular conception of representation must scout these larger frameworks. I therefore begin by characterizing the interlocking set of assumptions that gave life to the orthodox symbolist approach, paying special attention to the complementarity between representation and process that naturally arises from those assumptions. I then consider two versions of an alternative approach to representation: Dretske's and my own. Finally, I urge the merits of the "cognitive capacities" over the "belief-desire" approach to the subject-matter of cognitive science. (shrink)
The aim of this paper is to demonstrate a _prima facie_ tension between our commonsense conception of ourselves as thinkers and the connectionist programme for modelling cognitive processes. The language of thought hypothesis plays a pivotal role. The connectionist paradigm is opposed to the language of thought; and there is an argument for the language of thought that draws on features of the commonsense scheme of thoughts, concepts, and inference. Most of the paper (Sections 3-7) is taken up with the (...) argument for the language of thought hypothesis. The argument for an opposition between connectionism and the language of thought comes towards the end (Section 8), along with some discussion of the potential eliminativist consequences (Sections 9 and. (shrink)
There is disagreement over the notion of representation in cognitive science. Many investigators equate representations with symbols, that is, with syntactically defined elements in an internal symbol system. In recent years there have been two challenges to this orthodoxy. First, a number of philosophers, including many outside the symbolist orthodoxy, have argued that "representation" should be understood in its classical sense, as denoting a "stands for" relation between representation and represented. Second, there has been a growing challenge to orthodoxy under (...) the banner of connectionism. Although this connectionist challenge has evoked emotionally-charged rebukes from the symbolist camp, connectionists as a group have not articulated a conception of representation to replace the symbolist view. Nonetheless, most agree on the need for a nonsymbolic notion of representation. This paper advances a "stands for" sense of representation as primary by incorporating it into a general approach to cognitive science consonant with the connectionism. The idea is to marry connectionism to a particular version of functionalism, viz., one that builds its notion of "function" on the similarity between functional analysis in biology and in psychology. It builds on my earlier work adapting and revising Marr's tri-level approach to cognition. My proposal frees Marr's analysis from its moorings in the orthodox symbolic view of representation and allies it with a notion of functional analysis akin to that proposed by Cummins, developed further by Haugeland, and invoked by Millikan and Dretske. Unlike these authors, however, I do not wed representation to a general belief-desire analysis of behavior. Rather, I follow what I take to be the lesson of psychological practice, according to which the investigator does not seek to explain behavior in general but seeks to analyze the cognitive capacities that underlie behavior, such as vision, memory, learning, and linguistic capacities. Accordingly, ascriptions of representational content are made not by working back from belief-desire ascriptions, but in the context of forming psychological models to account for specific cognitive capacities. Because conceptions of representation in cognitive science typically are embedded in a general approach to the study of cognition, arguments for the comparative plausibility of a particular conception of representation must scout these larger frameworks. I therefore begin by characterizing the interlocking set of assumptions that gave life to the orthodox symbolist approach, paying special attention to the complementarity between representation and process that naturally arises from those assumptions. I then consider two versions of an alternative approach to representation: Dretske's and my own. Finally, I urge the merits of the "cognitive capacities" over the "belief-desire" approach to the subject-matter of cognitive science. (shrink)
Philosophers have argued that on the prevailing theory of mind, functionalism, the fact that mental states are multiply realizable or can be instantiated in a variety of different physical forms, at least in principle, shows that materialism or physical is probably false. A similar argument rejects the relevance to psychology of connectionism, which holds that mental states are embodied and and constituted by connectionist neural networks. These arguments, I argue, fall before reductios ad absurdam, proving too much -- they (...) apply as well to genes, which are multiply realizable, but the reduction of which to DNA is one the core cases of scientific reductive explanation, a reduction if anything is. -/- I suggest that psychology, like biology, be what I call "provincialized," abandon claims to universal validity, except as an idealization, and treat different classes of cognizers differently from an an explanatory perspective. This would permit specifies specific, or more precisely, provincial reductions of different psychologies that might be multiply realized if such reductions were available. Connectionism may be the foundation of a reduction of human psychology, and thus biology and connectionism retain their relevance to psychology, and physicalism or materialism is consistent with functionalism. (shrink)
Human agents draw a variety of inferences effortlessly, spontaneously, and with remarkable efficiency – as though these inferences were a reflexive response of their cognitive apparatus. Furthermore, these inferences are drawn with reference to a large body of background knowledge. This remarkable human ability seems paradoxical given the complexity of reasoning reported by researchers in artificial intelligence. It also poses a challenge for cognitive science and computational neuroscience: How can a system of simple and slow neuronlike elements represent a large (...) body of systemic knowledge and perform a range of inferences with such speed? We describe a computational model that takes a step toward addressing the cognitive science challenge and resolving the artificial intelligence paradox. We show how a connectionist network can encode millions of facts and rules involving n-ary predicates and variables and perform a class of inferences in a few hundred milliseconds. Efficient reasoning requires the rapid representation and propagation of dynamic bindings. Our model (which we refer to as SHRUTI) achieves this by representing (1) dynamic bindings as the synchronous firing of appropriate nodes, (2) rules as interconnection patterns that direct the propagation of rhythmic activity, and (3) long-term facts as temporal pattern-matching subnetworks. The model is consistent with recent neurophysiological evidence that synchronous activity occurs in the brain and may play a representational role in neural information processing. The model also makes specific psychologically significant predictions about the nature of reflexive reasoning. It identifies constraints on the form of rules that may participate in such reasoning and relates the capacity of the working memory underlying reflexive reasoning to biological parameters such as the lowest frequency at which nodes can sustain synchronous oscillations and the coarseness of synchronization. (shrink)
The philosophy of cognitive science has recently become one of the most exciting and fastest growing domains of philosophical inquiry and analysis. Until the early 1980s, nearly all of the models developed treated cognitive processes -- like problem solving, language comprehension, memory, and higher visual processing -- as rule-governed symbol manipulation. However, this situation has changed dramatically over the last half dozen years. In that period there has been an enormous shift of attention toward connectionist models of cognition that are (...) inspired by the network-like architecture of the brain. Because of their unique architecture and style of processing, connectionist systems are generally regarded as radically different from the more traditional symbol manipulation models. This collection was designed to provide philosophers who have been working in the area of cognitive science with a forum for expressing their views on these recent developments. Because the symbol-manipulating paradigm has been so important to the work of contemporary philosophers, many have watched the emergence of connectionism with considerable interest. The contributors take very different stands toward connectionism, but all agree that the potential exists for a radical shift in the way many philosophers think of various aspects of cognition. Exploring this potential and other philosophical dimensions of connectionist research is the aim of this volume. (shrink)
Philosophy and Memory Traces defends two theories of autobiographical memory. One is a bewildering historical view of memories as dynamic patterns in fleeting animal spirits, nervous fluids which rummaged through the pores of brain and body. The other is new connectionism, in which memories are 'stored' only superpositionally, and reconstructed rather than reproduced. Both models, argues John Sutton, depart from static archival metaphors by employing distributed representation, which brings interference and confusion between memory traces. Both raise urgent issues about (...) control of the personal past, and about relations between self and body. Sutton demonstrates the role of bizarre body fluids in moral physiology, as philosophers from Descartes and Locke to Coleridge struggled to control their own innards and impose cognitive discipline on 'the phantasmal chaos of association'. Going on to defend connectionism against Fodor and critics of passive mental representations, he shows how problems of the self are implicated in cognitive science. (shrink)
Over the last decade, fully distributed models have become dominant in connectionist psychological modelling, whereas the virtues of localist models have been underestimated. This target article illustrates some of the benefits of localist modelling. Localist models are characterized by the presence of localist representations rather than the absence of distributed representations. A generalized localist model is proposed that exhibits many of the properties of fully distributed models. It can be applied to a number of problems that are difficult for fully (...) distributed models, and its applicability can be extended through comparisons with a number of classic mathematical models of behaviour. There are reasons why localist models have been underused, though these often misconstrue the localist position. In particular, many conclusions about connectionist representation, based on neuroscientific observation, can be called into question. There are still some problems inherent in the application of fully distributed systems and some inadequacies in proposed solutions to these problems. In the domain of psychological modelling, localist modelling is to be preferred. Key Words: choice; competition; connectionist modelling; consolidation; distributed; localist; neural networks; reaction-time. (shrink)
Critics of the computational connectionism of the last decade suggest that it shares undesirable features with earlier empiricist or associationist approaches, and with behaviourist theories of learning. To assess the accuracy of this charge the works of earlier writers are examined for the presence of such features, and brief accounts of those found are given for Herbert Spencer, William James and the learning theorists Thorndike, Pavlov and Hull. The idea that cognition depends on associative connections among large networks of (...) neurons is indeed one with precedents, although the implications of this for psychological issues have been interpreted variously — not all versions of connectionism are alike. (shrink)
In the first section of the article, we examine some recent criticisms of the connectionist enterprise: first, that connectionist models are fundamentally behaviorist in nature (and, therefore, non-cognitive), and second that connectionist models are fundamentally associationist in nature (and, therefore, cognitively weak). We argue that, for a limited class of connectionist models (feed-forward, pattern-associator models), the first criticism is unavoidable. With respect to the second criticism, we propose that connectionist modelsare fundamentally associationist but that this is appropriate for building models (...) of human cognition. However, we do accept the point that there are cognitive capacities for which any purely associative model cannot provide a satisfactory account. The implication that we draw from is this is not that associationist models and mechanisms should be scrapped, but rather that they should be enhanced.In the next section of the article, we identify a set of connectionist approaches which are characterized by “active symbols” — recurrent circuits which are the basis of knowledge representation. We claim that such approaches avoid criticisms of behaviorism and are, in principle, capable of supporting full cognition. In the final section of the article, we speculate at some length about what we believe would be the characteristics of a fully realized active symbol system. This includes both potential problems and possible solutions (for example, mechanisms needed to control activity in a complex recurrent network) as well as the promise of such systems (in particular, the emergence of knowledge structures which would constitute genuine internal models). (shrink)
Language learning requires linguistic input, but several studies have found that knowledge of second language rules does not seem to improve with more language exposure. One reason for this is that previous studies did not factor out variation due to the different rules tested. To examine this issue, we reanalyzed grammaticality judgment scores in Flege, Yeni-Komshian, and Liu's study of L2 learners using rule-related predictors and found that, in addition to the overall drop in performance due to a sensitive period, (...) L2 knowledge increased with years of input. Knowledge of different grammar rules was negatively associated with input frequency of those rules. To better understand these effects, we modeled the results using a connectionist model that was trained using Korean as a first language and then English as an L2. To explain the sensitive period in L2 learning, the model's learning rate was reduced in an age-related manner. By assigning different learning rates for syntax and lexical learning, we were able to model the difference between early and late L2 learners in input sensitivity. The model's learning mechanism allowed transfer between the L1 and L2, and this helped to explain the differences between different rules in the grammaticality judgment task. This work demonstrates that an L1 model of learning and processing can be adapted to provide an explicit account of how the input and the sensitive period interact in L2 learning. (shrink)
This paper offers both a theoretical and an experimental perspective on the relationship between connectionist and Classical (symbol-processing) models. Firstly, a serious flaw in Fodor and Pylyshyn’s argument against connectionism is pointed out: if, in fact, a part of their argument is valid, then it establishes a conclusion quite different from that which they intend, a conclusion which is demonstrably false. The source of this flaw is traced to an underestimation of the differences between localist and distributed representation. It (...) has been claimed that distributed representations cannot support systematic operations, or that if they can, then they will be mere implementations of traditional ideas. This paper presents experimental evidence against this conclusion: distributed representations can be used to support direct structure-sensitive operations, in a man- ner quite unlike the Classical approach. Finally, it is argued that even if Fodor and Pylyshyn’s argument that connectionist models of compositionality must be mere implementations were correct, then this would still not be a serious argument against connectionism as a theory of mind. (shrink)
It is often assumed that similar domain-specific behavioural impairments found in cases of adult brain damage and developmental disorders correspond to similar underlying causes, and can serve as convergent evidence for the modular structure of the normal adult cognitive system. We argue that this correspondence is contingent on an unsupported assumption that atypical development can produce selective deficits while the rest of the system develops normally (Residual Normality), and that this assumption tends to bias data collection in the field. Based (...) on a review of connectionist models of acquired and developmental disorders in the domains of reading and past tense, as well as on new simulations, we explore the computational viability of Residual Normality and the potential role of development in producing behavioural deficits. Simulations demonstrate that damage to a developmental model can produce very different effects depending on whether it occurs prior to or following the training process. Because developmental disorders typically involve damage prior to learning, we conclude that the developmental process is a key component of the explanation of endstate impairments in such disorders. Further simulations demonstrate that in simple connectionist learning systems, the assumption of Residual Normality is undermined by processes of compensation or alteration elsewhere in the system. We outline the precise computational conditions required for Residual Normality to hold in development, and suggest that in many cases it is an unlikely hypothesis. We conclude that in developmental disorders, inferences from behavioural deficits to underlying structure crucially depend on developmental conditions, and that the process of ontogenetic development cannot be ignored in constructing models of developmental disorders. Key Words: Acquired and developmental disorders; connectionist models; modularity; past tense; reading. (shrink)
A competence model describes the abstract structure of a solution to some problem. or class of problems, facing the would-be intelligent system. Competence models can be quite derailed, specifying far more than merely the function to be computed. But for all that, they are pitched at some level of abstraction from the details of any particular algorithm or processing strategy which may be said to realize the competence. Indeed, it is the point and virtue of such models to specify some (...) equivalence class of algorithms/processing strategies so that the common properties highlighted by the chosen class may feature in psychologically interesting accounts. A question arises concerning the type of relation a theorist might expect to hold between such a competence model and a psychologically real processing strategy. Classical work in cognitive science expects the actual processing to depend on explicit or tacit knowledge of the competence theory. Connectionist work, for reasons to be explained, represents a departure from this norm. But the precise way in which a connectionist approach may disturb the satisfying classical symmetry of competence and processing has yet to be properly specified. A standard ?Newtonian? connectionist account, due to Paul Smolensky, is discussed and contrasted with a somewhat different ?rogue? account. A standard connectionist understanding has it that a classical competence theory describes an idealized subset of a network's behaviour. But the network's behaviour is not to be explained by its embodying explicit or tacit knowledge of the information laid out in the competence theory. A rogue model, by contrast, posits either two systems, or two aspects of a single system, such that one system does indeed embody the knowledge laid out in the competence theory. (shrink)
This volume provides an introduction to and review of key contemporary debates concerning connectionism, and the nature of explanation and methodology in cognitive psychology. The first debate centers on the question of whether human cognition is best modeled by classical or by connectionist architectures. The second centres on the question of the compatibility between folk, or commonsense, psychological explanation and explanations based on connectionist models of cognition. Each of the two sections includes a classic reading along with important responses, (...) and concludes with a specially commissioned reply by the main contributor. The editorial introductions provide a comprehensive survey and map through the debates. (shrink)
Recent experimental findings reveal dissociations of conscious and nonconscious performance in many fields of psychological research, suggesting that conscious and nonconscious effects result from qualitatively different processes. A connectionist view of these processes is put forward in which consciousness is the consequence of construction processes taking place in three types of working memory in a specific type of recurrent neural network. The recurrences arise by feeding back output to the input of a central (representational) network. They are assumed to be (...) intemalizations of motor-sensory feedback through the environment. In this manner, a subvocal-phonological, a visuo-spatial, and a somatosensory working memory may have developed. Representations in the central network, which constitutes long-term memory, can be kept active by rehearsal in the feedback loops. The sequentially recurrent architecture allows for recursive symbolic operations and the formation of (auditory, visual, or somatic) models of the external world which can be maintained, transformed and temporarily combined with other information in working memory. Moreover, the quasi-input from the loop directs subsequent attentional processing. The view may contribute to a formal framework to accommodate findings from disparate fields such as working memory, sequential reasoning, and conscious and nonconscious processes in memory and emotion. In theory, but probably not very soon in practice, such connectionist models might simulate aspects of consciousness. (shrink)
This paper investigates connectionism's potential to solve the frame problem. The frame problem arises in the context of modelling the human ability to see the relevant consequences of events in a situation. It has been claimed to be unsolvable for classical cognitive science, but easily manageable for connectionism. We will focus on a representational approach to the frame problem which advocates the use of intrinsic representations. We argue that although connectionism's distributed representations may look promising from this (...) perspective, doubts can be raised about the potential of distributed representations to allow large amounts of complexly structured information to be adequately encoded and processed. It is questionable whether connectionist models that are claimed to effectively represent structured information can be scaled up to a realistic extent. We conclude that the frame problem provides a difficulty to connectionism that is no less serious than the obstacle it constitutes for classical cognitive science. (shrink)
It is not widely realised that Turing was probably the first person to consider building computing machines out of simple, neuron-like elements connected together into networks in a largely random manner. Turing called his networks unorganised machines. By the application of what he described as appropriate interference, mimicking education an unorganised machine can be trained to perform any task that a Turing machine can carry out, provided the number of neurons is sufficient. Turing proposed simulating both the behaviour of the (...) network and the training process by means of a computer program. We outline Turing's connectionist project of 1948. (shrink)
Fodor and Pylyshyn's critique of connectionism has posed a challenge to connectionists: Adequately explain such nomological regularities as systematicity and productivity without postulating a "language of thought" (LOT). Some connectionists like Smolensky took the challenge very seriously, and attempted to meet it by developing models that were supposed to be non-classical. At the core of these attempts lies the claim that connectionist models can provide a representational system with a combinatorial syntax and processes sensitive to syntactic structure. They are (...) not implementation models because, it is claimed, the way they obtain syntax and structure sensitivity is not "concatenative," hence "radically different" from the way classicists handle them. In this paper, I offer an analysis of what it is to physically satisfy/realize a formal system. In this context, I examine the minimal truth-conditions of LOT Hypothesis. From my analysis it will follow that concatenative realization of formal systems is irrelevant to LOTH since the very notion of LOT is indifferent to such an implementation level issue as concatenation. I will conclude that to the extent to which they can explain the law-like cognitive regularities, a certain class of connectionist models proposed as radical alternatives to the classical LOT paradigm will in fact turn out to be LOT models, even though new and potentially very exciting ones. (shrink)
When cognitive scientists apply computational theory to the problem of phenomenal consciousness, as many of them have been doing recently, there are two fundamentally distinct approaches available. Either consciousness is to be explained in terms of the nature of the representational vehicles the brain deploys; or it is to be explained in terms of the computational processes defined over these vehicles. We call versions of these two approaches _vehicle_ and _process_ theories of consciousness, respectively. However, while there may be space (...) for vehicle theories of consciousness in cognitive science, they are relatively rare. This is because of the influence exerted, on the one hand, by a large body of research which purports to show that the explicit representation of information in the brain and conscious experience are _dissociable_, and on the other, by the _classical_ computational theory of mind – the theory that takes human cognition to be a species of symbol manipulation. But two recent developments in cognitive science combine to suggest that a reappraisal of this situation is in order. First, a number of theorists have recently been highly critical of the experimental methodologies employed in the dissociation studies – so critical, in fact, it’s no longer reasonable to assume that the dissociability of conscious experience and explicit representation has been adequately demonstrated. Second, classicism, as a theory of human cognition, is no longer as dominant in cognitive science as it once was. It now has a lively competitor in the form of _connectionism; _and connectionism, unlike classicism, does have the computational resources to support a robust vehicle theory of consciousness. In this paper we develop and defend this connectionist vehicle theory of consciousness. It takes the form of the following simple empirical hypothesis: _phenomenal experience consists in the explicit_ _representation of information in neurally realized PDP networks_.. (shrink)
The relation between logic and thought has long been controversial, but has recently influenced theorizing about the nature of mental processes in cognitive science. One prominent tradition argues that to explain the systematicity of thought we must posit syntactically structured representations inside the cognitive system which can be operated upon by structure sensitive rules similar to those employed in systems of natural deduction. I have argued elsewhere that the systematicity of human thought might better be explained as resulting from the (...) fact that we have learned natural languages which are themselves syntactically structured. According to this view, symbols of natural language are external to the cognitive processing system and what the cognitive system must learn to do is produce and comprehend such symbols. In this paper I pursue that idea by arguing that ability in natural deduction itself may rely on pattern recognition abilities that enable us to operate on external symbols rather than encodings of rules that might be applied to internal representations. To support this suggestion, I present a series of experiments with connectionist networks that have been trained to construct simple natural deductions in sentential logic. These networks not only succeed in reconstructing the derivations on which they have been trained, but in constructing new derivations that are only similar to the ones on which they have been trained. (shrink)
This is an overview of recent philosophical discussion about connectionism and the foundations of cognitive science. Connectionist modeling in cognitive science is described. Three broad conceptions of the mind are characterized, and their comparative strengths and weaknesses are discussed: (1) the classical computational conception in cognitive science; (2) a popular foundational interpretation of connectionism that John Tienson and I call “non-sentential computationalism”; and (3) an alternative interpretation of connectionism we call “dynamical cognition.” Also discussed are two recent (...) philosophical attempts to enlist connectionism in defense of eliminativism about folk psychology. (shrink)
The introduction of connectionist or parallel distributed processing (PDP) systems to model cognitive functions has raised the question of the possible relations between these models and traditional information processing models which employ rules to manipulate representations. After presenting a brief account of PDP models and two ways in which they are commonly interpreted by those seeking to use them to explain cognitive functions, I present two ways one might relate these models to traditional information processing models and so not totally (...) repudiate the tradition of modelling cognition through systems of rules and representations. The proposal that seems most promising is that PDP-type structures might provide the underlying framework in which a rule and representation model might be implemented. To show how one might pursue such a strategy, I discuss recent research by Barsalou on the instability of concepts and show how that might be accounted for in a system whose microstructure had a PDP architecture. I also outline how adopting a multi-leveled view of the mind, where on one level the mind employed a PDP-type system and at another level constituted a rule processing system, would allow researchers to relocate some problems which seemed difficult to explain at one level, such as the capacity for concept learning, to another level where it could be handled in a straightforward manner. (shrink)
In this paper, I define tacit knowledge as a kind of causal-explanatory structure, mirroring the derivational structure in the theory that is tacitly known. On this definition, tacit knowledge does not have to be explicitly represented. I then take the notion of a modular theory, and project the idea of modularity to several different levels of description: in particular, to the processing level and the neurophysiological level. The fundamental description of a connectionist network lies at a level between the processing (...) level and the physiological level. At this level, connectionism involves a characteristic departure from modularity, and a correlative absence of syntactic structure. This is linked to the fact that tacit knowledge descriptions of networks are only approximately true. A consequence is that strict causal systematicity in cognitive processes poses a problem for the connectionist programme. (shrink)
We examine two connectionist networks—a fractal learning neural network (FLNN) and a Simple Recurrent Network (SRN)—that are trained to process center-embedded symbol sequences. Previous work provides evidence that connectionist networks trained on infinite-state languages tend to form fractal encodings. Most such work focuses on simple counting recursion cases (e.g., anbn), which are not comparable to the complex recursive patterns seen in natural language syntax. Here, we consider exponential state growth cases (including mirror recursion), describe a new training scheme that seems (...) to facilitate learning, and note that the connectionist learning of these cases has a continuous metamorphosis property that looks very different from what is achievable with symbolic encodings. We identify a property—ragged progressive generalization—which helps make this difference clearer. We suggest two conclusions. First, the fractal analysis of these more complex learning cases reveals the possibility of comparing connectionist networks and symbolic models of grammatical structure in a principled way—this helps remove the black box character of connectionist networks and indicates how the theory they support is different from symbolic approaches. Second, the findings indicate the value of future, linked mathematical and empirical work on these models—something that is more possible now than it was 10 years ago. (shrink)
One new tradition that has emerged from early research on autonomous robots is embodied cognitive science. This paper describes the relationship between embodied cognitive science and a related tradition, synthetic psychology. It is argued that while both are synthetic, embodied cognitive science is antirepresentational while synthetic psychology still appeals to representations. It is further argued that modern connectionism offers a medium for conducting synthetic psychology, provided that researchers analyze the internal representations that their networks develop. The paper then provides (...) a detailed example of the synthetic approach by showing how the construction (and subsequent analysis) of a connectionist network can be used to contribute to a theory of how humans solve Piaget's classic balance scale task. (shrink)
Recently some philosophers have urged that connectionist artificial intelligence is (potentially) eliminative for the propositional attitudes of folk psychology. At the same time, however, these philosophers have also insisted that since philosophy of science has failed to provide criteria distinguishing ontologically retentive from eliminative theory changes, the resulting eliminativism is not principled. Application of some resources developed within the semantic view of scientific theories, particularly recent formal work on the theory reduction relation, reveals these philosophers to be wrong in this (...) second contention, yet by and large correct in the first. (shrink)
Ramsey, Stick and Garon (1991) argue that if the correct theory of mind is some parallel distributed processing theory, then folk psychology must be false. Their idea is that if the nodes and connections that encode one representation are causally active then all representations encoded by the same set of nodes and connections are also causally active. We present a clear, and concrete, counterexample to RSG's argument. In conclusion, we suggest that folk psychology and connectionism are best understood as (...) complementary theories. Each has different limitations, yet each will co-evolve with the other in an overlapping domain of 'normal' psychology. (shrink)
Fodor's and Pylyshyn's stand on systematicity in thought and language has been debated and criticized. Van Gelder and Niklasson, among others, have argued that Fodor and Pylyshyn offer no precise definition of systematicity. However, our concern here is with a learning based formulation of that concept. In particular, Hadley has proposed that a network exhibits strong semantic systematicity when, as a result of training, it can assign appropriate meaning representations to novel sentences (both simple and embedded) which contain words in (...) syntactic positions they did not occupy during training. The experience of researchers indicates that strong systematicity in any form is difficult to achieve in connectionist systems.Herein we describe a network which displays strong semantic systematicity in response to Hebbian, connectionist training. During training, two-thirds of all nouns are presented only in a single syntactic position (either as grammatical subject or object). Yet, during testing, the network correctly interprets thousands of sentences containing those nouns in novel positions. In addition, the network generalizes to novel levels of embedding. Successful training requires a, corpus of about 1000 sentences, and network training is quite rapid. The architecture and learning algorithms are purely connectionist, but classical insights are discernible in one respect, viz, that complex semantic representations spatially contain their semantic constituents. However, in other important respects, the architecture is distinctly non-classical. (shrink)
Recent work in the methodology of connectionist explanation has I'ocrrsccl on the notion of levels of explanation. Specific issucs in conncctionisrn hcrc intersect with rvider areas of debate in the philosophy of psychology and thc philosophy of science generally. The issues I raise in this chapter, then, are not unique to cognitive science; but they arise in new and important contexts when connectionism is taken seriously as a model of cognition. The general questions are the relation between levels and (...) the status of levels which have no obvious relation to others. In speaking of levels, what is the connection, if there is one, between explanation and ontology? Which, if any, conccpt of reduction is applicable to connectionist systems? What kind of legitinrtcy can the constructs of common sense psychology, or of that vclsion ol intentional realism represented by classical symbol-systems n I, hirvc irr ir full-scale connectionist theory of mind? (shrink)
Recently, connectionist models have been developed that seem to exhibit structuresensitive cognitive capacities without executing a program. This paper examines one such model and argues that it does execute a program. The argument proceeds by showing that what is essential to running a program is preserving the functional structure of the program. It has generally been assumed that this can only be done by systems possessing a certain temporalcausal organization. However, counterfactualpreserving functional architecture can be instantiated in other ways, for (...) example geometrically, which are realizable by connectionist networks. (shrink)
Fodor and Pylyshyn (1988) have argued that the cognitive architecture is not Connectionist. Their argument takes the following form: (1) the cognitive architecture is Classical; (2) Classicalism and Connectionism are incompatible; (3) therefore the cognitive architecture is not Connectionist. In this essay I argue that Fodor and Pylyshyn's defenses of (1) and (2) are inadequate. Their argument for (1), based on their claim that Classicalism best explains the systematicity of cognitive capacities, is an invalid instance of inference to the (...) best explanation. And their argument for (2) turns out to be question-begging. The upshot is that, while Fodor and Pylyshyn have presented Connectionists with the important empirical challenge of explaining systematicity, they have failed to provide sufficient reason for inferring that the cognitive architecture is Classical and not Connectionist. (shrink)
Not long ago the standard view in cognitive science was that representations are symbols in an internal representational system or language of thought and that psychological processes are computations defined over such representations. This orthodoxy has been challenged by adherents of functional analysis and by connectionists. Functional analysis as practiced by Marr is consistent with an analysis of representation that grants primacy to a stands for conception of representation. Connectionism is also compatible with this notion of representation; when conjoined (...) with functional analysis, it provides a means of analyzing psychological systems in term of rules and representations without becoming committed to symbolism. Direct theorists, who rejected the orthodox symbolist conception of representation because it violated their strictures against cognitive mediational mechanisms, should find it possible to accept rules-and-representations and information-processing analyses of the mechanisms of information pickup couched in terms of functional analysis. (shrink)
At present, the prevailing Connectionist methodology forrepresenting rules is toimplicitly embody rules in neurally-wired networks. That is, the methodology adopts the stance that rules must either be hard-wired or trained into neural structures, rather than represented via explicit symbolic structures. Even recent attempts to implementproduction systems within connectionist networks have assumed that condition-action rules (or rule schema) are to be embodied in thestructure of individual networks. Such networks must be grown or trained over a significant span of time. However, arguments (...) are presented herein that humanssometimes follow rules which arevery rapidly assignedexplicit internal representations, and that humans possessgeneral mechanisms capable of interpreting and following such rules. In particular, arguments are presented that thespeed with which humans are able to follow rules ofnovel structure demonstrates the existence of general-purpose rule following mechanisms. It is further argued that the existence of general-purpose rule following mechanisms strongly indicates that explicit rule following is not anisolated phenomenon, but may well be a common and important aspect of cognition. The relationship of the foregoing conclusions to Smolensky''s view of explicit rule following is also explored. The arguments presented here are pragmatic in nature, and are contrasted with thekind of arguments developed by Fodor and Pylyshyn in their recent, influential paper. (shrink)
In Connectionism and the Philosophy of Psychology, Horgan and Tienson (1996) argue that cognitive processes, pace classicism, are not governed by exceptionless, representation-level rules; they are instead the work of defeasible cognitive tendencies subserved by the non-linear dynamics of the brains neural networks. Many theorists are sympathetic with the dynamical characterisation of connectionism and the general (re)conception of cognition that it affords. But in all the excitement surrounding the connectionist revolution in cognitive science, it has largely gone unnoticed (...) that connectionism adds to the traditional focus on computational processes, a new focus one on the vehicles of mental representation, on the entities that carry content through the mind. Indeed, if Horgan and Tiensons dynamical characterisation of connectionism is on the right track, then so intimate is the relationship between computational processes and representational vehicles, that connectionist cognitive science is committed to a resemblance theory of mental content. (shrink)
This paper discusses the relation between cognitive and implementational levels of analysis. Chalmers (1990, 1993) argues that a connectionist implementation of a classical cognitive architecture possesses a compositional semantics, and therefore undercuts Fodor and Pylyshyn's (1988) argument that connectionist networks cannot possess a compositional semantics. I argue that Chalmers argument misconstrues the relation between cognitive and implementational levels of analysis. This paper clarifies the distinction, and shows that while Fodor and Pylyshyn's argument survives Chalmers' critique, it cannot be used to (...) establish the irrelevance of neurophysiological implementation to cognitive modeling; some aspects of Chater and Oaksford's (1990) response to Fodor and Pylyshyn, though not all, are therefore cogent. (shrink)
Along with the increasing popularity of connectionist language models has come a number of provocative suggestions about the challenge these models present to Chomsky's arguments for nativism. The aim of this paper is to assess these claims. We begin by reconstructing Chomsky's argument from the poverty of the stimulus and arguing that it is best understood as three related arguments, with increasingly strong conclusions. Next, we provide a brief introduction to connectionism and give a quick survey of recent efforts (...) to develop networks that model various aspects of human linguistic behavior. Finally, we explore the implications of this research for Chomsky's arguments. Our claim is that the relation between connectionism and Chomsky's views on innate knowledge is more complicated than many have assumed, and that even if these models enjoy considerable success the threat they pose for linguistic nativism is small. (shrink)
This paper presents considerations in favour of the view that traditional (classical) architectures can be seen as emergent features of connectionist networks with distributed representation. A recent paper by William Bechtel (1988) which argues for a similar conclusion is unsatisfactory in that it fails to consider whether the compositional syntax and semantics attributed to mental representations by classical models can emerge within a connectionist network. The compatibility of the two paradigms hinges largely, I suggest, on how this question is answered. (...) Focusing on the issue of syntax, I argue that while such structure is lacking in connectionist models with local representation, it can be accommodated within networks where representation is distributed. I discuss an important paper by Smolenski (1988) which attempts to show how connectionists can incorporate the relevant syntactic structure, suggesting that some criticisms levelled against that paper by Fodor & Pylyshyn (1988) are wanting. I then go on to indicate a strategy by which a compositional syntax and semantics can be defined for the sort of network that Smolenski describes. I conclude that since the connectionist can respect the central tenets of classicism, the two approaches are compatible with one another. (shrink)
Classical symbolic computational models of cognition are at variance with the empirical findings in the cognitive psychology of memory and inference. Standard symbolic computers are well suited to remembering arbitrary lists of symbols and performing logical inferences. In contrast, human performance on such tasks is extremely limited. Standard models donot easily capture content addressable memory or context sensitive defeasible inference, which are natural and effortless for people. We argue that Connectionism provides a more natural framework in which to model (...) this behaviour. In addition to capturing the gross human performance profile, Connectionist systems seem well suited to accounting for the systematic patterns of errors observed in the human data. We take these arguments to counter Fodor and Pylyshyn's (1988) recent claim that Connectionism is, in principle, irrelevant to psychology. (shrink)
Marinov''s critique I argue, is vitiated by its failure to recognize the distinctive role of superposition within the distributed connectionist paradigm. The use of so-called subsymbolic distributed encodings alone is not, I agree, enough to justify treating distributed connectionism as a distinctive approach. It has always been clear that microfeatural decomposition is both possible and actual within the confines of recognizably classical approaches. When such approaches also involve statistically-driven learning algorithms — as in the case of ID3 — the (...) fundamental differences become even harder to spot. To see them, it is necessary to consider not just the nature of an acquired input-output function but the nature of the representational scheme underlying it. Differences between such schemes make themselves best felt outside the domain of immediate problem solving. It is in the more extended contexts of performance DURING learning and cognitive change as a result of SUBSEQUENT training on new tasks (or simultaneous training on several tasks) that the effects of superpositional storage techniques come to the fore. I conclude that subsymbols, distribution and statistically driven learning alone are indeed not of the essence. But connectionism is not just about subsymbols and distribution. It is about the generation of whole subsymbol SYSTEMS in which multiple distributed representations are created and superposed. (shrink)
There is currently a debate over whether cognitive architecture is classical or connectionist in nature. One finds the following three comparisons between classical architecture and connectionist architecture made in the pro-connectionist literature in this debate: (1) connectionist architecture is neurally plausible and classical architecture is not; (2) connectionist architecture is far better suited to model pattern recognition capacities than is classical architecture; and (3) connectionist architecture is far better suited to model the acquisition of pattern recognition capacities by learning than (...) is classical architecture. If true, (1)–(3) would yield a compelling case against the view that cognitive architecture is classical, and would offer some reason to think that cognitive architecture may be connectionist. We first present the case for (1)–(3) in the very words of connectionist enthusiasts. We then argue that the currently available evidence fails to support any of (1)–(3). (shrink)
This paper reviews four significant advances on the feedforward architecture that has dominated discussions of connectionism. The first involves introducing modularity into networks by employing procedures whereby different networks learn to perform different components of a task, and a Gating Network determines which network is best equiped to respond to a given input. The second consists in the use of recurrent inputs whereby information from a previous cycle of processing is made available on later cycles. The third development involves (...) developing compressed representations of strings in which there is no longer an explicit encoding of the components but where information about the structure of the original string can be recovered and so is present functionally. The final advance entails using connectionist learning procedures not just to change weights in networks but to change the patterns used as inputs to the network. These advances significantly increase the usefulness of connectionist networks for modeling human cognitive performance by, among other things, providing tools for explaining the productivity and systematicity of some mental activities, and developing representations that are sensitive to the content they are to represent. (shrink)
This article compares the potential of classical and connectionist computational concepts for explanations of innate infant knowledge and of its development. It focuses on issues relating to: the perceptual process; the control and form(s) of perceptual-behavioural coordination; the role of environmental structure in the organization of action; and the construction of novel forms of activity. There is significant compatibility between connectionist and classical views of computation, though classical concepts are, at present, better able to provide a comprehensive computational view of (...) the infant. However, Varela's “enaction” perspective poses a significant challenge for both approaches. (shrink)
In an influential critique, Jerry Fodor and Zenon Pylyshyn point to the existence of a potentially devastating dilemma for connectionism (Fodor and Pylyshyn [1988]). Either connectionist models consist in mere associations of unstructured representations, or they consist in processes involving complex representations. If the former, connectionism is mere associationism, and will not be capable of accounting for very much of cognition. If the latter, then connectionist models concern only the implementation of cognitive processes, and are, therefore, not informative (...) at the level of cognition. I shall argue that Fodor and Pylyshyn's argument is based on a crucial misunderstanding, the same misunderstanding which motivates the entire language of thought hypothesis. (shrink)