A fundamental question in reading research concerns whether attention is allocated strictly serially, supporting lexical processing of one word at a time, or in parallel, supporting concurrent lexical processing of two or more words (Reichle, Liversedge, Pollatsek, & Rayner, 2009). The origins of this debate are reviewed. We then report three simulations to address this question using artificial reading agents (Liu & Reichle, 2010; Reichle & Laurent, 2006) that learn to dynamically allocate attention to 1–4 words to “read” as efficiently (...) as possible. These simulation results indicate that the agents strongly preferred serial word processing, although they occasionally attended to more than one word concurrently. The reason for this preference is discussed, along with implications for the debate about how humans allocate attention during reading. (shrink)
Rolls defends a higher-order thought theory of phenomenal consciousness, mapping the distinction between conscious and non-conscious states onto a distinction between two types of action and corresponding neural pathways. Only one type of action involves higher-order thought and consequently consciousness. This account of consciousness has implausible consequences for the nature of stimulus-reinforcement learning.
Sagvolden, Johansen, Aase, and Russell (Sagvolden et al.) examine attention-deficit/hyperactivity disorder (ADHD) at levels of analysis ranging from neurotransmitters to behavior. At the behavioral level they attribute aspects of ADHD to anomalies of delay-of-reinforcement gradients. With a normal gradient, responses followed after a long delay by a reinforcer may share in the effects of that reinforcer; with a diminished or steepened gradient they may fail to do so. Steepened gradients differentially select rapidly emitted responses (hyperactivity), and they limit the (...) effectiveness with which extended stimuli become conditioned reinforcers, so that observing behavior is less well maintained (attention deficit). Impulsiveness also follows from steepened gradients, which increase the effectiveness of smaller, more immediate consequences relative to larger, more delayed ones. Individuals who vary in the degree to which their delay gradients are steepened will show different balances between hyperactivity and attention deficit. Given the range of ADHD phenomena addressed, it may be unnecessary to appeal to additional behavioral processes such as extinction deficit. Extinction deficit is more likely a derivative of attention deficit, in that failure to attend to stimuli differentially correlated with extinction should slow its progress. The account suggests how relatively small differences in delay gradients early in development might engender behavioral interactions leading to very large differences later on. The steepened gradients presumably originate in properties of neurotransmitter function, but behavioral interventions that use consistently short delays of reinforcement to build higher-order behavioral units as a scaffolding to support complex cognitive and social skills may nonetheless be feasible. Key Words: ADHD; attention deficit; delay gradient; exponential decay; extinction deficit; hyperactivity; impulsiveness; intervention; observing responses; self-control. Footnotes1000 Jeffrey Gray was the BBS Editor for this treatment. When he accepted Sagvolden et al. for publication, he invited Catania, who was one of the reviewers of the original submission, to prepare a precommentary. Com-mentators were then invited to respond to the Sagvolden et al. article, to the Catania precommentary, or to both. (shrink)
We agree with Rachlin's argument that altruism is best understood as a case of self-control, and that a behavioral analysis is appropriate. However, the appeal to teleological behaviorism and the value of behavioral patterns may be unnecessary. Instead, we argue that altruism can generally be explained with traditional behavioral principles such as negative reinforcement, conditioned reinforcement, and rule-governed behavior.
Notwithstanding the many strengths of the dynamic developmental theory, there remain challenges to be overcome before it can be incorporated into a true causal model of attention-deficit/hyperactivity disorder (ADHD). These include the development of reliable measures of reinforcement delay gradients, the validation of shortened reinforcement delay as an endophenotype, and the integration of this pathway with other potential pathways.
The paper presents an approach for developing multi-agent reinforcement learning systems that are made up of a coalition of modular agents. We focus on learning to segment sequences (sequential decision tasks) to create modular structures, through a bidding process that is based on reinforcements received during task execution. The approach segments sequences (and divides them up among agents) to facilitate the learning of the overall task. Notably, our approach does not rely on a priori knowledge or a priori structures. (...) Initial experiments demonstrated the basic promise of the approach. This work shows how bidding and reinforcement learning can be usefully combined, thus pointing to a new research direction. (shrink)
Maintaining adequate performance in dynamic and uncertain settings has been a perennial stumbling block for intelligent systems. Nevertheless, any system intended for real-world deployment must be able to accommodate unexpected change—that is, it must be perturbation tolerant. We have found that metacognitive monitoring and control—the ability of a system to self-monitor its own decision-making processes and ongoing performance, and to make targeted changes to its beliefs and action-determining components—can play an important role in helping intelligent systems cope with the perturbations (...) that are the inevitable result of real-world deployment. In this article we present the results of several experiments demonstrating the efficacy of metacognition in improving the perturbation tolerance of reinforcement learners, and discuss a general theory of metacognitive monitoring and control, in a form we call the metacognitive loop. (shrink)
Reinforcement schemes are a class of non-Markovian stochastic processes. Their non-Markovian nature allows them to model some kind of memory of the past. One subclass of such models are those in which the past is exponentially discounted or forgotten. Often, models in this subclass have the property of becoming trapped with probability 1 in some degenerate state. While previous work has concentrated on such limit results, we concentrate here on a contrary effect, namely that the time to become trapped (...) may increase exponentially in 1/x as the discount rate, 1− x, approaches 1. As a result, the time to become trapped may easily exceed the lifetime of the simulation or of the physical data being modeled. In such a case, the quasi-stationary behavior is more germane. We apply our results to a model of social network formation based on ternary (three-person) interactions with uniform positive reinforcement. (shrink)
Abstra,ct— This paper will discuss learning in hybrid models that goes beyond simple rule extraction from backpropagation networks. Although simple rule extraction has received a lot of research attention, to further develop hybrid learning models that include both symbolic and subsymbolic knowledge and that learn autonomously, it is necessary to study autonomous learning of both subsymbolic and symbolic knowledge in integrated architectures. This paper will describe knowledge extraction from neural reinforcement learning. It includes two approaches towards extracting plan knowledge: (...) the extraction of explicit, symbolic rules from neural reinforcement learning, and the extraction of complete plans. This work points to the creation of a general framework for achieving the subsymbolic to symbolic transition in an integrated autonomous learning framework. (shrink)
This paper presents a GA-based multi-agent reinforce- ment learning bidding approach (GMARLB) for perform- ing multi-agent reinforcement learning. GMARLB inte- grates reinforcement learning, bidding and genetic algo- rithms. The general idea of our multi-agent systems is as follows: There are a number of individual agents in a team, each agent of the team has two modules: Q module and CQ module. Each agent can select actions to be performed at each step, which are done by the Q module. (...) While the CQ module determines at each step whether the agent should continue or relinquish control. Once an agent relinquishes its control, a new agent is selected by bidding algorithms. We applied GA-based GMARLB to the Backgammon game. The experimental results show GMARLB can achieve a su- perior level of performance in game-playing, outperforming PubEval, while the system uses zero built-in knowledge. (shrink)
traction from reinforcement learners It addresses two ap proaches towards knowledge extraction the extraction of ex plicit symbolic rules from neural reinforcement learners and the extraction of complete plans from such learners The advantages of such knowledge extraction include the improvement of learning especially with the rule extraction approach and the improvement of the usability of re sults of learning..
Altruism can be understood in terms of traditional principles of reinforcement if an outcome that is beneficial to another person reinforces the behavior of the actor who produces it. This account depends on a generalization of reinforcement across persons and might be more amenable to experimental investigation than the one proposed by Rachlin.
We investigate a simple stochastic model of social network formation by the process of reinforcement learning with discounting of the past. In the limit, for any value of the discounting parameter, small, stable cliques are formed. However, the time it takes to reach the limiting state in which cliques have formed is very sensitive to the discounting parameter. Depending on this value, the limiting result may or may not be a good predictor for realistic observation times.
methods to improve reinforcement learning are identi ed and discussed in some detail Each demonstrates to some extent the advantages of combining RL and symbolic meth ods These methods point to the potentials and the chal lenges of this line of research..
Reinforcement learning approaches to cognitive modeling represent task acquisition as learning to choose the sequence of steps that accomplishes the task while maximizing a reward. However, an apparently unrecognized problem for modelers is choosing when, what, and how much to reward; that is, when (the moment: end of trial, subtask, or some other interval of task performance), what (the objective function: e.g., performance time or performance accuracy), and how much (the magnitude: with binary, categorical, or continuous values). In this (...) article, we explore the problem space of these three parameters in the context of a task whose completion entails some combination of 36 state–action pairs, where all intermediate states (i.e., after the initial state and prior to the end state) represent progressive but partial completion of the task. Different choices produce profoundly different learning paths and outcomes, with the strongest effect for moment. Unfortunately, there is little discussion in the literature of the effect of such choices. This absence is disappointing, as the choice of when, what, and how much needs to be made by a modeler for every learning model. (shrink)
Rolls presents a theory of emotion based on the premise that emotions are evoked by events that are capable of being instrumental reinforcers and punishers. As support for this theory is drawn almost entirely from experiments in non-human primates, valuable insights into the relationship between punishment and reinforcement systems, and the nature of instrumentality, may have been overlooked.
Rachlin rightly highlights behavioural reinforcement, conditional cooperation, and framing. However, genes may explain part of the variance in altruistic behaviour. Framing cannot be used to support his theory of altruism. Reinforcement of acts is not identical to reinforcement of patterns of acts. Further, many patterns of acts could be reinforced, and Rachlin's altruism is not the most likely candidate.
This commentary considers factors that may account for the inconsistency between the behavioral momentum formulation and the partial reinforcement extinction effect. The method of testing, the variability of the schedule, the nature of the response-contingency, and response effort are considered. Some applications to real-world problems are also discussed.
Abstract There is a fundamental gap between people's assertions as to what is right or wrong and their actual behaviour. This has been traditionally attributed to akrasia or weakness of the will. This paper examines this concept, and the related positive concept of KRAT, and considers what moral education can do about it. Claims by R.B. Braithwaite and others that religious traditions can provide reinforcement are examined and attention is directed to some important qualifications. The implications for moral education (...) are considered, and it is argued that attempts by moral educationists to get to grips with this weakness must involve their own participation in moral debate and in decisions concerning the moral ethos of society. Anyone who observes human behaviour through moral spectacles soon discovers that there is a significant gap between people's assertions as to what is right or wrong and their actual behaviour. This gap is traditionally attributed to akrasia or weakness of the will. Any attempt to get to grips with moral education must (a) take cognizance of this phenomena and (b) consider what, if anything, can be done about it. In this paper I aim to do three things: (i) to draw attention to some significant factors associated with akrasia, (ii) to explore claims that religious traditions can help to overcome such weakness and (iii) to draw some general conclusions from these points for moral education. (shrink)
forcement learning algorithms that generate only reactive policies and existing probabilistic planning algorithms that requires a substantial amount of a priori knowledge in order to plan we devise a two stage bottom up learning to plan process in which rst reinforcement learn ing dynamic programming is applied without the use of a priori domain speci c knowledge to acquire a reactive policy and then explicit plans are extracted from the learned reactive policy Plan extraction is based on a beam (...) search algorithm that performs temporal projection in a restricted fashion guided by the value functions resulting from reinforcement learn ing dynamic programming.. (shrink)
This paper addresses weighting and partitioning in complex reinforcement learning tasks, with the aim of facilitating learning. The paper presents some ideas regarding weighting of multiple agents and extends them into partitioning an input/state space into multiple regions with di erential weighting in these regions, to exploit di erential characteristics of regions and di erential characteristics of agents to reduce the learning complexity of agents (and their function approximators) and thus to facilitate the learning overall. It analyzes, in (...) class='Hi'>reinforcement learning tasks, di erent ways of partitioning a task and using agents selectively based on partitioning. Based on the analysis, some heuristic methods are described and experimentally tested. We nd that some o -line heuristic methods performed the best, signi cantly better than single-agent models. (shrink)
The absence of a clear influence of the responses modified by new connections created by LTP on the development of these connections casts doubt on an essential role of LTP in learning and memory formation without any association with reinforcement. The evidence for the involvement of the monoaminergic systems in synaptic potentiation in the cerebral cortex during learning is adduced, and their role in reinforcement system function is discussed.
The concept of “intrinsic reinforcement” stretches the use of “reinforcement” beyond where it is valuable. The concept of the “self-system,” though fuzzy at the edges, can cover experience as well as the behaviour of altruistic acts.
This paper addresses automatic partitioning in complex reinforcement learning tasks with multiple agents, without a priori domain knowledge regarding task structures. Partitioning a state/input space into multiple regions helps to exploit the di erential characteristics of regions and di erential characteristics of agents, thus facilitating learning and reducing the complexity of agents especially when function approximators are used. We develop a method for optimizing the partitioning of the space through experience without the use of a priori domain knowledge. The (...) method is experimentally tested and compared to a number of other algorithms. As expected, we found that the multi-agent method with automatic partitioning outperformed single-agent learning. (shrink)
We consider the following signaling game. Nature plays first from the set {1, 2}. Player 1 (the Sender) sees this and plays from the set {A, B}. Player 2 (the Receiver) sees only Player 1’s play and plays from the set {1, 2}. Both players win if Player 2’s play equals Nature’s play and lose otherwise. Players are told whether they have won or lost, and the game is repeated. An urn scheme for learning coordination in this game is as (...) follows. Each node of the desicion tree for Players 1 and 2 contains an urn with balls of two colors for the two possible decisions. Players make decisions by drawing from the appropriate urns. After a win, each ball that was drawn is reinforced by adding another of the same color to the urn. A number of equilibria are possible for this game other than the optimal ones. However, we show that the urn scheme achieves asymptotically optimal coordination. (shrink)
Rolls presents a good integrative summary of the neural bases of emotions, adds new findings and insights, and takes a stance on controversial issues such as separate or distinct brain systems for processing emotion information and for planning and action. This commentary raises questions about his explanations of emotion activation, response to novelty, the evolution of emotions, and the phenomenal experience of emotions in human consciousness.
We welcome Soltis' use of evolutionary signaling theory, but question his interpretations of colic as a signal of vigor and his explanation of abnormal high-pitched crying as a signal of poor infant quality. Instead, we suggest that these phenomena may be suboptimal by-products of a generally adaptive learning process by which infants adjust their crying levels in relation to parental responsiveness.
Major contributions emanating from Sagvolden et al.'s theory include elucidation of the role in attention-deficit/hyperactivity disorder (ADHD) of temporal information processing, social learning, and response extinction learning. Key issues include a need for clearer explanation of the relative role of impulsivity versus response suppression/inhibition in the dual process model, and delineation of genotype-environment correlations versus interactions in the social and experiential mechanisms posited.
Reinforcement learning (RL) models of decision-making cannot account for human decisions in the absence of prior reward or punishment. We propose a mechanism for choosing among available options based on goal-option association strengths, where association strengths between objects represent previously experienced object proximity. The proposed mechanism, Goal-Proximity Decision-making (GPD), is implemented within the ACT-R cognitive framework. GPD is found to be more efficient than RL in three maze-navigation simulations. GPD advantages over RL seem to grow as task difficulty is (...) increased. An experiment is presented where participants are asked to make choices in the absence of prior reward. GPD captures human performance in this experiment better than RL. (shrink)
Abstract Peter Salins's The Ecology of Housing Destruction and Salins's co?authored work with Gerald Mildner, Scarcity by Design, provide fascinating evidence of the unintentionally harmful effects of urban policies, and their role as catalysts for further harmful policies designed to ameliorate previous harms. The result is a web of counterproductive regulations confronting bewildered policymakers and frustrated citizens. In his The Federal Government and Urban Housing, on the other hand, R. Allen Hays not only misses the existence of these complex policy (...) dynamics, but fails to assess accurately the consequences of the housing policies he favors. His work does, however, offer useful insights into the nature of policy decisions. (shrink)
Contents 1. Introduction 2. Reward-Guided Decision Making 3. Content in the Model 4. How to Deflate a Metarepresentational Reading Proust and Carruthers on metacognitive feelings 5. A Deflationary Treatment of RPEs? 5.1 Dispensing with prediction errors 5.2 What is use of the RPE focused on? 5.3 Alternative explanations—worldly correlates 5.4 Contrast cases 6. Conclusion Appendix: Temporal Difference Learning Algorithms.
What is the role of self-concept in motivating moral behavior? On one account, when people are primed to perceive themselves as “do-gooders”, conscious access to this positive self-concept will reinforce good behavior. On an alternative account, when people are reminded that they have done their “good deed for the day”, they will feel licensed to behave worse. In the current study, when participants were asked to recall their own good deeds (positive self-concept), their subsequent charitable donations were nearly twice that (...) of participants who recalled bad deeds, or recent conversation topics, consistent with an account of moral reinforcement. In addition, among participants reporting good deeds, those who did not note whether they were recognized or unrecognized by other people donated significantly more than participants who took note of others’ responses. In sum, when people are primed to see themselves as good people, who do good for goodness’ sake, not to obtain public credit, they may be motivated to do more good. (shrink)
This commentary focuses on the stimulus-reinforcer hypothesis of resistance to change. The overall context of reinforcement can account for resistance to extinction. There are ways to systematically test the hypothesis that Pavlovian contingencies account for the behavioral “mass” of discriminated operant behavior.
Any behavior belongs to innumerable overlapping types. Any adequate theory of emergence and retention of behavior, whether psychological or biological, must give us not only a general mechanism – reinforcement or selection, for example – but a reason why that mechanism applies to a particular behavior in terms of one of its types rather than others. Why is it as this type that the behavior is reinforced or selected?
Recent evidence from our lab indicates that LTP shares an important property with memory consolidation: it is consolidated by natural reinforcement. Nevertheless, the hypothesis, that LTP-like mechanisms or other forms of enhanced synaptic efficacy are basic elements in learning is not unequivocally supported. Skepticism aside, LTP is an accessible experimental model that is optimally equipped for the investigation of the cellular and molecular machinery involved in synaptic weight changes.
Many philosophers are building a solid case in favour of the knowledge account of assertion (KAA). According to KAA, if one asserts that P one represents oneself as knowing that P. KAA has recently received support from linguistic data about prompting challenges, parenthetical positioning and predictions. In this article, I add another argument to this rapidly growing list: an argument from what I will call ‘reinforcing parenthesis’.
Intraspecific aggression (IA), in service to dominance, has far deeper roots in animal behavior and human evolution than does predation. The reinforcing properties of such aggression are most likely to be a major source of human cruelty.
Deviant forms of human thought may provide insight into epistemic standards, such as rationality. A comparative analysis of paranoia and reinforced dogmatism suggests that reinforced dogmatism, such as pseudo-science a-la-Popper, demonstrates a primary epistemic lack of critical rationality, that is, of testability, whereas paranoia demonstrates a lack of range of alternative statements leading secondarily to a lack of testability. This reflects the importance to both epistemology and psychiatry of epistemic standards in addition to testability, such as relevance to problems, and (...) emphasizes the distinction of the context of introduction from the contexts of discovery and of justification. Key Words: context of introduction paranoia reinforced dogmatism relevance testability. (shrink)
In this paper I argue against a criticism by Matthew Weiner to Grice’s thesis that cancellability is a necessary condition for conversational implicature. I argue that the purported counterexamples fail because the supposed failed cancellation in the cases Weiner presents is not meant as a cancellation but as a reinforcement of the implicature. I moreover point out that there are special situations in which the supposed cancellation may really work as a cancellation.
In the present paper I wish to argue that psychological egoism may well have a basis in the empirical facts of human psychology. Certain contemporary learning theorists, e.g., Hull and Skinner, have put forward behavioristic theories of the origin and functioning of human motives which posit a certain number of basically "selfish, " unlearned primary drives or motives (like hunger, thirst, sleep, elimination, and sex), explain all other, higher-order, drives or motives as derived genetically from the primary ones via certain (...) "laws of reinforcement," and, further, deny the "functional autonomy" of those higher-order drives or motive. Now it is a hotly debated issue in contemporary Learning Theory whether any theory such as we have described briefly above could adequately explain adult human behavior. I shall, however, argue only that a theory of the above kind may well be true, and that from such a theory, fortified only by one additional psychological premise, the truth of egoism (non-altruism) logically follows. I hope to show, thereby, that the question of psychological egoism is still an open empirical issue, however fallacious be the philosophical arguments for it. (shrink)
Psychopathy is a developmental disorder associated with specific forms of emotional dysfunction and an increased risk for both frustration-based reactive aggression and goal-directed instrumental antisocial behavior. While the full behavioral manifestation of the disorder is under considerable social influence, the basis of this disorder appears to be genetic. At the neural level, individuals with psychopathy show atypical responding within the amygdala and ventromedial prefrontal cortex (vmPFC). Moreover, the roles of the amygdala in stimulus-reinforcement learning and responding to emotional expressions (...) and vmPFC in the representation of reinforcement expectancies are compromised. The implications of these functional impairments for responsibility are discussed. (shrink)
The study examines employee, managerial, and ethics officer perceptions regarding their companies codes of ethics. The study moves beyond examining the mere existence of a code of ethics to consider the role that code content and code process (i.e. creation, implementation, and administration) might play with respect to the effectiveness of codes in influencing behavior. Fifty-seven in-depth, semi-structured interviews of employees, managers, and ethics officers were conducted at four large Canadian companies. The factors viewed by respondents to be important with (...) respect to code effectiveness include: provisions of examples; readability; tone; relevance; realism; senior management support; training; reinforcement; living up to standards; reporting requirement; anonymous phone line; communicating violations; and enforcement. The factors found to be potentially important include: justification for provisions; employee involvement; and sign-off requirements. Factors found not to be important include: objectives for the code; prior distribution; testing; and relating ones performance review to compliance with the code. (shrink)
Managing ethical behavior is a one of the most pervasive and complex problems facing business organizations today. Employees' decisions to behave ethically or unethically are influenced by a myriad of individual and situational factors. Background, personality, decision history, managerial philosophy, and reinforcement are but a few of the factors which have been identified by researchers as determinants of employees' behavior when faced with ethical dilemmas. The literature related to ethical behavior is reviewed in this article, and a model for (...) understanding ethical behavior in business organizations is proposed. It is concluded that managing ethics in business organizations requires that managers engage in a concentrated effort which involves espousing ethics, behaving ethically, developing screening mechanisms, providing ethical training, creating ethics units and reinforcing ethical behavior. (shrink)
Jerry Fodor argues that Darwin was wrong about "natural selection" because (1) it is only a tautology rather than a scientific law that can support counterfactuals ("If X had happened, Y would have happened") and because (2) only minds can select. Hence Darwin's analogy with "artificial selection" by animal breeders was misleading and evolutionary explanation is nothing but post-hoc historical narrative. I argue that Darwin was right on all counts. Until Darwin's "tautology," it had been believed that either (a) God (...) had created all organisms as they are, or (b) organisms had always been as they are. Darwin revealed instead that (c) organisms have heritable traits that evolved across time through random variation, with survival and reproduction in (changing) environments determining (mindlessly) which variants were successfully transmitted to the next generation. This not only provided the (true) alternative (c), but also the methodology for investigating which traits had been adaptive, how and why; it also led to the discovery of the genetic mechanism of the encoding, variation and evolution of heritable traits. Fodor also draws erroneous conclusions from the analogy between Darwinian evolution and Skinnerian reinforcement learning. Fodor’s skepticism about both evolution and learning may be motivated by an overgeneralization of Chomsky’s “poverty of the stimulus argument” -- from the origin of Universal Grammar (UG) to the origin of the “concepts” underlying word meaning, which, Fodor thinks, must be “endogenous,” rather than evolved or learned. (shrink)
A book symposium on Peter, Carruthers. Phenomenal Consciousness: A Naturalistic Theory. Cambridge: Cambridge University Press, 2000. Contents: Author's précis Colin Allen, Evolving Phenomenal Consciousness - Carruthers's reply. José Luis Bermúdez, Commentary - Carruthers's reply - Reply to Carruthers: Properties, first-order representationalism and reinforcement. Joseph Levine, Commentary - Carruthers's reply. William Seager, Dispositions and Consciousness - Carruthers's reply.
Philosophers of science have offered different accounts of what it means for one scientific theory to reduce to another. I propose a more or less friendly amendment to Kenneth Schaffner’s “General Reduction-Replacement” model of scientific unification. Schaffner interprets scientific unification broadly in terms of a continuum from theory reduction to theory replacement. As such, his account leaves no place on its continuum for type irreducible and irreplaceable theories. The same is true for other accounts that incorporate Schaffner's continuum, for example, (...) those developed by Paul Churchland, Clifford Hooker, and John Bickle. Yet I believe a more general account of scientific unification should include type irreducible and irreplaceable theories in an account of their partial reduction, specifically, when there is a reduction of their tokens. Thus I propose a “Reduction-Reception-Replacement” model wherein type irreducible and irreplaceable theories are accepted or received for the purpose of unifying domains of particulars. I also suggest a link between this kind of token reduction and mechanistic explanation. (shrink)
Morality for the purposes of this paper consists of sets of rules or principles intended for the general regulation of conduct for all. Intuitionist accounts of morality are rejected as making reasoned analysis of morals impossible. In many interactions, there is partial conflict and partial cooperation. From the general social point of view, the rational thing to propose is that we steer clear of conflict and promote cooperation. This is what it is rational to propose to reinforce, and to assist (...) in reinforcing in society; it is not necessarily what it is individually rational to do. Even so, given the general situation, the rationality of its reinforcement will typically support the rationality of individual action as well. Game theory makes it possible to clarify these interactions, and these proposals for social solutions. (shrink)
Everyone wants to be virtuous, but recent psychological investigations suggest that this may not be possible. Mark Alfano challenges this theory and asks, not whether character is empirically adequate, but what characters human beings could have and develop. Although psychology suggests that most people do not have robust character traits such as courage, honesty and open-mindedness, Alfano argues that we have reason to attribute these virtues to people because such attributions function as self-fulfilling prophecies – children become more studious if (...) they are told that they are hard-working and adults become more generous if they are told that they are generous. He argues that we should think of virtue and character as social constructs: there is no such thing as virtue without social reinforcement. His original and provocative book will interest a wide range of readers in contemporary ethics, epistemology, moral psychology and empirically informed philosophy. (shrink)
Two different kinds of rules are needed in the regulation of human conduct in the sphere of global interaction. There is a need for global ethics and also a need for a global ethic. The first exists but needs reinforcement. The second also exists but not sufficiently widely and therefore needs a fashioning out in some contexts. Because ethics and an ethic are grammatically cognate and are both concerned with behavior, it is easy to conflate the two. Accordingly, clarity (...) will be sought below about the distinction between them. The hope is that such clarity might help in directing efforts in search of harmony and other good things on our planet. (shrink)
In China, guanxi is the basis on which Chinese exchange a lifetime of favors, resources, and business leverage. Guanxi is considered a unique construct and a product of Confucian values and the contemporary political and socioeconomic system in Chinese society. With its cultural embeddings guanxi , as the social norm of conduct, functions as complex adaptive systems that expand and interconnect to become well-knit social networks; meanwhile the functions of well-fixing and self-reinforcement of the guanxi networks ( chuens ) (...) are synergetically activated internally, externally, and interactively, which shows their extreme flexibility of adaptation. Taking as a case study an outside direct investment (ODI) Taiwanese firm in China, we address and conduct a survey to examine the effect of guanxi on management. Results of the research suggest that guanxi is not limited to interpersonal links; it is also the switch that activates social networks and that reconciles interpersonal and internetwork mismatches to influence management efficacy. (shrink)
Biobanks for long-term research pose challenges to the legal and ethical validity of consent to participate. Different models of consent have been proposed to answer some of these challenges. This paper contributes to this discussion by considering the meaning and value of consent to participants in biobanks. Empirical data from a qualitative study is used to provide a participant view of the consent process and to demonstrate that, despite limited understanding of the research, consent provides the research participants with some (...) level of control and a form of self determination that they value. Participation is framed as a moral act of a responsible citizen providing reinforcement of self identity. Consent symbolizes the trust invested in researchers and research institutions to use the biobank for the public good. The paper argues that consent continues to play an important role in biobank participation and that a participant view should inform proposals to modify consent processes. (shrink)
This article investigates the intersections of secrecy/interiority, the state, and speech/expression, and their implications for the rights of women. I propose a critique of commercial pornography that reanimates MacKinnon's claim that pornography and American democracy are in a relationship of mutual reinforcement, and incorporates poststructuralist (Lyotard, Baudrillard, and Butler) commitments to secrecy and unintelligibility, as well as their role in the production of pleasure.
This chapter is an essay in a volume that examines constitutional law in the United States through the lens of H.L.A. Hart's "rule of recognition" model of a legal system. My chapter focuses on a feature of constitutional practice that has been rarely examined: how jurists and scholars argue about interpretive methods. Although a vast body of scholarship provides arguments for or against various interpretive methods -- such as textualism, originalism, "living constitutionalism," structure-and-relationship reasoning, representation reinforcement, minimalism, and so (...) forth -- very little scholarship shifts to the meta-level and asks: What are the considerations that jurists and scholars bring to bear in arguing that one or another interpretive method is legally favored? And can we "make sense" of this body of argument? Is there a model of legal discourse that both accurately describes how U.S. jurists and scholars actually argue about interpretive methods, and that vindicates this discourse (in the sense of seeing these actors as making valid arguments)? I find that Hart's rule-of-recognition model fails to describe or vindicate how U.S. jurists and scholars argue about interpretive methods. The problem, in a nutshell, is that Hart sees legal argument as asserting or presupposing the social fact of contemporary official acceptance of a rule of recognition. By contrast, jurists and scholars typically point to social facts other than contemporary official acceptance in arguing for the legal status of an interpretive method -- for example, the fact that the method is supported by Framers' intent, or by U.S. culture and tradition, or by precedent. Further, jurists and scholars very often argue that some interpretive method is legally favored even though the method is controversial. On Hart's model, such a claim is problematic -- because, on his model, the content of the rule of recognition is not controversial, but rather a matter of consensus among officials. The upshot may just be that Hart's model is a failure. However, another possibility is to adopt an "error theory" of U.S. constitutional discourse. It may perhaps be the case that U.S. jurists and scholars often make claims for the favorable legal status of some interpretive method that are inconsistent with the best understanding of the nature of law. (shrink)
: This essay provides a critical analysis of rape prevention since the 1980s. I argue that we must challenge rape prevention's habitual reinforcement of the notion that fear is a woman's best line of defense. I suggest changes that must be made in the anti-rape movement if we are to move past fear. Ultimately, I raise the question of what, if not vague threats and scare tactics, constitutes prevention.
That parthood is a transitive relation is among the most basic principles of classical mereology. Alas, it is also very controversial. In a recent paper, Ingvar Johansson has put forward a novel diagnosis of the problem, along with a corresponding solution. The diagnosis is on the right track, I argue, but the solution is misleading. And once the pieces are properly put together, we end up with a reinforcement of the standard defense of transitivity on behalf of classical mereology.
In management theory and business practice, the dealing with diversity, especially a diverse workforce, has played a prominent role in recent years. In a globalizing economy companies recognized potential benefits of a multicultural workforce and tried to create more inclusive work environments. However, many organizations have been disappointed with the results they have achieved in their efforts to meet the diversity challenge [Cox: 2001, Creating the Multicultural Organization (Jossey-Bass, San Francisco)]. We see the reason for this in the fact that (...) while much attention has been paid to the strategic dimension of diversity policies, systems, and processes, much less thought has been given to the normative dimension, the norms and values involved. Given the fact that diversity is essentially about cultural norms and values, appropriate reflection work becomes a fundamental task to create a truly inclusive work environment where people from diverse backgrounds feel respected and recognized. Therefore, we focus in this article on the challenge of building an inclusive diversity culture showing that such a culture of inclusion has to be built on solid moral grounds. We present a conceptual framework of inclusion based on a moral theory of recognition and introduce the founding principles of reciprocal understanding, standpoint plurality and mutual enabling, trust and integrity. After revealing barriers that hinder a culture of inclusion from emerging we shed light on the process of developing such a culture which involves four essential transformational stages: The first phase focuses on raising awareness, building understanding and encouraging reflection. The second phase deals with the development of a vision of inclusion as an important step to define the change direction. In a third phase key management concepts and principles should be re-thought. This leads to the fourth, action-oriented phase, that focuses on an integrated Human Relations Management (HRM)1 system that helps implement change by doing both, translating the founding principles via competencies into observable and measurable behavior and fostering the development, reinforcement and recognition of inclusive behavior. (shrink)
This study examines the impact of the strength of an accounting firm’s ethical environment (presence and reinforcement vis-à-vis the presence of a code of conduct) on the quality of auditor judgment, across different levels of audit expertise. Using a 2 × 2 full factorial ‹between subjects’ experimental design, with audit managers and audit seniors, the impact of different levels of strength of the ethical environment on auditor judgments was assessed with a realistic audit scenario, requiring participants to make judgments (...) in respect of an inventory writedown. Based on prior research, and as hypothesized, participants possessing greater auditing experience made higher quality technical judgments. While there were no significant differences between the quality of audit judgments made by participants in the stronger ethical environment, over-all results indicate that managers are more sensitive to differences in the strength of the ethical environment than seniors. This is consistent with the hypothesis, and with prior research which suggests that the impact of the code will only be significant if it has been bilaterally internalized by individuals. This has important implications for accounting firms and regulators, given that the International Standard on Quality Control 1, requires the communication and reinforcement of ethical principles as part of firms' quality control processes. It suggests that firms will need to carefully consider the means by which they communicate and reinforce ethical principles, as it is possible to differentially impact auditors of different rank. (shrink)
What does it mean for emotion to be well-constituted? What distinguishes good feeling from (just) feeling good? Is there such a distinction at all? The answer to these questions becomes clearer if we realize that for an emotion to be all it seems, it must be responsible as well as responsive to what it is about. It may be that good feeling depends on feeling truly if we are to be really moved, moved in the way that avoids the need (...) for constant, fretful replenishment and reinforcement. To be sound, emotions may need to be capable of genuineness, depth, and other kinds of integrity. And that, in turn, may require certain virtues of mind, such as truthfulness, temperateness, and even courage, that are more familiar at the level of action. The governing aim of this book is to demonstrate that there can be problems of a structural kind with the adequacy of emotions and the emotional life. (shrink)
Attention-deficit/hyperactivity disorder (ADHD) is currently defined as a cognitive/behavioral developmental disorder where all clinical criteria are behavioral. Inattentiveness, overactivity, and impulsiveness are presently regarded as the main clinical symptoms. The dynamic developmental behavioral theory is based on the hypothesis that altered dopaminergic function plays a pivotal role by failing to modulate nondopaminergic (primarily glutamate and GABA) signal transmission appropriately. A hypofunctioning mesolimbic dopamine branch produces altered reinforcement of behavior and deficient extinction of previously reinforced behavior. This gives rise to (...) delay aversion, development of hyperactivity in novel situations, impulsiveness, deficient sustained attention, increased behavioral variability, and failure to “inhibit” responses (“disinhibition”). A hypofunctioning mesocortical dopamine branch will cause attention response deficiencies (deficient orienting responses, impaired saccadic eye movements, and poorer attention responses toward a target) and poor behavioral planning (poor executive functions). A hypofunctioning nigrostriatal dopamine branch will cause impaired modulation of motor functions and deficient nondeclarative habit learning and memory. These impairments will give rise to apparent developmental delay, clumsiness, neurological “soft signs,” and a “failure to inhibit” responses when quick reactions are required. Hypofunctioning dopamine branches represent the main individual predispositions in the present theory. The theory predicts that behavior and symptoms in ADHD result from the interplay between individual predispositions and the surroundings. The exact ADHD symptoms at a particular time in life will vary and be influenced by factors having positive or negative effects on symptom development. Altered or deficient learning and motor functions will produce special needs for optimal parenting and societal styles. Medication will to some degree normalize the underlying dopamine dysfunction and reduce the special needs of these children. The theory describes how individual predispositions interact with these conditions to produce behavioral, emotional, and cognitive effects that can turn into relatively stable behavioral patterns. Key Words: catecholamine; clumsiness; dopamine; hyperkinesis; hyperkinetic disorder; impulsivity; monoamine; neuromodulator; overactivity; pollutants; reinforcement; reward; verbally governed behavior; soft signs; variability. (shrink)
Behaving ethically depends on the ability to recognize that ethical issues exist, to see from an ethical point of view. This ability to see and respond ethically may be related more to attributes of corporate culture than to attributes of individual employees. Efforts to increase ethical standards and decrease pressure to behave unethically should therefore concentrate on the organization and its culture. The purpose of this paper is to discuss how total quality (TQ) techniques can facilitate the development of a (...) cooperative corporate culture that promotes and encourages ethical behavior throughout an organization. (shrink)
Rolls's preliminary definitions of emotion and speculative restriction of consciousness, including emotional sentience, to humans, display behaviorist prejudice. Reinforcement and causation are not by themselves sufficient conceptual resources to define either emotion or the directedness of thought and motivated action. For any adequate definition of emotion or delimitation of consciousness, new physiology, such as Rolls is contributing to, and also the resources of other fields, will be required.
In this paper I examine how the constituent elements of a firm's organizational structure affect the ethical behavior of workers. The formal features of organizations I examine are the compensation practices, performance and evaluation systems, and decision-making assignments. I argue that the formal organizational structure, which is distinguished from corporate culture, is necessary, though not sufficient, in solving ethical problems within firms. At best the formal structure should not undermine the ethical actions of workers. When combined with a strong culture, (...) however, the organizational structure may be sufficient in promoting ethical conduct. While helpful, ethics training and corporate codes are neither necessary nor sufficient in promoting ethical behavior within firms. (shrink)
There is a conflict of interest in behaviorism between diction and content, between clean speech and effective speech, between what we say and what we know. This article gives a framework for speech that is both clean and effective, that respects graded validation of hypotheses, and that favors distinction over doctrine. The article begins with the description of SDT, a mathematical model of discrimination based on statistical decision theory, which serves as leitmotif. It adopts Skinner's distinction between tacts and mands, (...) the former as responses under the predominant control of the stimulus and the latter as responses under the predominant control of the reinforcer. To analyze behavior is to understand the relative contribution of each of these loci of control, measured as d' and C, respectively. SDT is then applied to causal attributions. It is shown that Skinner's fundamental model of behavior, the three-term contingency, is itself a causal structure, with the initiating stimulus an efficient cause, the reinforcer a final cause, and the response and its various components the substrate upon which these act. In light of these correspondences, covert (mental) processes are viewed as links in a causal chain, under joint control of initiating and reinforcing stimuli. Their ascription is an inference, made with confidence when the links rise to the surface and with dubiety as they sink to the abyss. There exists no threshold at which the links become a different kind of thing; there are only gradients of clarity and confidence about what we take them to be. The host to these processes has a privileged but corrigible perspective on them and on the history of reinforcement that led to them. Skinner's model of the operant is a useful causal model of many nested levels, including covert processes such as cognition. In the avatar of SDT his model provides a tool for qualifying verbal behavior, including descriptions of cognition. (shrink)
Background Requirements for organ donation after cardiac or imminent death have been introduced to address the transplantable organs shortage in the United States. Organ procurement organizations (OPOs) increasingly use the Internet for organ donation consent. Methods An analysis of OPO Web sites available to the public for enrollment and consent for organ donation. The Web sites and consent forms were examined for the minimal information recommended by the United States Department of Health and Human Services for informed consent. Content scores (...) were calculated as percentages of data elements in four information categories: donor knowledge, donor consent reinforcement, donation promotion, and informed consent. Results There were 60 Web sites for organ donation enrollment serving the 52 states. The median percent (10 percentile-90 percentile) content scores of the Web sites for donor knowledge, donor consent reinforcement, and donation promotion were 33% (20–47), 79% (57–86), and 75% (50–100), respectively. The informed consent score was 0% (0–33). The content scores for donor knowledge and informed consent were significantly lower than donor consent reinforcement and donation promotion for all Web sites (P < .05). The content scores for the four categories were similar among the 11 regions of the United Network for Organ Sharing. Conclusion The Web sites and consent forms for public enrollment in organ donation do not fulfill the necessary requirements for informed consent. The Web sites predominantly provide positive reinforcement and promotional information rather than the transparent disclosure of organ donation process. Independent regulatory oversight is essential to ensure that Internet enrollment for organ donation complies with legal and ethical standards for informed consent. (shrink)
The Hospital for Rehabilitation, Stavern, in Norway has treated patients with physical symptoms with no organic cause, so called conversion disorder patients, for over a decade. For four years research on the treatment has been carried out. Patients with conversion disorder seem not to fit in traditional somatic hospitals because their patienthood depends upon psychiatric diagnosis. Ironically, they appear not to belong in psychiatric hospitals because of their physical symptoms. The treatment offered these patients at hospitals for rehabilitation is adapted (...) physical activity consisting of behaviour elements such as positive reinforcement of normal function and lack of positive reinforcement at dysfunction. The pedagogical approach is seen as crucial in the successful rehabilitation of the patients. The disorder and treatment can be understood by using theories about the ecstatic body, radical behaviourism and phenomenology. When patients have problems in behaviour concerning both body and mind, it would be natural to employ both in the road to recovery. This article describes the various treatments and discusses them from phenomenological, ethical and philosophical perspectives. (shrink)
We study a simple game theoretic model of information transfer which we consider to be a baseline model for capturing strategic aspects of epistemological questions. In particular, we focus on the question whether simple learning rules lead to an efficient transfer of information. We find that reinforcement learning, which is based exclusively on payoff experiences, is inadequate to generate efficient networks of information transfer. Fictitious play, the game theoretic counterpart to Carnapian inductive logic and a more sophisticated kind of (...) learning, suffices to produce efficiency in information transfer. (shrink)
The dynamic developmental theory (DDT) has benefited from the insights of the commentators, particularly in terms of the implications for the proposed steepened delay gradients in attention-deficit/hyperactivity disorder (ADHD). The introduction of modified memory processes as a basis for the delay gradients improved the links to aspects of ADHD. However, it remains unclear whether the hyperactive-impulsive and inattentive subtypes are separate subgroups or may be explained as different outcomes of the same genetic factors and thus explicable by the same principles. (...) The DDT suggests that altered reinforcement and extinction processes define an endophenotype in ADHD that can be related dimensionally to inattention, hyperactivity, and impulsivity. The relation between the suggested endophenotype, characterized by changes in basic learning mechanisms, and other endophenotypes characterized by delay aversion or response disinhibition, needs to be tested in future studies. (shrink)
Altruism by definition involves the self's evaluation of costs and benefits of an act of the self, which must include cost to the self and benefits to the other. Reinforcement value to the self of such acts is greater than the costs to the self. Without consideration of a self-system of evaluation, there is little meaning to altruistic acts.
Internal mechanisms, especially those implicating the self, are crucial for the egoism-altruism debate. Self-liking is extended to close others and can be extended, through socialization and reinforcement experiences, to non-close others: Altruistic responses are directed toward others who are included in the self. The process of self-extension can account for cross-situational variability, contextual variability, and individual differences in altruistic behavior.
"Objectivity" and "rationality" of science do not depend on freedom from all "presuppositions", but are inextricably bound with the employment of background beliefs, so long as those background beliefs satisfy certain constraints. These latter have developed through application of the same kind of reasoning that they themselves dictate, and change in response to changes in the reasoning-patterns which they themselves generate. This interaction of constraints and reasoning does not eventuate in a vicious circle; rather, what results is a mutual (...) class='Hi'>reinforcement, itself rational in a sense that is coherent with well-founded everyday intuitions from which the ideas in question (objectivity and rationality) are descendants, and that makes intelligible the progressive nature of science. (shrink)
Rotter’s theory of internal-external locus of control evolved from Carl Jung’s work. In Psychological Types (1923), Jung defined two opposing tendencies in personality introversion and extroversion. While both tendencies are present in all individuals, one tends to dominate the other. The internal–external control construct was conceived as a generalized expectancy to perceive reinforcement either as contingent upon one’s own behaviors (internal control) or as the result of forces beyond one’s control, such as chance, fate, or powerful others (external control) (...) (Lefcourt, 1981, p. 15). Locus of control refers to those causes to which individuals attribute their successes and failures. Individuals are responsive to some external motivators (e.g. better jobs, promotions, higher salaries), but the more powerful are internal pressures (the desire for increased job satisfaction, self-esteem, and quality of life). Research indicates that an individual’s internal–external locus of control impacts their ethical behavior in an organization. Rotter’s I–E Scale (1966, Psychological Monographics 80(1), 1–28), a 29 item forced choice instrument, is the most widely used instrument to measure the degree of internality versus externality. Each respondent’s score for this scale had a potential range from 0 to 23. As there are six filler items used to mask the intent of the questionnaire a score of 23 being extremely external in nature and a score of zero being extremely internal in nature. The I Scale (Internal Scale) measures the extent to which people believe that they have control over their own lives; the E Scale (External Scale) measures the extent to which people believe that they do not have control over their own lives. This study utilized the I–E Scale. Jones (Unpublished Doctoral Dissertation) and Deflumeri (Unpublished Doctoral Dissertation) investigated the likelihood of an individual to engage in unethical behavior in an organization. This research suggests that when employees perceive that locus of control resides internally they themselves decide what is appropriate behavior, but with an external locus of control, employees will look at others to decide appropriate behavior. The researcher in this study investigated the relationship between managers’ locus of control and their moral reasoning. (shrink)
The reinforcement/extinction disorder hypothesis (Sagvolden et al.) is an important counterweight to the executive dysfunction model of attention-deficit/hyperactivity disorder (ADHD). However, like that model, it conceptualises ADHD as pathophysiologically homogeneous, resulting from a common core dysfunction. Recent studies reporting neuropsychological heterogeneity suggest that this common core dysfunction may be the scientific equivalent of a red herring.
Many studies have demonstrated differential resistance to change in the context of negative behavioral contrast. That is, as a result of introducing a disruptor, response rates decrease to a greater extent when the maintaining reinforcement schedule is leaner. Resistance to change also applies to positive contrast, in that increases in response rate are greater in leaner schedules. The negative contrast effects seen in studies of intrinsically motivated behavior reflect an increase in resistance to change as a result of adding (...) extrinsic reinforcers. (shrink)
Signaling games with reinforcement learning have been used to model the evolution of term languages (Lewis 1969, Conven- tion. Cambridge, MA: Harvard University Press; Skyrms 2006, “Signals” Presidential Address. Philosophy of Science Association for PSA). In this article, syntactic games, extensions of David Lewis’s original sender– receiver game, are used to illustrate how a language that exploits available syntactic structure might evolve to code for states of the world. The evolution of a language occurs in the context of available (...) vocabulary and syntax—the role played by each component is compared in the context of simple reinforcement learning. (shrink)
The concept of emotion as defined by Rolls is based upon reinforcement mechanisms and their underlying neural networks. He shows how these networks process signals at many levels, through both separate and convergent pathways essential for adaptive action. While many behavioral issues related to emotion are omitted from his review, he succeeds admirably in summarizing both the “current state of the art” in single unit analyses and in pointing out how future research directions may be crafted.
In the metaphor of behavioral momentum, the rate of a free operant in the presence of a discriminative stimulus is analogous to the velocity of a moving body, and resistance to change measures an aspect of behavior that is analogous to its inertial mass. An extension of the metaphor suggests that preference measures an analog to the gravitational mass of that body. The independent functions relating resistance to change and preference to the conditions of reinforcement may be construed as (...) convergent measures of a single construct, analogous to physical mass, that represents the effects of a history of exposure to the signaled conditions of reinforcement and that unifies the traditionally separate notions of the strength of learning and the value of incentives. Research guided by the momentum metaphor encompasses the effects of reinforcement on response rate, resistance to change, and preference and has implications for clinical interventions, drug addiction, and self-control. In addition, its principles can be seen as a modern, quantitative version of Thorndike's (1911) Law of Effect, providing a new perspective on some of the challenges to his postulation of strengthening by reinforcement. Key Words: behavioral momentum; clinical interventions; drug addiction; preference; reinforcement; resistance to change; response strength; self-control. (shrink)
We present a skill learning model CLARION. Different from existing models of high-level skill learning that use a topdown approach (that is, turning declarative knowledge into procedural knowledge), we adopt a bottom-up approach toward low-level skill learning, where procedural knowledge develops first and declarative knowledge develops later. CLAR- ION is formed by integrating connectionist, reinforcement, and symbolic learning methods to perform on-line learning. We compare the model with human data in a minefield navigation task. A match between the model (...) and human data is found in several respects. (shrink)
In the attempt of defending an interpretation of David Hume's moral and political philosophy connected to classical utilitarianism, intervenes in a key way the so called problem of the " Sensitive Knave " raised by this author at the end of his more utilitarian work, the Enquiry Concerning the Principles of Morals. According to the classic interpretation of this fragment, the utilitarian rationality in politics would clash with morality turning useless the latter. Therefore, in the political area the defense of (...) a moral utilitarianism would be an auto contradictory task. In order to show that, first, Hume does not say anything similar to this and second, that even indicates the way of overcoming this apparent contradiction between morality and rationality, we analyze briefly the arguments from which there comes basically this "anti-utilitarian" standard interpretation, and we defend an interpretation of the humean discussion on the problem of the supposed conflict between morality and rationality, or of rational incentives for immoral behavior, which allows to explain better Hume's position on this problem. Finally, we propose an instance of overcoming the contradiction morality/ rationality by a rule adjusted utilitarianism centered on the idea of the "progressive development of artificial institutions of reinforcement of morality", that Hume himself would suggest in other places in which he approaches the topic of the apparent contradiction between "morality" and "knavery". We propose also possible lines of future development of this idea, between them its use to clarify the relation of David Hume's thought with certain forms of contemporary liberalism. (shrink)
Hull et al.'s analysis of operant behavior in terms of interaction and replication does not seem consistent with a genuine selection model. The putative replicators do not replicate, and the overall process is more reminiscent of directed mutation than of natural selection. General analogies between natural selection and operant reinforcement are too superficial to be of much scientific use.
I present a game-theoretical multi-agent system to simulate the evolutionary process responsible for the pragmatic phenomenon division of pragmatic labour (DOPL), a linguistic convention emerging from evolutionary forces. Each agent is positioned on a toroid lattice and communicates via signaling games , where the choice of an interlocutor depends on the Manhattan distance between them. In this framework I compare two learning dynamics: reinforcement learning (RL) and belief learning (BL). An agent’s experiences from previous plays influence his communication behaviour, (...) and RL agents act in a non-rational way whereas BL agents display a small degree of rationality by using best response dynamics . The complete system simulates an evolutionary process of communication strategies, which agents can learn in a structured spatial society. The significant questions are: what circumstances could lead to an evolutionary process that doesn’t result in the expected DOPL convention; and to what extent is interlocutor rationality necessary for the emergence of a society-wide convention à la DOPL? (shrink)
In recent years, many scholars have suggested that the Baldwin effect may play an important role in the evolution of language. However, the Baldwin effect is a multifaceted and controversial process and the assessment of its connection with language is difficult without a formal model. This paper provides a first step in this direction. We examine a game-theoretic model of the interaction between plasticity (represented by Herrnstein reinforcement learning) and evolution in the context of a simple language (...) game. Additionally, we describe three distinct aspects of the Baldwin effect: the Simpson–Baldwin effect, the Baldwin expediting effect and the Baldwin optimizing effect. We find that a simple model of the evolution of language lends theoretical plausibility to the existence of the Simpson–Baldwin and the Baldwin optimizing effects in this arena, but not the Baldwin expediting effect. (shrink)
We clarify the arguments in Neural organization: Structure, function, and dynamics, acknowledge important contributions cited by our critics, and respond to their criticisms by charting directions for further development of our integrated approach to theoretical and empirical studies of neural organization. We first discuss functional organization in general (behavior versus cognitive functioning, the need to study body and brain together, function in ontogeny and phylogeny) and then focus on schema theory (noting that schema theory is not just a top-down theory (...) and discussing the transition from action-oriented perception to cognition). We then turn to dynamical organization, with a focus first on neural modeling and dynamics (clarifying the multiple functions of neurons and brain regions, and looking further at various forms of dynamics) and second on learning, development, and self-organization (looking at monoaminergic systems, reinforcement, self-organization, postnatal development, and disease). We close with a brief philosophical discussion of postmodernism and reductionism. (shrink)
The general aim of this work is to show the importance of the adressee's mind as planned by the author of a speech act or of a norm; in particular, how important are the expected motivations for goal adoption. We show that speech acts differ from one another for the different motivations the speaker is attempting to obtain from the hearer. The description of the participants' social positions is not sufficient. Important conflicts can arise which are not relative to what (...) to do, but to the different motives requested by the speaker. This view is applied to norms, pointing out that what is required by a norm is not only a behaviour but also a mental attitude, and that the prescribed mind might be even more important than the prescribed behaviour. Norms don't want just behavioural conformity, but also that this conformity implies an acknowledgement and a reinforcement of both the authority and the norm itself. Norms ask for submission. Any form of norm is aimed at influencing the addressee by changing his or her mind. (shrink)
Existing models of strategic decision making typically assume that only the attributes of the currently played game need be considered when reaching a decision. The results presented in this article demonstrate that the so-called “cooperativeness” of the previously played prisoner’s dilemma games influence choices and predictions in the current prisoner’s dilemma game, which suggests that games are not considered independently. These effects involved reinforcement-based assimilation to the previous choices and also a perceptual contrast of the present game with preceding (...) games, depending on the range and the rank of their cooperativeness. A. Parducci’s (1965) range frequency theory and H. Helson’s (1964) adaptation level theory are plausible theories of relative judgment of magnitude information, which could provide an account of these context effects. (shrink)
psychological terms, the [Japanese] system relies on positive rather than negative reinforcement, emphasizing loving acceptance in exchange for genuine repentance. An analogue of what the Japanese policeman wants the offender to feel is the tearful relief of a child when confession of wrongdoing to his parents results in a gentle laugh and a warm hug. In relation to American policemen, Japanese officers want to be known for the warmth of their care rather than the strictness of their enforcement.1 Much (...) of the most disturbing police behaviour stems from two connected facts: the system's overwhelming dependence on admissions of guilt, and the absence of checks on police power in the interrogation room. In Japan, the conditions of interrogation?the duration and intensity of questioning, the duty to endure questioning even after the right to silence has been invoked, and the unavailability of defence lawyers?means that an ?overborne will? is more than merely an occasional problem.2. (shrink)
Many analogies exist between the process of evolution by natural selection and of learning by reinforcement and punishment. A full extension of the evolutionary analogy to learning to include analogues of the fitness, genotype, development, environmental influences, and phenotype concepts makes possible a single theory of the learning process able to encompass all of the elementary procedures known to yield learning.
Does it make sense, and is it at all plausible, to view the moral obligation to keep particular promises and do what is called for by particular agreements such as contracts as being founded on a general "Social Contract" -- i.e., to give a contractarian account of promise-keeping? This paper argues that it does. Borrowing from Hume, David Lewis, Gilbert Harman, and David Gauthier, I provide a sketch of what the "social contract" is (not, e.g., either a real or a (...) hypothetical meeting of all with all) -- namely, a form of commitment, makable by any individual, but with commitment likewise to social reinforcement. Then it is argued that Searle's familiar thesis is in error in that it leaves out the latter factor as well as implicitly calling for a version of the former. Then it is argued that this general understanding of morality can reasonably issue in a specific rule about agreements that would be one among others in a contractarian moral platform. (shrink)
Raising the issue of corporate moral agency in our examination of the morality of corporate speech is important for two fundamentalreasons. Each reason suggests we exercise caution in conflating corporations and individuals as the law often does. First, raising the issue of corporate moral agency is important to the aim of providing a framework for ethically evaluating corporate speech. It is tempting to proceed as if the nature of corporate speech is self-evident. But this is hardly the case. Corporations are (...) not natural persons, and we mustn’t assume corporate speech is indistinguishable from human speech. Before we can ethically evaluate corporate speech, we must first clarify what corporate speech is. This requires an understanding of the fundamental nature of the corporate entity, including its moral status. Second, raising the issue of corporate moral agency is important if we wish to promotemorally responsible corporate speech. Any diminished moral capacity on the corporation’s part would suggest a core role for the strong legal reinforcement of any ethical aspirations here. In promoting morally responsible corporate speech, ethical injunctions uncoupled from an effective legal regime may hold only limited promise. (shrink)
Two important assumptions of behavioral momentum theory are contradicted by existing data. Resistance to change is not due simply to the Pavlovian contingency between a discriminative stimulus and the rate of reinforcement in its presence, because variations in the response-reinforcer contingency, independent of the stimulus-reinforcer contingency, produce differential resistance to change. Resistance to change is also not clearly related to measures of preference, in that several experiments show the two measures to dissociate.
Stanovich & West's target article undervalues the power of implicit learning (particularly reinforcement learning). Implicit learning may allow the learning of more rational responses–and sometimes even generalisation of knowledge–in contexts where explicit, abstract knowledge proves only of limited value, such as for economic decision-making. Four other comments are made.
The missing ingredients in efforts to develop neural networks and artificial intelligence (AI) that can emulate human intelligence have been the evolutionary processes of performing tasks at increased orders of hierarchical complexity. Stacked neural networks based on the Model of Hierarchical Complexity could emulate evolution's actual learning processes and behavioral reinforcement. Theoretically, this should result in stability and reduce certain programming demands. The eventual success of such methods begs questions of humans' survival in the face of androids of superior (...) intelligence and physical composition. These raise future moral questions worthy of speculation. (shrink)
The Hospital for Rehabilitation, Stavern, in Norway has treated patients with physical symptoms with no organic cause, so called conversion disorder patients, for over a decade. For four years research on the treatment has been carried out. Patients with conversion disorder seem not to fit in traditional somatic hospitals because their patienthood depends upon psychiatric diagnosis. Ironically, they appear not to belong in psychiatric hospitals because of their physical symptoms. The treatment offered these patients at hospitals for rehabilitation is adapted (...) physical activity consisting of behaviour elements such as positive reinforcement of normal function and lack of positive reinforcement at dysfunction. The pedagogical approach is seen as crucial in the successful rehabilitation of the patients. The disorder and treatment can be understood by using theories about the ecstatic body, radical behaviourism and phenomenology. When patients have problems in behaviour concerning both body and mind, it would be natural to employ both in the road to recovery. This article describes the various treatments and discusses them from phenomenological, ethical and philosophical perspectives. (shrink)