Search results for 'Reinforcement' (try it on Scholar)

1000+ found
Order:
  1.  5
    Vladislav D. Veksler, Christopher W. Myers & Kevin A. Gluck (2014). SAwSu: An Integrated Model of Associative and Reinforcement Learning. Cognitive Science 38 (3):580-598.
    Successfully explaining and replicating the complexity and generality of human and animal learning will require the integration of a variety of learning mechanisms. Here, we introduce a computational model which integrates associative learning (AL) and reinforcement learning (RL). We contrast the integrated model with standalone AL and RL models in three simulation studies. First, a synthetic grid-navigation task is employed to highlight performance advantages for the integrated model in an environment where the reward structure is both diverse and dynamic. (...)
    Direct download (4 more)  
     
    Export citation  
     
    My bibliography   1 citation  
  2.  8
    Christian P. Janssen & Wayne D. Gray (2012). When, What, and How Much to Reward in Reinforcement Learning-Based Models of Cognition. Cognitive Science 36 (2):333-358.
    Reinforcement learning approaches to cognitive modeling represent task acquisition as learning to choose the sequence of steps that accomplishes the task while maximizing a reward. However, an apparently unrecognized problem for modelers is choosing when, what, and how much to reward; that is, when (the moment: end of trial, subtask, or some other interval of task performance), what (the objective function: e.g., performance time or performance accuracy), and how much (the magnitude: with binary, categorical, or continuous values). In this (...)
    Direct download (7 more)  
     
    Export citation  
     
    My bibliography   1 citation  
  3.  1
    Yunfeng Zhang, Jaehyon Paik & Peter Pirolli (2015). Reinforcement Learning and Counterfactual Reasoning Explain Adaptive Behavior in a Changing Environment. Topics in Cognitive Science 7 (2):368-381.
    Animals routinely adapt to changes in the environment in order to survive. Though reinforcement learning may play a role in such adaptation, it is not clear that it is the only mechanism involved, as it is not well suited to producing rapid, relatively immediate changes in strategies in response to environmental changes. This research proposes that counterfactual reasoning might be an additional mechanism that facilitates change detection. An experiment is conducted in which a task state changes over time and (...)
    Direct download (4 more)  
     
    Export citation  
     
    My bibliography  
  4.  1
    Samuel J. Gershman & Yael Niv (2015). Novelty and Inductive Generalization in Human Reinforcement Learning. Topics in Cognitive Science 7 (3):391-415.
    In reinforcement learning, a decision maker searching for the most rewarding option is often faced with the question: What is the value of an option that has never been tried before? One way to frame this question is as an inductive problem: How can I generalize my previous experience with one set of options to a novel option? We show how hierarchical Bayesian inference can be used to solve this problem, and we describe an equivalence between the Bayesian model (...)
    Direct download (4 more)  
     
    Export citation  
     
    My bibliography  
  5.  8
    James A. Dinsmoor (1983). Observing and Conditioned Reinforcement. Behavioral and Brain Sciences 6 (4):693.
    Direct download (2 more)  
     
    Export citation  
     
    My bibliography   59 citations  
  6.  2
    Allan R. Wagner (1961). Effects of Amount and Percentage of Reinforcement and Number of Acquisition Trials on Conditioning and Extinction. Journal of Experimental Psychology 62 (3):234.
  7.  14
    Neal E. Miller (1948). Studies of Fear as an Acquirable Drive: I. Fear as Motivation and Fear-Reduction as Reinforcement in the Learning of New Responses. Journal of Experimental Psychology 38 (1):89.
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   48 citations  
  8.  3
    Douglas Anger (1956). The Dependence of Interresponse Times Upon the Relative Reinforcement of Different Interresponse Times. Journal of Experimental Psychology 52 (3):145.
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   48 citations  
  9.  6
    O. H. Mowrer & H. Jones (1945). Habit Strength as a Function of the Pattern of Reinforcement. Journal of Experimental Psychology 35 (4):293.
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   53 citations  
  10. Stewart H. Hulse Jr (1958). Amount and Percentage of Reinforcement and Duration of Goal Confinement in Conditioning and Extinction. Journal of Experimental Psychology 56 (1):48.
  11.  4
    M. David Egger & Neal E. Miller (1962). Secondary Reinforcement in Rats as a Function of Information Value and Reliability of the Stimulus. Journal of Experimental Psychology 64 (2):97.
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   24 citations  
  12.  3
    David R. Thomas, David L. Berman & George E. Serednesky (1968). Information Value and Stimulus Configuring as Factors in Conditioned Reinforcement. Journal of Experimental Psychology 76 (2p1):181.
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   20 citations  
  13.  6
    Charles B. Ferster (1953). Sustained Behavior Under Delayed Reinforcement. Journal of Experimental Psychology 45 (4):218.
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   25 citations  
  14. Arnold H. Buss (1956). Reversal and Nonreversal Shifts in Concept Formation with Partial Reinforcement Eliminated. Journal of Experimental Psychology 52 (3):162.
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   26 citations  
  15.  8
    David A. Grant & Esta Berg (1948). A Behavioral Analysis of Degree of Reinforcement and Ease of Shifting to New Responses in a Weigl-Type Card-Sorting Problem. Journal of Experimental Psychology 38 (4):404.
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   3 citations  
  16. David R. Powell Jr & Charles C. Perkins Jr (1957). Strength of Secondary Reinforcement as a Determiner of the Effects of Duration of Goal Response on Learning. Journal of Experimental Psychology 53 (2):106.
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   19 citations  
  17.  1
    C. T. Perin (1943). A Quantitative Investigation of the Delay-of-Reinforcement Gradient. Journal of Experimental Psychology 32 (1):37.
  18.  2
    E. J. Capaldi (1974). Partial Reward Either Following or Preceding Consistent Reward: A Case of Reinforcement Level. Journal of Experimental Psychology 102 (6):954.
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   11 citations  
  19.  10
    G. Robert Grice (1948). The Relation of Secondary Reinforcement to Delayed Reward in Visual Discrimination Learning. Journal of Experimental Psychology 38 (1):1.
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   6 citations  
  20.  3
    George Collier & Melvin H. Marx (1959). Changes in Performance as a Function of Shifts in the Magnitude of Reinforcement. Journal of Experimental Psychology 57 (5):305.
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   7 citations  
  21. James E. Spivey (1967). Resistance to Extinction as a Function of Number of N-R Transitions and Percentage of Reinforcement. Journal of Experimental Psychology 75 (1):43.
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   7 citations  
  22.  21
    Yanping Liu, Erik D. Reichle & Ding‐Guo Gao (2013). Using Reinforcement Learning to Examine Dynamic Attention Allocation During Reading. Cognitive Science 37 (8):1507-1540.
    A fundamental question in reading research concerns whether attention is allocated strictly serially, supporting lexical processing of one word at a time, or in parallel, supporting concurrent lexical processing of two or more words (Reichle, Liversedge, Pollatsek, & Rayner, 2009). The origins of this debate are reviewed. We then report three simulations to address this question using artificial reading agents (Liu & Reichle, 2010; Reichle & Laurent, 2006) that learn to dynamically allocate attention to 1–4 words to “read” as efficiently (...)
    Direct download (7 more)  
     
    Export citation  
     
    My bibliography  
  23.  4
    William B. Pavlik & Peter L. Carlton (1965). A Reversed Partial-Reinforcement Effect. Journal of Experimental Psychology 70 (4):417.
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   5 citations  
  24.  3
    Peter R. Killeen (1994). Mathematical Principles of Reinforcement. Behavioral and Brain Sciences 17 (1):105.
    Direct download (4 more)  
     
    Export citation  
     
    My bibliography   2 citations  
  25.  7
    Roger L. Mellgren & Dennis G. Dyck (1972). Partial Reinforcement Effect, Reverse Partial Reinforcement Effect, and Generalized Partial Reinforcement Effect Within Subjects. Journal of Experimental Psychology 92 (3):339.
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   3 citations  
  26.  4
    C. I. Hovland (1937). The Generalization of Conditioned Responses. IV. The Effects of Varying Amounts of Reinforcement Upon the Degree of Generalization of Conditioned Responses. [REVIEW] Journal of Experimental Psychology 21 (3):261.
  27.  1
    David A. Stevens & Laurence D. Fechter (1968). Relative Strengths of Approach and Avoidance Tendencies in Discrimination Learning of Rats Trained Under Two Types of Reinforcement. Journal of Experimental Psychology 76 (3p1):489.
  28.  5
    Kent Henderson (1966). Within-Subjects Partial-Reinforcement Effects in Acquisition and in Later Discrimination Learning. Journal of Experimental Psychology 72 (5):704.
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   3 citations  
  29. Joseph A. Sgro & Solomon Weinstock (1963). Effects of Delay on Subsequent Running Under Immediate Reinforcement. Journal of Experimental Psychology 66 (3):260.
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   5 citations  
  30.  1
    David Zeaman (1949). Response Latency as a Function of the Amount of Reinforcement. Journal of Experimental Psychology 39 (4):466.
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   5 citations  
  31.  1
    Norman Stein & Richard Landis (1973). Mediating Role of Human Collateral Behavior During a Spaced-Responding Schedule of Reinforcement. Journal of Experimental Psychology 97 (1):28.
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   3 citations  
  32.  6
    Lloyd G. Humphreys (1939). The Effect of Random Alternation of Reinforcement on the Acquisition and Extinction of Conditioned Eyelid Reactions. Journal of Experimental Psychology 25 (2):141.
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   2 citations  
  33.  5
    Gail Ditkoff & Ronald Ley (1974). Effects of Positive and Negative Force-Contingent Reinforcement on the Frustration Effect in Humans. Journal of Experimental Psychology 102 (5):818.
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   1 citation  
  34.  6
    Dennis G. Dyck, Roger L. Mellgren & Jeffrey A. Seybert (1973). Within-Subject Partial Reinforcement Effects: Differential Extinction Following Nondifferential Percentage of Reinforcement in Acquisition. Journal of Experimental Psychology 97 (3):391.
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   1 citation  
  35.  4
    Morton P. Friedman, Edward C. Carterette & Norman H. Anderson (1968). Long-Term Probability Learning with a Random Schedule of Reinforcement. Journal of Experimental Psychology 78 (3p1):442.
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   1 citation  
  36.  1
    Virginia F. Sheffield (1949). Extinction as a Function of Partial Reinforcement and Distribution of Practice. Journal of Experimental Psychology 39 (4):511.
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   2 citations  
  37.  1
    Harry Fowler & Milton A. Trapold (1962). Escape Performance as a Function of Delay of Reinforcement. Journal of Experimental Psychology 63 (5):464.
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   3 citations  
  38.  3
    Arnold H. Buss (1950). A Study of Concept Formation as a Function of Reinforcement and Stimulus Generalization. Journal of Experimental Psychology 40 (4):494.
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   1 citation  
  39. Robert F. Weiss, Jenny L. Boyer, James T. Colwick & Dennis J. Moran (1971). A Delay of Reinforcement Gradient and Correlated Reinforcement in the Instrumental Conditioning of Conversational Behavior. Journal of Experimental Psychology 90 (1):33.
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   3 citations  
  40.  3
    Alvin R. Mahrer (1956). The Role of Expectancy in Delayed Reinforcement. Journal of Experimental Psychology 52 (2):101.
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   2 citations  
  41.  1
    Norman Guttman (1959). Generalization Gradients Around Stimuli Associated with Different Reinforcement Schedules. Journal of Experimental Psychology 58 (5):335.
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   2 citations  
  42.  3
    Norman Guttman (1953). Operant Conditioning, Extinction, and Periodic Reinforcement in Relation to Concentration of Sucrose Used as Reinforcing Agent. Journal of Experimental Psychology 46 (4):213.
  43.  2
    Clark L. Hull (1947). Reactively Heterogeneous Compound Trial-and-Error Learning with Distributed Trials and Terminal Reinforcement. Journal of Experimental Psychology 37 (2):118.
    No categories
    Direct download (5 more)  
     
    Export citation  
     
    My bibliography  
  44.  2
    Bradley Reynolds (1950). Resistance to Extinction as a Function of the Amount of Reinforcement Present During Acquisition. Journal of Experimental Psychology 40 (1):46.
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography  
  45.  2
    A. Grant Young (1969). Resistance to Extinction as a Function of Partial Reinforcement and Bar Weighting: A Within-S Design. Journal of Experimental Psychology 79 (2p1):363.
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   2 citations  
  46.  6
    R. H. Day, T. S. Wong & Charles I. Brooks (1971). "Radial and Tangential Movement Directions as Determinants of the Haptic Illusion in an L Figure"/ "Frustration Considerations of the Small-Trials Partial Reinforcement Effect: Experience with Nonreward and Intertrial Reinforcement": Errata. Journal of Experimental Psychology 90 (2):344-344.
    Direct download (2 more)  
     
    Export citation  
     
    My bibliography  
  47.  1
    Charles F. Flaherty & John W. Davenport (1972). Successive Brightness Discrimination in Rats Following Regular Versus Random Intermittent Reinforcement. Journal of Experimental Psychology 96 (1):1.
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   2 citations  
  48.  2
    Abram Amsel, James J. Hug & C. Thomas Surridge (1968). Number of Food Pellets, Goal Approaches, and the Partial Reinforcement Effect After Minimal Acquisition. Journal of Experimental Psychology 77 (4):530.
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   1 citation  
  49.  1
    Patrick Suppes & Raymond W. Frankmann (1961). Test of Stimulus Sampling Theory for a Continuum of Responses with Unimodal Noncontingent Determinate Reinforcement. Journal of Experimental Psychology 61 (2):122.
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography   2 citations  
  50.  5
    G. Robert Grice & Herbert M. Goldman (1955). Generalized Extinction and Secondary Reinforcement in Visual Discrimination Learning with Delayed Reward. Journal of Experimental Psychology 50 (3):197.
    Direct download (3 more)  
     
    Export citation  
     
    My bibliography  
1 — 50 / 1000