학습 심리 발제
(Behavioral Perspective)
학습 이란 ? a stable change within an organism caused by an experience or set of experiences
A. Introduction
1. Watson (early 1900s)
-founder of behaviorism
2. Philosophy
a. study behaviors: observable
b. mental processes exist, but can’t be studied
c. describe learning in stimulus – response terms
d. association : 근접성, 빈도, 감정강도
B. Classical Conditioning : (무의식적인) 예측 반응 조건화
0. an anticipatory biological response :
– Classical Conditioning은 생명체가 생존에 중요한 event가 일어날때 일어 나야되는 반응을 적은 신경만을 갖고도 처리할 수있게 하기 위한 자동 예측 반응이다. – 반사행동
– Classical conditioning comes down squarely on the “parallel, emotional, heart, intuitive, unconscious” side.
– ex : an unpleasant emotional reaction
1. Reflexes
-simple, relatively automatic stimulus-response sequence
-must be mediated by the nervous system
2. Pavlov (Unconditioned Reflexes -learning-> conditional reflexes)
a. physiologist studying digestion reflexes using dogs
b. found that dogs began to salivate before food was placed in mouths
c. varied conditions prior to feeding—> Bell
d. ex : Pavlov’s dog and Watson’s fingertip
e. conditional as meaning conditional upon learning or dependent upon conditions. The conditional stimulus is what cues the conditional response.
3. Paradigm
a. Conditioned Reflex 조건 반사 (CS->CR)
-depends on the unique conditions of learning environment
1. Conditioned Stimulus (the bell sound)
2. Conditioned Response (salivation)
b. Unconditioned Reflex 무조건 반사 (US->UR)
-the original, unlearned reflex
1. Unconditioned Stimulus (food placed in mouth)
2. Unconditioned Response (salivation)
c. difference
– CR and UCR occur at a different strength
4. Variations of stimuli and responses
a. forward conditioning :
– delay conditioning 지연 조건형성
– trace conditioning 흔적 조건형성
b. simultaneous conditiong 동시 조건형성
c. backeard conditiong 후향 조건 형성
d. temporal conditiong – the background, or context, can serve as the CS
e. unpaired conditiong – to study non-associative behavioral responses, such as sensitization 민감화
f. cs-alone extinction
g. 학습 효과 : DC > SC or TC > BC
5. This established the Science of learning (Observable!)
6. Terms and Concepts
a. Extinction – 소거 ( vs 망각)
-conditioned reflex established
-presentation of conditioned stimulus without unconditioned stimulus
-eventual “unlearning of association” (but not really!
see spontaneous recovery)
b. Spontaneous Recovery (Thomas Landauer)
1. a single re-presentation of UCS & CS pairing results in full conditioned reflex
2. after time elapses, some recovery takes place without any pairing
c. Generalization (overgeneralization) – Albert
-stimuli similar to CS will elicit the CR
-more similar = greater response or greater likelihood of response
-colors, smells, tones
d. Discrimination Training (동물이나 유아의 감각/인지능력 판별)
-notices differences between situations rather than similarities
-can train to discriminate between two similar stimuli
-always pair 1000Hz tone with food
-always present 1200Hz tone without food
-release from habituation & experimental neurosis (by excessively fine discriminations)
e. 2차 조건 형성 (second/high-order conditioning) Rescola
-CS1+US->UCR, CS1 -> CR, CS1+CS2->CR1 then CS2->CR2
-간접경험
f. 역 조건 형성 (encounter-conditioning) – 공포 CS + 긍정적 CS -> 공포 CS 소거
g. Robert A. Rescorla (1988 American Psychologist article)
1. The textbook description emphasized contiguity [closeness in time] of the conditional stimulus and unconditional stimulus, while modern studies emphasize the informative or predictive nature of the conditional stimulus.
2. The textbook description gave the impression any two stimuli could be associated as conditional stimulus and unconditional stimulus. Modern research shows that some stimuli are much easier to associate with a particular biological response than others.
3. The old textbook description gave the impression conditioning was slow and gradual, requiring many repetitions or trials. Rescorla wrote, “Although conditioning can sometimes be slow, in fact most modern conditioning preparations routinely show rapid learning” requiring from 1 to 8 trials.
-Classical conditioning “is intimately involved in the control of central psychological processes, such as emotions and motivation.”
-Classical conditioning is now known to be involved with the immune system, sexual anticipation, tolerance to addictive drugs, and much more.
7. Contiguity 근접성과 Contingency 수반성
a. Rescola의 실험
b. Garcia의 맛-혐오 실험 : 근접하지 않아도 학습이 수행된다.
c. Kamin의 blocking 차폐 실험 cf: overshadowing
C. Operant Conditioning : 의지(의식적인) 반응 조건화
0. contrasts between classical and operant conditioning
-Memory for classically conditioned responses occurs throughout the nervous system at the neural level
-Memory for patterns of operant responses (i.e. complex non-instinctive behavior) typically requires the hippocampus, the part of the brain responsible for event memory.
-An operant is defined by its effect.
-Operant and classical conditioning effects are mingled in course of normal animal behavior.
1. Operant responses
-operate on environment to produce some change
-any behavioral response that has an effect on the environment
2. Law of Effect
a. Thorndike (1898) - 기능주의
b. cat learned to respond in appropriate manner to get food
[diagram] puzzle box
c. Responses that produce a satisfying effect in a particular situation become more likely
to occur again in that situation, and responses that produce a discomforting effect
become less likely to occur again in that situation
d. 한 행동의 강도(빈도, 지속기간 등)는 과거에 그 행동이 초래했던 결과에 좌우된다
e. trial and error learning
3. Pavlov versus Thorndike
a. Pavlov only interested in environmental events which precede the response
-stimulus leads to response
-Push button X and get response Y
b. Thorndike interested in the environmental events which occur after the response (a consequence of the response)
-response leads to stimulus
4. Burrhus Fredric Skinner
-major proponent of behaviorism over past 50 years
-dev
eloped the Skinner Box
[diagram]
-made immediate multiple responses possible
-made quantification of responses easy
-developed the terminology of operant conditioning
-reinforcer, rather than “satisfaction” or “reward”
definition: a stimulus that increases the frequency of a response
5. General Applications
-this is how we learn many skills, cause-and-effect relationships
6. Shaping 행동 조성/ Backward chaining 행동 연쇄화
-reinforce successively closer approximations to the desired response
Exercise: Hot/Cold to condition a volunteer
7. Reinforcement Schedules
[diagram]
a. Fixed ratio
-response must be emitted a certain fixed number of times before a reinforcer occurs
-e.g., every 5th response reinforced
-e.g., selling on commission with quotas putting money in a pop machine FR-1
b. Variable ratio
-response must be emitted a certain average number of times before a reinforcer occurs
-e.g., average every 5th response, but vary ratio
-e.g., slot machines at casinos
c. Fixed interval
-fixed period of time must elapse after each reinforced response before another reinforcement can be obtained
-e.g., reinforced for first response after every 10 seconds
-e.g., getting paid every 2 weeks
d. Variable interval
-average period of time must elapse after each reinforced response before another reinforcement can be obtained
-e.g., reinforced for first response on the average after every 10 seconds, but vary interval
-e.g., rarely occur in a natural setting call about a job application, line is busy, keep calling at regular intervals
e. continuous reinforcement vs. partial reinforcement
f. Effects of different types of simple schedules
– Ratio schedules produce higher rates of responding than interval schedules.
– Variable schedules produce higher rates and greater resistance to extinction than most fixed schedules.
– The variable ratio schedule produces both the highest rate of responding and the greatest resistance to extinction.
– Partial reinforcement schedules are more resistant to extinction than continuous reinforcement schedules
8. Extinction
-the decline in response rate that results when the operant response is no longer followed by a reinforcer
-different reinforcement schedules have different rates of extinction
9. Stimulus Control of Operant Behavior
a. Importance of context
b. Discrimination (e.g., police car and speeding)
c. Discriminant stimulus
-one which signals the availability of a reinforcer or punisher
10. Chaining and Secondary Reinforcement
a. Primary reinforcers
-based on some biological drive or need, such as the need for food, water, attention, or a comfortable temperature
-Primary reinforcers are unlearned. (innately reinforcing)
b. Chain of reinforcers
-Secondary reinforcers are learned or symbolic reinforcers.
-training rat in college
-grades—>education—>better job—>money—>food
11. Consequences
-change the future frequency of a response
a. Reinforcer – leads to increased frequency of response
1. Positive – introduce a favorable stimulus (추가할수록 강화)
e.g., water, money, good grade를 추가 할수있다는 믿음에 행동이 강화되기
2. Negative – remove an unfavorable stimulus (제거할수록 강화) – escape training
e.g., pebble in shoe, adjust walk, brother hitting you, scream for mom을 제거할수있다는 믿음에 행동이 강화되기
b. Punisher – leads to decreased frequency of response
1. Positive – introduce an unfavorable stimulus, a stimulus is added
e.g., shock, hit, scare
2. Negative – remove a favorable stimulus ; Response Cost
e.g., prison, fines, groundings
12. Premack principle
-high-probability behaviors (those performed frequently under conditions of free choice) can be used to reinforce low-probability behaviors.
-As a rule, preferred behaviors can be used to reinforce unpreferred behaviors.
-숙제(선호도 낮은)하면 TV볼수(선호도 높은)있게해 줄께.
D. 회피 및 도피 학습 (보충)
1. 회피 학습
2. 도피 학습
E. Cognitive Learning 인지 학습 (보충)
1. Cognitive Map 인지도 (Tolman)
2. Insight Learning 통찰학습 (Wolfgang Köhler)
3. Observational Learning 관찰학습 (Bandura)
– attention(주의) – retention(파지) – reproduction(재생) – motivation(동기)