February 1, 2024
PMAP 8521: Program evaluation
Andrew Young School of Policy Studies
Regression FAQs
Regression FAQs
Regression with R
Regression FAQs
Regression with R
Measuring outcomes
Regression FAQs
Regression with R
Measuring outcomes
(Maybe)
DAGs
How was the 0.05 significance
threshold determined?
Could we say something is significant
if p > 0.05, but just note that it is at
a higher p-value?
Or does it have to fall under 0.05?
Why all this convoluted
logic of null worlds?
Why does this matter for evaluation?
Statistical power!
Do we care about the actual coefficients
or just whether or not they're significant?
How does significance relate to causation?
If we can't use statistics to assert causation
how are we going to use this information
in program evaluation?
What counts as a "good" R²?
Simpson's Paradox
Evaluation is good, but expensive
"Evaluation thinking"
Evaluation is good, but expensive
"Evaluation thinking"
Too much evaluation is bad
Taming programs
Outcome variable
Thing you're measuring
Outcome variable
Thing you're measuring
Outcome change
∆ in thing you're measuring over time
Outcome variable
Thing you're measuring
Outcome change
∆ in thing you're measuring over time
Program effect
∆ in thing you're measuring over time because of the program
Abstraction
Causal thinking is necessary—
even for descriptive work!
Necessity of causal thinking: Mention the McElreath tweet on birth certificate introduction and death ages: https://twitter.com/rlmcelreath/status/1427564280744976384
"Every time I get a haircut, I become more mature!"
"Every time I get a haircut, I become more mature!"
E[Maturity∣do(Get haircut)]
Getting older opens a backdoor path
But what does that mean,
"opening a backdoor path"?
How does statistical association
get passed through paths?
How do I know which of these is which?
d-separation
Except for the one arrow between X and Y,
no statistical association can flow between X and Y
This is identification—
all alternative stories are ruled out
and the relationship is isolated
How exactly do colliders
mess up your results?
It looks like you can
still get the effect of X on Y
Effect of race on police use of force
using administrative data
Effect of race on police use of force
using administrative data
Keyboard shortcuts
↑, ←, Pg Up, k | Go to previous slide |
↓, →, Pg Dn, Space, j | Go to next slide |
Home | Go to first slide |
End | Go to last slide |
Number + Return | Go to specific slide |
b / m / f | Toggle blackout / mirrored / fullscreen mode |
c | Clone slideshow |
p | Toggle presenter mode |
t | Restart the presentation timer |
?, h | Toggle this help |
o | Tile View: Overview of Slides |
Esc | Back to slideshow |
February 1, 2024
PMAP 8521: Program evaluation
Andrew Young School of Policy Studies
Regression FAQs
Regression FAQs
Regression with R
Regression FAQs
Regression with R
Measuring outcomes
Regression FAQs
Regression with R
Measuring outcomes
(Maybe)
DAGs
How was the 0.05 significance
threshold determined?
Could we say something is significant
if p > 0.05, but just note that it is at
a higher p-value?
Or does it have to fall under 0.05?
Why all this convoluted
logic of null worlds?
Why does this matter for evaluation?
Statistical power!
Do we care about the actual coefficients
or just whether or not they're significant?
How does significance relate to causation?
If we can't use statistics to assert causation
how are we going to use this information
in program evaluation?
What counts as a "good" R²?
Simpson's Paradox
Evaluation is good, but expensive
"Evaluation thinking"
Evaluation is good, but expensive
"Evaluation thinking"
Too much evaluation is bad
Taming programs
Outcome variable
Thing you're measuring
Outcome variable
Thing you're measuring
Outcome change
∆ in thing you're measuring over time
Outcome variable
Thing you're measuring
Outcome change
∆ in thing you're measuring over time
Program effect
∆ in thing you're measuring over time because of the program
Abstraction
Causal thinking is necessary—
even for descriptive work!
Necessity of causal thinking: Mention the McElreath tweet on birth certificate introduction and death ages: https://twitter.com/rlmcelreath/status/1427564280744976384
"Every time I get a haircut, I become more mature!"
"Every time I get a haircut, I become more mature!"
E[Maturity∣do(Get haircut)]
Getting older opens a backdoor path
But what does that mean,
"opening a backdoor path"?
How does statistical association
get passed through paths?
How do I know which of these is which?
d-separation
Except for the one arrow between X and Y,
no statistical association can flow between X and Y
This is identification—
all alternative stories are ruled out
and the relationship is isolated
How exactly do colliders
mess up your results?
It looks like you can
still get the effect of X on Y
Effect of race on police use of force
using administrative data
Effect of race on police use of force
using administrative data