Data, Inference, and Decision-Making with Integrity
Describe one way a graph can be manipulated to deceive:
Given data: 2, 4, 4, 5, 9. Mean = ; Median = ; s ≈
P(rolling a 5 on fair die) = ; P(not 5) =
Bayes: P(A|B) = [P(B|A)P(A)] / P(B)
In a class, 60% are right-handed, 20% of right-handers play piano, 30% of left-handers play piano. Find P(piano) overall:
Binomial: n=5, p=0.3, k=2 ⇒ P =
Give one example of a biased survey design and how to fix it:
For population mean (σ known): CI = x̄ ± z* (σ/√n)
Interpretation: If we repeated sampling, about C% of intervals would capture μ.
Sample mean 50, σ=10, n=25, 95% CI:
Beware p-hacking and cherry-picking α.
Is a coin fair? You flip 40 times, get 28 heads. Set up H₀/H₁ and describe next steps:
r = 0.85 between study hours and test scores. What does this mean? What cautions remain?
How can you guard against misuse of statistics in your field of interest?
U2: Mean=4.8; Median=4; s≈2.68
U3: 1/6 and 5/6
U4: P(piano)=0.6*0.2 + 0.4*0.3 = 0.24
U5: Binomial P ≈ C(5,2)(0.3)²(0.7)³
U7: 50 ± 1.96*(10/5) = 50 ± 3.92 → (46.08, 53.92)
U10: r interpretation: strong positive linear relationship; still no causation proof.