1. Original Paper (Bartlett & Mendelson, 2002) Title : "Rademacher and Gaussian Complexities: Risk Bounds and Structural Results" Authors : Peter Bartlett and Shahar Mendelson Journal : Journal of Machine Learning Research (JMLR) , 2002. PDF : Download from JMLR Key Contributions : Introduces Rademacher complexity as a measure of hypothesis class complexity. Provides data-dependent generalization bounds (tighter than VC bounds). Connects Rademacher complexity to Gaussian complexity and covering numbers. 2. Why Rademacher Complexity? Rademacher complexity measures the ability of a hypothesis class to fit random noise , defined as: R n ( H ) = E σ , D n [ sup h ∈ H 1 n ∑ i = 1 n σ i h ( x i ) ] R n ( H ) = E σ , D n [ h ∈ H sup n 1 i = 1 ∑ n σ i h ( x i ) ] where σ i σ i are random Rademacher variables ( ± 1 ± 1 with equal probability). Interpretation : A high Rademacher complexity...