Population structure, differential bias and genomic control in a large-scale, case-control association study.
Clayton DG., Walker NM., Smyth DJ., Pask R., Cooper JD., Maier LM., Smink LJ., Lam AC., Ovington NR., Stevens HE., Nutland S., Howson JMM., Faham M., Moorhead M., Jones HB., Falkowski M., Hardenbol P., Willis TD., Todd JA.
The main problems in drawing causal inferences from epidemiological case-control studies are confounding by unmeasured extraneous factors, selection bias and differential misclassification of exposure. In genetics the first of these, in the form of population structure, has dominated recent debate. Population structure explained part of the significant +11.2% inflation of test statistics we observed in an analysis of 6,322 nonsynonymous SNPs in 816 cases of type 1 diabetes and 877 population-based controls from Great Britain. The remainder of the inflation resulted from differential bias in genotype scoring between case and control DNA samples, which originated from two laboratories, causing false-positive associations. To avoid excluding SNPs and losing valuable information, we extended the genomic control method by applying a variable downweighting to each SNP.