- Spain, Sarah L;
- Carvajal-Carmona, Luis G;
- Howarth, Kimberley M;
- Jones, Angela M;
- Su, Zhan;
- Cazier, Jean-Baptiste;
- Williams, Jennet;
- Aaltonen, Lauri A;
- Pharoah, Paul;
- Kerr, David J;
- Cheadle, Jeremy;
- Li, Li;
- Casey, Graham;
- Vodicka, Pavel;
- Sieber, Oliver;
- Lipton, Lara;
- Gibbs, Peter;
- Martin, Nicholas G;
- Montgomery, Grant W;
- Young, Joanne;
- Baird, Paul N;
- Morreau, Hans;
- van Wezel, Tom;
- Ruiz-Ponte, Clara;
- Fernandez-Rozadilla, Ceres;
- Carracedo, Angel;
- Castells, Antoni;
- Castellvi-Bel, Sergi;
- Dunlop, Malcolm;
- Houlston, Richard S;
- Tomlinson, Ian PM
In genome-wide association studies (GWASs) of colorectal cancer, we have identified two genomic regions in which pairs of tagging-single nucleotide polymorphisms (tagSNPs) are associated with disease; these comprise chromosomes 1q41 (rs6691170, rs6687758) and 12q13.13 (rs7163702, rs11169552). We investigated these regions further, aiming to determine whether they contain more than one independent association signal and/or to identify the SNPs most strongly associated with disease. Genotyping of additional sample sets at the original tagSNPs showed that, for both regions, the two tagSNPs were unlikely to identify a single haplotype on which the functional variation lay. Conversely, one of the pair of SNPs did not fully capture the association signal in each region. We therefore undertook more detailed analyses, using imputation, logistic regression, genealogical analysis using the GENECLUSTER program and haplotype analysis. In the 1q41 region, the SNP rs11118883 emerged as a strong candidate based on all these analyses, sufficient to account for the signals at both rs6691170 and rs6687758. rs11118883 lies within a region with strong evidence of transcriptional regulatory activity and has been associated with expression of PDGFRB mRNA. For 12q13.13, a complex situation was found: SNP rs7972465 showed stronger association than either rs11169552 or rs7136702, and GENECLUSTER found no good evidence for a two-SNP model. However, logistic regression and haplotype analyses supported a two-SNP model, in which a signal at the SNP rs706793 was added to that at rs11169552. Post-GWAS fine-mapping studies are challenging, but the use of multiple tools can assist in identifying candidate functional variants in at least some cases.