Tagging SNP-set selection with maximum information based on linkage disequilibrium structure in genome-wide association studies

    loading  Checking for direct PDF access through Ovid



Effective tagging single-nucleotide polymorphism (SNP)-set selection is crucial to SNP-set analysis in genome-wide association studies (GWAS). Most of the existing tagging SNP-set selection methods cannot make full use of the information hidden in common or rare variants associated diseases. It is noticed that some SNPs have overlapping genetic information owing to linkage disequilibrium (LD) structure between SNPs. Therefore, when testing the association between SNPs and disease susceptibility, it is sufficient to elect the representative SNPs (called tag SNP-set or tagSNP-set) with maximum information.


It is proposed a new tagSNP-set selection method based on LD information between SNPs, namely TagSNP-Set with Maximum Information. Compared with classical SNP-set analytical method, our method not only has higher power, but also can minimize the number of selected tagSNPs and maximize the information provided by selected tagSNPs with less genotyping cost and lower time complexity.



Supplementary information:

Supplementary data are available at Bioinformatics online.

Related Topics

    loading  Loading Related Articles