The GWAS catalog is a free online database that compiles data of genome-wide association studies (GWAS), summarizing unstructured data from different literature sources into accessible high quality data.[1] It was created by the National Human Genome Research Institute (NHGRI) in 2008 and have become a collaborative project between the NHGRI and the European Bioinformatics Institute (EBI) since 2010.[1] As of September 2018, it has included 71,673 SNP–trait associations in 3,567 publications.[2]
A GWAS identifies genetic loci associated with common traits and disease through the analysis of categorized variants across the genome and the catalog provides information from all published GWAS results that meet its criteria.[3] The catalog contains publication information, study groups information (origin, size) and SNP-disease association information (including SNP identifier, P-value, gene and risk allele).[4] Over the years, the GWAS catalog has enhanced its data release frequency by adding features such as graphical user interface, ontology-supported search functionality and a curation interface.[3]
The GWAS catalog is widely used to identify causal variants and understand disease mechanisms by biologists, bioinformaticians and other researchers.[4][5][6][7][8] Some GWAS identified common genomic loci that are associated diseases include: cardiovascular disease, inflammatory bowel disease, type 2 diabetes and breast cancer.[3]
Accessibility of data
The public can gain access to the GWAS Catalog’ s data in three ways:[4]
- The NHGRI web interface’s search: provide information on traits and study publication and an tab-delimited file that is available for download.[4]
- Interactive interface: provide a visualization of all SNP-associated traits in the GWAS catalog as well as SNPs’ positions on human chromosomes.[4] And all SNPs are associated with a particular trait are displayed with web links to related literature from different databases.[4]
- Ensembl, the UCSC Genome Browser, the PheGenI and other data portals provide access to the GWAS catalog through providing web links.[4]
Applications
Some current applications of the GWAS Catalog include the use of studies on the genetics of human diseases [5][6] and the heritability of human traits.[7] The GWAS catalog data can also be used as a pool of markers for SNP studies.[8]
References
- 1 2 "About the GWAS Catalog". GWAS Catalog. Retrieved 2019-06-17.
- ↑ Buniello A, MacArthur JA, Cerezo M, Harris LW, Hayhurst J, Malangone C, et al. (January 2019). "The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019". Nucleic Acids Research. 47 (D1): D1005–D1012. doi:10.1093/nar/gky1120. PMC 6323933. PMID 30445434.
- 1 2 3 MacArthur J, Bowler E, Cerezo M, Gil L, Hall P, Hastings E, et al. (January 2017). "The new NHGRI-EBI Catalog of published genome-wide association studies (GWAS Catalog)". Nucleic Acids Research. 45 (D1): D896–D901. doi:10.1093/nar/gkw1133. PMC 5210590. PMID 27899670.
- 1 2 3 4 5 6 7 Welter D, MacArthur J, Morales J, Burdett T, Hall P, Junkins H, et al. (January 2014). "The NHGRI GWAS Catalog, a curated resource of SNP-trait associations". Nucleic Acids Research. 42 (Database issue): D1001-6. doi:10.1093/nar/gkt1229. PMC 3965119. PMID 24316577.
- 1 2 Morales J, Welter D, Bowler EH, Cerezo M, Harris LW, McMahon AC, et al. (February 2018). "A standardized framework for representation of ancestry data in genomics studies, with application to the NHGRI-EBI GWAS Catalog". Genome Biology. 19 (1): 21. doi:10.1186/s13059-018-1396-2. PMC 5815218. PMID 29448949.
- 1 2 Loos RJ, Yeo GS (January 2014). "The bigger picture of FTO: the first GWAS-identified obesity gene". Nature Reviews. Endocrinology. 10 (1): 51–61. doi:10.1038/nrendo.2013.227. PMC 4188449. PMID 24247219.
- 1 2 López-Cortegano E, Caballero A (July 2019). "Inferring the Nature of Missing Heritability in Human Traits Using Data from the GWAS Catalog". Genetics. 212 (3): 891–904. doi:10.1534/genetics.119.302077. PMC 6614893. PMID 31123044. Retrieved 7 Nov 2019.
- 1 2 Pal LR, Moult J (July 2015). "Genetic Basis of Common Human Disease: Insight into the Role of Missense SNPs from Genome-Wide Association Studies". Journal of Molecular Biology. 427 (13): 2271–89. doi:10.1016/j.jmb.2015.04.014. PMC 4893807. PMID 25937569.