Mapping copy number variation by population-scale genome sequencing
Mills, Ryan E.; Walter, Klaudia; Stewart, Chip; Handsaker, Robert E.; Chen, Ken; Alkan, Can; Abyzov, Alexej; Yoon, Seungtai Chris; Ye, Kai; Cheetham, R. Keira; Chinwalla, Asif; Conrad, Donald F.; Fu, Yutao; Grubert, Fabian; Hajirasouliha, Iman; Hormozdiari, Fereydoun; Iakoucheva, Lilia M.; Iqbal, Zamin; Kang, Shuli; Kidd, Jeffrey M.; Konkel, Miriam K.; Korn, Joshua; Khurana, Ekta; Kural, Deniz; Lam, Hugo Y. K.; Leng, Jing; Li, Ruiqiang; Li, Yingrui; Lin, Chang-Yun; Luo, Ruibang; Mu, Xinmeng Jasmine; Nemesh, James; Peckham, Heather E.; Rausch, Tobias; Scally, Aylwyn; Shi, Xinghua; Stromberg, Michael P.; Stütz, Adrian M.; Urban, Alexander Eckehart; Walker, Jerilyn A.; Wu, Jiantao; Zhang, Yujun; Zhang, Zhengdong D.; Batzer, Mark A.; Ding, Li; Marth, Gabor T.; McVean, Gil; Sebat, Jonathan; Snyder, Michael; Wang, Jun; Ye, Kenny; Eichler, Evan E.; Gerstein, Mark B.; Hurles, Matthew E.; Lee, Charles; McCarroll, Steven A.; Korbel, Jan O.; 1000 Genomes Project; Mills, Ryan E.; Department of Pathology, Brigham and Womenâ s Hospital and Harvard Medical School, Boston, Massachusetts, USA; Walter, Klaudia; The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA, UK; Stewart, Chip; Department of Biology, Boston College, Boston, Massachusetts, USA; Handsaker, Robert E.; Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, Massachusetts, USA; Chen, Ken; The Genome Center at Washington University, St. Louis, Missouri, USA; Alkan, Can; Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington, USA; Howard Hughes Medical Institute, University of Washington, Seattle, Washington, USA; Abyzov, Alexej; Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, USA; Yoon, Seungtai Chris; Seaver Autism Center and Department of Psychiatry, Mount Sinai School of Medicine, New York, New York, USA; Ye, Kai; Departments of Molecular Epidemiology, Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, The Netherlands; Cheetham, R. Keira; Illumina Cambridge Ltd, Chesterford Research Park, Little Chesterford, Saffron Walden CB10 1XL, UK; Chinwalla, Asif; The Genome Center at Washington University, St. Louis, Missouri, USA; Conrad, Donald F.; The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA, UK; Fu, Yutao; Life Technologies, Beverly, Massachusetts, USA; Grubert, Fabian; Department of Genetics, Stanford University, Stanford, California, USA; Hajirasouliha, Iman; School of Computing Science, Simon Fraser University, Burnaby, British Columbia, Canada; Hormozdiari, Fereydoun; School of Computing Science, Simon Fraser University, Burnaby, British Columbia, Canada; Iakoucheva, Lilia M.; Department of Psychiatry, Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, University of California, San Diego, La Jolla, California, USA; Iqbal, Zamin; Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, UK; Kang, Shuli; Department of Psychiatry, Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, University of California, San Diego, La Jolla, California, USA; Kidd, Jeffrey M.; Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington, USA; Konkel, Miriam K.; Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, USA; Korn, Joshua; Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, Massachusetts, USA; Khurana, Ekta; Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, USA; Molecular Biophysics and Biochemistry Department, Yale University, New Haven, Connecticut, USA; Kural, Deniz; Department of Biology, Boston College, Boston, Massachusetts, USA; Lam, Hugo Y. K.; Department of Genetics, Stanford University, Stanford, California, USA; Leng, Jing; Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, USA; Li, Ruiqiang; BGI-Shenzhen, Shenzhen 518083, China; Li, Yingrui; BGI-Shenzhen, Shenzhen 518083, China; Lin, Chang-Yun; Albert Einstein College of Medicine, Bronx, New York, USA; Luo, Ruibang; BGI-Shenzhen, Shenzhen 518083, China; Mu, Xinmeng Jasmine; Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, USA; Nemesh, James; Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, Massachusetts, USA; Peckham, Heather E.; Life Technologies, Beverly, Massachusetts, USA; Rausch, Tobias; Genome Biology Research Unit, European Molecular Biology Laboratory, Heidelberg, Germany; Scally, Aylwyn; The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA, UK; Shi, Xinghua; Department of Pathology, Brigham and Womenâ s Hospital and Harvard Medical School, Boston, Massachusetts, USA; Stromberg, Michael P.; Department of Biology, Boston College, Boston, Massachusetts, USA; Stütz, Adrian M.; Genome Biology Research Unit, European Molecular Biology Laboratory, Heidelberg, Germany; Urban, Alexander Eckehart; Department of Genetics, Stanford University, Stanford, California, USA; Department of Psychiatry and Behavioral Sciences, Stanford University, Stanford, California, USA; Walker, Jerilyn A.; Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, USA; Wu, Jiantao; Department of Biology, Boston College, Boston, Massachusetts, USA; Zhang, Yujun; The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA, UK; Zhang, Zhengdong D.; Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, USA; Batzer, Mark A.; Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, USA; Ding, Li; The Genome Center at Washington University, St. Louis, Missouri, USA; Department of Genetics, Washington University, St Louis, Missouri, USA; Marth, Gabor T.; Department of Biology, Boston College, Boston, Massachusetts, USA; McVean, Gil; Department of Statistics, University of Oxford, OX3 7BN, UK; Sebat, Jonathan; Department of Psychiatry, Department of Cellular and Molecular Medicine, Institute for Genomic Medicine, University of California, San Diego, La Jolla, California, USA; Snyder, Michael; Department of Genetics, Stanford University, Stanford, California, USA; Wang, Jun; BGI-Shenzhen, Shenzhen 518083, China; Department of Biology, University of Copenhagen, Copenhagen, Denmark; Ye, Kenny; Albert Einstein College of Medicine, Bronx, New York, USA; Eichler, Evan E.; Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington, USA; Howard Hughes Medical Institute, University of Washington, Seattle, Washington, USA; Gerstein, Mark B.; Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, USA; Molecular Biophysics and Biochemistry Department, Yale University, New Haven, Connecticut, USA; Department of Computer Science, Yale University, New Haven, Connecticut, USA; Hurles, Matthew E.; The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA, UK; Lee, Charles; Department of Pathology, Brigham and Womenâ s Hospital and Harvard Medical School, Boston, Massachusetts, USA; McCarroll, Steven A.; Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, Massachusetts, USA; Department of Genetics, Harvard Medical School, Boston, Massachusetts, USA; Korbel, Jan O.; Genome Biology Research Unit, European Molecular Biology Laboratory, Heidelberg, Germany
Журнал:
Nature
Дата:
2011
Аннотация:
Genomic structural variants (SVs) are abundant in humans, differing from other forms of variation in extent, origin and functional impact. Despite progress in SV characterization, the nucleotide resolution architecture of most SVs remains unknown. We constructed a map of unbalanced SVs (that is, copy number variants) based on whole genome DNA sequencing data from 185 human genomes, integrating evidence from complementary SV discovery approaches with extensive experimental validations. Our map encompassed 22,025 deletions and 6,000 additional SVs, including insertions and tandem duplications. Most SVs (53%) were mapped to nucleotide resolution, which facilitated analysing their origin and functional impact. We examined numerous whole and partial gene deletions with a genotyping approach and observed a depletion of gene disruptions amongst high frequency deletions. Furthermore, we observed differences in the size spectra of SVs originating from distinct formation mechanisms, and constructed a map of SV hotspots formed by common mechanisms. Our analytical framework and SV map serves as a resource for sequencing-based association studies.
973.1Кб