Analysis of the 9p21.3 sequence associated with coronary artery disease reveals a tendency for duplication in a CAD patient
Metrics: PDF 1272 views | HTML 2972 views | ?
Natalay Kouprina1, Mikhail Liskovykh1,*, Nicholas C.O. Lee1,*, Vladimir N. Noskov1, Joshua J. Waterfall2, Robert L. Walker2, Paul S. Meltzer2, Eric J. Topol3 and Vladimir Larionov1
1Developmental Therapeutics Branch, National Cancer Institute, Bethesda, MD 20892, USA
2Genetics Branch, National Cancer Institute, Bethesda, MD 20892, USA
3The Scripps Translational Science Institute, The Scripps Research Institute and Scripps Health, La Jolla, CA 92037, USA
*These authors have contributed equally to this work
Natalay Kouprina, email: [email protected]
Keywords: TAR-cloning; segmental duplication; genome alterations; 9p21; CAD interval
Received: November 29, 2017 Accepted: February 10, 2018 Epub: February 26, 2018 Published: March 16, 2018
Tandem segmental duplications (SDs) greater than 10 kb are widespread in complex genomes. They provide material for gene divergence and evolutionary adaptation, while formation of specific de novo SDs is a hallmark of cancer and some human diseases. Most SDs map to distinct genomic regions termed ‘duplication blocks’. SDs organization within these blocks is often poorly characterized as they are mosaics of ancestral duplicons juxtaposed with younger duplicons arising from more recent duplication events. Structural and functional analysis of SDs is further hampered as long repetitive DNA structures are underrepresented in existing BAC and YAC libraries. We applied Transformation-Associated Recombination (TAR) cloning, a versatile technique for large DNA manipulation, to selectively isolate the coronary artery disease (CAD) interval sequence within the 9p21.3 chromosome locus from a patient with coronary artery disease and normal individuals. Four tandem head-to-tail duplicons, each ~50 kb long, were recovered in the patient but not in normal individuals. Sequence analysis revealed that the repeats varied by 10-15 SNPs between each other and by 82 SNPs between the human genome sequence (version hg19). SNPs polymorphism within the junctions between repeats allowed two junction types to be distinguished, Type 1 and Type 2, which were found at a 2:1 ratio. The junction sequences contained an Alu element, a sequence previously shown to play a role in duplication. Knowledge of structural variation in the CAD interval from more patients could help link this locus to cardiovascular diseases susceptibility, and maybe relevant to other cases of regional amplification, including cancer.
All site content, except where otherwise noted, is licensed under a Creative Commons Attribution 4.0 License.