Aiming off the target: recycling target capture sequencing reads for investigating repetitive DNA
Thomas, William Wayt
MetadataShow full item record
AbstractWith the advance of high-throughput sequencing (HTS), reduced-representation methods such as target capture sequencing (TCS) emerged as cost-efficient ways of gathering genomic information, particularly from coding regions. As the off-target reads from such sequencing are expected to be similar to genome skimming (GS), we assessed the quality of repeat characterization in plant genomes using this data. Repeat composition obtained from TCS datasets of five Rhynchospora (Cyperaceae) species were compared with GS data from the same taxa. In addition, a FISH probe was designed based on the most abundant satellite found in the TCS dataset of Rhynchospora cephalotes. Finally, repeat-based phylogenies of the five Rhynchospora species were constructed based on the GS and TCS dataset and the topologies were compared with a gene-alignment based phylogenetic tree. All the major repetitive DNA families were identified in TCS, including repeats that showed abundances as low as 0.01% in the GS data. Rank correlation between GS and TCS repeat abundances were moderately high (r = 0.58-0.85), increasing after filtering out the targeted loci from the raw TCS reads (r = 0.66-0.92). Repeat data obtained by TCS was also reliable to develop a cytogenetic probe of a new variant of the holocentromeric satellite Tyba. Repeat-based phylogenies from TCS data were congruent with those obtained from GS data and the gene-alignment tree. Our results show that off-target TCS reads can be recycled to identify repeats for cyto- and phylogenomic investigations. Given the growing availability of TCS reads, driven by global phylogenomic projects, our strategy represents a way to recycle genomic data and contribute to a better characterization of plant biodiversity.
CitationCosta L, Marques A, Buddenhagen C, Thomas WW, Huettel B, Schubert V, Dodsworth S, Houben A, Souza G, Pedrosa-Harand A (2021) 'Aiming off the target: recycling target capture sequencing reads for investigating repetitive DNA', Annals of Botany, (), pp.-.
PublisherOxford University Press
JournalAnnals of Botany
- The repetitive DNA landscape in Avena (Poaceae): chromosome and genome evolution defined by major repeat classes in whole-genome sequence reads.
- Authors: Liu Q, Li X, Zhou X, Li M, Zhang F, Schwarzacher T, Heslop-Harrison JS
- Issue date: 2019 May 30
- De novo identification of satellite DNAs in the sequenced genomes of Drosophila virilis and D. americana using the RepeatExplorer and TAREAN pipelines.
- Authors: Silva BSML, Heringer P, Dias GB, Svartman M, Kuhn GCS
- Issue date: 2019
- Holocentromeres in Rhynchospora are associated with genome-wide centromere-specific repeat arrays interspersed among euchromatin.
- Authors: Marques A, Ribeiro T, Neumann P, Macas J, Novák P, Schubert V, Pellino M, Fuchs J, Ma W, Kuhlmann M, Brandt R, Vanzela AL, Beseda T, Šimková H, Pedrosa-Harand A, Houben A
- Issue date: 2015 Nov 3
- Graph-based clustering and characterization of repetitive sequences in next-generation sequencing data.
- Authors: Novák P, Neumann P, Macas J
- Issue date: 2010 Jul 15
- RepeatExplorer: a Galaxy-based web server for genome-wide characterization of eukaryotic repetitive elements from next-generation sequence reads.
- Authors: Novák P, Neumann P, Pech J, Steinhaisl J, Macas J
- Issue date: 2013 Mar 15