Search

Article
Peer Reviewed

Separating homeologs by phasing in the tetraploid wheat transcriptome

UC Davis Previously Published Works (2013)

Abstract Background The high level of identity among duplicated homoeologous genomes in tetraploid pasta wheat presents substantial challenges for de novo transcriptome assembly. To solve this problem, we develop a specialized bioinformatics workflow that optimizes transcriptome assembly and separation of merged homoeologs. To evaluate our strategy, we sequence and assemble the transcriptome of one of the diploid ancestors of pasta wheat, and compare both assemblies with a benchmark set of 13,472 full-length, non-redundant bread wheat cDNAs. Results A total of 489 million 100 bp paired-end reads from tetraploid wheat assemble in 140,118 contigs, including 96% of the benchmark cDNAs. We used a comparative genomics approach to annotate 66,633 open reading frames. The multiple k-mer assembly strategy increases the proportion of cDNAs assembled full-length in a single contig by 22% relative to the best single k-mer size. Homoeologs are separated using a post-assembly pipeline that includes polymorphism identification, phasing of SNPs, read sorting, and re-assembly of phased reads. Using a reference set of genes, we determine that 98.7% of SNPs analyzed are correctly separated by phasing. Conclusions Our study shows that de novo transcriptome assembly of tetraploid wheat benefit from multiple k-mer assembly strategies more than diploid wheat. Our results also demonstrate that phasing approaches originally designed for heterozygous diploid organisms can be used to separate the close homoeologous genomes of tetraploid wheat. The predicted tetraploid wheat proteome and gene models provide a valuable tool for the wheat research community and for those interested in comparative genomic studies.

Cover page: Separating homeologs by phasing in the tetraploid wheat transcriptome

Article
Peer Reviewed

The genetic architecture of genome‐wide recombination rate variation in allopolyploid wheat revealed by nested association mapping

UC Davis Previously Published Works (2018)

Recombination affects the fate of alleles in populations by imposing constraints on the reshuffling of genetic information. Understanding the genetic basis of these constraints is critical for manipulating the recombination process to improve the resolution of genetic mapping, and reducing the negative effects of linkage drag and deleterious genetic load in breeding. Using sequence-based genotyping of a wheat nested association mapping (NAM) population of 2,100 recombinant inbred lines created by crossing 29 diverse lines, we mapped QTL affecting the distribution and frequency of 102 000 crossovers (CO). Genome-wide recombination rate variation was mostly defined by rare alleles with small effects together explaining up to 48.6% of variation. Most QTL were additive and showed predominantly trans-acting effects. The QTL affecting the proximal COs also acted additively without increasing the frequency of distal COs. We showed that the regions with decreased recombination carry more single nucleotide polymorphisms (SNPs) with possible deleterious effects than the regions with a high recombination rate. Therefore, our study offers insights into the genetic basis of recombination rate variation in wheat and its effect on the distribution of deleterious SNPs across the genome. The identified trans-acting additive QTL can be utilized to manipulate CO frequency and distribution in the large polyploid wheat genome opening the possibility to improve the efficiency of gene pyramiding and reducing the deleterious genetic load in the low-recombining pericentromeric regions of chromosomes.

Cover page: The genetic architecture of genome‐wide recombination rate variation in allopolyploid wheat revealed by nested association mapping

Article
Peer Reviewed

Variation in the AvrSr35 gene determines Sr35 resistance against wheat stem rust race Ug99

UC Davis Previously Published Works (2017)

Puccinia graminis f. sp. tritici (Pgt) causes wheat stem rust, a devastating fungal disease. The Sr35 resistance gene confers immunity against this pathogen's most virulent races, including Ug99. We used comparative whole-genome sequencing of chemically mutagenized and natural Pgt isolates to identify a fungal gene named AvrSr35 that is required for Sr35 avirulence. The AvrSr35 gene encodes a secreted protein capable of interacting with Sr35 and triggering the immune response. We show that the origin of Pgt isolates virulent on Sr35 is associated with the nonfunctionalization of the AvrSr35 gene by the insertion of a mobile element. The discovery of AvrSr35 provides a new tool for Pgt surveillance, identification of host susceptibility targets, and characterization of the molecular determinants of immunity in wheat.

Cover page: Variation in the AvrSr35 gene determines Sr35 resistance against wheat stem rust race Ug99

Article
Peer Reviewed

A haplotype map of allohexaploid wheat reveals distinct patterns of selection on homoeologous genomes

UC Davis Previously Published Works (2015)

Background

Bread wheat is an allopolyploid species with a large, highly repetitive genome. To investigate the impact of selection on variants distributed among homoeologous wheat genomes and to build a foundation for understanding genotype-phenotype relationships, we performed population-scale re-sequencing of a diverse panel of wheat lines.

Results

A sample of 62 diverse lines was re-sequenced using the whole exome capture and genotyping-by-sequencing approaches. We describe the allele frequency, functional significance, and chromosomal distribution of 1.57 million single nucleotide polymorphisms and 161,719 small indels. Our results suggest that duplicated homoeologous genes are under purifying selection. We find contrasting patterns of variation and inter-variant associations among wheat genomes; this, in addition to demographic factors, could be explained by differences in the effect of directional selection on duplicated homoeologs. Only a small fraction of the homoeologous regions harboring selected variants overlapped among the wheat genomes in any given wheat line. These selected regions are enriched for loci associated with agronomic traits detected in genome-wide association studies.

Conclusions

Evidence suggests that directional selection in allopolyploids rarely acted on multiple parallel advantageous mutations across homoeologous regions, likely indicating that a fitness benefit could be obtained by a mutation at any one of the homoeologs. Additional advantageous variants in other homoelogs probably either contributed little benefit, or were unavailable in populations subjected to directional selection. We hypothesize that allopolyploidy may have increased the likelihood of beneficial allele recovery by broadening the set of possible selection targets.

Article
Peer Reviewed

A haplotype map of allohexaploid wheat reveals distinct patterns of selection on homoeologous genomes

UC Davis Previously Published Works (2015)

Background

Bread wheat is an allopolyploid species with a large, highly repetitive genome. To investigate the impact of selection on variants distributed among homoeologous wheat genomes and to build a foundation for understanding genotype-phenotype relationships, we performed population-scale re-sequencing of a diverse panel of wheat lines.

Results

A sample of 62 diverse lines was re-sequenced using the whole exome capture and genotyping-by-sequencing approaches. We describe the allele frequency, functional significance, and chromosomal distribution of 1.57 million single nucleotide polymorphisms and 161,719 small indels. Our results suggest that duplicated homoeologous genes are under purifying selection. We find contrasting patterns of variation and inter-variant associations among wheat genomes; this, in addition to demographic factors, could be explained by differences in the effect of directional selection on duplicated homoeologs. Only a small fraction of the homoeologous regions harboring selected variants overlapped among the wheat genomes in any given wheat line. These selected regions are enriched for loci associated with agronomic traits detected in genome-wide association studies.

Conclusions

Evidence suggests that directional selection in allopolyploids rarely acted on multiple parallel advantageous mutations across homoeologous regions, likely indicating that a fitness benefit could be obtained by a mutation at any one of the homoeologs. Additional advantageous variants in other homoelogs probably either contributed little benefit, or were unavailable in populations subjected to directional selection. We hypothesize that allopolyploidy may have increased the likelihood of beneficial allele recovery by broadening the set of possible selection targets.

Article
Peer Reviewed

Genome-wide comparative diversity uncovers multiple targets of selection for improvement in hexaploid wheat landraces and cultivars

UC Davis Previously Published Works (2013)

Domesticated crops experience strong human-mediated selection aimed at developing high-yielding varieties adapted to local conditions. To detect regions of the wheat genome subject to selection during improvement, we developed a high-throughput array to interrogate 9,000 gene-associated single-nucleotide polymorphisms (SNP) in a worldwide sample of 2,994 accessions of hexaploid wheat including landraces and modern cultivars. Using a SNP-based diversity map we characterized the impact of crop improvement on genomic and geographic patterns of genetic diversity. We found evidence of a small population bottleneck and extensive use of ancestral variation often traceable to founders of cultivars from diverse geographic regions. Analyzing genetic differentiation among populations and the extent of haplotype sharing, we identified allelic variants subjected to selection during improvement. Selective sweeps were found around genes involved in the regulation of flowering time and phenology. An introgression of a wild relative-derived gene conferring resistance to a fungal pathogen was detected by haplotype-based analysis. Comparing selective sweeps identified in different populations, we show that selection likely acts on distinct targets or multiple functionally equivalent alleles in different portions of the geographic range of wheat. The majority of the selected alleles were present at low frequency in local populations, suggesting either weak selection pressure or temporal variation in the targets of directional selection during breeding probably associated with changing agricultural practices or environmental conditions. The developed SNP chip and map of genetic variation provide a resource for advancing wheat breeding and supporting future population genomic and genome-wide association studies in wheat.

Article
Peer Reviewed

Characterization of polyploid wheat genomic diversity using a high‐density 90 000 single nucleotide polymorphism array

UC Davis Previously Published Works (2014)

High-density single nucleotide polymorphism (SNP) genotyping arrays are a powerful tool for studying genomic patterns of diversity, inferring ancestral relationships between individuals in populations and studying marker-trait associations in mapping experiments. We developed a genotyping array including about 90,000 gene-associated SNPs and used it to characterize genetic variation in allohexaploid and allotetraploid wheat populations. The array includes a significant fraction of common genome-wide distributed SNPs that are represented in populations of diverse geographical origin. We used density-based spatial clustering algorithms to enable high-throughput genotype calling in complex data sets obtained for polyploid wheat. We show that these model-free clustering algorithms provide accurate genotype calling in the presence of multiple clusters including clusters with low signal intensity resulting from significant sequence divergence at the target SNP site or gene deletions. Assays that detect low-intensity clusters can provide insight into the distribution of presence-absence variation (PAV) in wheat populations. A total of 46 977 SNPs from the wheat 90K array were genetically mapped using a combination of eight mapping populations. The developed array and cluster identification algorithms provide an opportunity to infer detailed haplotype structure in polyploid wheat and will serve as an invaluable resource for diversity studies and investigating the genetic basis of trait variation in wheat.

Cover page: Characterization of polyploid wheat genomic diversity using a high‐density 90 000 single nucleotide polymorphism array

Article
Peer Reviewed

A chromosome-based draft sequence of the hexaploid bread wheat (Triticum aestivum) genome

UC Riverside Previously Published Works (2014)

An ordered draft sequence of the 17-gigabase hexaploid bread wheat (Triticum aestivum) genome has been produced by sequencing isolated chromosome arms. We have annotated 124,201 gene loci distributed nearly evenly across the homeologous chromosomes and subgenomes. Comparative gene analysis of wheat subgenomes and extant diploid and tetraploid wheat relatives showed that high sequence similarity and structural conservation are retained, with limited gene loss, after polyploidization. However, across the genomes there was evidence of dynamic gene gain, loss, and duplication since the divergence of the wheat lineages. A high degree of transcriptional autonomy and no global dominance was found for the subgenomes. These insights into the genome biology of a polyploid crop provide a springboard for faster gene isolation, rapid genetic marker development, and precise breeding to meet the needs of increasing food demand worldwide.

Cover page: A chromosome-based draft sequence of the hexaploid bread wheat (Triticum aestivum) genome