Search

Article
Peer Reviewed

High-throughput genetic clustering of type 2 diabetes loci reveals heterogeneous mechanistic pathways of metabolic disease.

UC San Diego Previously Published Works (2023)

AIMS/HYPOTHESIS: Type 2 diabetes is highly polygenic and influenced by multiple biological pathways. Rapid expansion in the number of type 2 diabetes loci can be leveraged to identify such pathways. METHODS: We developed a high-throughput pipeline to enable clustering of type 2 diabetes loci based on variant-trait associations. Our pipeline extracted summary statistics from genome-wide association studies (GWAS) for type 2 diabetes and related traits to generate a matrix of 323 variants × 64 trait associations and applied Bayesian non-negative matrix factorisation (bNMF) to identify genetic components of type 2 diabetes. Epigenomic enrichment analysis was performed in 28 cell types and single pancreatic cells. We generated cluster-specific polygenic scores and performed regression analysis in an independent cohort (N=25,419) to assess for clinical relevance. RESULTS: We identified ten clusters of genetic loci, recapturing the five from our prior analysis as well as novel clusters related to beta cell dysfunction, pronounced insulin secretion, and levels of alkaline phosphatase, lipoprotein A and sex hormone-binding globulin. Four clusters related to mechanisms of insulin deficiency, five to insulin resistance and one had an unclear mechanism. The clusters displayed tissue-specific epigenomic enrichment, notably with the two beta cell clusters differentially enriched in functional and stressed pancreatic beta cell states. Additionally, cluster-specific polygenic scores were differentially associated with patient clinical characteristics and outcomes. The pipeline was applied to coronary artery disease and chronic kidney disease, identifying multiple overlapping clusters with type 2 diabetes. CONCLUSIONS/INTERPRETATION: Our approach stratifies type 2 diabetes loci into physiologically interpretable genetic clusters associated with distinct tissues and clinical outcomes. The pipeline allows for efficient updating as additional GWAS become available and can be readily applied to other conditions, facilitating clinical translation of GWAS findings. Software to perform this clustering pipeline is freely available.

Cover page: High-throughput genetic clustering of type 2 diabetes loci reveals heterogeneous mechanistic pathways of metabolic disease.

Article
Peer Reviewed

The repertoire of mutational signatures in human cancer

UCLA Previously Published Works (2020)

Somatic mutations in cancer genomes are caused by multiple mutational processes, each of which generates a characteristic mutational signature¹. Here, as part of the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium² of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA), we characterized mutational signatures using 84,729,690 somatic mutations from 4,645 whole-genome and 19,184 exome sequences that encompass most types of cancer. We identified 49 single-base-substitution, 11 doublet-base-substitution, 4 clustered-base-substitution and 17 small insertion-and-deletion signatures. The substantial size of our dataset, compared with previous analyses^3-15, enabled the discovery of new signatures, the separation of overlapping signatures and the decomposition of signatures into components that may represent associated-but distinct-DNA damage, repair and/or replication mechanisms. By estimating the contribution of each signature to the mutational catalogues of individual cancer genomes, we revealed associations of signatures to exogenous or endogenous exposures, as well as to defective DNA-maintenance processes. However, many signatures are of unknown cause. This analysis provides a systematic perspective on the repertoire of mutational processes that contribute to the development of human cancer.

Cover page: The repertoire of mutational signatures in human cancer

Article
Peer Reviewed

Biomarker correlates with response to NY-ESO-1 TCR T cells in patients with synovial sarcoma

UC Irvine Previously Published Works (2022)

Autologous T cells transduced to express a high affinity T-cell receptor specific to NY-ESO-1 (letetresgene autoleucel, lete-cel) show promise in the treatment of metastatic synovial sarcoma, with 50% overall response rate. The efficacy of lete-cel treatment in 45 synovial sarcoma patients (NCT01343043) has been previously reported, however, biomarkers predictive of response and resistance remain to be better defined. This post-hoc analysis identifies associations of response to lete-cel with lymphodepleting chemotherapy regimen (LDR), product attributes, cell expansion, cytokines, and tumor gene expression. Responders have higher IL-15 levels pre-infusion (p = 0.011) and receive a higher number of transduced effector memory (CD45RA- CCR7-) CD8 + cells per kg (p = 0.039). Post-infusion, responders have increased IFNγ, IL-6, and peak cell expansion (p < 0.01, p < 0.01, and p = 0.016, respectively). Analysis of tumor samples post-treatment illustrates lete-cel infiltration and a decrease in expression of macrophage genes, suggesting remodeling of the tumor microenvironment. Here we report potential predictive and pharmacodynamic markers of lete-cel response that may inform LDR, cell dose, and strategies to enhance anticancer efficacy.

Cover page: Biomarker correlates with response to NY-ESO-1 TCR T cells in patients with synovial sarcoma

Article
Peer Reviewed

The Integrated Genomic Landscape of Thymic Epithelial Tumors

UC San Francisco Previously Published Works (2018)

Thymic epithelial tumors (TETs) are one of the rarest adult malignancies. Among TETs, thymoma is the most predominant, characterized by a unique association with autoimmune diseases, followed by thymic carcinoma, which is less common but more clinically aggressive. Using multi-platform omics analyses on 117 TETs, we define four subtypes of these tumors defined by genomic hallmarks and an association with survival and World Health Organization histological subtype. We further demonstrate a marked prevalence of a thymoma-specific mutated oncogene, GTF2I, and explore its biological effects on multi-platform analysis. We further observe enrichment of mutations in HRAS, NRAS, and TP53. Last, we identify a molecular link between thymoma and the autoimmune disease myasthenia gravis, characterized by tumoral overexpression of muscle autoantigens, and increased aneuploidy.

Cover page: The Integrated Genomic Landscape of Thymic Epithelial Tumors

Article
Peer Reviewed

Next-generation characterization of the Cancer Cell Line Encyclopedia

UC San Francisco Previously Published Works (2019)

Large panels of comprehensively characterized human cancer models, including the Cancer Cell Line Encyclopedia (CCLE), have provided a rigorous framework with which to study genetic variants, candidate targets, and small-molecule and biological therapeutics and to identify new marker-driven cancer dependencies. To improve our understanding of the molecular features that contribute to cancer phenotypes, including drug responses, here we have expanded the characterizations of cancer cell lines to include genetic, RNA splicing, DNA methylation, histone H3 modification, microRNA expression and reverse-phase protein array data for 1,072 cell lines from individuals of various lineages and ethnicities. Integration of these data with functional characterizations such as drug-sensitivity, short hairpin RNA knockdown and CRISPR-Cas9 knockout data reveals potential targets for cancer drugs and associated biomarkers. Together, this dataset and an accompanying public data portal provide a resource for the acceleration of cancer research using model cancer cell lines.

Cover page: Next-generation characterization of the Cancer Cell Line Encyclopedia

Article
Peer Reviewed

Analyses of non-coding somatic drivers in 2,658 cancer whole genomes.

UC Santa Cruz Previously Published Works (2020)

The discovery of drivers of cancer has traditionally focused on protein-coding genes^1-4. Here we present analyses of driver point mutations and structural variants in non-coding regions across 2,658 genomes from the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium⁵ of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA). For point mutations, we developed a statistically rigorous strategy for combining significance levels from multiple methods of driver discovery that overcomes the limitations of individual methods. For structural variants, we present two methods of driver discovery, and identify regions that are significantly affected by recurrent breakpoints and recurrent somatic juxtapositions. Our analyses confirm previously reported drivers^6,7, raise doubts about others and identify novel candidates, including point mutations in the 5' region of TP53, in the 3' untranslated regions of NFKBIZ and TOB1, focal deletions in BRD4 and rearrangements in the loci of AKR1C genes. We show that although point mutations and structural variants that drive cancer are less frequent in non-coding genes and regulatory sequences than in protein-coding genes, additional examples of these drivers will be found as more cancer genomes become available.

Cover page: Analyses of non-coding somatic drivers in 2,658 cancer whole genomes.

Article
Peer Reviewed

Comprehensive Molecular Characterization of Muscle-Invasive Bladder Cancer

UC San Francisco Previously Published Works (2017)

We report a comprehensive analysis of 412 muscle-invasive bladder cancers characterized by multiple TCGA analytical platforms. Fifty-eight genes were significantly mutated, and the overall mutational load was associated with APOBEC-signature mutagenesis. Clustering by mutation signature identified a high-mutation subset with 75% 5-year survival. mRNA expression clustering refined prior clustering analyses and identified a poor-survival "neuronal" subtype in which the majority of tumors lacked small cell or neuroendocrine histology. Clustering by mRNA, long non-coding RNA (lncRNA), and miRNA expression converged to identify subsets with differential epithelial-mesenchymal transition status, carcinoma in situ scores, histologic features, and survival. Our analyses identified 5 expression subtypes that may stratify response to different treatments.

Cover page: Comprehensive Molecular Characterization of Muscle-Invasive Bladder Cancer

Article
Peer Reviewed

Comprehensive Molecular Portraits of Invasive Lobular Breast Cancer

UC San Francisco Previously Published Works (2015)

Invasive lobular carcinoma (ILC) is the second most prevalent histologic subtype of invasive breast cancer. Here, we comprehensively profiled 817 breast tumors, including 127 ILC, 490 ductal (IDC), and 88 mixed IDC/ILC. Besides E-cadherin loss, the best known ILC genetic hallmark, we identified mutations targeting PTEN, TBX3, and FOXA1 as ILC enriched features. PTEN loss associated with increased AKT phosphorylation, which was highest in ILC among all breast cancer subtypes. Spatially clustered FOXA1 mutations correlated with increased FOXA1 expression and activity. Conversely, GATA3 mutations and high expression characterized luminal A IDC, suggesting differential modulation of ER activity in ILC and IDC. Proliferation and immune-related signatures determined three ILC transcriptional subtypes associated with survival differences. Mixed IDC/ILC cases were molecularly classified as ILC-like and IDC-like revealing no true hybrid features. This multidimensional molecular atlas sheds new light on the genetic bases of ILC and provides potential clinical options.

Cover page: Comprehensive Molecular Portraits of Invasive Lobular Breast Cancer

Article
Peer Reviewed

Integrated Molecular Characterization of Uterine Carcinosarcoma

UC Santa Cruz Previously Published Works (2017)

We performed genomic, epigenomic, transcriptomic, and proteomic characterizations of uterine carcinosarcomas (UCSs). Cohort samples had extensive copy-number alterations and highly recurrent somatic mutations. Frequent mutations were found in TP53, PTEN, PIK3CA, PPP2R1A, FBXW7, and KRAS, similar to endometrioid and serous uterine carcinomas. Transcriptome sequencing identified a strong epithelial-to-mesenchymal transition (EMT) gene signature in a subset of cases that was attributable to epigenetic alterations at microRNA promoters. The range of EMT scores in UCS was the largest among all tumor types studied via The Cancer Genome Atlas. UCSs shared proteomic features with gynecologic carcinomas and sarcomas with intermediate EMT features. Multiple somatic mutations and copy-number alterations in genes that are therapeutic targets were identified.

Cover page: Integrated Molecular Characterization of Uterine Carcinosarcoma

Article
Peer Reviewed

Molecular Profiling Reveals Biologically Discrete Subsets and Pathways of Progression in Diffuse Glioma

UC Santa Cruz Previously Published Works (2016)

Therapy development for adult diffuse glioma is hindered by incomplete knowledge of somatic glioma driving alterations and suboptimal disease classification. We defined the complete set of genes associated with 1,122 diffuse grade II-III-IV gliomas from The Cancer Genome Atlas and used molecular profiles to improve disease classification, identify molecular correlations, and provide insights into the progression from low- to high-grade disease. Whole-genome sequencing data analysis determined that ATRX but not TERT promoter mutations are associated with increased telomere length. Recent advances in glioma classification based on IDH mutation and 1p/19q co-deletion status were recapitulated through analysis of DNA methylation profiles, which identified clinically relevant molecular subsets. A subtype of IDH mutant glioma was associated with DNA demethylation and poor outcome; a group of IDH-wild-type diffuse glioma showed molecular similarity to pilocytic astrocytoma and relatively favorable survival. Understanding of cohesive disease groups may aid improved clinical outcomes.

Cover page: Molecular Profiling Reveals Biologically Discrete Subsets and Pathways of Progression in Diffuse Glioma