default search action
James R. Glass
Person information
- affiliation: Massachusetts Institute of Technology (MIT), CSAIL, Cambridge, MA, USA
Other persons with the same name
- Jim Glass 0002 — Electric Power Board of Chattanooga, TN, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c347]Junmo Kang, Hongyin Luo, Yada Zhu, Jacob A. Hansen, James R. Glass, David D. Cox, Alan Ritter, Rogério Feris, Leonid Karlinsky:
Self-Specialization: Uncovering Latent Expertise within Large Language Models. ACL (Findings) 2024: 2681-2706 - [c346]Cheng-Yu Hsieh, Yung-Sung Chuang, Chun-Liang Li, Zifeng Wang, Long T. Le, Abhishek Kumar, James R. Glass, Alexander Ratner, Chen-Yu Lee, Ranjay Krishna, Tomas Pfister:
Found in the middle: Calibrating Positional Attention Bias Improves Long Context Utilization. ACL (Findings) 2024: 14982-14995 - [c345]Brian Chen, Nina Shvetsova, Andrew Rouditchenko, Daniel Kondermann, Samuel Thomas, Shih-Fu Chang, Rogério Feris, James R. Glass, Hilde Kuehne:
What, When, and Where? Self-Supervised Spatio- Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions. CVPR 2024: 18419-18429 - [c344]Wei Fang, Yung-Sung Chuang, James R. Glass:
Joint Inference of Retrieval and Generation for Passage Re-ranking. EACL (Findings) 2024: 2289-2298 - [c343]Yung-Sung Chuang, Linlu Qiu, Cheng-Yu Hsieh, Ranjay Krishna, Yoon Kim, James R. Glass:
Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps. EMNLP 2024: 1419-1436 - [c342]Tianhua Zhang, Kun Li, Hongyin Luo, Xixin Wu, James R. Glass, Helen Meng:
Adaptive Query Rewriting: Aligning Rewriters through Marginal Probability of Conversational Answers. EMNLP 2024: 13444-13461 - [c341]Sameer Khurana, Nauman Dawalatabad, Antoine Laurent, Luis Vicente, Pablo Gimeno, Victoria Mingote, James R. Glass:
Cross-Lingual Transfer Learning for Low-Resource Speech Translation. ICASSP Workshops 2024: 670-674 - [c340]Alexander H. Liu, Sung-Lin Yeh, James R. Glass:
Revisiting Self-supervised Learning of Speech Representation from a Mutual Information Perspective. ICASSP 2024: 12051-12055 - [c339]Yuan Gong, Hongyin Luo, Alexander H. Liu, Leonid Karlinsky, James R. Glass:
Listen, Think, and Understand. ICLR 2024 - [c338]Yung-Sung Chuang, Yujia Xie, Hongyin Luo, Yoon Kim, James R. Glass, Pengcheng He:
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models. ICLR 2024 - [c337]Zhang-Wei Hong, Idan Shenfeld, Tsun-Hsuan Wang, Yung-Sung Chuang, Aldo Pareja, James R. Glass, Akash Srivastava, Pulkit Agrawal:
Curiosity-driven Red-teaming for Large Language Models. ICLR 2024 - [c336]Heng-Jui Chang, James R. Glass:
R-Spin: Efficient Speaker and Noise-invariant Representation Learning with Acoustic Pieces. NAACL-HLT 2024: 642-662 - [c335]Tianhua Zhang, Jiaxin Ge, Hongyin Luo, Yung-Sung Chuang, Mingye Gao, Yuan Gong, Yoon Kim, Xixin Wu, Helen Meng, Jim Glass:
Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning. NAACL-HLT (Findings) 2024: 4131-4155 - [i156]Alexander H. Liu, Sung-Lin Yeh, James R. Glass:
Revisiting Self-supervised Learning of Speech Representation from a Mutual Information Perspective. CoRR abs/2401.08833 (2024) - [i155]Zhang-Wei Hong, Idan Shenfeld, Tsun-Hsuan Wang, Yung-Sung Chuang, Aldo Pareja, James R. Glass, Akash Srivastava, Pulkit Agrawal:
Curiosity-driven Red-teaming for Large Language Models. CoRR abs/2402.19464 (2024) - [i154]Philip Schroeder, Nathaniel Morgan, Hongyin Luo, James R. Glass:
THREAD: Thinking Deeper with Recursive Spawning. CoRR abs/2405.17402 (2024) - [i153]Andrew Rouditchenko, Yuan Gong, Samuel Thomas, Leonid Karlinsky, Hilde Kuehne, Rogério Feris, James R. Glass:
Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation. CoRR abs/2406.10082 (2024) - [i152]Tianhua Zhang, Kun Li, Hongyin Luo, Xixin Wu, James R. Glass, Helen Meng:
Adaptive Query Rewriting: Aligning Rewriters through Marginal Probability of Conversational Answers. CoRR abs/2406.10991 (2024) - [i151]Junmo Kang, Leonid Karlinsky, Hongyin Luo, Zhen Wang, Jacob A. Hansen, Jim Glass, David D. Cox, Rameswar Panda, Rogério Feris, Alan Ritter:
Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts. CoRR abs/2406.12034 (2024) - [i150]Cheng-Yu Hsieh, Yung-Sung Chuang, Chun-Liang Li, Zifeng Wang, Long T. Le, Abhishek Kumar, James R. Glass, Alexander Ratner, Chen-Yu Lee, Ranjay Krishna, Tomas Pfister:
Found in the Middle: Calibrating Positional Attention Bias Improves Long Context Utilization. CoRR abs/2406.16008 (2024) - [i149]Liming Wang, Yuan Gong, Nauman Dawalatabad, Marco Vilela, Katerina Placek, Brian Tracey, Yishu Gong, Alan Premasiri, Fernando Vieira, James R. Glass:
Automatic Prediction of Amyotrophic Lateral Sclerosis Progression using Longitudinal Speech Transformer. CoRR abs/2406.18625 (2024) - [i148]Yung-Sung Chuang, Linlu Qiu, Cheng-Yu Hsieh, Ranjay Krishna, Yoon Kim, James R. Glass:
Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps. CoRR abs/2407.07071 (2024) - [i147]Haibin Wu, Xuanjun Chen, Yi-Cheng Lin, Kai-Wei Chang, Jiawei Du, Ke-Han Lu, Alexander H. Liu, Ho-Lam Chung, Yuan-Kuei Wu, Dongchao Yang, Songxiang Liu, Yi-Chiao Wu, Xu Tan, James R. Glass, Shinji Watanabe, Hung-yi Lee:
Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec models. CoRR abs/2409.14085 (2024) - [i146]Zhenting Qi, Hongyin Luo, Xuliang Huang, Zhuokai Zhao, Yibo Jiang, Xiangjun Fan, Himabindu Lakkaraju, James R. Glass:
Quantifying Generalization Complexity for Large Language Models. CoRR abs/2410.01769 (2024) - [i145]Muhammad Jehanzeb Mirza, Mengjie Zhao, Zhuoyuan Mao, Sivan Doveh, Wei Lin, Paul Gavrikov, Michael Dorkenwald, Shiqi Yang, Saurav Jha, Hiromi Wakaki, Yuki Mitsufuji, Horst Possegger, Rogério Feris, Leonid Karlinsky, James R. Glass:
GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models. CoRR abs/2410.06154 (2024) - 2023
- [c334]Tianxing He, Jingyu Zhang, Tianle Wang, Sachin Kumar, Kyunghyun Cho, James R. Glass, Yulia Tsvetkov:
On the Blind Spots of Model-Based Evaluation Metrics for Text Generation. ACL (1) 2023: 12067-12097 - [c333]Yung-Sung Chuang, Wei Fang, Shang-Wen Li, Wen-tau Yih, James R. Glass:
Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering. ACL (Findings) 2023: 12131-12147 - [c332]Jiaxin Ge, Hongyin Luo, Yoon Kim, James R. Glass:
Entailment as Robust Self-Learner. ACL (1) 2023: 13803-13817 - [c331]Yuan Gong, Alexander H. Liu, Hongyin Luo, Leonid Karlinsky, James R. Glass:
Joint Audio and Speech Understanding. ASRU 2023: 1-8 - [c330]Cheng-I Jeff Lai, Freda Shi, Puyuan Peng, Yoon Kim, Kevin Gimpel, Shiyu Chang, Yung-Sung Chuang, Saurabhchand Bhati, David D. Cox, David Harwath, Yang Zhang, Karen Livescu, James R. Glass:
Audio-Visual Neural Syntax Acquisition. ASRU 2023: 1-8 - [c329]Hongyin Luo, James R. Glass:
Logic Against Bias: Textual Entailment Mitigates Stereotypical Sentence Reasoning. EACL 2023: 1235-1246 - [c328]Hongyin Luo, Tianhua Zhang, Yung-Sung Chuang, Yuan Gong, Yoon Kim, Xixin Wu, Helen Meng, James R. Glass:
Search Augmented Instruction Learning. EMNLP (Findings) 2023: 3717-3729 - [c327]Nauman Dawalatabad, Sameer Khurana, Antoine Laurent, James R. Glass:
On Unsupervised Uncertainty-Driven Speech Pseudo-Label Filtering and Model Calibration. ICASSP 2023: 1-5 - [c326]Andrew Rouditchenko, Yung-Sung Chuang, Nina Shvetsova, Samuel Thomas, Rogério Feris, Brian Kingsbury, Leonid Karlinsky, David Harwath, Hilde Kuehne, James R. Glass:
C2KD: Cross-Lingual Cross-Modal Knowledge Distillation for Multilingual Text-Video Retrieval. ICASSP 2023: 1-5 - [c325]Yuan Gong, Andrew Rouditchenko, Alexander H. Liu, David Harwath, Leonid Karlinsky, Hilde Kuehne, James R. Glass:
Contrastive Audio-Visual Masked Autoencoder. ICLR 2023 - [c324]Andrew Rouditchenko, Sameer Khurana, Samuel Thomas, Rogério Feris, Leonid Karlinsky, Hilde Kuehne, David Harwath, Brian Kingsbury, James R. Glass:
Comparison of Multilingual Self-Supervised and Weakly-Supervised Speech Pre-Training for Adaptation to Unseen Languages. INTERSPEECH 2023: 2268-2272 - [c323]Yuan Gong, Sameer Khurana, Leonid Karlinsky, James R. Glass:
Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers. INTERSPEECH 2023: 2798-2802 - [c322]Heng-Jui Chang, Alexander H. Liu, James R. Glass:
Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering. INTERSPEECH 2023: 2983-2987 - [c321]Alexander H. Liu, Heng-Jui Chang, Michael Auli, Wei-Ning Hsu, James R. Glass:
DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning. NeurIPS 2023 - [c320]David Cheng-Han Chiang, Hung-yi Lee, Yung-Sung Chuang, James R. Glass:
Revealing the Blind Spot of Sentence Encoder Evaluation by HEROS. RepL4NLP@ACL 2023: 289-302 - [c319]Jingyu Zhang, James R. Glass, Tianxing He:
PCFG-Based Natural Language Interface Improves Generalization for Controlled Text Generation. *SEM@ACL 2023: 295-313 - [i144]Hongyin Luo, James R. Glass:
Logic Against Bias: Textual Entailment Mitigates Stereotypical Sentence Reasoning. CoRR abs/2303.05670 (2023) - [i143]Brian Chen, Nina Shvetsova, Andrew Rouditchenko, Daniel Kondermann, Samuel Thomas, Shih-Fu Chang, Rogério Feris, James R. Glass, Hilde Kuehne:
What, when, and where? - Self-Supervised Spatio-Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions. CoRR abs/2303.16990 (2023) - [i142]Tianhua Zhang, Hongyin Luo, Yung-Sung Chuang, Wei Fang, Luc Gaitskell, Thomas Hartvigsen, Xixin Wu, Danny Fox, Helen Meng, James R. Glass:
Interpretable Unified Language Checking. CoRR abs/2304.03728 (2023) - [i141]Alexander H. Liu, Heng-Jui Chang, Michael Auli, Wei-Ning Hsu, James R. Glass:
DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning. CoRR abs/2305.10005 (2023) - [i140]Yuan Gong, Hongyin Luo, Alexander H. Liu, Leonid Karlinsky, James R. Glass:
Listen, Think, and Understand. CoRR abs/2305.10790 (2023) - [i139]Heng-Jui Chang, Alexander H. Liu, James R. Glass:
Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering. CoRR abs/2305.11072 (2023) - [i138]Andrew Rouditchenko, Sameer Khurana, Samuel Thomas, Rogério Feris, Leonid Karlinsky, Hilde Kuehne, David Harwath, Brian Kingsbury, James R. Glass:
Comparison of Multilingual Self-Supervised and Weakly-Supervised Speech Pre-Training for Adaptation to Unseen Languages. CoRR abs/2305.12606 (2023) - [i137]Hongyin Luo, Yung-Sung Chuang, Yuan Gong, Tianhua Zhang, Yoon Kim, Xixin Wu, Danny Fox, Helen Meng, James R. Glass:
SAIL: Search-Augmented Instruction Learning. CoRR abs/2305.15225 (2023) - [i136]Yung-Sung Chuang, Wei Fang, Shang-Wen Li, Wen-tau Yih, James R. Glass:
Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering. CoRR abs/2305.17080 (2023) - [i135]Jiaxin Ge, Hongyin Luo, Yoon Kim, James R. Glass:
Entailment as Robust Self-Learner. CoRR abs/2305.17197 (2023) - [i134]Sameer Khurana, Nauman Dawalatabad, Antoine Laurent, Luis Vicente, Pablo Gimeno, Victoria Mingote, James R. Glass:
Improved Cross-Lingual Transfer Learning For Automatic Speech Translation. CoRR abs/2306.00789 (2023) - [i133]David Cheng-Han Chiang, Yung-Sung Chuang, James R. Glass, Hung-yi Lee:
Revealing the Blind Spot of Sentence Encoder Evaluation by HEROS. CoRR abs/2306.05083 (2023) - [i132]Yuan Gong, Sameer Khurana, Leonid Karlinsky, James R. Glass:
Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers. CoRR abs/2307.03183 (2023) - [i131]Yung-Sung Chuang, Yujia Xie, Hongyin Luo, Yoon Kim, James R. Glass, Pengcheng He:
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models. CoRR abs/2309.03883 (2023) - [i130]Tianhua Zhang, Jiaxin Ge, Hongyin Luo, Yung-Sung Chuang, Mingye Gao, Yuan Gong, Xixin Wu, Yoon Kim, Helen Meng, James R. Glass:
Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning. CoRR abs/2309.10814 (2023) - [i129]Yuan Gong, Alexander H. Liu, Hongyin Luo, Leonid Karlinsky, James R. Glass:
Joint Audio and Speech Understanding. CoRR abs/2309.14405 (2023) - [i128]Junmo Kang, Hongyin Luo, Yada Zhu, James R. Glass, David D. Cox, Alan Ritter, Rogério Feris, Leonid Karlinsky:
Self-Specialization: Uncovering Latent Expertise within Large Language Models. CoRR abs/2310.00160 (2023) - [i127]Cheng-I Jeff Lai, Freda Shi, Puyuan Peng, Yoon Kim, Kevin Gimpel, Shiyu Chang, Yung-Sung Chuang, Saurabhchand Bhati, David D. Cox, David Harwath, Yang Zhang, Karen Livescu, James R. Glass:
Audio-Visual Neural Syntax Acquisition. CoRR abs/2310.07654 (2023) - [i126]Heng-Jui Chang, James R. Glass:
R-Spin: Efficient Speaker and Noise-invariant Representation Learning with Acoustic Pieces. CoRR abs/2311.09117 (2023) - 2022
- [j36]Gene-Ping Yang, Sung-Lin Yeh, Yu-An Chung, James R. Glass, Hao Tang:
Autoregressive Predictive Coding: A Comprehensive Study. IEEE J. Sel. Top. Signal Process. 16(6): 1380-1390 (2022) - [j35]Sameer Khurana, Antoine Laurent, James R. Glass:
SAMU-XLSR: Semantically-Aligned Multimodal Utterance-Level Cross-Lingual Speech Representation. IEEE J. Sel. Top. Signal Process. 16(6): 1493-1504 (2022) - [j34]Yuan Gong, Alexander H. Liu, Andrew Rouditchenko, James R. Glass:
UAVM: Towards Unifying Audio and Visual Models. IEEE Signal Process. Lett. 29: 2437-2441 (2022) - [c318]Yuan Gong, Cheng-I Lai, Yu-An Chung, James R. Glass:
SSAST: Self-Supervised Audio Spectrogram Transformer. AAAI 2022: 10699-10709 - [c317]Alexander H. Liu, SouYoung Jin, Cheng-I Lai, Andrew Rouditchenko, Aude Oliva, James R. Glass:
Cross-Modal Discrete Representation Learning. ACL (1) 2022: 3013-3035 - [c316]Jiabao Ji, Yoon Kim, James R. Glass, Tianxing He:
Controlling the Focus of Pretrained Language Generation Models. ACL (Findings) 2022: 3291-3306 - [c315]Nina Shvetsova, Brian Chen, Andrew Rouditchenko, Samuel Thomas, Brian Kingsbury, Rogério Feris, David Harwath, James R. Glass, Hilde Kuehne:
Everything at Once - Multi-modal Fusion Transformer for Video Retrieval. CVPR 2022: 19988-19997 - [c314]Nauman Dawalatabad, Yuan Gong, Sameer Khurana, Rhoda Au, James R. Glass:
Detecting Dementia from Long Neuropsychological Interviews. EMNLP (Findings) 2022: 5270-5283 - [c313]Yuan Gong, Jin Yu, James R. Glass:
Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition. ICASSP 2022: 151-155 - [c312]Sameer Khurana, Antoine Laurent, James R. Glass:
Magic Dust for Cross-Lingual Adaptation of Monolingual Wav2vec-2.0. ICASSP 2022: 6647-6651 - [c311]R'mani Haulcy, Katerina Placek, Brian Tracey, Adam P. Vogel, James R. Glass:
Repetition Assessment for Speech and Language Disorders: A Study of the Logopenic Variant of Primary Progressive Aphasia. ICASSP 2022: 6932-6936 - [c310]Yuan Gong, Ziyi Chen, Iek-Heng Chu, Peng Chang, James R. Glass:
Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment. ICASSP 2022: 7262-7266 - [c309]Cheng-I Jeff Lai, Erica Cooper, Yang Zhang, Shiyu Chang, Kaizhi Qian, Yi-Lun Liao, Yung-Sung Chuang, Alexander H. Liu, Junichi Yamagishi, David D. Cox, James R. Glass:
On the Interplay between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis. ICASSP 2022: 8447-8451 - [c308]Alexander H. Liu, Cheng-I Lai, Wei-Ning Hsu, Michael Auli, Alexei Baevski, James R. Glass:
Simple and Effective Unsupervised Speech Synthesis. INTERSPEECH 2022: 843-847 - [c307]Christopher Song, David Harwath, Tuka Alhanai, James R. Glass:
Speak: A Toolkit Using Amazon Mechanical Turk to Collect and Validate Speech Audio Recordings. LREC 2022: 7253-7258 - [c306]Hongyin Luo, Shang-Wen Li, Mingye Gao, Seunghak Yu, James R. Glass:
Cooperative Self-training of Machine Reading Comprehension. NAACL-HLT 2022: 244-257 - [c305]Yung-Sung Chuang, Rumen Dangovski, Hongyin Luo, Yang Zhang, Shiyu Chang, Marin Soljacic, Shang-Wen Li, Scott Yih, Yoon Kim, James R. Glass:
DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings. NAACL-HLT 2022: 4207-4218 - [i125]Jiabao Ji, Yoon Kim, James R. Glass, Tianxing He:
Controlling the Focus of Pretrained Language Generation Models. CoRR abs/2203.01146 (2022) - [i124]Yuan Gong, Sameer Khurana, Andrew Rouditchenko, James R. Glass:
CMKD: CNN/Transformer-Based Cross-Model Knowledge Distillation for Audio Classification. CoRR abs/2203.06760 (2022) - [i123]Alexander H. Liu, Cheng-I Jeff Lai, Wei-Ning Hsu, Michael Auli, Alexei Baevski, James R. Glass:
Simple and Effective Unsupervised Speech Synthesis. CoRR abs/2204.02524 (2022) - [i122]Yung-Sung Chuang, Rumen Dangovski, Hongyin Luo, Yang Zhang, Shiyu Chang, Marin Soljacic, Shang-Wen Li, Wen-tau Yih, Yoon Kim, James R. Glass:
DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings. CoRR abs/2204.10298 (2022) - [i121]Yuan Gong, Ziyi Chen, Iek-Heng Chu, Peng Chang, James R. Glass:
Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment. CoRR abs/2205.03432 (2022) - [i120]Yuan Gong, Jin Yu, James R. Glass:
Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition. CoRR abs/2205.03433 (2022) - [i119]Sameer Khurana, Antoine Laurent, James R. Glass:
SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation. CoRR abs/2205.08180 (2022) - [i118]Vijay Gadepally, Gregory Angelides, Andrei Barbu, Andrew Bowne, Laura J. Brattain, Tamara Broderick, Armando Cabrera, Glenn Carl, Ronisha Carter, Miriam Cha, Emilie Cowen, Jesse Cummings, Bill Freeman, James R. Glass, Sam Goldberg, Mark Hamilton, Thomas Heldt, Kuan Wei Huang, Phillip Isola, Boris Katz, Jamie Koerner, Yen-Chen Lin, David Mayo, Kyle McAlpin, Taylor Perron, Jean E. Piou, Hrishikesh M. Rao, Hayley Reynolds, Kaira Samuel, Siddharth Samsi, Morgan Schmidt, Leslie Shing, Olga Simek, Brandon Swenson, Vivienne Sze, Jonathan Taylor, Paul Tylkin, Mark Veillette, Matthew L. Weiss, Allan B. Wollaber, Sophia Yuditskaya, Jeremy Kepner:
Developing a Series of AI Challenges for the United States Department of the Air Force. CoRR abs/2207.07033 (2022) - [i117]Yuan Gong, Alexander H. Liu, Andrew Rouditchenko, James R. Glass:
UAVM: A Unified Model for Audio-Visual Learning. CoRR abs/2208.00061 (2022) - [i116]Andrew Rouditchenko, Yung-Sung Chuang, Nina Shvetsova, Samuel Thomas, Rogério Feris, Brian Kingsbury, Leonid Karlinsky, David Harwath, Hilde Kuehne, James R. Glass:
C2KD: Cross-Lingual Cross-Modal Knowledge Distillation for Multilingual Text-Video Retrieval. CoRR abs/2210.03625 (2022) - [i115]Jingyu Zhang, James R. Glass, Tianxing He:
PCFG-based Natural Language Interface Improves Generalization for Controlled Text Generation. CoRR abs/2210.07431 (2022) - [i114]Yuan Gong, Andrew Rouditchenko, Alexander H. Liu, David Harwath, Leonid Karlinsky, Hilde Kuehne, James R. Glass:
Contrastive Audio-Visual Masked Autoencoder. CoRR abs/2210.07839 (2022) - [i113]Nauman Dawalatabad, Sameer Khurana, Antoine Laurent, James R. Glass:
On Unsupervised Uncertainty-Driven Speech Pseudo-Label Filtering and Model Calibration. CoRR abs/2211.07795 (2022) - [i112]Tianxing He, Jingyu Zhang, Tianle Wang, Sachin Kumar, Kyunghyun Cho, James R. Glass, Yulia Tsvetkov:
On the Blind Spots of Model-Based Evaluation Metrics for Text Generation. CoRR abs/2212.10020 (2022) - 2021
- [j33]Yuan Gong, Yu-An Chung, James R. Glass:
PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3292-3306 (2021) - [c304]Wei-Ning Hsu, David Harwath, Tyler Miller, Christopher Song, James R. Glass:
Text-Free Image-to-Speech Synthesis Using Learned Segmental Units. ACL/IJCNLP (1) 2021: 5284-5300 - [c303]Mathew Monfort, SouYoung Jin, Alexander H. Liu, David Harwath, Rogério Feris, James R. Glass, Aude Oliva:
Spoken Moments: Learning Joint Audio-Visual Representations From Video Descriptions. CVPR 2021: 14871-14881 - [c302]Tianxing He, Jun Liu, Kyunghyun Cho, Myle Ott, Bing Liu, James R. Glass, Fuchun Peng:
Analyzing the Forgetting Problem in Pretrain-Finetuning of Open-domain Dialogue Response Models. EACL 2021: 1121-1133 - [c301]Tianxing He, Jingzhao Zhang, Zhiming Zhou, James R. Glass:
Exposure Bias versus Self-Recovery: Are Distortions Really Incremental for Autoregressive Text Generation? EMNLP (1) 2021: 5087-5102 - [c300]Yu-An Chung, Yonatan Belinkov, James R. Glass:
Similarity Analysis of Self-Supervised Speech Representations. ICASSP 2021: 3040-3044 - [c299]Cheng-I Lai, Yung-Sung Chuang, Hung-Yi Lee, Shang-Wen Li, James R. Glass:
Semi-Supervised Spoken Language Understanding via Self-Supervised Speech and Language Model Pretraining. ICASSP 2021: 7468-7472 - [c298]Brian Chen, Andrew Rouditchenko, Kevin Duarte, Hilde Kuehne, Samuel Thomas, Angie W. Boggust, Rameswar Panda, Brian Kingsbury, Rogério Feris, David Harwath, James R. Glass, Michael Picheny, Shih-Fu Chang:
Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos. ICCV 2021: 7992-8001 - [c297]Yuan Gong, Yu-An Chung, James R. Glass:
AST: Audio Spectrogram Transformer. Interspeech 2021: 571-575 - [c296]Andrew Rouditchenko, Angie W. Boggust, David Harwath, Brian Chen, Dhiraj Joshi, Samuel Thomas, Kartik Audhkhasi, Hilde Kuehne, Rameswar Panda, Rogério Schmidt Feris, Brian Kingsbury, Michael Picheny, Antonio Torralba, James R. Glass:
AVLnet: Learning Audio-Visual Language Representations from Instructional Videos. Interspeech 2021: 1584-1588 - [c295]R'mani Haulcy, James R. Glass:
CLAC: A Speech Corpus of Healthy English Speakers. Interspeech 2021: 2966-2970 - [c294]Andrew Rouditchenko, Angie W. Boggust, David Harwath, Samuel Thomas, Hilde Kuehne, Brian Chen, Rameswar Panda, Rogério Feris, Brian Kingsbury, Michael Picheny, James R. Glass:
Cascaded Multilingual Audio-Visual Learning from Videos. Interspeech 2021: 3006-3010 - [c293]Hongyin Luo, James R. Glass, Garima Lalwani, Yi Zhang, Shang-Wen Li:
Joint Retrieval-Extraction Training for Evidence-Aware Dialog Response Selection. Interspeech 2021: 3241-3245 - [c292]Ian Palmer, Andrew Rouditchenko, Andrei Barbu, Boris Katz, James R. Glass:
Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset. Interspeech 2021: 3650-3654 - [c291]Alexander H. Liu, Yu-An Chung, James R. Glass:
Non-Autoregressive Predictive Coding for Learning Speech Representations from Local Dependencies. Interspeech 2021: 3730-3734 - [c290]Cheng-I Jeff Lai, Yang Zhang, Alexander H. Liu, Shiyu Chang, Yi-Lun Liao, Yung-Sung Chuang, Kaizhi Qian, Sameer Khurana, David D. Cox, James R. Glass:
PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition. NeurIPS 2021: 21256-21272 - [c289]Seunghak Yu, Giovanni Da San Martino, Mitra Mohtarami, James R. Glass, Preslav Nakov:
Interpretable Propaganda Detection in News Articles. RANLP 2021: 1597-1605 - [i111]Hongyin Luo, Shang-Wen Li, James R. Glass:
Knowledge Grounded Conversational Symptom Detection with Graph Memory Networks. CoRR abs/2101.09773 (2021) - [i110]Yuan Gong, Yu-An Chung, James R. Glass:
PSLA: Improving Audio Event Classification with Pretraining, Sampling, Labeling, and Aggregation. CoRR abs/2102.01243 (2021) - [i109]Hongyin Luo, Shang-Wen Li, Seunghak Yu, James R. Glass:
Cooperative Learning of Zero-Shot Machine Reading Comprehension. CoRR abs/2103.07449 (2021) - [i108]Yuan Gong, Yu-An Chung, James R. Glass:
AST: Audio Spectrogram Transformer. CoRR abs/2104.01778 (2021) - [i107]Brian Chen, Andrew Rouditchenko, Kevin Duarte, Hilde Kuehne, Samuel Thomas, Angie W. Boggust, Rameswar Panda, Brian Kingsbury, Rogério Schmidt Feris, David Harwath, James R. Glass, Michael Picheny, Shih-Fu Chang:
Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos. CoRR abs/2104.12671 (2021) - [i106]Mathew Monfort, SouYoung Jin, Alexander H. Liu, David Harwath, Rogério Feris, James R. Glass, Aude Oliva:
Spoken Moments: Learning Joint Audio-Visual Representations from Video Descriptions. CoRR abs/2105.04489 (2021) - [i105]Alexander H. Liu, SouYoung Jin, Cheng-I Jeff Lai, Andrew Rouditchenko, Aude Oliva, James R. Glass:
Cross-Modal Discrete Representation Learning. CoRR abs/2106.05438 (2021) - [i104]Cheng-I Jeff Lai, Yang Zhang, Alexander H. Liu, Shiyu Chang, Yi-Lun Liao, Yung-Sung Chuang, Kaizhi Qian, Sameer Khurana, David D. Cox, James R. Glass:
PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition. CoRR abs/2106.05933 (2021) - [i103]Yung-Sung Chuang, Mingye Gao, Hongyin Luo, James R. Glass, Hung-Yi Lee, Yun-Nung Chen, Shang-Wen Li:
Mitigating Biases in Toxic Language Detection through Invariant Rationalization. CoRR abs/2106.07240 (2021) - [i102]Seunghak Yu, Giovanni Da San Martino, Mitra Mohtarami, James R. Glass, Preslav Nakov:
Interpretable Propaganda Detection in News Articles. CoRR abs/2108.12802 (2021) - [i101]Tianxing He, Kyunghyun Cho, James R. Glass:
An Empirical Study on Few-shot Knowledge Probing for Pretrained Language Models. CoRR abs/2109.02772 (2021) - [i100]Cheng-I Jeff Lai, Erica Cooper, Yang Zhang, Shiyu Chang, Kaizhi Qian, Yi-Lun Liao, Yung-Sung Chuang, Alexander H. Liu, Junichi Yamagishi, David D. Cox, James R. Glass:
On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis. CoRR abs/2110.01147 (2021) - [i99]Sameer Khurana, Antoine Laurent, James R. Glass:
Magic dust for cross-lingual adaptation of monolingual wav2vec-2.0. CoRR abs/2110.03560 (2021) - [i98]Ian Palmer, Andrew Rouditchenko, Andrei Barbu, Boris Katz, James R. Glass:
Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset. CoRR abs/2110.07575 (2021) - [i97]Yuan Gong, Cheng-I Jeff Lai, Yu-An Chung, James R. Glass:
SSAST: Self-Supervised Audio Spectrogram Transformer. CoRR abs/2110.09784 (2021) - [i96]Andrew Rouditchenko, Angie W. Boggust, David Harwath, Samuel Thomas, Hilde Kuehne, Brian Chen, Rameswar Panda, Rogério Feris, Brian Kingsbury, Michael Picheny, James R. Glass:
Cascaded Multilingual Audio-Visual Learning from Videos. CoRR abs/2111.04823 (2021) - [i95]Kevin Duarte, Brian Chen, Nina Shvetsova, Andrew Rouditchenko, Samuel Thomas, Alexander H. Liu, David Harwath, James R. Glass, Hilde Kuehne, Mubarak Shah:
Routing with Self-Attention for Multimodal Capsule Networks. CoRR abs/2112.00775 (2021) - [i94]Nina Shvetsova, Brian Chen, Andrew Rouditchenko, Samuel Thomas, Brian Kingsbury, Rogério Feris, David Harwath, James R. Glass, Hilde Kuehne:
Everything at Once - Multi-modal Fusion Transformer for Video Retrieval. CoRR abs/2112.04446 (2021) - 2020
- [j32]Yonatan Belinkov, Nadir Durrani, Fahim Dalvi, Hassan Sajjad, James R. Glass:
On the Linguistic Representational Power of Neural Machine Translation Models. Comput. Linguistics 46(1): 1-52 (2020) - [j31]David Harwath, Adrià Recasens, Dídac Surís, Galen Chuang, Antonio Torralba, James R. Glass:
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input. Int. J. Comput. Vis. 128(3): 620-641 (2020) - [c288]Tianxing He, James R. Glass:
Negative Training for Neural Dialogue Response Generation. ACL 2020: 2044-2058 - [c287]Yu-An Chung, James R. Glass:
Improved Speech Representations with Multi-Target Autoregressive Predictive Coding. ACL 2020: 2353-2358 - [c286]Ramy Baly, Georgi Karadzhov, Jisun An, Haewoon Kwak, Yoan Dinkov, Ahmed Ali, James R. Glass, Preslav Nakov:
What Was Written vs. Who Read It: News Media Profiling Using Text Analysis and Social Media Context. ACL 2020: 3364-3374 - [c285]John M. Wu, Yonatan Belinkov, Hassan Sajjad, Nadir Durrani, Fahim Dalvi, James R. Glass:
Similarity Analysis of Contextual Word Representation Models. ACL 2020: 4638-4655 - [c284]Hongyin Luo, Shang-Wen Li, James R. Glass:
Knowledge Grounded Conversational Symptom Detection with Graph Memory Networks. ClinicalNLP@EMNLP 2020: 136-145 - [c283]Ramy Baly, Giovanni Da San Martino, James R. Glass, Preslav Nakov:
We Can Detect Your Bias: Predicting the Political Ideology of News Articles. EMNLP (1) 2020: 4982-4991 - [c282]Yu-An Chung, James R. Glass:
Generative Pre-Training for Speech with Autoregressive Predictive Coding. ICASSP 2020: 3497-3501 - [c281]Yasunori Ohishi, Akisato Kimura, Takahito Kawanishi, Kunio Kashino, David Harwath, James R. Glass:
Trilingual Semantic Embeddings of Visually Grounded Speech with Self-Attention Mechanisms. ICASSP 2020: 4352-4356 - [c280]François Grondin, Hao Tang, James R. Glass:
Audio-Visual Calibration with Polynomial Regression for 2-D Projection Using SVD-PHAT. ICASSP 2020: 4856-4860 - [c279]Jennifer Drexler, James R. Glass:
Learning a Subword Inventory Jointly with End-to-End Automatic Speech Recognition. ICASSP 2020: 6439-6443 - [c278]Suwon Shon, Ahmed Ali, Younes Samih, Hamdy Mubarak, James R. Glass:
ADI17: A Fine-Grained Arabic Dialect Identification Dataset. ICASSP 2020: 8244-8248 - [c277]David Harwath, Wei-Ning Hsu, James R. Glass:
Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech. ICLR 2020 - [c276]Moin Nadeem, Tianxing He, Kyunghyun Cho, James R. Glass:
A Systematic Characterization of Sampling Algorithms for Open-ended Language Generation. AACL/IJCNLP 2020: 334-346 - [c275]Michael Gump, Wei-Ning Hsu, James R. Glass:
Unsupervised Methods for Evaluating Speech Representations. INTERSPEECH 2020: 170-174 - [c274]Shammur A. Chowdhury, Ahmed Ali, Suwon Shon, James R. Glass:
What Does an End-to-End Dialect Identification Model Learn About Non-Dialectal Information? INTERSPEECH 2020: 462-466 - [c273]Yasunori Ohishi, Akisato Kimura, Takahito Kawanishi, Kunio Kashino, David Harwath, James R. Glass:
Pair Expansion for Learning Multilingual Semantic Embeddings Using Disjoint Visually-Grounded Speech Audio Datasets. INTERSPEECH 2020: 1486-1490 - [c272]Suwon Shon, James R. Glass:
Multimodal Association for Speaker Verification. INTERSPEECH 2020: 2247-2251 - [c271]Yu-An Chung, Hao Tang, James R. Glass:
Vector-Quantized Autoregressive Predictive Coding. INTERSPEECH 2020: 3760-3764 - [c270]Sameer Khurana, Antoine Laurent, Wei-Ning Hsu, Jan Chorowski, Adrian Lancucki, Ricard Marxer, James R. Glass:
A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning. INTERSPEECH 2020: 3790-3794 - [c269]Hongyin Luo, Shang-Wen Li, James R. Glass:
Prototypical Q Networks for Automatic Conversational Diagnosis and Few-Shot New Disease Adaption. INTERSPEECH 2020: 3895-3899 - [i93]François Grondin, Hao Tang, James R. Glass:
Audio-Visual Calibration with Polynomial Regression for 2-D Projection Using SVD-PHAT. CoRR abs/2002.01440 (2020) - [i92]Yu-An Chung, James R. Glass:
Improved Speech Representations with Multi-Target Autoregressive Predictive Coding. CoRR abs/2004.05274 (2020) - [i91]John M. Wu, Yonatan Belinkov, Hassan Sajjad, Nadir Durrani, Fahim Dalvi, James R. Glass:
Similarity Analysis of Contextual Word Representation Models. CoRR abs/2005.01172 (2020) - [i90]Ramy Baly, Georgi Karadzhov, Jisun An, Haewoon Kwak, Yoan Dinkov, Ahmed Ali, James R. Glass, Preslav Nakov:
What Was Written vs. Who Read It: News Media Profiling Using Text Analysis and Social Media Context. CoRR abs/2005.04518 (2020) - [i89]Yu-An Chung, Hao Tang, James R. Glass:
Vector-Quantized Autoregressive Predictive Coding. CoRR abs/2005.08392 (2020) - [i88]Hongyin Luo, Shang-Wen Li, James R. Glass:
Prototypical Q Networks for Automatic Conversational Diagnosis and Few-Shot New Disease Adaption. CoRR abs/2005.11153 (2020) - [i87]Sameer Khurana, Antoine Laurent, Wei-Ning Hsu, Jan Chorowski, Adrian Lancucki, Ricard Marxer, James R. Glass:
A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning. CoRR abs/2006.02547 (2020) - [i86]Sameer Khurana, Antoine Laurent, James R. Glass:
CSTNet: Contrastive Speech Translation Network for Self-Supervised Speech Representation Learning. CoRR abs/2006.02814 (2020) - [i85]Andrew Rouditchenko, Angie W. Boggust, David Harwath, Dhiraj Joshi, Samuel Thomas, Kartik Audhkhasi, Rogério Feris, Brian Kingsbury, Michael Picheny, Antonio Torralba, James R. Glass:
AVLnet: Learning Audio-Visual Language Representations from Instructional Videos. CoRR abs/2006.09199 (2020) - [i84]Seunghak Yu, Tianxing He, James R. Glass:
Constructing a Knowledge Graph from Unstructured Documents without External Alignment. CoRR abs/2008.08995 (2020) - [i83]Moin Nadeem, Tianxing He, Kyunghyun Cho, James R. Glass:
A Systematic Characterization of Sampling Algorithms for Open-ended Language Generation. CoRR abs/2009.07243 (2020) - [i82]Ramy Baly, Giovanni Da San Martino, James R. Glass, Preslav Nakov:
We Can Detect Your Bias: Predicting the Political Ideology of News Articles. CoRR abs/2010.05338 (2020) - [i81]Yu-An Chung, Yonatan Belinkov, James R. Glass:
Similarity Analysis of Self-Supervised Speech Representations. CoRR abs/2010.11481 (2020) - [i80]Cheng-I Lai, Yung-Sung Chuang, Hung-yi Lee, Shang-wen Li, James R. Glass:
Semi-Supervised Spoken Language Understanding via Self-Supervised Speech and Language Model Pretraining. CoRR abs/2010.13826 (2020) - [i79]Alexander H. Liu, Yu-An Chung, James R. Glass:
Non-Autoregressive Predictive Coding for Learning Speech Representations from Local Dependencies. CoRR abs/2011.00406 (2020) - [i78]Wei-Ning Hsu, David Harwath, Christopher Song, James R. Glass:
Text-Free Image-to-Speech Synthesis Using Learned Segmental Units. CoRR abs/2012.15454 (2020)
2010 – 2019
- 2019
- [j30]Salvatore Romeo, Giovanni Da San Martino, Yonatan Belinkov, Alberto Barrón-Cedeño, Mohamed Eldesouki, Kareem Darwish, Hamdy Mubarak, James R. Glass, Alessandro Moschitti:
Language processing and learning models for community question answering in Arabic. Inf. Process. Manag. 56(2): 274-290 (2019) - [j29]Pepa Atanasova, Preslav Nakov, Lluís Màrquez, Alberto Barrón-Cedeño, Georgi Karadzhov, Tsvetomila Mihaylova, Mitra Mohtarami, James R. Glass:
Automatic Fact-Checking Using Context and Discourse Information. ACM J. Data Inf. Qual. 11(3): 12:1-12:27 (2019) - [j28]Yonatan Belinkov, James R. Glass:
Analysis Methods in Neural Language Processing: A Survey. Trans. Assoc. Comput. Linguistics 7: 49-72 (2019) - [j27]Achintya Kumar Sarkar, Zheng-Hua Tan, Hao Tang, Suwon Shon, James R. Glass:
Time-Contrastive Learning Based Deep Bottleneck Features for Text-Dependent Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 27(8): 1267-1279 (2019) - [j26]Mandy Korpusik, James R. Glass:
Deep Learning for Database Mapping and Asking Clarification Questions in Dialogue Systems. IEEE ACM Trans. Audio Speech Lang. Process. 27(8): 1321-1334 (2019) - [c268]Fahim Dalvi, Nadir Durrani, Hassan Sajjad, Yonatan Belinkov, Anthony Bau, James R. Glass:
What Is One Grain of Sand in the Desert? Analyzing Individual Neurons in Deep NLP Models. AAAI 2019: 6309-6317 - [c267]Fahim Dalvi, Avery Nortonsmith, Anthony Bau, Yonatan Belinkov, Hassan Sajjad, Nadir Durrani, James R. Glass:
NeuroX: A Toolkit for Analyzing Individual Neurons in Neural Networks. AAAI 2019: 9851-9852 - [c266]Hongyin Luo, Lan Jiang, Yonatan Belinkov, James R. Glass:
Improving Neural Language Models by Segmenting, Attending, and Predicting the Future. ACL (1) 2019: 1483-1493 - [c265]Jennifer Drexler, James R. Glass:
Explicit Alignment of Text and Speech Encodings for Attention-Based End-to-End Speech Recognition. ASRU 2019: 913-919 - [c264]Ahmed Ali, Suwon Shon, Younes Samih, Hamdy Mubarak, Ahmed Abdelali, James R. Glass, Steve Renals, Khalid Choukri:
The MGB-5 Challenge: Recognition and Dialect Identification of Dialectal Arabic Speech. ASRU 2019: 1026-1033 - [c263]Angie W. Boggust, Kartik Audhkhasi, Dhiraj Joshi, David Harwath, Samuel Thomas, Rogério Schmidt Feris, Danny Gutfreund, Yang Zhang, Antonio Torralba, Michael Picheny, James R. Glass:
Grounding Spoken Words in Unlabeled Video. CVPR Workshops 2019: 29-32 - [c262]Didac Suris, Adrià Recasens, David Bau, David Harwath, James R. Glass, Antonio Torralba:
Learning Words by Drawing Images. CVPR 2019: 2029-2038 - [c261]François Grondin, Iwona Sobieraj, Mark D. Plumbley, James R. Glass:
Sound Event Localization and Detection Using CRNN on Pairs of Microphones. DCASE 2019: 84-88 - [c260]Yifan Zhang, Giovanni Da San Martino, Alberto Barrón-Cedeño, Salvatore Romeo, Jisun An, Haewoon Kwak, Todor Staykovski, Israa Jaradat, Georgi Karadzhov, Ramy Baly, Kareem Darwish, James R. Glass, Preslav Nakov:
Tanbih: Get To Know What You Are Reading. EMNLP/IJCNLP (3) 2019: 223-228 - [c259]Mitra Mohtarami, James R. Glass, Preslav Nakov:
Contrastive Language Adaptation for Cross-Lingual Stance Detection. EMNLP/IJCNLP (1) 2019: 4441-4451 - [c258]David Harwath, James R. Glass:
Towards Visually Grounded Sub-word Speech Unit Discovery. ICASSP 2019: 3017-3021 - [c257]Suwon Shon, Tae-Hyun Oh, James R. Glass:
Noise-tolerant Audio-visual Online Person Verification Using an Attention-based Neural Network Fusion. ICASSP 2019: 3995-3999 - [c256]François Grondin, James R. Glass:
SVD-PHAT: A Fast Sound Source Localization Method. ICASSP 2019: 4140-4144 - [c255]Wei-Ning Hsu, Yu Zhang, Ron J. Weiss, Yu-An Chung, Yuxuan Wang, Yonghui Wu, James R. Glass:
Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorization. ICASSP 2019: 5901-5905 - [c254]Suwon Shon, Ahmed Ali, James R. Glass:
Domain Attentive Fusion for End-to-end Dialect Identification with Unknown Target Domain. ICASSP 2019: 5951-5955 - [c253]Jennifer Drexler, James R. Glass:
Subword Regularization and Beam Search Decoding for End-to-end Automatic Speech Recognition. ICASSP 2019: 6266-6270 - [c252]Sameer Khurana, Shafiq Rayhan Joty, Ahmed Ali, James R. Glass:
A Factorial Deep Markov Model for Unsupervised Disentangled Representation Learning from Speech. ICASSP 2019: 6540-6544 - [c251]Yu-An Chung, Wei-Hung Weng, Schrasing Tong, James R. Glass:
Towards Unsupervised Speech-to-text Translation. ICASSP 2019: 7170-7174 - [c250]Mandy Korpusik, James R. Glass:
Dialogue State Tracking with Convolutional Semantic Taggers. ICASSP 2019: 7220-7224 - [c249]Anthony Bau, Yonatan Belinkov, Hassan Sajjad, Nadir Durrani, Fahim Dalvi, James R. Glass:
Identifying and Controlling Important Neurons in Neural Machine Translation. ICLR (Poster) 2019 - [c248]Tianxing He, James R. Glass:
Detecting Egregious Responses in Neural Sequence-to-sequence Models. ICLR (Poster) 2019 - [c247]Yonatan Belinkov, Ahmed Ali, James R. Glass:
Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech Recognition. INTERSPEECH 2019: 81-85 - [c246]Yu-An Chung, Wei-Ning Hsu, Hao Tang, James R. Glass:
An Unsupervised Autoregressive Model for Speech Representation Learning. INTERSPEECH 2019: 146-150 - [c245]Emmanuel Azuh, David Harwath, James R. Glass:
Towards Bilingual Lexicon Discovery From Visually Grounded Speech Audio. INTERSPEECH 2019: 276-280 - [c244]Suwon Shon, Najim Dehak, Douglas A. Reynolds, James R. Glass:
MCE 2018: The 1st Multi-Target Speaker Detection and Identification Challenge Evaluation. INTERSPEECH 2019: 356-360 - [c243]Hongyin Luo, Mitra Mohtarami, James R. Glass, Karthik Krishnamurthy, Brigitte Richardson:
Integrating Video Retrieval and Moment Detection in a Unified Corpus for Video Question Answering. INTERSPEECH 2019: 599-603 - [c242]Mandy Korpusik, Zoe Liu, James R. Glass:
A Comparison of Deep Learning Methods for Language Understanding. INTERSPEECH 2019: 849-853 - [c241]Logan Ford, Hao Tang, François Grondin, James R. Glass:
A Deep Residual Network for Large-Scale Acoustic Scene Analysis. INTERSPEECH 2019: 2568-2572 - [c240]François Grondin, James R. Glass:
Multiple Sound Source Localization with SVD-PHAT. INTERSPEECH 2019: 2698-2702 - [c239]Suwon Shon, Hao Tang, James R. Glass:
VoiceID Loss: Speech Enhancement for Speaker Verification. INTERSPEECH 2019: 2888-2892 - [c238]Wei-Ning Hsu, David Harwath, James R. Glass:
Transfer Learning from Audio-Visual Grounding to Speech Recognition. INTERSPEECH 2019: 3242-3246 - [c237]François Grondin, James R. Glass:
Fast and Robust 3-D Sound Source Localization with DSVD-PHAT. IROS 2019: 5352-5357 - [c236]Moin Nadeem, Wei Fang, Brian Xu, Mitra Mohtarami, James R. Glass:
FAKTA: An Automatic End-to-End Fact Checking System. NAACL-HLT (Demonstrations) 2019: 78-83 - [c235]Ramy Baly, Georgi Karadzhov, Abdelrhman Saleh, James R. Glass, Preslav Nakov:
Multi-Task Ordinal Regression for Jointly Predicting the Trustworthiness and the Leading Political Ideology of News Media. NAACL-HLT (1) 2019: 2109-2116 - [c234]Yonatan Belinkov, James R. Glass:
Analysis Methods in Neural Language Processing: A Survey. NAACL-HLT (1) 2019: 3348-3354 - [c233]Abdelrhman Saleh, Ramy Baly, Alberto Barrón-Cedeño, Giovanni Da San Martino, Mitra Mohtarami, Preslav Nakov, James R. Glass:
Team QCRI-MIT at SemEval-2019 Task 4: Propaganda Analysis Meets Hyperpartisan News Detection. SemEval@NAACL-HLT 2019: 1041-1046 - [i77]Brian Xu, Mitra Mohtarami, James R. Glass:
Adversarial Domain Adaptation for Stance Detection. CoRR abs/1902.02401 (2019) - [i76]David Harwath, James R. Glass:
Towards Visually Grounded Sub-Word Speech Unit Discovery. CoRR abs/1902.08213 (2019) - [i75]Tianxing He, James R. Glass:
Negative Training for Neural Dialogue Response Generation. CoRR abs/1903.02134 (2019) - [i74]Ramy Baly, Georgi Karadzhov, Abdelrhman Saleh, James R. Glass, Preslav Nakov:
Multi-Task Ordinal Regression for Jointly Predicting the Trustworthiness and the Leading Political Ideology of News Media. CoRR abs/1904.00542 (2019) - [i73]Yu-An Chung, Wei-Ning Hsu, Hao Tang, James R. Glass:
An Unsupervised Autoregressive Model for Speech Representation Learning. CoRR abs/1904.03240 (2019) - [i72]Abdelrhman Saleh, Ramy Baly, Alberto Barrón-Cedeño, Giovanni Da San Martino, Mitra Mohtarami, Preslav Nakov, James R. Glass:
Team QCRI-MIT at SemEval-2019 Task 4: Propaganda Analysis Meets Hyperpartisan News Detection. CoRR abs/1904.03513 (2019) - [i71]Suwon Shon, Hao Tang, James R. Glass:
VoiceID Loss: Speech Enhancement for Speaker Verification. CoRR abs/1904.03601 (2019) - [i70]Suwon Shon, Najim Dehak, Douglas A. Reynolds, James R. Glass:
MCE 2018: The 1st Multi-target Speaker Detection and Identification Challenge Evaluation. CoRR abs/1904.04240 (2019) - [i69]Achintya Kumar Sarkar, Zheng-Hua Tan, Hao Tang, Suwon Shon, James R. Glass:
Time-Contrastive Learning Based Deep Bottleneck Features for Text-Dependent Speaker Verification. CoRR abs/1905.04554 (2019) - [i68]Tianxing He, Jingzhao Zhang, Zhiming Zhou, James R. Glass:
Quantifying Exposure Bias for Neural Language Generation. CoRR abs/1905.10617 (2019) - [i67]Hongyin Luo, Lan Jiang, Yonatan Belinkov, James R. Glass:
Improving Neural Language Models by Segmenting, Attending, and Predicting the Future. CoRR abs/1906.01702 (2019) - [i66]Moin Nadeem, Wei Fang, Brian Xu, Mitra Mohtarami, James R. Glass:
FAKTA: An Automatic End-to-End Fact Checking System. CoRR abs/1906.04164 (2019) - [i65]Wei Fang, Yu-An Chung, James R. Glass:
Towards Transfer Learning for End-to-End Speech Synthesis from Deep Pre-Trained Language Models. CoRR abs/1906.07307 (2019) - [i64]Yonatan Belinkov, Ahmed Ali, James R. Glass:
Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech Recognition. CoRR abs/1907.04224 (2019) - [i63]Wei-Ning Hsu, David F. Harwath, James R. Glass:
Transfer Learning from Audio-Visual Grounding to Speech Recognition. CoRR abs/1907.04355 (2019) - [i62]François Grondin, James R. Glass:
Fast and Robust 3-D Sound Source Localization with DSVD-PHAT. CoRR abs/1907.12621 (2019) - [i61]Pepa Atanasova, Preslav Nakov, Lluís Màrquez, Alberto Barrón-Cedeño, Georgi Karadzhov, Tsvetomila Mihaylova, Mitra Mohtarami, James R. Glass:
Automatic Fact-Checking Using Context and Discourse Information. CoRR abs/1908.01328 (2019) - [i60]Sameer Khurana, Ahmed Ali, James R. Glass:
DARTS: Dialectal Arabic Transcription System. CoRR abs/1909.12163 (2019) - [i59]Yifan Zhang, Giovanni Da San Martino, Alberto Barrón-Cedeño, Salvatore Romeo, Jisun An, Haewoon Kwak, Todor Staykovski, Israa Jaradat, Georgi Karadzhov, Ramy Baly, Kareem Darwish, James R. Glass, Preslav Nakov:
Tanbih: Get To Know What You Are Reading. CoRR abs/1910.02028 (2019) - [i58]Mitra Mohtarami, James R. Glass, Preslav Nakov:
Contrastive Language Adaptation for Cross-Lingual Stance Detection. CoRR abs/1910.02076 (2019) - [i57]Tianxing He, Jun Liu, Kyunghyun Cho, Myle Ott, Bing Liu, James R. Glass, Fuchun Peng:
Mix-review: Alleviate Forgetting in the Pretrain-Finetune Framework for Neural Language Generation Models. CoRR abs/1910.07117 (2019) - [i56]François Grondin, James R. Glass, Iwona Sobieraj, Mark D. Plumbley:
Sound Event Localization and Detection Using CRNN on Pairs of Microphones. CoRR abs/1910.10049 (2019) - [i55]Yu-An Chung, James R. Glass:
Generative Pre-Training for Speech with Autoregressive Predictive Coding. CoRR abs/1910.12607 (2019) - [i54]Yonatan Belinkov, Nadir Durrani, Fahim Dalvi, Hassan Sajjad, James R. Glass:
On the Linguistic Representational Power of Neural Machine Translation Models. CoRR abs/1911.00317 (2019) - [i53]David Harwath, Wei-Ning Hsu, James R. Glass:
Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech. CoRR abs/1911.09602 (2019) - [i52]Preslav Nakov, Lluís Màrquez, Walid Magdy, Alessandro Moschitti, James R. Glass, Bilal Randeree:
SemEval-2015 Task 3: Answer Selection in Community Question Answering. CoRR abs/1911.11403 (2019) - [i51]Preslav Nakov, Lluís Màrquez, Alessandro Moschitti, Walid Magdy, Hamdy Mubarak, Abed Alhakim Freihat, James R. Glass, Bilal Randeree:
SemEval-2016 Task 3: Community Question Answering. CoRR abs/1912.01972 (2019) - 2018
- [j25]Michael Price, James R. Glass, Anantha P. Chandrakasan:
A Low-Power Speech Recognizer and Voice Activity Detector Using Deep Neural Networks. IEEE J. Solid State Circuits 53(1): 66-75 (2018) - [c232]Tsvetomila Mihaylova, Preslav Nakov, Lluís Màrquez, Alberto Barrón-Cedeño, Mitra Mohtarami, Georgi Karadzhov, James R. Glass:
Fact Checking in Community Forums. AAAI 2018: 5309-5316 - [c231]David Harwath, Adrià Recasens, Dídac Surís, Galen Chuang, Antonio Torralba, James R. Glass:
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input. ECCV (6) 2018: 659-677 - [c230]Ramy Baly, Georgi Karadzhov, Dimitar Alexandrov, James R. Glass, Preslav Nakov:
Predicting Factuality of Reporting and Bias of News Media Sources. EMNLP 2018: 3528-3539 - [c229]Hongyin Luo, James R. Glass:
Learning Word Representations with Cross-Sentence Dependencyfor End-to-End Co-reference Resolution. EMNLP 2018: 4829-4833 - [c228]Skanda Koppula, James R. Glass, Anantha P. Chandrakasan:
Energy-Efficient Speaker Identification with Low-Precision Networks. ICASSP 2018: 2246-2250 - [c227]David Harwath, Galen Chuang, James R. Glass:
Vision as an Interlingua: Learning Multilingual Semantic Embeddings of Untranscribed Speech. ICASSP 2018: 4969-4973 - [c226]Maryam Najafian, Sameer Khurana, Suwon Shon, Ahmed Ali, James R. Glass:
Exploiting Convolutional Neural Networks for Phonotactic Based Dialect Identification. ICASSP 2018: 5174-5178 - [c225]Wei-Ning Hsu, James R. Glass:
Extracting Domain Invariant Features by Unsupervised Learning for Robust Automatic Speech Recognition. ICASSP 2018: 5614-5618 - [c224]Mandy Korpusik, James R. Glass:
Convolutional Neural Networks and Multitask Strategies for Semantic Mapping of Natural Language Input to a Structured Database. ICASSP 2018: 6174-6178 - [c223]Siqi Zheng, Jianzong Wang, Jing Xiao, Wei-Ning Hsu, James R. Glass:
A Noise-Robust Self-Adaptive Multitarget Speaker Detection System. ICPR 2018: 1068-1072 - [c222]Yu-An Chung, James R. Glass:
Speech2Vec: A Sequence-to-Sequence Framework for Learning Word Embeddings from Speech. INTERSPEECH 2018: 811-815 - [c221]Wei-Ning Hsu, James R. Glass:
Scalable Factorized Hierarchical Variational Autoencoder Training. INTERSPEECH 2018: 1462-1466 - [c220]Wei-Ning Hsu, Hao Tang, James R. Glass:
Unsupervised Adaptation with Interpretable Disentangled Representations for Distant Conversational Speech Recognition. INTERSPEECH 2018: 1576-1580 - [c219]Tuka Al Hanai, Mohammad M. Ghassemi, James R. Glass:
Detecting Depression with Audio/Text Sequence Modeling of Interviews. INTERSPEECH 2018: 1716-1720 - [c218]Hao Tang, Wei-Ning Hsu, François Grondin, James R. Glass:
A Study of Enhancement, Augmentation and Autoencoder Methods for Domain Adaptation in Distant Speech Recognition. INTERSPEECH 2018: 2928-2932 - [c217]Ramy Baly, Mitra Mohtarami, James R. Glass, Lluís Màrquez, Alessandro Moschitti, Preslav Nakov:
Integrating Stance Detection and Fact Checking in a Unified Corpus. NAACL-HLT (2) 2018: 21-27 - [c216]Adam Poliak, Yonatan Belinkov, James R. Glass, Benjamin Van Durme:
On the Evaluation of Semantic Phenomena in Neural Machine Translation Using Natural Language Inference. NAACL-HLT (2) 2018: 513-523 - [c215]Tuka Al Hanai, Rhoda Au, James R. Glass:
Role-specific Language Models for Processing Recorded Neuropsychological Exams. NAACL-HLT (2) 2018: 746-752 - [c214]Mitra Mohtarami, Ramy Baly, James R. Glass, Preslav Nakov, Lluís Màrquez, Alessandro Moschitti:
Automatic Stance Detection Using End-to-End Memory Networks. NAACL-HLT 2018: 767-776 - [c213]Yu-An Chung, Hung-yi Lee, James R. Glass:
Supervised and Unsupervised Transfer Learning for Question Answering. NAACL-HLT 2018: 1585-1594 - [c212]Yu-An Chung, Wei-Hung Weng, Schrasing Tong, James R. Glass:
Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces. NeurIPS 2018: 7365-7375 - [c211]Suwon Shon, Ahmed Ali, James R. Glass:
Convolutional Neural Network and Language Embeddings for End-to-End Dialect Recognition. Odyssey 2018: 98-104 - [c210]Hao Tang, James R. Glass:
On Training Recurrent Networks with Truncated Backpropagation Through time in Speech Recognition. SLT 2018: 48-55 - [c209]Suwon Shon, Wei-Ning Hsu, James R. Glass:
Unsupervised Representation Learning of Speech for Dialect Identification. SLT 2018: 105-111 - [c208]Jennifer Drexler, James R. Glass:
Combining End-to-End and Adversarial Training for Low-Resource Speech Recognition. SLT 2018: 361-368 - [c207]Mandy Korpusik, James R. Glass:
Convolutional Neural Networks for Dialogue State Tracking without Pre-Trained Word Vectors or Semantic Dictionaries. SLT 2018: 884-891 - [c206]Suwon Shon, Hao Tang, James R. Glass:
Frame-Level Speaker Embeddings for Text-Independent Speaker Recognition and Analysis of End-to-End Model. SLT 2018: 1007-1013 - [c205]Marcos Zampieri, Shervin Malmasi, Preslav Nakov, Ahmed Ali, Suwon Shon, James R. Glass, Yves Scherrer, Tanja Samardzic, Nikola Ljubesic, Jörg Tiedemann, Chris van der Lee, Stefan Grondelaers, Nelleke Oostdijk, Dirk Speelman, Antal van den Bosch, Ritesh Kumar, Bornini Lahiri, Mayank Jain:
Language Identification and Morphosyntactic Tagging: The Second VarDial Evaluation Campaign. VarDial@COLING 2018 2018: 1-17 - [i50]Yonatan Belinkov, Lluís Màrquez, Hassan Sajjad, Nadir Durrani, Fahim Dalvi, James R. Glass:
Evaluating Layers of Representation in Neural Machine Translation on Part-of-Speech and Semantic Tagging Tasks. CoRR abs/1801.07772 (2018) - [i49]Wei-Ning Hsu, James R. Glass:
Extracting Domain Invariant Features by Unsupervised Learning for Robust Automatic Speech Recognition. CoRR abs/1803.02551 (2018) - [i48]Tsvetomila Mihaylova, Preslav Nakov, Lluís Màrquez, Alberto Barrón-Cedeño, Mitra Mohtarami, Georgi Karadzhov, James R. Glass:
Fact Checking in Community Forums. CoRR abs/1803.03178 (2018) - [i47]Suwon Shon, Ahmed Ali, James R. Glass:
Convolutional Neural Networks and Language Embeddings for End-to-End Dialect Recognition. CoRR abs/1803.04567 (2018) - [i46]Yu-An Chung, James R. Glass:
Speech2Vec: A Sequence-to-Sequence Framework for Learning Word Embeddings from Speech. CoRR abs/1803.08976 (2018) - [i45]David Harwath, Adrià Recasens, Dídac Surís, Galen Chuang, Antonio Torralba, James R. Glass:
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input. CoRR abs/1804.01452 (2018) - [i44]David F. Harwath, Galen Chuang, James R. Glass:
Vision as an Interlingua: Learning Multilingual Semantic Embeddings of Untranscribed Speech. CoRR abs/1804.03052 (2018) - [i43]Wei-Ning Hsu, James R. Glass:
Scalable Factorized Hierarchical Variational Autoencoder Training. CoRR abs/1804.03201 (2018) - [i42]Mitra Mohtarami, Ramy Baly, James R. Glass, Preslav Nakov, Lluís Màrquez, Alessandro Moschitti:
Automatic Stance Detection Using End-to-End Memory Networks. CoRR abs/1804.07581 (2018) - [i41]Ramy Baly, Mitra Mohtarami, James R. Glass, Lluís Màrquez, Alessandro Moschitti, Preslav Nakov:
Integrating Stance Detection and Fact Checking in a Unified Corpus. CoRR abs/1804.08012 (2018) - [i40]Adam Poliak, Yonatan Belinkov, James R. Glass, Benjamin Van Durme:
On the Evaluation of Semantic Phenomena in Neural Machine Translation Using Natural Language Inference. CoRR abs/1804.09779 (2018) - [i39]Yu-An Chung, Wei-Hung Weng, Schrasing Tong, James R. Glass:
Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces. CoRR abs/1805.07467 (2018) - [i38]Wei-Ning Hsu, James R. Glass:
Disentangling by Partitioning: A Representation Learning Framework for Multimodal Sensory Data. CoRR abs/1805.11264 (2018) - [i37]Hao Tang, Wei-Ning Hsu, François Grondin, James R. Glass:
A Study of Enhancement, Augmentation, and Autoencoder Methods for Domain Adaptation in Distant Speech Recognition. CoRR abs/1806.04841 (2018) - [i36]Wei-Ning Hsu, Hao Tang, James R. Glass:
Unsupervised Adaptation with Interpretable Disentangled Representations for Distant Conversational Speech Recognition. CoRR abs/1806.04872 (2018) - [i35]Hao Tang, James R. Glass:
On Training Recurrent Networks with Truncated Backpropagation Through Time in Speech Recognition. CoRR abs/1807.03396 (2018) - [i34]Suwon Shon, Najim Dehak, Douglas A. Reynolds, James R. Glass:
MCE 2018: The 1st Multi-target Speaker Detection and Identification Challenge Evaluation (MCE) Plan, Dataset and Baseline System. CoRR abs/1807.06663 (2018) - [i33]Tianxing He, James R. Glass:
Detecting egregious responses in neural sequence-to-sequence models. CoRR abs/1809.04113 (2018) - [i32]Suwon Shon, Hao Tang, James R. Glass:
Frame-level speaker embeddings for text-independent speaker recognition and analysis of end-to-end model. CoRR abs/1809.04437 (2018) - [i31]Suwon Shon, Wei-Ning Hsu, James R. Glass:
Unsupervised Representation Learning of Speech for Dialect Identification. CoRR abs/1809.04458 (2018) - [i30]Ramy Baly, Georgi Karadzhov, Dimitar Alexandrov, James R. Glass, Preslav Nakov:
Predicting Factuality of Reporting and Bias of News Media Sources. CoRR abs/1810.01765 (2018) - [i29]Hao Tang, James R. Glass:
On The Inductive Bias of Words in Acoustics-to-Word Models. CoRR abs/1810.13407 (2018) - [i28]Anthony Bau, Yonatan Belinkov, Hassan Sajjad, Nadir Durrani, Fahim Dalvi, James R. Glass:
Identifying and Controlling Important Neurons in Neural Machine Translation. CoRR abs/1811.01157 (2018) - [i27]Yu-An Chung, Wei-Hung Weng, Schrasing Tong, James R. Glass:
Towards Unsupervised Speech-to-Text Translation. CoRR abs/1811.01307 (2018) - [i26]Suwon Shon, Tae-Hyun Oh, James R. Glass:
Noise-tolerant Audio-visual Online Person Verification using an Attention-based Neural Network Fusion. CoRR abs/1811.10813 (2018) - [i25]François Grondin, James R. Glass:
SVD-PHAT: A Fast Sound Source Localization Method. CoRR abs/1811.11785 (2018) - [i24]François Grondin, James R. Glass:
A Study of the Complexity and Accuracy of Direction of Arrival Estimation Methods Based on GCC-PHAT for a Pair of Close Microphones. CoRR abs/1811.11787 (2018) - [i23]Suwon Shon, Ahmed Ali, James R. Glass:
Domain Attentive Fusion for End-to-end Dialect Identification with Unknown Target Domain. CoRR abs/1812.01501 (2018) - [i22]Yonatan Belinkov, James R. Glass:
Analysis Methods in Neural Language Processing: A Survey. CoRR abs/1812.08951 (2018) - [i21]Fahim Dalvi, Nadir Durrani, Hassan Sajjad, Yonatan Belinkov, Anthony Bau, James R. Glass:
What Is One Grain of Sand in the Desert? Analyzing Individual Neurons in Deep NLP Models. CoRR abs/1812.09355 (2018) - [i20]Fahim Dalvi, Avery Nortonsmith, Anthony Bau, Yonatan Belinkov, Hassan Sajjad, Nadir Durrani, James R. Glass:
NeuroX: A Toolkit for Analyzing Individual Neurons in Neural Networks. CoRR abs/1812.09359 (2018) - 2017
- [j24]Mandy Korpusik, James R. Glass:
Spoken Language Understanding for a Nutrition Dialogue System. IEEE ACM Trans. Audio Speech Lang. Process. 25(7): 1450-1461 (2017) - [c204]David Harwath, James R. Glass:
Learning Word-Like Units from Joint Audio-Visual Analysis. ACL (1) 2017: 506-517 - [c203]Yonatan Belinkov, Nadir Durrani, Fahim Dalvi, Hassan Sajjad, James R. Glass:
What do Neural Machine Translation Models Learn about Morphology? ACL (1) 2017: 861-872 - [c202]Wei-Ning Hsu, Yu Zhang, James R. Glass:
Unsupervised domain adaptation for robust speech recognition via variational autoencoder-based data augmentation. ASRU 2017: 16-23 - [c201]Maryam Najafian, Wei-Ning Hsu, Ahmed Ali, James R. Glass:
Automatic speech recognition of Arabic multi-genre broadcast media. ASRU 2017: 353-359 - [c200]Suwon Shon, Ahmed Ali, James R. Glass:
MIT-QCRI Arabic dialect identification system for the 2017 multi-genre broadcast challenge. ASRU 2017: 374-380 - [c199]Tuka Alhanai, Rhoda Au, James R. Glass:
Spoken language biomarkers for detecting cognitive impairment. ASRU 2017: 409-416 - [c198]Kenneth Leidal, David Harwath, James R. Glass:
Learning modality-invariant representations for speech and images. ASRU 2017: 424-429 - [c197]Mandy Korpusik, Zachary Collins, James R. Glass:
Semantic mapping of natural language input to database entries via convolutional neural networks. ICASSP 2017: 5685-5689 - [c196]Yonatan Belinkov, Lluís Màrquez, Hassan Sajjad, Nadir Durrani, Fahim Dalvi, James R. Glass:
Evaluating Layers of Representation in Neural Machine Translation on Part-of-Speech and Semantic Tagging Tasks. IJCNLP(1) 2017: 1-10 - [c195]Wei-Ning Hsu, Yu Zhang, James R. Glass:
Learning Latent Representations for Speech Generation and Transformation. INTERSPEECH 2017: 1273-1277 - [c194]Sameer Khurana, Maryam Najafian, Ahmed Ali, Tuka Al Hanai, Yonatan Belinkov, James R. Glass:
QMDIS: QCRI-MIT Advanced Dialect Identification System. INTERSPEECH 2017: 2591-2595 - [c193]Xue Feng, Brigitte Richardson, Scott Amman, James R. Glass:
An Environmental Feature Representation for Robust Speech Recognition and for Environment Identification. INTERSPEECH 2017: 3078-3082 - [c192]Mandy Korpusik, Zachary Collins, James R. Glass:
Character-Based Embedding Models and Reranking Strategies for Understanding Natural Language Meal Descriptions. INTERSPEECH 2017: 3320-3324 - [c191]Michael Price, James R. Glass, Anantha P. Chandrakasan:
14.4 A scalable speech recognizer with deep-neural-network acoustic models and voice-activated power gating. ISSCC 2017: 244-245 - [c190]Wei-Ning Hsu, Yu Zhang, James R. Glass:
Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data. NIPS 2017: 1878-1889 - [c189]Yonatan Belinkov, James R. Glass:
Analyzing Hidden Representations in End-to-End Automatic Speech Recognition Systems. NIPS 2017: 2441-2451 - [i19]David F. Harwath, James R. Glass:
Learning Word-Like Units from Joint Audio-Visual Analysis. CoRR abs/1701.07481 (2017) - [i18]Hongyin Luo, Jie Fu, James R. Glass:
Bidirectional Backpropagation: Towards Biologically Plausible Error Signal Transmission in Neural Networks. CoRR abs/1702.07097 (2017) - [i17]Yonatan Belinkov, Nadir Durrani, Fahim Dalvi, Hassan Sajjad, James R. Glass:
What do Neural Machine Translation Models Learn about Morphology? CoRR abs/1704.03471 (2017) - [i16]Wei-Ning Hsu, Yu Zhang, James R. Glass:
Learning Latent Representations for Speech Generation and Transformation. CoRR abs/1704.04222 (2017) - [i15]Wei-Ning Hsu, Yu Zhang, James R. Glass:
Unsupervised Domain Adaptation for Robust Speech Recognition via Variational Autoencoder-Based Data Augmentation. CoRR abs/1707.06265 (2017) - [i14]Suwon Shon, Ahmed Ali, James R. Glass:
MIT-QCRI Arabic Dialect Identification System for the 2017 Multi-Genre Broadcast Challenge. CoRR abs/1709.00387 (2017) - [i13]Yonatan Belinkov, James R. Glass:
Analyzing Hidden Representations in End-to-End Automatic Speech Recognition Systems. CoRR abs/1709.04482 (2017) - [i12]Wei-Ning Hsu, Yu Zhang, James R. Glass:
Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data. CoRR abs/1709.07902 (2017) - [i11]Tuka Alhanai, Rhoda Au, James R. Glass:
Spoken Language Biomarkers for Detecting Cognitive Impairment. CoRR abs/1710.07551 (2017) - [i10]Yu-An Chung, James R. Glass:
Learning Word Embeddings from Speech. CoRR abs/1711.01515 (2017) - [i9]Yu-An Chung, Hung-yi Lee, James R. Glass:
Supervised and Unsupervised Transfer Learning for Question Answering. CoRR abs/1711.05345 (2017) - [i8]Kenneth Leidal, David Harwath, James R. Glass:
Learning Modality-Invariant Representations for Speech and Images. CoRR abs/1712.03897 (2017) - 2016
- [j23]Stephen H. Shum, David F. Harwath, Najim Dehak, James R. Glass:
On the Use of Acoustic Unit Discovery for Language Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 24(9): 1665-1676 (2016) - [c188]Salvatore Romeo, Giovanni Da San Martino, Alberto Barrón-Cedeño, Alessandro Moschitti, Yonatan Belinkov, Wei-Ning Hsu, Yu Zhang, Mitra Mohtarami, James R. Glass:
Neural Attention for Learning to Rank Questions in Community Question Answering. COLING 2016: 1734-1745 - [c187]Ekapol Chuangsuwanich, Yu Zhang, James R. Glass:
Multilingual data selection for training stacked bottleneck features. ICASSP 2016: 5410-5414 - [c186]Yu Zhang, Ekapol Chuangsuwanich, James R. Glass, Dong Yu:
Prediction-adaptation-correction recurrent neural networks for low-resource language speech recognition. ICASSP 2016: 5415-5419 - [c185]Yu Zhang, Guoguo Chen, Dong Yu, Kaisheng Yao, Sanjeev Khudanpur, James R. Glass:
Highway long short-term memory RNNS for distant speech recognition. ICASSP 2016: 5755-5759 - [c184]Mandy Korpusik, Calvin Huang, Michael Price, James R. Glass:
Distributional semantics for understanding spoken meal descriptions. ICASSP 2016: 6070-6074 - [c183]Ann Lee, Nancy F. Chen, James R. Glass:
Personalized mispronunciation detection and diagnosis based on unsupervised error pattern discovery. ICASSP 2016: 6145-6149 - [c182]Wei-Ning Hsu, Yu Zhang, Ann Lee, James R. Glass:
Exploiting Depth and Highway Connections in Convolutional Recurrent Deep Neural Networks for Speech Recognition. INTERSPEECH 2016: 395-399 - [c181]Michael Price, Anantha P. Chandrakasan, James R. Glass:
Memory-Efficient Modeling and Search Techniques for Hardware ASR Decoders. INTERSPEECH 2016: 1893-1897 - [c180]Ahmed Ali, Najim Dehak, Patrick Cardinal, Sameer Khurana, Sree Harsha Yella, James R. Glass, Peter Bell, Steve Renals:
Automatic Dialect Detection in Arabic Broadcast Speech. INTERSPEECH 2016: 2934-2938 - [c179]David F. Harwath, Antonio Torralba, James R. Glass:
Unsupervised Learning of Spoken Language with Visual Context. NIPS 2016: 1858-1866 - [c178]Henry Nassif, Mitra Mohtarami, James R. Glass:
Learning Semantic Relatedness in Community Question Answering Using Neural Models. Rep4NLP@ACL 2016: 137-147 - [c177]Preslav Nakov, Lluís Màrquez, Alessandro Moschitti, Walid Magdy, Hamdy Mubarak, Abed Alhakim Freihat, James R. Glass, Bilal Randeree:
SemEval-2016 Task 3: Community Question Answering. SemEval@NAACL-HLT 2016: 525-545 - [c176]Mitra Mohtarami, Yonatan Belinkov, Wei-Ning Hsu, Yu Zhang, Tao Lei, Kfir Bar, Scott Cyphers, James R. Glass:
SLS at SemEval-2016 Task 3: Neural-based Approaches for Ranking in Community Question Answering. SemEval@NAACL-HLT 2016: 828-835 - [c175]Ahmed Ali, Peter Bell, James R. Glass, Yacine Messaoui, Hamdy Mubarak, Steve Renals, Yifan Zhang:
The MGB-2 challenge: Arabic multi-dialect broadcast media recognition. SLT 2016: 279-284 - [c174]Tuka Al Hanai, Wei-Ning Hsu, James R. Glass:
Development of the MIT ASR system for the 2016 Arabic Multi-genre Broadcast Challenge. SLT 2016: 299-304 - [c173]Wei-Ning Hsu, Yu Zhang, James R. Glass:
A prioritized grid long short-term memory RNN for speech recognition. SLT 2016: 467-473 - [c172]Felix Sun, David F. Harwath, James R. Glass:
Look, listen, and decode: Multimodal speech recognition with images. SLT 2016: 573-578 - [c171]Yonatan Belinkov, James R. Glass:
A Character-level Convolutional Neural Network for Distinguishing Similar Languages and Dialects. VarDial@COLING 2016: 145-152 - [i7]Wei-Ning Hsu, Yu Zhang, James R. Glass:
Recurrent Neural Network Encoder with Attention for Community Question Answering. CoRR abs/1603.07044 (2016) - [i6]Ahmed Ali, Peter Bell, James R. Glass, Yacine Messaoui, Hamdy Mubarak, Steve Renals, Yifan Zhang:
The MGB-2 Challenge: Arabic Multi-Dialect Broadcast Media Recognition. CoRR abs/1609.05625 (2016) - [i5]Yonatan Belinkov, James R. Glass:
A Character-level Convolutional Neural Network for Distinguishing Similar Languages and Dialects. CoRR abs/1609.07568 (2016) - [i4]Yonatan Belinkov, James R. Glass:
Large-Scale Machine Translation between Arabic and Hebrew: Available Corpora and Initial Results. CoRR abs/1609.07701 (2016) - 2015
- [j22]Matthew R. Walter, Matthew E. Antone, Ekapol Chuangsuwanich, Andrew Correa, Randall Davis, Luke Fletcher, Emilio Frazzoli, Yuli Friedman, James R. Glass, Jonathan P. How, Jeong hwan Jeon, Sertac Karaman, Brandon Luders, Nicholas Roy, Stefanie Tellex, Seth J. Teller:
A Situationally Aware Voice-commandable Robotic Forklift Working Alongside People in Unstructured Outdoor Environments. J. Field Robotics 32(4): 590-628 (2015) - [j21]Michael Price, James R. Glass, Anantha P. Chandrakasan:
A 6 mW, 5, 000-Word Real-Time Speech Recognizer Using WFST Models. IEEE J. Solid State Circuits 50(1): 102-112 (2015) - [j20]Chia-ying Lee, Timothy J. O'Donnell, James R. Glass:
Unsupervised Lexicon Discovery from Acoustic Input. Trans. Assoc. Comput. Linguistics 3: 389-403 (2015) - [j19]Lin-Shan Lee, James R. Glass, Hung-yi Lee, Chun-an Chan:
Spoken Content Retrieval - Beyond Cascading Speech Recognition with Text Retrieval. IEEE ACM Trans. Audio Speech Lang. Process. 23(9): 1389-1420 (2015) - [c170]David F. Harwath, James R. Glass:
Deep multimodal semantic embeddings for speech and images. ASRU 2015: 237-244 - [c169]Carrie J. Cai, Philip J. Guo, James R. Glass, Robert C. Miller:
Wait-Learning: Leveraging Wait Time for Second Language Education. CHI 2015: 3701-3710 - [c168]Yonatan Belinkov, James R. Glass:
Arabic Diacritization with Recurrent Neural Networks. EMNLP 2015: 2281-2285 - [c167]Xue Feng, Brigitte Richardson, Scott Amman, James R. Glass:
On using heterogeneous data for vehicle-based speech recognition: A DNN-based approach. ICASSP 2015: 4385-4389 - [c166]Ann Lee, James R. Glass:
Mispronunciation detection without nonnative training data. INTERSPEECH 2015: 643-647 - [c165]Patrick Cardinal, Najim Dehak, Yu Zhang, James R. Glass:
Speaker adaptation using the i-vector technique for bottleneck features. INTERSPEECH 2015: 2867-2871 - [c164]Abdulaziz Alghunaim, Mitra Mohtarami, Scott Cyphers, James R. Glass:
A Vector Space Approach for Aspect Based Sentiment Analysis. VS@HLT-NAACL 2015: 116-122 - [c163]Preslav Nakov, Lluís Màrquez, Walid Magdy, Alessandro Moschitti, James R. Glass, Bilal Randeree:
SemEval-2015 Task 3: Answer Selection in Community Question Answering. SemEval@NAACL-HLT 2015: 269-281 - [c162]Yonatan Belinkov, Mitra Mohtarami, Scott Cyphers, James R. Glass:
VectorSLU: A Continuous Word Vector Approach to Answer Selection in Community Question Answering Systems. SemEval@NAACL-HLT 2015: 282-287 - [i3]Yu Zhang, Guoguo Chen, Dong Yu, Kaisheng Yao, Sanjeev Khudanpur, James R. Glass:
Highway Long Short-Term Memory RNNs for Distant Speech Recognition. CoRR abs/1510.08983 (2015) - [i2]Yu Zhang, Ekapol Chuangsuwanich, James R. Glass, Dong Yu:
Prediction-Adaptation-Correction Recurrent Neural Networks for Low-Resource Language Speech Recognition. CoRR abs/1510.08985 (2015) - [i1]David F. Harwath, James R. Glass:
Deep Multimodal Semantic Embeddings for Speech and Images. CoRR abs/1511.03690 (2015) - 2014
- [j18]Mohamad Hasan Bahari, Najim Dehak, Hugo Van hamme, Lukás Burget, Ahmed Ali, Jim Glass:
Non-Negative Factor Analysis of Gaussian Mixture Model Weight Adaptation for Language and Dialect Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 22(7): 1117-1129 (2014) - [c161]Carrie J. Cai, Philip J. Guo, James R. Glass, Robert C. Miller:
Wait-learning: leveraging conversational dead time for second language education. CHI Extended Abstracts 2014: 2239-2244 - [c160]Brenden M. Lake, Chia-ying Lee, James R. Glass, Joshua B. Tenenbaum:
One-shot learning of generative speech concepts. CogSci 2014 - [c159]Iman Saleh, Scott Cyphers, James R. Glass, Shafiq R. Joty, Lluís Màrquez i Villodre, Alessandro Moschitti, Preslav Nakov:
A Study of using Syntactic and Semantic Structures for Concept Segmentation and Labeling. COLING 2014: 193-202 - [c158]Yu Zhang, Ekapol Chuangsuwanich, James R. Glass:
Extracting deep neural network bottleneck features using low-rank matrix factorization. ICASSP 2014: 185-189 - [c157]Xue Feng, Yaodong Zhang, James R. Glass:
Speech feature denoising and dereverberation via deep autoencoders for noisy reverberant speech recognition. ICASSP 2014: 1759-1763 - [c156]Anne Cutler, Yu Zhang, Ekapol Chuangsuwanich, James R. Glass:
Language ID-based training of multilingual stacked bottleneck features. INTERSPEECH 2014: 1-5 - [c155]Stephen H. Shum, Najim Dehak, James R. Glass:
Limited labels for unlimited data: active learning for speaker recognition. INTERSPEECH 2014: 383-387 - [c154]Patrick Cardinal, Ahmed Ali, Najim Dehak, Yu Zhang, Tuka Al Hanai, Yifan Zhang, James R. Glass, Stephan Vogel:
Recent advances in ASR applied to an Arabic transcription system for Al-Jazeera. INTERSPEECH 2014: 2088-2092 - [c153]Hung-yi Lee, Yu Zhang, Ekapol Chuangsuwanich, James R. Glass:
Graph-based re-ranking using acoustic feature similarity between search results for spoken term detection on low-resource languages. INTERSPEECH 2014: 2479-2483 - [c152]Tuka Al Hanai, James R. Glass:
Lexical modeling for Arabic ASR: a systematic approach. INTERSPEECH 2014: 2605-2609 - [c151]David F. Harwath, James R. Glass:
Speech recognition without a lexicon - bridging the gap between graphemic and phonetic systems. INTERSPEECH 2014: 2655-2659 - [c150]Ann Lee, James R. Glass:
Context-dependent pronunciation error pattern discovery with limited annotations. INTERSPEECH 2014: 2877-2881 - [c149]Michael Price, James R. Glass, Anantha P. Chandrakasan:
27.2 A 6mW 5K-Word real-time speech recognizer using WFST models. ISSCC 2014: 454-455 - [c148]Ahmed Ali, Yifan Zhang, Patrick Cardinal, Najim Dehak, Stephan Vogel, James R. Glass:
A complete KALDI recipe for building Arabic speech recognition systems. SLT 2014: 525-529 - [c147]Mandy Korpusik, Nicole Schmidt, Jennifer Drexler, Scott Cyphers, James R. Glass:
Data collection and language understanding of food descriptions. SLT 2014: 560-565 - 2013
- [j17]Ian McGraw, Ibrahim Badr, James R. Glass:
Learning Lexicons From Speech Using a Pronunciation Mixture Model. IEEE Trans. Speech Audio Process. 21(2): 357-366 (2013) - [j16]Stephen Shum, Najim Dehak, Réda Dehak, James R. Glass:
Unsupervised Methods for Speaker Diarization: An Integrated and Iterative Approach. IEEE Trans. Speech Audio Process. 21(10): 2015-2028 (2013) - [c146]Jingjing Liu, Panupong Pasupat, Yining Wang, Scott Cyphers, James R. Glass:
Query understanding enhanced by hierarchical parsing structures. ASRU 2013: 72-77 - [c145]Chia-ying Lee, Yu Zhang, James R. Glass:
Joint Learning of Phonetic Units and Word Pronunciations for ASR. EMNLP 2013: 182-192 - [c144]Ann Lee, Yaodong Zhang, James R. Glass:
Mispronunciation detection via dynamic time warping on deep belief network-based posteriorgrams. ICASSP 2013: 8227-8231 - [c143]Jingjing Liu, Panupong Pasupat, Scott Cyphers, James R. Glass:
Asgard: A portable architecture for multilingual dialogue systems. ICASSP 2013: 8386-8390 - [c142]David F. Harwath, Timothy J. Hazen, James R. Glass:
Zero resource spoken audio corpus analysis. ICASSP 2013: 8555-8559 - [c141]Xiao Fang, Najim Dehak, James R. Glass:
Bayesian distance metric learning on i-vector for speaker verification. INTERSPEECH 2013: 2514-2518 - [c140]William Li, Jim Glass, Nicholas Roy, Seth J. Teller:
Probabilistic Dialogue Modeling for Speech-Enabled Assistive Technology. SLPAT 2013: 67-72 - [c139]Ann Lee, James R. Glass:
Pronunciation assessment via a comparison-based system. SLaTE 2013: 122-126 - 2012
- [c138]Chia-ying Lee, James R. Glass:
A Nonparametric Bayesian Approach to Acoustic Model Discovery. ACL (1) 2012: 40-49 - [c137]Hung-An Chang, James R. Glass:
Evaluation of multi-level context-dependent acoustic model for large vocabulary speaker adaptation tasks. ICASSP 2012: 4313-4316 - [c136]Ekapol Chuangsuwanich, Shinji Watanabe, Takaaki Hori, Tomoharu Iwata, James R. Glass:
Handling uncertain observations in unsupervised topic-mixture language model adaptation. ICASSP 2012: 5033-5036 - [c135]Yaodong Zhang, Ruslan Salakhutdinov, Hung-An Chang, James R. Glass:
Resource configurable spoken query detection using Deep Boltzmann Machines. ICASSP 2012: 5161-5164 - [c134]Yaodong Zhang, Kiarash Adl, James R. Glass:
Fast spoken query detection using lower-bound Dynamic Time Warping on Graphical Processing Units. ICASSP 2012: 5173-5176 - [c133]Stephen Shum, Najim Dehak, Jim Glass:
On the Use of Spectral and Iterative Methods for Speaker Diarization. INTERSPEECH 2012: 482-485 - [c132]Ann Lee, James R. Glass:
Sentence Detection Using Multiple Annotations. INTERSPEECH 2012: 1848-1851 - [c131]Jingjing Liu, Scott Cyphers, Panupong Pasupat, Ian McGraw, James R. Glass:
A Conversational Movie Search System Based on Conditional Random Fields. INTERSPEECH 2012: 2454-2457 - [c130]Ian McGraw, Scott Cyphers, Panupong Pasupat, Jingjing Liu, James R. Glass:
Automating Crowd-supervised Learning for Spoken Language Systems. INTERSPEECH 2012: 2474-2477 - [c129]James R. Glass:
Towards unsupervised speech processing. ISSPA 2012: 1-4 - [c128]Ann Lee, James R. Glass:
A comparison-based approach to mispronunciation detection. SLT 2012: 382-387 - 2011
- [c127]Hung-An Chang, James R. Glass:
Multi-level context-dependent acoustic modeling for automatic speech recognition. ASRU 2011: 89-94 - [c126]Najim Dehak, Zahi N. Karam, Douglas A. Reynolds, Réda Dehak, William M. Campbell, James R. Glass:
A channel-blind system for speaker verification. ICASSP 2011: 4536-4539 - [c125]Yaodong Zhang, James R. Glass:
An inner-product lower-bound estimate for dynamic time warping. ICASSP 2011: 5660-5663 - [c124]Chia-ying Lee, James R. Glass, Oded Ghitza:
An Efferent-Inspired Auditory Model Front-End for Speech Recognition. INTERSPEECH 2011: 49-52 - [c123]Ibrahim Badr, Ian McGraw, James R. Glass:
Pronunciation Learning from Continuous Speech. INTERSPEECH 2011: 549-552 - [c122]Stephen Shum, Najim Dehak, Ekapol Chuangsuwanich, Douglas A. Reynolds, James R. Glass:
Exploiting Intra-Conversation Variability for Speaker Diarization. INTERSPEECH 2011: 945-948 - [c121]Yaodong Zhang, James R. Glass:
A Piecewise Aggregate Approximation Lower-Bound Estimate for Posteriorgram-Based Dynamic Time Warping. INTERSPEECH 2011: 1909-1912 - [c120]Ekapol Chuangsuwanich, James R. Glass:
Robust Voice Activity Detector for Real World Applications Using Harmonicity and Modulation Frequency. INTERSPEECH 2011: 2645-2648 - [c119]Chia-ying Lee, James R. Glass:
A Transcription Task for Crowdsourcing with Automatic Quality Control. INTERSPEECH 2011: 3041-3044 - [c118]Ian McGraw, James R. Glass, Stephanie Seneff:
Growing a Spoken Language Interface on Amazon Mechanical Turk. INTERSPEECH 2011: 3057-3060 - 2010
- [j15]Ji Ming, Timothy J. Hazen, James R. Glass:
Combining missing-feature theory, speech enhancement, and speaker-dependent/-independent modeling for speech separation. Comput. Speech Lang. 24(1): 67-76 (2010) - [j14]Zheng-Hua Tan, Reinhold Haeb-Umbach, Sadaoki Furui, James R. Glass, Maurizio Omologo:
Introduction to the Issue on Speech Processing for Natural Interaction With Intelligent Environments. IEEE J. Sel. Top. Signal Process. 4(5): 769-771 (2010) - [c117]Andrew Correa, Matthew R. Walter, Luke Fletcher, Jim Glass, Seth J. Teller, Randall Davis:
Multimodal interaction with an autonomous forklift. HRI 2010: 243-250 - [c116]Yaodong Zhang, James R. Glass:
Towards multi-speaker unsupervised speech pattern discovery. ICASSP 2010: 4366-4369 - [c115]Seth J. Teller, Matthew R. Walter, Matthew E. Antone, Andrew Correa, Randall Davis, Luke Fletcher, Emilio Frazzoli, Jim Glass, Jonathan P. How, Albert S. Huang, Jeong hwan Jeon, Sertac Karaman, Brandon Luders, Nicholas Roy, Tara N. Sainath:
A voice-commandable robotic forklift working alongside humans in minimally-prepared outdoor environments. ICRA 2010: 526-533 - [c114]Ibrahim Badr, Ian McGraw, James R. Glass:
Learning new word pronunciations from spoken examples. INTERSPEECH 2010: 2294-2297 - [c113]Ian McGraw, Chia-ying Lee, I. Lee Hetherington, Stephanie Seneff, James R. Glass:
Collecting Voices from the Cloud. LREC 2010 - [c112]Najim Dehak, Réda Dehak, James R. Glass, Douglas A. Reynolds, Patrick Kenny:
Cosine Similarity Scoring without Score Normalization Techniques. Odyssey 2010: 15 - [c111]Stephen Shum, Najim Dehak, Réda Dehak, James R. Glass:
Unsupervised Speaker Adaptation based on the Cosine Similarity for Text-Independent Speaker Verification. Odyssey 2010: 16 - [c110]Sean Liu, Stephanie Seneff, James R. Glass:
A collective data generation method for speech language models. SLT 2010: 223-228 - [c109]Ekapol Chuangsuwanich, D. Scott Cyphers, James R. Glass, Seth J. Teller:
Spoken command of large mobile robots in outdoor environments. SLT 2010: 306-311
2000 – 2009
- 2009
- [j13]Kate Saenko, Karen Livescu, James R. Glass, Trevor Darrell:
Multistream Articulatory Feature-Based Models for Visual Speech Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 31(9): 1700-1707 (2009) - [j12]Janet M. Baker, Li Deng, James R. Glass, Sanjeev Khudanpur, Chin-Hui Lee, Nelson Morgan, Douglas D. O'Shaughnessy:
Developments and directions in speech recognition and understanding, Part 1 [DSP Education]. IEEE Signal Process. Mag. 26(3): 75-80 (2009) - [j11]Janet M. Baker, Li Deng, Sanjeev Khudanpur, Chin-Hui Lee, James R. Glass, Nelson Morgan, Douglas D. O'Shaughnessy:
Updated MINDS report on speech recognition and understanding, Part 2 [DSP Education]. IEEE Signal Process. Mag. 26(4): 78-85 (2009) - [c108]Yaodong Zhang, James R. Glass:
Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams. ASRU 2009: 398-403 - [c107]Alexander Gruenstein, Jarrod Orszulak, Sean Liu, Shannon C. Roberts, Jeff Zabel, Bryan Reimer, Bruce Mehler, Stephanie Seneff, James R. Glass, Joseph F. Coughlin:
City browser: developing a conversational automotive HMI. CHI Extended Abstracts 2009: 4291-4296 - [c106]Ibrahim Badr, Rabih Zbib, James R. Glass:
Syntactic Phrase Reordering for English-to-Arabic Statistical Machine Translation. EACL 2009: 86-93 - [c105]Yaodong Zhang, James R. Glass:
Speech rhythm guided syllable nuclei detection. ICASSP 2009: 3797-3800 - [c104]Hung-An Chang, James R. Glass:
Discriminative training of hierarchical acoustic models for large vocabulary continuous speech recognition. ICASSP 2009: 4481-4484 - [c103]Karen Livescu, Bo Zhu, James R. Glass:
On the phonetic information in ultrasonic microphone signals. ICASSP 2009: 4621-4624 - [c102]Bo-June Paul Hsu, James R. Glass:
Language model parameter estimation using user transcriptions. ICASSP 2009: 4805-4808 - [c101]Hung-An Chang, James R. Glass:
A back-off discriminative acoustic model for automatic speech recognition. INTERSPEECH 2009: 232-235 - 2008
- [j10]A. S. Park, James R. Glass:
Unsupervised Pattern Discovery in Speech. IEEE Trans. Speech Audio Process. 16(1): 186-197 (2008) - [c100]Ibrahim Badr, Rabih Zbib, James R. Glass:
Segmentation for English-to-Arabic Statistical Machine Translation. ACL (2) 2008: 153-156 - [c99]Bo-June Paul Hsu, James R. Glass:
N-gram Weighting: Reducing Training Data Mismatch in Cross-Domain Language Model Estimation. EMNLP 2008: 829-838 - [c98]Ghinwa F. Choueiter, Mesrob I. Ohannessian, Stephanie Seneff, James R. Glass:
A turbo-style algorithm for lexical baseforms estimation. ICASSP 2008: 4313-4316 - [c97]Bo-June Paul Hsu, James R. Glass:
Iterative language model estimation: efficient data structure & algorithms. INTERSPEECH 2008: 841-844 - 2007
- [j9]Ghinwa F. Choueiter, James R. Glass:
An Implementation of Rational Wavelets and Filter Design for Phonetic Classification. IEEE Trans. Speech Audio Process. 15(3): 939-948 (2007) - [j8]Ji Ming, Timothy J. Hazen, James R. Glass, Douglas A. Reynolds:
Robust Speaker Recognition in Noisy Conditions. IEEE Trans. Speech Audio Process. 15(5): 1711-1723 (2007) - [c96]Igor Malioutov, Alex Park, Regina Barzilay, James R. Glass:
Making Sense of Sound: Unsupervised Topic Segmentation over Acoustic Input. ACL 2007 - [c95]Ghinwa F. Choueiter, Stephanie Seneff, James R. Glass:
Automatic lexical pronunciations generation and update. ASRU 2007: 225-230 - [c94]Hung-An Chang, James R. Glass:
Hierarchical large-margin Gaussian mixture models for phonetic classification. ASRU 2007: 272-277 - [c93]Ken Schutte, James R. Glass:
Speech recognition with localized time-frequency pattern detectors. ASRU 2007: 341-346 - [c92]Takaaki Hori, I. Lee Hetherington, Timothy J. Hazen, James R. Glass:
Open-Vocabulary Spoken Utterance Retrieval using Confusion Networks. ICASSP (4) 2007: 73-76 - [c91]Ryan Rifkin, Ken Schutte, Michelle Saad, Jake V. Bouvrie, James R. Glass:
Noise Robust Phonetic Classificationwith Linear Regularized Least Squares and Second-Order Features. ICASSP (4) 2007: 881-884 - [c90]Bo Zhu, Timothy J. Hazen, James R. Glass:
Multimodal speech recognition with ultrasonic sensors. INTERSPEECH 2007: 662-665 - [c89]Ghinwa F. Choueiter, Stephanie Seneff, James R. Glass:
New word acquisition using subword modeling. INTERSPEECH 2007: 1765-1768 - [c88]James R. Glass, Timothy J. Hazen, D. Scott Cyphers, Igor Malioutov, David Huynh, Regina Barzilay:
Recent progress in the MIT spoken lecture processing project. INTERSPEECH 2007: 2553-2556 - 2006
- [c87]Bo-June Paul Hsu, James R. Glass:
Style & Topic Language Model Adaptation Using HMM-LDA. EMNLP 2006: 373-381 - [c86]Alex Park, James R. Glass:
Unsupervised Word Acquisition from Speech using Pattern Discovery. ICASSP (1) 2006: 409-412 - [c85]I. Lee Hetherington, Han Shu, James R. Glass:
Flexible Multi-Stream Framework for Speech Recognition using Multi-Tape Finite-State Transducers. ICASSP (1) 2006: 417-420 - [c84]Ji Ming, Timothy J. Hazen, James R. Glass:
Speaker Verification Over Handheld Devices with Realistic Noisy Speech Data. ICASSP (1) 2006: 637-640 - [c83]Ji Ming, Timothy J. Hazen, James R. Glass:
Combining missing-feature theory, speech enhancement and speaker-dependent/-independent modeling for speech separation. INTERSPEECH 2006 - [c82]Bo-June Paul Hsu, James R. Glass:
Spoken Correction for Chinese Text Entry. ISCSLP (Selected Papers) 2006: 648-659 - [c81]Ji Ming, Timothy J. Hazen, James R. Glass:
A Comparative Study of Methods for Handheld Speaker Verification in Realistic Noisy Conditions. Odyssey 2006: 1-8 - [c80]Alex Park, James R. Glass:
A Novel DTW-Based Distance Measure for speaker Segmentation. SLT 2006: 22-25 - 2005
- [c79]Kate Saenko, Karen Livescu, James R. Glass, Trevor Darrell:
Production domain modeling of pronunciation for visual speech recognition. ICASSP (5) 2005: 473-476 - [c78]Alex Park, Timothy J. Hazen, James R. Glass:
Automatic Processing of Audio Lectures for Information Retrieval: Vocabulary Selection and Language Modeling. ICASSP (1) 2005: 497-500 - [c77]Ghinwa F. Choueiter, James R. Glass:
A Wavelet and Filter Bank Framework For Phonetic Classification. ICASSP (1) 2005: 933-936 - [c76]Kate Saenko, Karen Livescu, Michael Siracusa, Kevin W. Wilson, James R. Glass, Trevor Darrell:
Visual Speech Recognition with Loosely Synchronized Feature Streams. ICCV 2005: 1424-1431 - [c75]Ken Schutte, James R. Glass:
Robust detection of sonorant landmarks. INTERSPEECH 2005: 1005-1008 - [c74]Tony Ezzat, Ethan Meyers, James R. Glass, Tomaso A. Poggio:
Morphing spectral envelopes using audio flow. INTERSPEECH 2005: 2545-2548 - [c73]James R. Glass, Timothy J. Hazen, D. Scott Cyphers, Ken Schutte, Alex Park:
The MIT Spoken Lecture Processing Project. HLT/EMNLP 2005: 28-29 - 2004
- [c72]James R. Glass, Eugene Weinstein, D. Scott Cyphers, Joseph Polifroni, Grace Chung, Mikio Nakano:
A Framework for Developing Conversational User Interfaces. CADUI 2004: 347-358 - [c71]Kate Saenko, Trevor Darrell, James R. Glass:
Articulatory features for robust visual speech recognition. ICMI 2004: 152-158 - [c70]Timothy J. Hazen, Kate Saenko, Chia-Hao La, James R. Glass:
A segment-based audio-visual speech recognizer: data collection, development, and initial experiments. ICMI 2004: 235-242 - [c69]Karen Livescu, James R. Glass:
Feature-based pronunciation modeling with trainable asynchrony probabilities. INTERSPEECH 2004: 677-680 - [c68]Karen Livescu, James R. Glass:
Feature-based Pronunciation Modeling for Speech Recognition. HLT-NAACL (Short Papers) 2004 - 2003
- [j7]James R. Glass:
A probabilistic framework for segment-based speech recognition. Comput. Speech Lang. 17(2-3): 137-152 (2003) - [c67]Karen Livescu, James R. Glass, Jeff A. Bilmes:
Hidden feature models for speech recognition using dynamic Bayesian networks. INTERSPEECH 2003: 2529-2532 - 2002
- [c66]Issam Bazzi, James R. Glass:
A multi-class approach for modelling out-of-vocabulary words. INTERSPEECH 2002: 1613-1616 - [c65]Jon R. W. Yi, James R. Glass:
Information-theoretic criteria for unit selection synthesis. INTERSPEECH 2002: 2617-2620 - 2001
- [c64]Issam Bazzi, James R. Glass:
Learning units for domain-independent out-of- vocabulary word modelling. INTERSPEECH 2001: 61-64 - [c63]Mikio Nakano, Yasuhiro Minami, Stephanie Seneff, Timothy J. Hazen, D. Scott Cyphers, James R. Glass, Joseph Polifroni, Victor Zue:
Mokusei: a telephone-based Japanese conversational system in the weather domain. INTERSPEECH 2001: 1331-1334 - [c62]James R. Glass, Eugene Weinstein:
Speechbuilder: facilitating spoken dialogue system development. INTERSPEECH 2001: 1335-1338 - [c61]Karen Livescu, James R. Glass:
Segment-based recognition on the phonebook task: initial results and observations on duration modeling. INTERSPEECH 2001: 1437-1440 - 2000
- [j6]Victor W. Zue, James R. Glass:
Conversational interfaces: advances and challenges. Proc. IEEE 88(8): 1166-1180 (2000) - [j5]James R. Glass, Ronald Rosenfeld:
Guest editorial introduction to the special issue on language modeling and dialogue systems. IEEE Trans. Speech Audio Process. 8(1): 1-2 (2000) - [j4]Victor Zue, Stephanie Seneff, James R. Glass, Joseph Polifroni, Christine Pao, Timothy J. Hazen, I. Lee Hetherington:
JUPlTER: a telephone-based conversational interface for weather information. IEEE Trans. Speech Audio Process. 8(1): 85-96 (2000) - [c60]Issam Bazzi, James R. Glass:
Heterogeneous lexical units for automatic speech recognition: preliminary investigations. ICASSP 2000: 1257-1260 - [c59]Karen Livescu, James R. Glass:
Lexical modeling of non-native speech for automatic speech recognition. ICASSP 2000: 1683-1686 - [c58]James R. Glass, Joseph Polifroni, Stephanie Seneff, Victor Zue:
Data collection and performance evaluation of spoken dialogue systems: the MIT experience. INTERSPEECH 2000: 1-4 - [c57]Jon R. W. Yi, James R. Glass, I. Lee Hetherington:
A flexible, scalable finite-state transducer architecture for corpus-based concatenative speech synthesis. INTERSPEECH 2000: 322-325 - [c56]Issam Bazzi, James R. Glass:
Modeling out-of-vocabulary words for robust speech recognition. INTERSPEECH 2000: 401-404
1990 – 1999
- 1999
- [c55]James R. Glass, Timothy J. Hazen, I. Lee Hetherington:
Real-time telephone-based speech recognition in the Jupiter domain. ICASSP 1999: 61-64 - 1998
- [c54]James R. Glass, Timothy J. Hazen:
Telephone-based conversational speech recognition in the JUPITER domain. ICSLP 1998 - [c53]Andrew K. Halberstadt, James R. Glass:
Heterogeneous measurements and multiple classifiers for speech recognition. ICSLP 1998 - [c52]Steven C. Lee, James R. Glass:
Real-time probabilistic segmentation for segment-based speech recognition. ICSLP 1998 - [c51]Christine Pao, Philipp Schmid, James R. Glass:
Confidence scoring for speech understanding systems. ICSLP 1998 - [c50]Jon R. W. Yi, James R. Glass:
Natural-sounding speech synthesis using variable-length units. ICSLP 1998 - [c49]Joseph Polifroni, Stephanie Seneff, James R. Glass, Timothy J. Hazen:
Evaluation methodology for a telephone-based conversational system. LREC 1998: 43-50 - 1997
- [c48]Chao Wang, James R. Glass, Helen M. Meng, Joseph Polifroni, Stephanie Seneff, Victor W. Zue:
YINHE: a Mandarin Chinese version of the GALAXY system. EUROSPEECH 1997: 351-354 - [c47]Andrew K. Halberstadt, James R. Glass:
Heterogeneous acoustic measurements for phonetic classification 1. EUROSPEECH 1997: 401-404 - [c46]Michael K. McCandless, James R. Glass:
MUSE: a scripting language for the development of interactive speech analysis and recognition tools. EUROSPEECH 1997: 629-632 - [c45]Jane W. Chang, James R. Glass:
Segmentation and modeling in segment-based recognition. EUROSPEECH 1997: 1199-1202 - [c44]Timothy J. Hazen, James R. Glass:
A comparison of novel techniques for instantaneous speaker adaptation. EUROSPEECH 1997: 2047-2050 - [c43]Victor W. Zue, Stephanie Seneff, James R. Glass, I. Lee Hetherington, Edward Hurley, Helen M. Meng, Christine Pao, Joseph Polifroni, Rafael Schloming, Philipp Schmid:
From interface to content: translingual access and delivery of on-line information. EUROSPEECH 1997: 2227-2230 - 1996
- [c42]Helen M. Meng, Senis Busayapongchai, James R. Glass, David Goddeau, I. Lee Hetherington, Edward Hurley, Christine Pao, Joseph Polifroni, Stephanie Seneff, Victor Zue:
WHEELS: a conversational system in the automobile classifieds domain. ICSLP 1996: 542-545 - [c41]Edward Hurley, Joseph Polifroni, James R. Glass:
Telephone data collection using the world wide web. ICSLP 1996: 1898-1901 - [c40]Victor Zue, Stephanie Seneff, Joseph Polifroni, Helen M. Meng, James R. Glass:
Multilingual human-computer interactions: from information access to language learning. ICSLP 1996: 2207-2210 - [c39]James R. Glass, Jane W. Chang, Michael K. McCandless:
A probabilistic framework for feature-based speech recognition. ICSLP 1996: 2277-2280 - 1995
- [j3]James R. Glass, Giovanni Flammia, David Goodine, Michael S. Phillips, Joseph Polifroni, Shinsuke Sakai, Stephanie Seneff, Victor Zue:
Multilingual spoken-language understanding in the MIT Voyager system. Speech Commun. 17(1-2): 1-18 (1995) - 1994
- [j2]Victor Zue, Stephanie Seneff, Joseph Polifroni, Michael S. Phillips, Christine Pao, David Goodine, David Goddeau, James R. Glass:
PEGASUS: A spoken dialogue interface for on-line air travel planning. Speech Communication 15(3-4): 331-340 (1994) - [c38]David Goddeau, Eric Brill, James R. Glass, Christine Pao, Michael S. Phillips, Joseph Polifroni, Stephanie Seneff, Victor W. Zue:
GALAXY: a human-language interface to on-line travel information. ICSLP 1994: 707-710 - [c37]Michael K. McCandless, James R. Glass:
Empirical acquisition of language models for speech recognition. ICSLP 1994: 835-838 - [c36]Giovanni Flammia, James R. Glass, Michael S. Phillips, Joseph Polifroni, Stephanie Seneff, Victor W. Zue:
Porting the bilingual voyager system to Italian. ICSLP 1994: 911-914 - [c35]James R. Glass, Joseph Polifroni, Stephanie Seneff:
Multilingual language generation across multiple domains. ICSLP 1994: 983-986 - [c34]William Goldenthal, James R. Glass:
Statistical trajectory models for phonetic recognition. ICSLP 1994: 1871-1874 - [c33]Victor Zue, Stephanie Seneff, Joseph Polifroni, Michael S. Phillips, Christine Pao, David Goddeau, James R. Glass, Eric Brill:
PEGASUS: A Spoken Language Interface for On-Line Air Travel Planning I. HLT 1994 - 1993
- [c32]Hong C. Leung, Benjamin Chigier, James R. Glass:
A comparative study of signal representations and classification techniques for speech recognition. ICASSP (2) 1993: 680-683 - [c31]William Goldenthal, James R. Glass:
Modelling spectral dynamics for vowel classification. EUROSPEECH 1993: 289-292 - [c30]Michael K. McCandless, James R. Glass:
Empirical acquisition of word and phrase classes in the atis domain. EUROSPEECH 1993: 981-984 - [c29]I. Lee Hetherington, Michael S. Phillips, James R. Glass, Victor W. Zue:
A* word network search for continuous speech recognition. EUROSPEECH 1993: 1533-1536 - [c28]James R. Glass, David Goodine, Michael S. Phillips, Shinsuke Sakai, Stephanie Seneff, Victor W. Zue:
A bilingual Voyager system. EUROSPEECH 1993: 2063-2066 - 1992
- [c27]Rolf Carlson, James R. Glass:
Vowel classification based on analysis-by-synthesis. ICSLP 1992: 575-578 - [c26]Michael S. Phillips, James R. Glass, Joseph Polifroni, Victor Zue:
Collection and analyses of WSJ-CSR corpus at MIT. ICSLP 1992: 907-910 - [c25]Michael S. Phillips, James R. Glass, Joseph Polifroni, Victor Zue:
Collection and Analyses of WSJ-CSR Data at MIT. HLT 1992 - [c24]Victor Zue, James R. Glass, David Goddeau, David Goodine, Lynette Hirschman, Michael S. Phillips, Joseph Polifroni, Stephanie Seneff:
T]he MIT ATIS System: February 1992 Progress Report. HLT 1992 - 1991
- [c23]Victor Zue, James R. Glass, David Goodine, Hong C. Leung, Michael S. Phillips, Joseph Polifroni, Stephanie Seneff:
Integration of speech recognition and natural language processing in the MIT VOYAGER system. ICASSP 1991: 713-716 - [c22]Victor W. Zue, James R. Glass, David Goodine, Lynette Hirschman, Hong C. Leung, Michael S. Phillips, Joseph Polifroni, Stephanie Seneff:
The MIT ATIS system; preliminary development, spontaneous speech data collection, and performance evaluation. EUROSPEECH 1991: 537-540 - [c21]Michael S. Phillips, James R. Glass, Victor W. Zue:
Automatic learning of lexical representations for sub-word unit based speech recognition systems. EUROSPEECH 1991: 577-580 - [c20]Michael S. Phillips, James R. Glass, Victor Zue:
Modelling Context Dependency in Acoustic-Phonetic and Lexical Representations. HLT 1991 - [c19]Stephanie Seneff, James R. Glass, David Goddeau, David Goodine, Lynette Hirschman, Hong C. Leung, Michael S. Phillips, Joseph Polifroni, Victor Zue:
Development and Preliminary Evaluation of the MIT ATIS System. HLT 1991 - [c18]Victor Zue, James R. Glass, Dave Goddeau, David Goodine, Lynette Hirschman, Hong C. Leung, Michael S. Phillips, Joseph Polifroni, Stephanie Seneff:
Spoken language systems for human/machine interfaces. RIAO 1991: 936-955 - 1990
- [j1]Victor Zue, Stephanie Seneff, James R. Glass:
Speech database development at MIT: Timit and beyond. Speech Commun. 9(4): 351-356 (1990) - [c17]Victor Zue, James R. Glass, David Goodine, Michael Philips, Stephanie Seneff:
The SUMMIT speech recognition system: phonological modelling and lexical access. ICASSP 1990: 49-52 - [c16]Victor Zue, James R. Glass, David Goodine, Hong C. Leung, Michael S. Phillips, Joseph Polifroni, Stephanie Seneff:
The VOYAGER speech understanding system: preliminary development and evaluation. ICASSP 1990: 73-76 - [c15]Hong C. Leung, James R. Glass, Michael S. Phillips, Victor W. Zue:
Detection and classification of phonemes using context-independent error back-propagation. ICSLP 1990: 1061-1064 - [c14]Victor W. Zue, James R. Glass, Dave Goddeau, David Goodine, Hong C. Leung, Michael K. McCandless, Michael S. Phillips, Joseph Polifroni, Stephanie Seneff, Dave Whitney:
Recent progress on the MIT VOYAGER spoken language system. ICSLP 1990: 1317-1320 - [c13]Victor Zue, James R. Glass, David Goodine, Hong C. Leung, Michael K. McCandless, Michael S. Phillips, Joseph Polifroni, Stephanie Seneff:
Recent Progress on the VOYAGER System. HLT 1990 - [c12]Victor Zue, James R. Glass, David Goodine, Hong C. Leung, Michael S. Phillips, Joseph Polifroni, Stephanie Seneff:
Preliminary ATIS Development at MIT. HLT 1990 - [c11]Victor Zue, James R. Glass, David Goodine, Hong C. Leung, Michael S. Phillips, Joseph Polifroni, Stephanie Seneff:
Recent Progress on the SUMMIT System. HLT 1990 - [c10]Hong C. Leung, James R. Glass, Michael S. Phillips, Victor Zue:
Phonetic Classification and Recognition Using the Multi-Layer Perceptron. NIPS 1990: 248-254 - [c9]Victor Zue, James R. Glass, David Goodine, Lynette Hirschman, Hong C. Leung, Michael S. Phillips, Joseph Polifroni, Stephanie Seneff:
From Speech Recognition to Spoken Language Understanding. NIPS 1990: 255-261
1980 – 1989
- 1989
- [c8]Victor Zue, James R. Glass, Michael Philips, Stephanie Seneff:
Acoustic segmentation and phonetic classification in the SUMMIT system. ICASSP 1989: 389-392 - [c7]Victor Zue, Nancy A. Daly-Kelly, James R. Glass, David Goodine, Hong C. Leung, Michael S. Phillips, Joseph Polifroni, Stephanie Seneff, Michal Soclof:
The Collection and Preliminary Analysis of a Spontaneous Speech Database. HLT (2) 1989 - [c6]Victor Zue, James R. Glass, David Goodine, Hong C. Leung, Michael S. Phillips, Joseph Polifroni, Stephanie Seneff:
The Voyager Speech Understanding System: A Progress Report. HLT (2) 1989 - [c5]Victor Zue, James R. Glass, David Goodine, Hong C. Leung, Michael S. Phillips, Joseph Polifroni, Stephanie Seneff:
Preliminary Evaluation of the Voyager Spoken Language System. HLT (2) 1989 - [c4]Victor Zue, James R. Glass, Michael S. Phillips, Stephanie Seneff:
The MIT Summit Speech Recognition System: a Progress Report. HLT (1) 1989 - 1988
- [b1]James R. Glass:
Finding acoustic regularities in speech: applications to phonetic recognition. Massachusetts Institute of Technology, Cambridge, MA, USA, 1988 - [c3]James R. Glass, Victor W. Zue:
Multi-level acoustic segmentation of continuous speech. ICASSP 1988: 429-432 - 1986
- [c2]James R. Glass, Victor W. Zue:
Detection and recognition of nasal consonants in American English. ICASSP 1986: 2767-2770 - 1985
- [c1]James R. Glass, Victor W. Zue:
Detection of nasalized vowels in American English. ICASSP 1985: 1569-1572
Coauthor Index
aka: Scott Cyphers
aka: Rogério Schmidt Feris
aka: David Harwath
aka: Cheng-I Jeff Lai
aka: Lluís Màrquez i Villodre
aka: Helen Meng
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-19 20:48 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint