default search action
Jan Skoglund
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c47]Alessandro Ragano, Jan Skoglund, Andrew Hines:
NOMAD: Unsupervised Learning of Perceptual Embeddings For Speech Enhancement and Non-Matching Reference Audio Quality Assessment. ICASSP 2024: 1011-1015 - [i18]Minje Kim, Jan Skoglund:
Neural Speech and Audio Coding. CoRR abs/2408.06954 (2024) - 2023
- [j9]Dong Yu, Yifan Gong, Michael A. Picheny, Bhuvana Ramabhadran, Dilek Hakkani-Tür, Rohit Prasad, Heiga Zen, Jan Skoglund, Jan Honza Cernocký, Lukás Burget, Abdelrahman Mohamed:
Twenty-Five Years of Evolution in Speech and Language Processing. IEEE Signal Process. Mag. 40(5): 27-39 (2023) - [c46]Teerapat Jenrungrot, Michael Chinen, W. Bastiaan Kleijn, Jan Skoglund, Zalán Borsos, Neil Zeghidour, Marco Tagliasacchi:
LMCodec: A Low Bitrate Speech Codec with Causal Transformer Models. ICASSP 2023: 1-5 - [c45]W. Bastiaan Kleijn, Michael Chinen, Felicia S. C. Lim, Jan Skoglund:
Multi-Channel Audio Signal Generation. ICASSP 2023: 1-5 - [c44]Hong-Goo Kang, Jan Skoglund, W. Bastiaan Kleijn, Andrew Storus, Hengchin Yeh:
A High-Rate Extension to Soundstream. WASPAA 2023: 1-5 - [i17]Teerapat Jenrungrot, Michael Chinen, W. Bastiaan Kleijn, Jan Skoglund, Zalán Borsos, Neil Zeghidour, Marco Tagliasacchi:
LMCodec: A Low Bitrate Speech Codec With Causal Transformer Models. CoRR abs/2303.12984 (2023) - [i16]Alessandro Ragano, Jan Skoglund, Andrew Hines:
NOMAD: Unsupervised Learning of Perceptual Embeddings for Speech Enhancement and Non-matching Reference Audio Quality Assessment. CoRR abs/2309.16284 (2023) - 2022
- [j8]Wissam A. Jassim, Jan Skoglund, Michael Chinen, Andrew Hines:
Speech quality assessment with WARP-Q: From similarity to subsequence dynamic time warp cost. IET Signal Process. 16(9): 1050-1070 (2022) - [j7]Neil Zeghidour, Alejandro Luebs, Ahmed Omran, Jan Skoglund, Marco Tagliasacchi:
SoundStream: An End-to-End Neural Audio Codec. IEEE ACM Trans. Audio Speech Lang. Process. 30: 495-507 (2022) - [c43]Ali Siahkoohi, Michael Chinen, Tom Denton, W. Bastiaan Kleijn, Jan Skoglund:
Ultra-Low-Bitrate Speech Coding with Pretrained Transformers. INTERSPEECH 2022: 4421-4425 - [c42]Michael Chinen, Jan Skoglund, Chandan K. A. Reddy, Alessandro Ragano, Andrew Hines:
Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset. INTERSPEECH 2022: 4531-4535 - [i15]Ali Siahkoohi, Michael Chinen, Tom Denton, W. Bastiaan Kleijn, Jan Skoglund:
Ultra-Low-Bitrate Speech Coding with Pretrained Transformers. CoRR abs/2207.02262 (2022) - [i14]Michael Chinen, Jan Skoglund, Chandan K. A. Reddy, Alessandro Ragano, Andrew Hines:
Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset. CoRR abs/2209.06358 (2022) - 2021
- [c41]Wissam A. Jassim, Jan Skoglund, Michael Chinen, Andrew Hines:
Warp-Q: Quality Prediction for Generative Neural Speech Codecs. ICASSP 2021: 401-405 - [c40]W. Bastiaan Kleijn, Andrew Storus, Michael Chinen, Tom Denton, Felicia S. C. Lim, Alejandro Luebs, Jan Skoglund, Hengchin Yeh:
Generative Speech Coding with Predictive Variance Regularization. ICASSP 2021: 6478-6482 - [i13]W. Bastiaan Kleijn, Andrew Storus, Michael Chinen, Tom Denton, Felicia S. C. Lim, Alejandro Luebs, Jan Skoglund, Hengchin Yeh:
Generative Speech Coding with Predictive Variance Regularization. CoRR abs/2102.09660 (2021) - [i12]Tom Denton, Alejandro Luebs, Felicia S. C. Lim, Andrew Storus, Hengchin Yeh, W. Bastiaan Kleijn, Jan Skoglund:
Handling Background Noise in Neural Speech Generation. CoRR abs/2102.11906 (2021) - [i11]Neil Zeghidour, Alejandro Luebs, Ahmed Omran, Jan Skoglund, Marco Tagliasacchi:
SoundStream: An End-to-End Neural Audio Codec. CoRR abs/2107.03312 (2021) - 2020
- [c39]Tom Denton, Alejandro Luebs, Michael Chinen, Felicia S. C. Lim, Andrew Storus, Hengchin Yeh, W. Bastiaan Kleijn, Jan Skoglund:
Handling Background Noise in Neural Speech Generation. ACSSC 2020: 667-671 - [c38]Felicia S. C. Lim, W. Bastiaan Kleijn, Michael Chinen, Jan Skoglund:
Robust Low Rate Speech Coding Based on Cloned Networks and Wavenet. ICASSP 2020: 6769-6773 - [c37]Jan Skoglund, Jean-Marc Valin:
Improving Opus Low Bit Rate Quality with Neural Speech Synthesis. INTERSPEECH 2020: 2847-2851 - [c36]Michael Chinen, Felicia S. C. Lim, Jan Skoglund, Nikita Gureev, Feargus O'Gorman, Andrew Hines:
ViSQOL v3: An Open Source Production Ready Objective Speech and Audio Metric. QoMEX 2020: 1-6 - [c35]Wissam A. Jassim, Jan Skoglund, Michael Chinen, Andrew Hines:
Speech Quality Factors for Traditional and Neural-Based Low Bit Rate Vocoders. QoMEX 2020: 1-6 - [i10]Wissam A. Jassim, Jan Skoglund, Michael Chinen, Andrew Hines:
Speech Quality Factors for Traditional and Neural-Based Low Bit Rate Vocoders. CoRR abs/2003.11882 (2020) - [i9]Michael Chinen, Felicia S. C. Lim, Jan Skoglund, Nikita Gureev, Feargus O'Gorman, Andrew Hines:
ViSQOL v3: An Open Source Production Ready Objective Speech and Audio Metric. CoRR abs/2004.09584 (2020)
2010 – 2019
- 2019
- [c34]Jean-Marc Valin, Jan Skoglund:
LPCNET: Improving Neural Speech Synthesis through Linear Prediction. ICASSP 2019: 5891-5895 - [c33]W. Bastiaan Kleijn, Felicia S. C. Lim, Michael Chinen, Jan Skoglund:
Salient Speech Representations Based on Cloned Networks. INTERSPEECH 2019: 919-923 - [c32]Jean-Marc Valin, Jan Skoglund:
A Real-Time Wideband Neural Vocoder at 1.6kb/s Using LPCNet. INTERSPEECH 2019: 3406-3410 - [c31]Michael Chinen, W. Bastiaan Kleijn, Felicia S. C. Lim, Jan Skoglund:
Generative Speech Enhancement Based on Cloned Networks. WASPAA 2019: 214-218 - [i8]Jean-Marc Valin, Jan Skoglund:
A Real-Time Wideband Neural Vocoder at 1.6 kb/s Using LPCNet. CoRR abs/1903.12087 (2019) - [i7]Jan Skoglund, Jean-Marc Valin:
Improving Opus Low Bit Rate Quality with Neural Speech Synthesis. CoRR abs/1905.04628 (2019) - [i6]W. Bastiaan Kleijn, Felicia S. C. Lim, Michael Chinen, Jan Skoglund:
Salient Speech Representations Based on Cloned Networks. CoRR abs/1908.07045 (2019) - [i5]Michael Chinen, W. Bastiaan Kleijn, Felicia S. C. Lim, Jan Skoglund:
Generative Speech Enhancement Based on Cloned Networks. CoRR abs/1909.04776 (2019) - 2018
- [j6]Jinkyu Lee, Jan Skoglund, Turaj Shabestary, Hong-Goo Kang:
Phase-Sensitive Joint Learning Algorithms for Deep Learning-Based Speech Enhancement. IEEE Signal Process. Lett. 25(8): 1276-1280 (2018) - [c30]W. Bastiaan Kleijn, Felicia S. C. Lim, Alejandro Luebs, Jan Skoglund, Florian Stimberg, Quan Wang, Thomas C. Walters:
Wavenet Based Low Rate Speech Coding. ICASSP 2018: 676-680 - [c29]Kevin W. Wilson, Michael Chinen, Jeremy Thorpe, Brian Patton, John R. Hershey, Rif A. Saurous, Jan Skoglund, Richard F. Lyon:
Exploring Tradeoffs in Models for Low-Latency Speech Enhancement. IWAENC 2018: 366-370 - [c28]Jan Skoglund:
Spatial Audio on the Web - Create, Compress, and Render. AVSU@MM 2018: 17 - [c27]W. Bastiaan Kleijn, Christopher Laguna, Alejandro Luebs, Andrew MacDonald, Jan Skoglund:
Beamforming with Partial Knowledge of the Acoustic Scenario. MMSP 2018: 1-6 - [c26]Miroslaw Narbutt, Andrew Allen, Jan Skoglund, Michael Chinen, Andrew Hines:
AMBIQUAL - a full reference objective quality metric for ambisonic spatial audio. QoMEX 2018: 1-6 - [i4]Jean-Marc Valin, Jan Skoglund:
LPCNet: Improving Neural Speech Synthesis Through Linear Prediction. CoRR abs/1810.11846 (2018) - [i3]Kevin W. Wilson, Michael Chinen, Jeremy Thorpe, Brian Patton, John R. Hershey, Rif A. Saurous, Jan Skoglund, Richard F. Lyon:
Exploring Tradeoffs in Models for Low-latency Speech Enhancement. CoRR abs/1811.07030 (2018) - [i2]Jan Skoglund, Michael Graczyk:
Ambisonics in an Ogg Opus Container. RFC 8486: 1-10 (2018) - 2017
- [c25]Yiteng Huang, Jan Skoglund, Alejandro Luebs:
Practically efficient nonlinear acoustic echo cancellers using cascaded block RLS and FLMS adaptive filters. ICASSP 2017: 596-600 - [c24]Miroslaw Narbutt, Seán O'Leary, Andrew Allen, Jan Skoglund, Andrew Hines:
Streaming VR for immersion: Quality aspects of compressed spatial audio. VSMM 2017: 1-6 - [c23]Christos Tzagkarakis, W. Bastiaan Kleijn, Jan Skoglund:
Joint wideband source localization and acquisition based on a grid-shift approach. WASPAA 2017: 81-85 - [c22]W. Bastiaan Kleijn, Andrew Allen, Jan Skoglund, Felicia Lim:
Incoherent idempotent ambisonics rendering. WASPAA 2017: 209-213 - [i1]W. Bastiaan Kleijn, Felicia S. C. Lim, Alejandro Luebs, Jan Skoglund, Florian Stimberg, Quan Wang, Thomas C. Walters:
Wavenet based low rate speech coding. CoRR abs/1712.01120 (2017) - 2016
- [c21]Yiteng Arden Huang, Alejandro Luebs, Jan Skoglund, W. Bastiaan Kleijn:
Globally optimized least-squares post-filtering for microphone array speech enhancement. ICASSP 2016: 380-384 - [c20]Herbert Buchner, Jan Skoglund, Simon J. Godsill:
An acoustic keystroke transient canceler for speech communication terminals using a semi-blind adaptive filter model. ICASSP 2016: 614-618 - [c19]Yiteng Arden Huang, Jan Skoglund, Alejandro Luebs:
Bi-magnitude processing framework for nonlinear acoustic echo cancellation on Android devices. IWAENC 2016: 1-5 - [c18]Hong-Goo Kang, Michael Graczyk, Jan Skoglund:
On pre-filtering strategies for the GCC-PHAT algorithm. IWAENC 2016: 1-5 - 2015
- [j5]Andrew Hines, Jan Skoglund, Anil C. Kokaram, Naomi Harte:
ViSQOL: an objective speech quality model. EURASIP J. Audio Speech Music. Process. 2015: 13 (2015) - [c17]James Eaton, Alastair H. Moore, Patrick A. Naylor, Jan Skoglund:
Direct-to-Reverberant Ratio estimation using a null-steered beamformer. ICASSP 2015: 46-50 - [c16]Simon J. Godsill, Herbert Buchner, Jan Skoglund:
Detection and suppression of keyboard transient noise in audio streams with auxiliary keybed microphone. ICASSP 2015: 379-383 - 2014
- [c15]Alastair H. Moore, Patrick A. Naylor, Jan Skoglund:
An analysis of the effect of larynx-synchronous averaging on dereverberation of voiced speech. EUSIPCO 2014: 924-928 - [c14]W. Bastiaan Kleijn, Turaj Zakizadeh Shabestary, Jan Skoglund:
Sinusoidal interpolation across missing data. IWAENC 2014: 70-74 - [c13]Andrew Hines, Eoin Gillen, Damien Kelly, Jan Skoglund, Anil C. Kokaram, Naomi Harte:
Perceived Audio Quality for Streaming Stereo Music. ACM Multimedia 2014: 1173-1176 - 2013
- [c12]Andrew Hines, Jan Skoglund, Anil C. Kokaram, Naomi Harte:
Robustness of speech quality metrics to background noise and network degradations: Comparing ViSQOL, PESQ and POLQA. ICASSP 2013: 3697-3701 - [c11]Andrew Hines, Jan Skoglund, Anil C. Kokaram, Naomi Harte:
Monitoring the effects of temporal clipping on voIP speech quality. INTERSPEECH 2013: 1188-1192 - [c10]Minyue Li, Jan Skoglund, W. Bastiaan Kleijn:
Rate-distortion optimization for multichannel audio compression. WASPAA 2013: 1-4 - 2012
- [c9]Andrew Hines, Jan Skoglund, Anil C. Kokaram, Naomi Harte:
ViSQOL: The Virtual Speech Quality Objective Listener. IWAENC 2012 - [c8]W. Bastiaan Kleijn, Jan Skoglund:
Improved Prediction of Nearly-Periodic Signals. IWAENC 2012
2000 – 2009
- 2000
- [j4]Jan Skoglund, W. Bastiaan Kleijn:
On time-frequency masking in voiced speech. IEEE Trans. Speech Audio Process. 8(4): 361-369 (2000) - [j3]Per Hedelin, Jan Skoglund:
Vector quantization based on Gaussian mixture models. IEEE Trans. Speech Audio Process. 8(4): 385-401 (2000) - [c7]Jan Skoglund, Richard V. Cox, John S. Collura:
A combined WI and MELP coder at 5.2 kbps. ICASSP 2000: 1387-1390
1990 – 1999
- 1999
- [j2]Thomas Eriksson, Jan Linden, Jan Skoglund:
Interframe LSF quantization for noisy channels. IEEE Trans. Speech Audio Process. 7(5): 495-509 (1999) - [c6]Per Hedelin, Jan Skoglund, Jonas Samuelsson:
Performance bounds for LPC spectrum quantization. ICASSP 1999: 677-680 - 1998
- [j1]Jan Skoglund:
Analysis and quantization of glottal pulse shapes. Speech Commun. 24(2): 133-152 (1998) - [c5]Mikael Skoglund, Jan Skoglund:
On nonlinear utilization of intervector dependency in vector quantization. ICASSP 1998: 361-364 - [c4]Jan Skoglund, W. Bastiaan Kleijn:
On the significance of temporal masking in speech coding. ICSLP 1998 - 1997
- [c3]Jan Skoglund, Jan Linden:
Predictive VQ for noisy channel spectrum coding: AR or MA? ICASSP 1997: 1351-1354 - 1996
- [c2]Thomas Eriksson, Jan Linden, Jan Skoglund:
Exploiting interframe correlation in spectral quantization: a study of different memory VQ schemes. ICASSP 1996: 765-768 - 1995
- [c1]Thomas Eriksson, Jan Linden, Jan Skoglund:
Vector quantization of glottal pulses. EUROSPEECH 1995: 225-228
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 22:10 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint