default search action

combined dblp search
author search
venue search
publication search

ask others

Erik Visser

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SridharGVM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SridharGVM24
Arvind Krishna Sridhar, Yinyi Guo, Erik Visser, Rehana Mahfuz:
Parameter Efficient Audio Captioning with Faithful Guidance Using Audio-Text Shared Latent Representation. ICASSP 2024: 1181-1185
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-06126
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-06126
Kyungguen Byun, Jason Filos, Erik Visser, Sunkuk Moon:
VC-ENHANCE: Speech Restoration with Integrated Noise Suppression and Voice Conversion. CoRR abs/2409.06126 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-06223
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-06223
Arvind Krishna Sridhar, Yinyi Guo, Erik Visser:
Enhancing Temporal Understanding in Audio Question Answering for Large Audio Language Models. CoRR abs/2409.06223 (2024)
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-08489
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-08489
Rehana Mahfuz, Yinyi Guo, Erik Visser:
Confidence Calibration for Audio Captioning Models. CoRR abs/2409.08489 (2024)
2023
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MahfuzGV23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MahfuzGV23
Rehana Mahfuz, Yinyi Guo, Erik Visser:
Improving Audio Captioning Using Semantic Similarity Metrics. ICASSP 2023: 1-5
[c6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KerpicciNZV23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KerpicciNZV23
Mine Kerpicci, Van Nguyen, Shuhua Zhang, Erik Visser:
Application of Knowledge Distillation to Multi-Task Speech Representation Learning. INTERSPEECH 2023: 2813-2817
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-02730
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-02730
Hyungseob Lim, Kyungguen Byun, Sunkuk Moon, Erik Visser:
Stylebook: Content-Dependent Speaking Style Modeling for Any-to-Any Voice Conversion using Only Speech Data. CoRR abs/2309.02730 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-03326
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-03326
Rehana Mahfuz, Yinyi Guo, Arvind Krishna Sridhar, Erik Visser:
Detecting False Alarms and Misses in Audio Captions. CoRR abs/2309.03326 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-03340
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-03340
Arvind Krishna Sridhar, Yinyi Guo, Erik Visser, Rehana Mahfuz:
Parameter Efficient Audio Captioning With Faithful Guidance Using Audio-text Shared Latent Representation. CoRR abs/2309.03340 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-03364
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-03364
Kyungguen Byun, Sunkuk Moon, Erik Visser:
Highly Controllable Diffusion-based Any-to-Any Voice Conversion Model with Frame-level Prosody Feature. CoRR abs/2309.03364 (2023)
2022
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HussainNZV22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HussainNZV22
Shehzeen Hussain, Van Nguyen, Shuhua Zhang, Erik Visser:
Multi-Task Voice Activated Framework Using Self-Supervised Learning. ICASSP 2022: 6137-6141
[i6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-09316
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-09316
Ravi Choudhary, Arvind Krishna Sridhar, Erik Visser:
Activity report analysis with automatic single or multispan answer extraction. CoRR abs/2209.09316 (2022)
[i5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-16470
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-16470
Rehana Mahfuz, Yinyi Guo, Erik Visser:
Improving Audio Captioning Using Semantic Similarity Metrics. CoRR abs/2210.16470 (2022)
[i4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-16611
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-16611
Mine Kerpicci, Van Nguyen, Shuhua Zhang, Erik Visser:
Application of Knowledge Distillation to Multi-task Speech Representation Learning. CoRR abs/2210.16611 (2022)
[i3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-02712
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-02712
Arvind Krishna Sridhar, Erik Visser:
Improved Beam Search for Hallucination Mitigation in Abstractive Summarization. CoRR abs/2212.02712 (2022)
2021
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-01077
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-01077
Shehzeen Hussain, Van Nguyen, Shuhua Zhang, Erik Visser:
Multi-task Voice Activated Framework using Self-supervised Learning. CoRR abs/2110.01077 (2021)
2020
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/KohSGHV20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/KohSGHV20
Eunjeong Koh, Fatemeh Saki, Yinyi Guo, Cheng-Yu Hung, Erik Visser:
Incremental Learning Algorithm For Sound Event Detection. ICME 2020: 1-6
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2003-12175
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-12175
Eunjeong Koh, Fatemeh Saki, Yinyi Guo, Cheng-Yu Hung, Erik Visser:
Incremental Learning Algorithm for Sound Event Detection. CoRR abs/2003.12175 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c3]
- view
  - electronic edition @ dcase.community (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/dcase/SakiGHKDMKV19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/SakiGHKDMKV19
Fatemeh Saki, Yinyi Guo, Cheng-Yu Hung, Lae-Hoon Kim, Manyu Deshpande, Sunkuk Moon, Eunjeong Koh, Erik Visser:
Open-set Evolving Acoustic Scene Classification System. DCASE 2019: 219-223

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2007
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Visser07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Visser07
Erik Visser:
Frequency Domain Passive Broadband Speaker Localization using a Permutation-Free Blind Source Separation Algorithm. ICASSP (2) 2007: 673-676
2006
[c1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Visser06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Visser06
Erik Visser:
Geometrically constrained permutation-free source separation in an undercomplete speech unmixing scenario. INTERSPEECH 2006

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.