default search action
WASPAA 2023: New Paltz, NY, USA
- IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023, New Paltz, NY, USA, October 22-25, 2023. IEEE 2023, ISBN 979-8-3503-2372-6
- Ayal Schwartz, Elior Hadad, Sharon Gannot, Shlomo E. Chazan:
Array Configuration Mismatch in Deep DOA Estimation: Towards Robust Training. 1-5 - Bastiaan Tamm, Rik Vandenberghe, Hugo Van hamme:
Analysis of XLS-R for Speech Quality Assessment. 1-5 - Aryan Chaudhary, Vinayak Abrol:
Towards on-Device Keyword Spotting using Low-Footprint Quaternion Neural Models. 1-5 - Gal Itzhak, Israel Cohen:
Region-of-Interest Oriented Constant-Beamwidth Beamforming with Rectangular Arrays. 1-5 - Chang-Bin Jeon, Kyogu Lee:
Music De-Limiter Networks Via Sample-Wise Gain Inversion. 1-5 - Da-Hee Yang, Donghyun Kim, Joon-Hyuk Chang:
Masked Frequency Modeling for Improving Packet Loss Concealment in Speech Transmission Systems. 1-5 - Kenta Ogawa, Shun Sawada, Kouichi Katsurada, Hidehumi Ohmura:
Automatic Detection of Poor Tone Quality in Classical Guitar Playing Using Deep Anomaly Detection Method. 1-5 - Afagh Farhadi, Laurel H. Carney:
Predicting Thresholds in an Auditory Overshoot Paradigm Using a Computational Subcortical Model with Efferent Feedback. 1-5 - Leny Vinceslas, Matteo Scerbo, Hüseyin Hacihabiboglu, Zoran Cvetkovic, Enzo De Sena:
Low-Complexity Higher Order Scattering Delay Networks. 1-5 - Yurii Iotov, Sidsel Marie Nørholm, Valiantsin Belyi, Mads Græsbøll Christensen:
Adaptive Sparse Linear Prediction in Fixed-Filter ANC Headphone Applications for Multi-Speaker Speech Reduction. 1-5 - Shuai Tao, Yang Xiang, Himavanth Reddy, Jesper Rindom Jensen, Mads Græsbøll Christensen:
Single Channel Speech Presence Probability Estimation based on Hybrid Global-Local Information. 1-5 - Tre DiPassio, Michael C. Heilemann, Benjamin Thompson, Mark F. Bocko:
Estimating the Direction of Arrival of a Spoken Wake Word Using a Single Sensor on an Elastic Panel. 1-5 - Devansh Zurale, Shlomo Dubnov:
Learning Sub-Dimensional HRTF Representations Towards Individualization Applications - Traditional and Deep Learning Approaches. 1-5 - Richard Füg, Bernd Edler:
Temporal Noise Shaping on MDCT Subband Signals for Transform Audio Coding. 1-5 - Pablo M. Delgado, Jürgen Herre:
An Improved Metric of Informational Masking for Perceptual Audio Quality Measurement. 1-5 - Dimitrios Bralios, Efthymios Tzinis, Paris Smaragdis:
Complete and Separate: Conditional Separation with Missing Target Source Attribute Completion. 1-5 - Eric Guizzo, Tillman Weyde, Giacomo Tarroni, Danilo Comminiello:
Quaternion Anti-Transfer Learning for Speech Emotion Recognition. 1-5 - Michael Neri, Archontis Politis, Daniel Krause, Marco Carli, Tuomas Virtanen:
Single-Channel Speaker Distance Estimation in Reverberant Environments. 1-5 - Yuma Koizumi, Heiga Zen, Shigeki Karita, Yifan Ding, Kohei Yatabe, Nobuyuki Morioka, Yu Zhang, Wei Han, Ankur Bapna, Michiel Bacchiani:
Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text Representations. 1-5 - François G. Germain, Gordon Wichern, Jonathan Le Roux:
Hyperbolic Unsupervised Anomalous Sound Detection. 1-5 - Elisa Tengan, Thomas Dietzen, Filip Elvander, Toon van Waterschoot:
Multi-Source Direction-of-Arrival Estimation using Group-Sparse Fitting of Steered Response Power Maps. 1-5 - Yoshiki Masuyama, Xuankai Chang, Wangyou Zhang, Samuele Cornell, Zhong-Qiu Wang, Nobutaka Ono, Yanmin Qian, Shinji Watanabe:
Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation. 1-5 - Atsushi Miyashita, Tomoki Toda:
Differentiable Representation of Warping Based on Lie Group Theory. 1-5 - Hong-Goo Kang, Jan Skoglund, W. Bastiaan Kleijn, Andrew Storus, Hengchin Yeh:
A High-Rate Extension to Soundstream. 1-5 - Jarin Ritu, Ethan Barnes, Riley Martell, Alexandra Van Dine, Joshua Peeples:
Histogram Layer Time Delay Neural Networks for Passive Sonar Classification. 1-5 - James A. King, Arshdeep Singh, Mark D. Plumbley:
Compressing Audio CNNS with Graph Centrality Based Filter Pruning. 1-5 - Keisuke Kimura, Shoichi Koyama, Hiroshi Saruwatari:
Perceptual Quality Enhancement of Sound Field Synthesis Based on Combination of Pressure and Amplitude Matching. 1-5 - Ivan Shanin, Simon Dixon:
Annotating Jazz Recordings Using Lead Sheet Alignment with Deep Chroma Features. 1-5 - Jean-Marie Lemercier, Simon Welker, Timo Gerkmann:
Diffusion Posterior Sampling for Informed Single-Channel Dereverberation. 1-5 - Ahmed Alghamdi, Leonard Moen, Wai-Yip Chan, Daniel Fogerty, Jesper Jensen:
Correlation Based Glimpse Proportion Index. 1-5 - Yoshiki Masuyama, Natsuki Ueno, Nobutaka Ono:
Signal Reconstruction from Mel-Spectrogram Based on Bi-Level Consistency of Full-Band Magnitude and Phase. 1-5 - Enric Gusó, Joanna Luberadzka, Martí Baig, Umut Sayin Saraç, Xavier Serra:
An Objective Evaluation of Hearing AIDS and DNN-Based Binaural Speech Enhancement in Complex Acoustic Scenes. 1-5 - Julia Wilkins, Justin Salamon, Magdalena Fuentes, Juan Pablo Bello, Oriol Nieto:
Bridging High-Quality Audio and Video Via Language for Sound Effects Retrieval from Visual Queries. 1-5 - Byeongho Jo, Seungkwon Beack:
Hybrid Noise Shaping for Audio Coding Using Perfectly Overlapped Window. 1-5 - Yinghao Aaron Li, Cong Han, Nima Mesgarani:
SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs. 1-5 - Saurjya Sarkar, Louise Thorpe, Emmanouil Benetos, Mark Sandler:
Leveraging Synthetic Data for Improving Chamber Ensemble Separation. 1-5 - Jin Woo Lee, Hyeong-Seok Choi, Kyogu Lee:
AECSQI: Referenceless Acoustic Echo Cancellation Measures Using Speech Quality and Intelligibility Improvement. 1-5 - Jiarui Hai, Mounya Elhilali:
Diff-Pitcher: Diffusion-Based Singing Voice Pitch Correction. 1-5 - Ricardo Falcón Pérez, Gordon Wichern, François G. Germain, Jonathan Le Roux:
Location as Supervision for Weakly Supervised Multi-Channel Source Separation of Machine Sounds. 1-5 - Ilyass Moummad, Nicolas Farrugia:
Pretraining Respiratory Sound Representations using Metadata and Contrastive Learning. 1-5 - Vincent Lostanlen, Daniel Haider, Han Han, Mathieu Lagrange, Péter Balázs, Martin Ehler:
Fitting Auditory Filterbanks with Multiresolution Neural Networks. 1-5 - Menglu Li, Xiao-Ping Zhang:
Robust Audio Anti-Spoofing System Based on Low-Frequency Sub-Band Information. 1-5 - Rajesh R, Padmanabhan Rajan:
Neural Networks for Interference Reduction in Multi-Track Recordings. 1-5 - Ante Jukic, Jagadeesh Balam, Boris Ginsburg:
Flexible Multichannel Speech Enhancement for Noise-Robust Frontend. 1-5 - Alice Sokolova, Baris Aksanli, Fred Harris, Harinath Garudadri:
Consolidating Compression and Revisiting Expansion: an Alternative Amplification Rule for Wide Dynamic Range Compression. 1-5 - Bowen Zhi, Alisha Sharma, Dmitry N. Zotkin, Ramani Duraiswami:
A Differentiable Image Source Model for Room Acoustics Optimization. 1-5 - Archontis Politis, Lauros Pajunen, Jussi Leppänen, Sujeet Mate, Antti J. Eronen:
Wide-Area 6DOF Rendering of Multi-Point Ambisonic Recordings Based on Interpolation of Spatial Parameters. 1-5 - Mohamed Elminshawi, Srikanth Raj Chetupalli, Emanuël A. P. Habets:
Slim-Tasnet: A Slimmable Neural Network for Speech Separation. 1-5 - Martin Strauss, Nicola Pia, Nagashree K. S. Rao, Bernd Edler:
SEFGAN: Harvesting the Power of Normalizing Flows and GANs for Efficient High-Quality Speech Enhancement. 1-5 - Maximilian Schäfer, Karolina Prawda, Rudolf Rabenstein, Sebastian J. Schlecht:
Distribution of Modal Damping in Absorptive Shoebox Rooms. 1-5 - Rui Wang, Tomoki Toda:
Directional Target Speaker Extraction under Noisy Underdetermined Conditions through Conditional Variational Autoencoder with Global Style Tokens. 1-5 - Pil Moo Byun, Jeong-Hwan Choi, Joon-Hyuk Chang:
Class Activation Mapping-Driven Data Augmentation: Masking Significant Regions for Enhanced Acoustic Scene Classification. 1-5 - Taejun Kim, Juhan Nam:
All-in-One Metrical and Functional Structure Analysis with Neighborhood Attentions on Demixed Audio. 1-5 - Henri Gode, Simon Doclo:
Covariance Blocking and Whitening Method for Successive Relative Transfer Function Vector Estimation in Multi-Speaker Scenarios. 1-5 - Jan Büthe, Jean-Marc Valin, Ahmed Mustafa:
Lace: A Light-Weight, Causal Model for Enhancing Coded Speech Through Adaptive Convolutions. 1-5 - Cyrus Vahidi, Shubhr Singh, Emmanouil Benetos, Huy Phan, Dan Stowell, György Fazekas, Mathieu Lagrange:
Perceptual Musical Similarity Metric Learning with Graph Neural Networks. 1-5 - Diep Luong, Minh Tran, Shayan Gharib, Konstantinos Drossos, Tuomas Virtanen:
Representation Learning for Audio Privacy Preservation Using Source Separation and Robust Adversarial Learning. 1-5 - Nils L. Westhausen, Bernd T. Meyer:
Low Bit Rate Binaural Link for Improved Ultra Low-Latency Low-Complexity Multichannel Speech Enhancement in Hearing Aids. 1-5 - Shoichi Koyama, Masaki Nakada, Juliano G. C. Ribeiro, Hiroshi Saruwatari:
Kernel Interpolation of Incident Sound Field in Region Including Scattering Objects. 1-5 - Matthew Rice, Christian J. Steinmetz, George Fazekas, Joshua D. Reiss:
General Purpose Audio Effect Removal. 1-5 - Samuel F. Potter, Monte Hoover, Dmitry N. Zotkin, Ramani Duraiswami:
Computing Acoustic Onsets Via an Eikonal Solver. 1-5 - Andrew Wiggins, Youngmoo E. Kim:
A Differentiable Acoustic Guitar Model for String-Specific Polyphonic Synthesis. 1-5 - Hao-Wen Dong, Xiaoyu Liu, Jordi Pons, Gautam Bhattacharya, Santiago Pascual, Joan Serrà, Taylor Berg-Kirkpatrick, Julian J. McAuley:
CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models. 1-5 - Amir Ivry, Israel Cohen, Baruch Berdugo:
Deep Adaptation Control for Stereophonic Acoustic Echo Cancellation. 1-5 - Yaakov Buchris, Israel Cohen, Alon Amar:
Design of Frequency-Invariant Beamformers with Sparse Concentric Circular Arrays. 1-5 - Wo Jae Lee, Emanuele Coviello:
A Novel Method to Detect Instrumental Music in a Large Scale Music Catalog. 1-5 - George Close, Thomas Hain, Stefan Goetze:
The Effect of Spoken Language on Speech Enhancement Using Self-Supervised Speech Representation Loss Functions. 1-5 - Carlotta Anemüller, Oliver Thiergart, Emanuël A. P. Habets:
Neural Audio Decorrelation Using Generative Adversarial Networks. 1-5 - Aditya Arie Nugraha, Diego Di Carlo, Yoshiaki Bando, Mathieu Fontaine, Kazuyoshi Yoshii:
Time-Domain Audio Source Separation Based on Gaussian Processes with Deep Kernel Learning. 1-5 - Zhi Zhong, Hao Shi, Masato Hirano, Kazuki Shimada, Kazuya Tateishi, Takashi Shibuya, Shusuke Takahashi, Yuki Mitsufuji:
Extending Audio Masked Autoencoders toward Audio Restoration. 1-5 - Ryan M. Corey:
Mixed-Delay Distributed Beamforming for Own-Speech Separation in Hearing Devices with Wireless Remote Microphones. 1-5 - Axel Marmoret, Jérémy E. Cohen, Frédéric Bimbot:
Convolutive Block-Matching Segmentation Algorithm with Application to Music Structure Analysis. 1-5 - Mark R. P. Thomas, Jan-Hendrik Hanschke:
Inverted Cardioid Topology for Multi-Radius Spherical Microphone Arrays. 1-5 - Yutong Wen, You Zhang, Zhiyao Duan:
Mitigating Cross-Database Differences for Learning Unified HRTF Representation. 1-5 - Christoph Hold, Leo McCormack, Archontis Politis, Ville Pulkki:
Optimizing Higher-Order Directional Audio Coding with Adaptive Mixing and Energy Matching for Ambisonic Compression and Upmixing. 1-5 - Benjamin Stahl, Alois Sontacchi:
Multichannel Subband-Fullband Gated Convolutional Recurrent Neural Network for Direction-Based Speech Enhancement with Head-Mounted Microphone Arrays. 1-5 - Davide Berghi, Philip J. B. Jackson:
Audio Inputs for Active Speaker Detection and Localization Via Microphone Array. 1-5 - Shivam Saini, Jürgen Peissig:
Blind Room Acoustic Parameters Estimation Using Mobile Audio Transformer. 1-5 - Ernst Seidel, Pejman Mowlaee, Tim Fingscheidt:
Efficient Deep Acoustic Echo Suppression with Condition-Aware Training. 1-5 - Wiebke Middelberg, Henri Gode, Simon Doclo:
Relative Transfer Function Vector Estimation for Acoustic Sensor Networks Exploiting Covariance Matrix Structure. 1-5 - Sungho Lee, Hyeong-Seok Choi, Kyogu Lee:
Yet Another Generative Model for Room Impulse Response Estimation. 1-5 - Saksham Singh Kushwaha, Irán R. Román, Magdalena Fuentes, Juan Pablo Bello:
Sound Source Distance Estimation in Diverse and Dynamic Acoustic Conditions. 1-5 - Zhepei Wang, Cem Subakan, Krishna Subramani, Junkai Wu, Tiago Tavares, Fábio Ayres, Paris Smaragdis:
Unsupervised Improvement of Audio-Text Cross-Modal Representations. 1-5
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.