default search action
Athanasios Katsamanis
Person information
- affiliation: National Technical University of Athens, Greece
- affiliation: University of Southern California, Signal Analysis and Interpretation Laboratory
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j12]Georgios Paraskevopoulos, Theodoros Kouzelis, Georgios Rouvalis, Athanasios Katsamanis, Vassilis Katsouros, Alexandros Potamianos:
Sample-Efficient Unsupervised Domain Adaptation of Speech Recognition Systems: A Case Study for Modern Greek. IEEE ACM Trans. Audio Speech Lang. Process. 32: 286-299 (2024) - [i14]Georgios Paraskevopoulos, Chara Tsoukala, Athanasios Katsamanis, Vassilis Katsouros:
The Greek podcast corpus: Competitive speech models for low-resourced languages with weakly supervised data. CoRR abs/2406.15284 (2024) - [i13]Leon Voukoutis, Dimitris Roussis, Georgios Paraskevopoulos, Sokratis Sofianopoulos, Prokopis Prokopidis, Vassilis Papavasileiou, Athanasios Katsamanis, Stelios Piperidis, Vassilis Katsouros:
Meltemi: The first open Large Language Model for Greek. CoRR abs/2407.20743 (2024) - [i12]Spyridoula Stamouli, Georgios Paraskevopoulos, Nassos Katsamanis:
A Conversational AI Assistant for Teaching and Learning. ERCIM News 2024(136) (2024) - [i11]Chara Tsoukala, Georgios Paraskevopoulos, Athanasios Katsamanis:
Revolutionising Theatre Archives: Using Large Language Models to Interact with Structured Archival Content. ERCIM News 2024(136) (2024) - [i10]Diego Collarana Vargas, Nassos Katsamanis:
Large Language Models - Introduction to the Special Theme. ERCIM News 2024(136) (2024) - 2023
- [c68]Panagiotis Paraskevas Filntisis, George Retsinas, Foivos Paraperas Papantoniou, Athanasios Katsamanis, Anastasios Roussos, Petros Maragos:
SPECTRE: Visual Speech-Informed Perceptual 3D Facial Expression Reconstruction from Videos. CVPR Workshops 2023: 5745-5755 - [c67]Nikolaos Antoniou, Athanasios Katsamanis, Theodoros Giannakopoulos, Shrikanth Narayanan:
Designing and Evaluating Speech Emotion Recognition Systems: A Reality Check Case Study with IEMOCAP. ICASSP 2023: 1-5 - [c66]Zehra Shah, Shiang Qi, Fei Wang, Mahtab Farrokh, Mashrura Tasnim, Eleni Stroulia, Russell Greiner, Manos Plitsis, Athanasios Katsamanis:
Exploring Language-Agnostic Speech Representations Using Domain Knowledge for Detecting Alzheimer's Dementia. ICASSP 2023: 1-2 - [c65]Theodoros Kouzelis, Georgios Paraskevopoulos, Athanasios Katsamanis, Vassilis Katsouros:
Weakly-supervised forced alignment of disfluent speech using phoneme-level modeling. INTERSPEECH 2023: 1563-1567 - [c64]Thomas Melistas, Lefteris Kapelonis, Nikolaos Antoniou, Petros Mitseas, Dimitris Sgouropoulos, Theodoros Giannakopoulos, Athanasios Katsamanis, Shrikanth Narayanan:
Cross-Lingual Features for Alzheimer's Dementia Detection from Speech. INTERSPEECH 2023: 3008-3012 - [i9]Georgios Paraskevopoulos, Theodoros Kouzelis, Georgios Rouvalis, Athanasios Katsamanis, Vassilis Katsouros, Alexandros Potamianos:
Sample-Efficient Unsupervised Domain Adaptation of Speech Recognition Systems A case study for Modern Greek. CoRR abs/2301.00304 (2023) - [i8]Nikolaos Antoniou, Athanasios Katsamanis, Theodoros Giannakopoulos, Shrikanth Narayanan:
Designing and Evaluating Speech Emotion Recognition Systems: A reality check case study with IEMOCAP. CoRR abs/2304.00860 (2023) - [i7]Theodoris Kouzelis, Grigoris Bastas, Athanasios Katsamanis, Alexandros Potamianos:
Efficient Audio Captioning Transformer with Patchout and Text Guidance. CoRR abs/2304.02916 (2023) - [i6]Theodoros Kouzelis, Georgios Paraskevopoulos, Athanasios Katsamanis, Vassilis Katsouros:
Weakly-supervised forced alignment of disfluent speech using phoneme-level modeling. CoRR abs/2306.00996 (2023) - 2022
- [c63]Aggelina Chatziagapi, Dimitris Sgouropoulos, Constantinos Karouzos, Thomas Melistas, Theodoros Giannakopoulos, Athanasios Katsamanis, Shrikanth Narayanan:
Audio and ASR-based Filled Pause Detection. ACII 2022: 1-7 - [c62]Alkiviadis Katsalis, Konstantinos Christantonis, Charalampos Tsioustas, Pantelis I. Kaplanoglou, Maximos A. Kaliakatsos-Papakostas, Athanasios Katsamanis, Konstantinos I. Diamantaras, Vassilis Katsouros, Evita F. Fotinea, Depy Panga, Dimitra Loupi:
NLP-Theatre: Employing Speech Recognition Technologies for Improving Accessibility and Augmenting the Theatrical Experience. IntelliSys (2) 2022: 507-526 - [c61]Gerasimos Chatzoudis, Manos Plitsis, Spyridoula Stamouli, Athanasia-Lida Dimou, Nassos Katsamanis, Vassilis Katsouros:
Zero-Shot Cross-lingual Aphasia Detection using Automatic Speech Recognition. INTERSPEECH 2022: 2178-2182 - [c60]Grigoris Bastas, Maximos A. Kaliakatsos-Papakostas, Georgios Paraskevopoulos, Pantelis I. Kaplanoglou, Konstantinos Christantonis, Charalampos Tsioustas, Dimitris Mastrogiannopoulos, Depy Panga, Evita F. Fotinea, Athanasios Katsamanis, Vassilis Katsouros, Konstantinos I. Diamantaras, Petros Maragos:
Towards a DHH Accessible Theater: Real-Time Synchronization of Subtitles and Sign Language Videos with ASR and NLP Solutions. PETRA 2022: 653-661 - [c59]Martino Ciaperoni, Aristides Gionis, Athanasios Katsamanis, Panagiotis Karras:
SIEVE: A Space-Efficient Algorithm for Viterbi Decoding. SIGMOD Conference 2022: 1136-1145 - [c58]Efthymios Georgiou, Kosmas Kritsis, Georgios Paraskevopoulos, Athanasios Katsamanis, Vassilis Katsouros, Alexandros Potamianos:
Regotron: Regularizing the Tacotron2 Architecture Via Monotonic Alignment Loss. SLT 2022: 977-983 - [i5]Gerasimos Chatzoudis, Manos Plitsis, Spyridoula Stamouli, Athanasia-Lida Dimou, Athanasios Katsamanis, Vassilis Katsouros:
Zero-Shot Cross-lingual Aphasia Detection using Automatic Speech Recognition. CoRR abs/2204.00448 (2022) - [i4]Efthymios Georgiou, Kosmas Kritsis, Georgios Paraskevopoulos, Athanasios Katsamanis, Vassilis Katsouros, Alexandros Potamianos:
Regotron: Regularizing the Tacotron2 architecture via monotonic alignment loss. CoRR abs/2204.13437 (2022) - [i3]Panagiotis Paraskevas Filntisis, George Retsinas, Foivos Paraperas Papantoniou, Athanasios Katsamanis, Anastasios Roussos, Petros Maragos:
Visual Speech-Aware Perceptual 3D Facial Expression Reconstruction from Videos. CoRR abs/2207.11094 (2022) - 2021
- [i2]Efthymios Georgiou, Athanasios Katsamanis:
AudioVisual Speech Synthesis: A brief literature review. CoRR abs/2103.03927 (2021) - [i1]Emmanouil Zaranis, Georgios Paraskevopoulos, Athanasios Katsamanis, Alexandros Potamianos:
EmpBot: A T5-based Empathetic Chatbot focusing on Sentiments. CoRR abs/2111.00310 (2021)
2010 – 2019
- 2019
- [j11]Antigoni Tsiami, Petros Koutras, Athanasios Katsamanis, Argiro Vatakis, Petros Maragos:
A behaviorally inspired fusion approach for computational audiovisual saliency modeling. Signal Process. Image Commun. 76: 186-200 (2019) - [c57]Theodoros Giannakopoulos, Spiros Dimopoulos, Georgios Pantazopoulos, Aggelina Chatziagapi, Dimitris Sgouropoulos, Athanasios Katsamanis, Alexandros Potamianos, Shrikanth S. Narayanan:
Using Oliver API for emotion-aware movie content characterization. CBMI 2019: 1-4 - [c56]Aggelina Chatziagapi, Georgios Paraskevopoulos, Dimitris Sgouropoulos, Georgios Pantazopoulos, Malvina Nikandrou, Theodoros Giannakopoulos, Athanasios Katsamanis, Alexandros Potamianos, Shrikanth Narayanan:
Data Augmentation Using GANs for Speech Emotion Recognition. INTERSPEECH 2019: 171-175 - 2018
- [c55]Ioannis K. Douros, Athanasios Katsamanis, Petros Maragos:
Multi-View Audio-Articulatory Features for Phonetic Recognition on RTMRI-TIMIT Database. ICASSP 2018: 5514-5518 - 2017
- [j10]Isidoros Rodomagoulakis, Athanasios Katsamanis, Gerasimos Potamianos, Panagiotis Giannoulis, Antigoni Tsiami, Petros Maragos:
Room-localized spoken command recognition in multi-room, multi-microphone environments. Comput. Speech Lang. 46: 419-443 (2017) - [j9]Panagiotis Paraskevas Filntisis, Athanasios Katsamanis, Pirros Tsiakoulis, Petros Maragos:
Video-realistic expressive audio-visual speech synthesis for the Greek language. Speech Commun. 95: 137-152 (2017) - [j8]James Gibson, Athanasios Katsamanis, Francisco Romero, Bo Xiao, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Multiple Instance Learning for Behavioral Coding. IEEE Trans. Affect. Comput. 8(1): 81-94 (2017) - [c54]Panagiotis Paraskevas Filntisis, Athanasios Katsamanis, Petros Maragos:
Photorealistic adaptation and interpolation of facial expressions using HMMS and AAMS for audio-visual speech synthesis. ICIP 2017: 2941-2945 - [c53]Panagiotis Paraskevas Filntisis, Athanasios Katsamanis, Petros Maragos:
Demonstration of an HMM-based photorealistic expressive audio-visual speech synthesis system. ICIP 2017: 4588 - [p5]Athanasios Katsamanis, Vassilis Pitsikalis, Stavros Theodorakis, Petros Maragos:
Multimodal gesture recognition. The Handbook of Multimodal-Multisensor Interfaces, Volume 1 (1) 2017: 449-487 - [p4]Vassilis Pitsikalis, Athanasios Katsamanis, Stavros Theodorakis, Petros Maragos:
Multimodal Gesture Recognition via Multiple Hypotheses Rescoring. Gesture Recognition 2017: 467-496 - 2016
- [c52]Panagiotis Giannoulis, Gerasimos Potamianos, Petros Maragos, Athanasios Katsamanis:
Improved Dictionary Selection and Detection Schemes in Sparse-CNMF-Based Overlapping Acoustic Event Detection. DCASE 2016: 25-29 - [c51]Isidoros Rodomagoulakis, Nikolaos Kardaris, Vassilis Pitsikalis, E. Mavroudi, Athanasios Katsamanis, Antigoni Tsiami, Petros Maragos:
Multimodal human action recognition in assistive human-robot interaction. ICASSP 2016: 2702-2706 - [c50]Antigoni Tsiami, Athanasios Katsamanis, Petros Maragos, Argiro Vatakis:
Towards a behaviorally-validated computational audiovisual saliency model. ICASSP 2016: 2847-2851 - [c49]Georgia Panagiotaropoulou, Petros Koutras, Athanasios Katsamanis, Petros Maragos, Athanasia Zlatintsi, Athanassios Protopapas, Efstratios Karavasilis, Nikolaos Smyrnis:
FMRI-based perceptual validation of a computational model for visual and auditory saliency in videos. ICIP 2016: 699-703 - [c48]Alessio Brutti, Antigoni Tsiami, Athanasios Katsamanis, Petros Maragos:
A Phase-Based Time-Frequency Masking for Multi-Channel Speech Enhancement in Domestic Environments. INTERSPEECH 2016: 2875-2879 - [p3]Petros Maragos, Vassilis Pitsikalis, Athanasios Katsamanis, Georgios Pavlakos, Stavros Theodorakis:
On Shape Recognition and Language. Perspectives in Shape Analysis 2016: 321-344 - 2015
- [j7]Vassilis Pitsikalis, Athanasios Katsamanis, Stavros Theodorakis, Petros Maragos:
Multimodal gesture recognition via multiple hypotheses rescoring. J. Mach. Learn. Res. 16: 255-284 (2015) - [c47]Angeliki Metallinou, Athanasios Katsamanis, Martin Wöllmer, Florian Eyben, Björn W. Schuller, Shrikanth S. Narayanan:
Context-sensitive learning for enhanced audiovisual emotion classification (Extended abstract). ACII 2015: 463-469 - [c46]Panagiotis Giannoulis, Alessio Brutti, Marco Matassoni, Alberto Abad, Athanasios Katsamanis, Miguel Matos, Gerasimos Potamianos, Petros Maragos:
Multi-room speech activity detection using a distributed microphone network in domestic environments. EUSIPCO 2015: 1271-1275 - [c45]Petros Koutras, Athanasia Zlatintsi, Elias Iosif, Athanasios Katsamanis, Petros Maragos, Alexandros Potamianos:
Predicting audio-visual salient events based on visual, audio and text modalities for movie summarization. ICIP 2015: 4361-4365 - 2014
- [j6]Chi-Chun Lee, Athanasios Katsamanis, Matthew P. Black, Brian R. Baucom, Andrew Christensen, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Computing vocal entrainment: A signal-derived PCA-based quantification scheme with application to affect analysis in married couple interactions. Comput. Speech Lang. 28(2): 518-539 (2014) - [c44]Panagiotis Giannoulis, Gerasimos Potamianos, Athanasios Katsamanis, Petros Maragos:
Multi-microphone fusion for detection of speech and acoustic events in smart spaces. EUSIPCO 2014: 2375-2379 - [c43]Antigoni Tsiami, Athanasios Katsamanis, Petros Maragos, Gerasimos Potamianos:
Experiments in acoustic source localization using sparse arrays in adverse indoors environments. EUSIPCO 2014: 2390-2394 - [c42]Petros Koutras, Athanasios Katsamanis, Petros Maragos:
Predicting Eyes' Fixations in Movie Videos: Visual Saliency Experiments on a New Eye-Tracking Database. HCI (23) 2014: 183-194 - [c41]Panagiotis Giannoulis, Antigoni Tsiami, Isidoros Rodomagoulakis, Athanasios Katsamanis, Gerasimos Potamianos, Petros Maragos:
The Athena-RC system for speech activity detection and speaker localization in the DIRHA smart home. HSCMA 2014: 167-171 - [c40]Athanasios Katsamanis, Isidoros Rodomagoulakis, Gerasimos Potamianos, Petros Maragos, Antigoni Tsiami:
Robust far-field spoken command recognition for home automation combining adaptation and multichannel processing. ICASSP 2014: 5547-5551 - [c39]Georgios Pavlakos, Stavros Theodorakis, Vassilis Pitsikalis, Athanasios Katsamanis, Petros Maragos:
Kinect-based multimodal gesture recognition using a two-pass fusion scheme. ICIP 2014: 1495-1499 - [c38]Antigoni Tsiami, Isidoros Rodomagoulakis, Panagiotis Giannoulis, Athanasios Katsamanis, Gerasimos Potamianos, Petros Maragos:
ATHENA: a Greek multi-sensory database for home automation control uthor: isidoros rodomagoulakis (NTUA, Greece). INTERSPEECH 2014: 1608-1612 - [c37]Marco Matassoni, Ramón Fernandez Astudillo, Athanasios Katsamanis, Mirco Ravanelli:
The DIRHA-GRID corpus: baseline and tools for multi-room distant speech recognition using distributed microphones. INTERSPEECH 2014: 1613-1617 - 2013
- [j5]Angeliki Metallinou, Athanasios Katsamanis, Shrikanth S. Narayanan:
Tracking continuous emotional trends of participants during affective dyadic interactions using body language and speech information. Image Vis. Comput. 31(2): 137-152 (2013) - [j4]Matthew P. Black, Athanasios Katsamanis, Brian R. Baucom, Chi-Chun Lee, Adam C. Lammert, Andrew Christensen, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Toward automating a human behavioral coding system for married couples' interactions using speech acoustic features. Speech Commun. 55(1): 1-21 (2013) - [c36]Andreas Tsiartas, Theodora Chaspari, Nassos Katsamanis, Prasanta Kumar Ghosh, Ming Li, Maarten Van Segbroeck, Alexandros Potamianos, Shrikanth S. Narayanan:
Multi-band long-term signal variability features for robust voice activity detection. INTERSPEECH 2013: 718-722 - 2012
- [j3]Angeliki Metallinou, Martin Wöllmer, Athanasios Katsamanis, Florian Eyben, Björn W. Schuller, Shrikanth S. Narayanan:
Context-Sensitive Learning for Enhanced Audiovisual Emotion Classification. IEEE Trans. Affect. Comput. 3(2): 184-198 (2012) - [c35]Chi-Chun Lee, Athanasios Katsamanis, Brian R. Baucom, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Using measures of vocal entrainment to inform outcome-related behaviors in marital conflicts. APSIPA 2012: 1-5 - [c34]Angeliki Metallinou, Athanasios Katsamanis, Shrikanth S. Narayanan:
A hierarchical framework for modeling multimodality and emotional evolution in affective dialogs. ICASSP 2012: 2401-2404 - [c33]Martin Wöllmer, Angeliki Metallinou, Nassos Katsamanis, Björn W. Schuller, Shrikanth S. Narayanan:
Analyzing the memory of BLSTM Neural Networks for enhanced emotion classification in dyadic spoken interactions. ICASSP 2012: 4157-4160 - [c32]Theodora Chaspari, Emily Mower Provost, Athanasios Katsamanis, Shrikanth S. Narayanan:
An acoustic analysis of shared enjoyment in ECA interactions of children with autism. ICASSP 2012: 4485-4488 - [c31]Chi-Chun Lee, Athanasios Katsamanis, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Based on Isolated Saliency or Causal Integration? Toward a Better Understanding of Human Annotation Process using Multiple Instance Learning and Sequential Probability Ratio Test. INTERSPEECH 2012: 619-622 - [c30]David R. Traum, Priti Aggarwal, Ron Artstein, Susan Foutz, Jillian Gerten, Athanasios Katsamanis, Anton Leuski, Dan Noren, William R. Swartout:
Ada and Grace: Direct Interaction with Museum Visitors. IVA 2012: 245-251 - [c29]Priti Aggarwal, Ron Artstein, Jillian Gerten, Athanasios Katsamanis, Shrikanth S. Narayanan, Angela Nazarian, David R. Traum:
The Twins Corpus of Museum Visitor Questions. LREC 2012: 2355-2361 - 2011
- [c28]Chi-Chun Lee, Athanasios Katsamanis, Matthew P. Black, Brian R. Baucom, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Affective State Recognition in Married Couples' Interactions Using PCA-Based Vocal Entrainment Measures with Multiple Instance Learning. ACII (2) 2011: 31-41 - [c27]Athanasios Katsamanis, James Gibson, Matthew P. Black, Shrikanth S. Narayanan:
Multiple Instance Learning for Classification of Human Behavior Observations. ACII (1) 2011: 145-154 - [c26]Angeliki Metallinou, Athanassios Katsamanis, Yun Wang, Shrikanth S. Narayanan:
Tracking changes in continuous emotion states using body language and prosodic cues. ICASSP 2011: 2288-2291 - [c25]Viktor Rozgic, Bo Xiao, Athanasios Katsamanis, Brian R. Baucom, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Estimation of ordinal approach-avoidance labels in dyadic interactions: Ordinal logistic regression approach. ICASSP 2011: 2368-2371 - [c24]Vikram Ramanarayanan, Athanasios Katsamanis, Shrikanth S. Narayanan:
Automatic Data-Driven Learning of Articulatory Primitives from Real-Time MRI Data Using Convolutive NMF with Sparseness Constraints. INTERSPEECH 2011: 61-64 - [c23]Matthew Black, Panayiotis G. Georgiou, Athanasios Katsamanis, Brian R. Baucom, Shrikanth S. Narayanan:
"You made me do it": Classification of Blame in Married Couples' Interactions by Fusing Automatically Derived Speech and Language Information. INTERSPEECH 2011: 89-92 - [c22]Michael I. Proctor, Adam C. Lammert, Athanasios Katsamanis, Louis M. Goldstein, Christina Hagedorn, Shrikanth S. Narayanan:
Direct Estimation of Articulatory Kinematics from Real-Time Magnetic Resonance Image Sequences. INTERSPEECH 2011: 281-284 - [c21]Shrikanth S. Narayanan, Erik Bresch, Prasanta Kumar Ghosh, Louis Goldstein, Athanasios Katsamanis, Yoon Kim, Adam C. Lammert, Michael I. Proctor, Vikram Ramanarayanan, Yinghua Zhu:
A Multimodal Real-Time MRI Articulatory Corpus for Speech Research. INTERSPEECH 2011: 837-840 - [c20]James Gibson, Athanasios Katsamanis, Matthew P. Black, Shrikanth S. Narayanan:
Automatic Identification of Salient Acoustic Instances in Couples' Behavioral Interactions Using Diverse Density Support Vector Machines. INTERSPEECH 2011: 1561-1564 - [c19]Bo Xiao, Viktor Rozgic, Athanasios Katsamanis, Brian R. Baucom, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Acoustic and Visual Cues of Turn-Taking Dynamics in Dyadic Interactions. INTERSPEECH 2011: 2441-2444 - [c18]Adam C. Lammert, Michael I. Proctor, Athanasios Katsamanis, Shrikanth S. Narayanan:
Morphological Variation in the Adult Vocal Tract: A Modeling Study of its Potential Acoustic Impact. INTERSPEECH 2011: 2813-2816 - [c17]Athanasios Katsamanis, Erik Bresch, Vikram Ramanarayanan, Shrikanth S. Narayanan:
Validating rt-MRI Based Articulatory Representations via Articulatory Recognition. INTERSPEECH 2011: 2841-2844 - [c16]Chi-Chun Lee, Athanasios Katsamanis, Matthew P. Black, Brian R. Baucom, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
An Analysis of PCA-Based Vocal Entrainment Measures in Married Couples' Affective Spoken Interactions. INTERSPEECH 2011: 3101-3104 - 2010
- [c15]Chi-Chun Lee, Matthew Black, Athanasios Katsamanis, Adam C. Lammert, Brian R. Baucom, Andrew Christensen, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Quantification of prosodic entrainment in affective spontaneous spoken interactions of married couples. INTERSPEECH 2010: 793-796 - [c14]Michael I. Proctor, Daniel Bone, Athanasios Katsamanis, Shrikanth S. Narayanan:
Rapid semi-automatic segmentation of real-time magnetic resonance images for parametric vocal tract analysis. INTERSPEECH 2010: 1576-1579 - [c13]Erik Bresch, Athanasios Katsamanis, Louis Goldstein, Shrikanth S. Narayanan:
Statistical multi-stream modeling of real-time MRI articulatory speech data. INTERSPEECH 2010: 1584-1587 - [c12]Viktor Rozgic, Bo Xiao, Athanasios Katsamanis, Brian R. Baucom, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
A new multichannel multi modal dyadic interaction database. INTERSPEECH 2010: 1982-1985 - [c11]Matthew Black, Athanasios Katsamanis, Chi-Chun Lee, Adam C. Lammert, Brian R. Baucom, Andrew Christensen, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Automatic classification of married couples' behavior using audio features. INTERSPEECH 2010: 2030-2033
2000 – 2009
- 2009
- [j2]Athanassios Katsamanis, George Papandreou, Petros Maragos:
Face Active Appearance Modeling and Speech Acoustic Information to Recover Articulation. IEEE Trans. Speech Audio Process. 17(3): 411-422 (2009) - [j1]George Papandreou, Athanassios Katsamanis, Vassilis Pitsikalis, Petros Maragos:
Adaptive Multimodal Fusion by Uncertainty Compensation With Application to Audiovisual Speech Recognition. IEEE Trans. Speech Audio Process. 17(3): 423-435 (2009) - [c10]Stavros Theodorakis, Athanassios Katsamanis, Petros Maragos:
Product-HMMs for automatic sign language recognition. ICASSP 2009: 1601-1604 - [c9]Anastasios Roussos, Athanassios Katsamanis, Petros Maragos:
Tongue tracking in Ultrasound images with Active Appearance Models. ICIP 2009: 1733-1736 - 2008
- [c8]Athanasios Katsamanis, Gopal Ananthakrishnan, George Papandreou, Petros Maragos, Olov Engwall:
Audiovisual speech inversion by switching dynamical modeling governed by a Hidden Markov process. EUSIPCO 2008: 1-5 - [c7]Athanassios Katsamanis, George Papandreou, Petros Maragos:
Audiovisual-to-articulatory speech inversion using Active Appearance Models for the face and Hidden Markov Models for the dynamics. ICASSP 2008: 2237-2240 - [c6]Stamatios Lefkimmiatis, Petros Maragos, Athanassios Katsamanis:
Multisensor multiband cross-energy tracking for feature extraction and recognition. ICASSP 2008: 4741-4744 - [p2]Petros Maragos, Patrick Gros, Athanassios Katsamanis, George Papandreou:
Cross-Modal Integration for Performance Improving in Multimedia: A Review. Multimodal Processing and Interaction 2008: 1-46 - [p1]George Papandreou, Athanassios Katsamanis, Vassilis Pitsikalis, Petros Maragos:
Adaptive Multimodal Fusion by Uncertainty Compensation with Application to Audio-Visual Speech Recognition. Multimodal Processing and Interaction 2008: 1-15 - 2007
- [c5]George Papandreou, Athanassios Katsamanis, Vassilis Pitsikalis, Petros Maragos:
Multimodal Fusion and Learning with Uncertain Features Applied to Audiovisual Speech Recognition. MMSP 2007: 264-267 - [c4]Athanassios Katsamanis, George Papandreou, Petros Maragos:
Audiovisual-to-Articulatory Speech Inversion Using HMMs. MMSP 2007: 457-460 - 2006
- [c3]Athanassios Katsamanis, George Papandreou, Vassilis Pitsikalis, Petros Maragos:
Multimodal fusion by adaptive compensation for feature uncertainty with application to audiovisual speech recognition. EUSIPCO 2006: 1-5 - [c2]Vassilis Pitsikalis, Athanassios Katsamanis, George Papandreou, Petros Maragos:
Adaptive multimodal fusion by uncertainty compensation. INTERSPEECH 2006 - 2005
- [c1]Athanassios Katsamanis, Petros Maragos:
Advances in statistical estimation and tracking of AM-FM speech components. INTERSPEECH 2005: 1125-1128
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-08-22 19:46 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint