default search action
Oriol Nieto
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c29]Sreyan Ghosh, Sonal Kumar, Ashish Seth, Chandra Kiran Reddy Evuru, Utkarsh Tyagi, S. Sakshi, Oriol Nieto, Ramani Duraiswami, Dinesh Manocha:
GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities. EMNLP 2024: 6288-6313 - [c28]Sreyan Ghosh, Ashish Seth, Sonal Kumar, Utkarsh Tyagi, Chandra Kiran Reddy Evuru, Ramaneswaran S., Sakshi Singh, Oriol Nieto, Ramani Duraiswami, Dinesh Manocha:
CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models. ICLR 2024 - [i16]Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Utkarsh Tyagi, Oriol Nieto, Zeyu Jin, Dinesh Manocha:
VDGD: Mitigating LVLM Hallucinations in Cognitive Prompts by Bridging the Visual Perception Gap. CoRR abs/2405.15683 (2024) - [i15]Sreyan Ghosh, Sonal Kumar, Ashish Seth, Chandra Kiran Reddy Evuru, Utkarsh Tyagi, Sakshi Singh, Oriol Nieto, Ramani Duraiswami, Dinesh Manocha:
GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities. CoRR abs/2406.11768 (2024) - [i14]Sreyan Ghosh, Sonal Kumar, Chandra Kiran Reddy Evuru, Oriol Nieto, Ramani Duraiswami, Dinesh Manocha:
ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds. CoRR abs/2409.09213 (2024) - [i13]Ilaria Manco, Justin Salamon, Oriol Nieto:
Augment, Drop & Swap: Improving Diversity in LLM Captions for Efficient Music-Text Representation Learning. CoRR abs/2409.11498 (2024) - 2023
- [c27]Reuben Tan, Arijit Ray, Andrea Burns, Bryan A. Plummer, Justin Salamon, Oriol Nieto, Bryan Russell, Kate Saenko:
Language-Guided Audio-Visual Source Separation via Trimodal Consistency. CVPR 2023: 10575-10584 - [c26]Ho-Hsiang Wu, Oriol Nieto, Juan Pablo Bello, Justin Salamon:
Audio-Text Models Do Not Yet Leverage Natural Language. ICASSP 2023: 1-5 - [c25]Oriol Nieto, Zeyu Jin, Franck Dernoncourt, Justin Salamon:
Efficient Spoken Language Recognition via Multilabel Classification. INTERSPEECH 2023: 506-510 - [c24]Julia Wilkins, Justin Salamon, Magdalena Fuentes, Juan Pablo Bello, Oriol Nieto:
Bridging High-Quality Audio and Video Via Language for Sound Effects Retrieval from Visual Queries. WASPAA 2023: 1-5 - [i12]Ho-Hsiang Wu, Oriol Nieto, Juan Pablo Bello, Justin Salamon:
Audio-Text Models Do Not Yet Leverage Natural Language. CoRR abs/2303.10667 (2023) - [i11]Reuben Tan, Arijit Ray, Andrea Burns, Bryan A. Plummer, Justin Salamon, Oriol Nieto, Bryan Russell, Kate Saenko:
Language-Guided Audio-Visual Source Separation via Trimodal Consistency. CoRR abs/2303.16342 (2023) - [i10]Oriol Nieto, Zeyu Jin, Franck Dernoncourt, Justin Salamon:
Efficient Spoken Language Recognition via Multilabel Classification. CoRR abs/2306.01945 (2023) - [i9]Julia Wilkins, Justin Salamon, Magdalena Fuentes, Juan Pablo Bello, Oriol Nieto:
Bridging High-Quality Audio and Video via Language for Sound Effects Retrieval from Visual Queries. CoRR abs/2308.09089 (2023) - [i8]Sreyan Ghosh, Ashish Seth, Sonal Kumar, Utkarsh Tyagi, Chandra Kiran Reddy Evuru, Ramaneswaran S., Sakshi Singh, Oriol Nieto, Ramani Duraiswami, Dinesh Manocha:
CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models. CoRR abs/2310.08753 (2023) - 2022
- [c23]Nikhil Kandpal, Oriol Nieto, Zeyu Jin:
Music Enhancement via Image Translation and Vocoding. ICASSP 2022: 3124-3128 - [i7]Nikhil Kandpal, Oriol Nieto, Zeyu Jin:
Music Enhancement via Image Translation and Vocoding. CoRR abs/2204.13289 (2022) - 2021
- [c22]Minz Won, Sergio Oramas, Oriol Nieto, Fabien Gouyon, Xavier Serra:
Multimodal Metric Learning for Tag-Based Music Retrieval. ICASSP 2021: 591-595 - [c21]Justin Salamon, Oriol Nieto, Nicholas J. Bryan:
Deep Embeddings and Section Fusion Improve Music Segmentation. ISMIR 2021: 594-601 - 2020
- [j2]Oriol Nieto, Gautham J. Mysore, Cheng-i Wang, Jordan B. L. Smith, Jan Schlüter, Thomas Grill, Brian McFee:
Audio-Based Music Structure Analysis: Current Trends, Open Challenges, and Applications. Trans. Int. Soc. Music. Inf. Retr. 3(1): 246-263 (2020) - [c20]Minz Won, Sanghyuk Chun, Oriol Nieto, Xavier Serra:
Data-Driven Harmonic Filters for Audio Representation Learning. ICASSP 2020: 536-540 - [c19]Filip Korzeniowski, Oriol Nieto, Matthew C. McCallum, Minz Won, Sergio Oramas, Erik M. Schmidt:
Mood Classification Using Listening Data. ISMIR 2020: 542-549 - [i6]Filip Korzeniowski, Oriol Nieto, Matthew C. McCallum, Minz Won, Sergio Oramas, Erik M. Schmidt:
Mood Classification Using Listening Data. CoRR abs/2010.11512 (2020) - [i5]Minz Won, Sergio Oramas, Oriol Nieto, Fabien Gouyon, Xavier Serra:
Multimodal Metric Learning for Tag-based Music Retrieval. CoRR abs/2010.16030 (2020)
2010 – 2019
- 2019
- [c18]Oriol Nieto, Matthew C. McCallum, Matthew E. P. Davies, Andrew Robertson, Adam M. Stark, Eran Egozy:
The Harmonix Set: Beats, Downbeats, and Functional Segment Annotations of Western Popular Music. ISMIR 2019: 565-572 - 2018
- [j1]Sergio Oramas, Francesco Barbieri, Oriol Nieto, Xavier Serra:
Multimodal Deep Learning for Music Genre Classification. Trans. Int. Soc. Music. Inf. Retr. 1(1): 4-21 (2018) - [c17]Jordi Pons, Oriol Nieto, Matthew Prockup, Erik M. Schmidt, Andreas F. Ehmann, Xavier Serra:
End-to-end Learning for Music Audio Tagging at Scale. ISMIR 2018: 637-644 - [c16]Samaneh Ebrahimi, Hossein Vahabi, Matthew Prockup, Oriol Nieto:
Predicting Audio Advertisement Quality. WSDM 2018: 153-161 - [i4]Samaneh Ebrahimi, Hossein Vahabi, Matthew Prockup, Oriol Nieto:
Predicting Audio Advertisement Quality. CoRR abs/1802.03319 (2018) - 2017
- [c15]Sergio Oramas, Oriol Nieto, Francesco Barbieri, Xavier Serra:
Multi-Label Music Genre Classification from Audio, Text and Images Using Deep Features. ISMIR 2017: 23-30 - [c14]Sergio Oramas, Oriol Nieto, Mohamed Sordo, Xavier Serra:
A Deep Multimodal Approach for Cold-start Music Recommendation. DLRS@RecSys 2017: 32-37 - [i3]Sergio Oramas, Oriol Nieto, Mohamed Sordo, Xavier Serra:
A Deep Multimodal Approach for Cold-start Music Recommendation. CoRR abs/1706.09739 (2017) - [i2]Sergio Oramas, Oriol Nieto, Francesco Barbieri, Xavier Serra:
Multi-label Music Genre Classification from Audio, Text, and Images Using Deep Features. CoRR abs/1707.04916 (2017) - [i1]Jordi Pons, Oriol Nieto, Matthew Prockup, Erik M. Schmidt, Andreas F. Ehmann, Xavier Serra:
End-to-end learning for music audio tagging at scale. CoRR abs/1711.02520 (2017) - 2016
- [c13]Oriol Nieto, Juan Pablo Bello:
Systematic Exploration of Computational Music Structure Research. ISMIR 2016: 547-553 - 2015
- [c12]Brian McFee, Oriol Nieto, Juan Pablo Bello:
Hierarchical Evaluation of Segment Boundary Detection. ISMIR 2015: 406-412 - [c11]Brian McFee, Colin Raffel, Dawen Liang, Daniel P. W. Ellis, Matt McVicar, Eric Battenberg, Oriol Nieto:
librosa: Audio and Music Signal Analysis in Python. SciPy 2015: 18-24 - 2014
- [c10]Andreu Ballus, Eric Arnau, Oriol Nieto, Frederic Font, Alba Torrents:
Embodying Theoretical Research in Music Cognition: Four Proposals for Theory-Driven Experimentation. CogSci 2014 - [c9]Oriol Nieto, Juan Pablo Bello:
Music segment similarity using 2D-Fourier Magnitude Coefficients. ICASSP 2014: 664-668 - [c8]Oriol Nieto, Morwaread Mary Farbood, Tristan Jehan, Juan Pablo Bello:
Perceptual Analysis of the F-Measure to Evaluate Section Boundaries in Music. ISMIR 2014: 265-270 - [c7]Colin Raffel, Brian McFee, Eric J. Humphrey, Justin Salamon, Oriol Nieto, Dawen Liang, Daniel P. W. Ellis:
MIR_EVAL: A Transparent Implementation of Common MIR Metrics. ISMIR 2014: 367-372 - [c6]Oriol Nieto, Morwaread Mary Farbood:
Identifying Polyphonic Musical Patterns From Audio Recordings Using Music Segmentation Techniques. ISMIR 2014: 411-416 - [c5]Eric J. Humphrey, Justin Salamon, Oriol Nieto, Jon Forsyth, Rachel M. Bittner, Juan Pablo Bello:
JAMS: A JSON Annotated Music Specification for Reproducible MIR Research. ISMIR 2014: 591-596 - 2013
- [c4]Oriol Nieto, Tristan Jehan:
Convex non-negative matrix factorization for automatic music structure identification. ICASSP 2013: 236-240 - [c3]Eric J. Humphrey, Oriol Nieto, Juan Pablo Bello:
Data Driven and Discriminative Projections for Large-Scale Cover Song Identification. ISMIR 2013: 149-154 - [c2]Tae Hong Park, Oriol Nieto:
Fortissimo: Force-Feedback for Mobile Devices. NIME 2013: 291-294 - 2012
- [c1]Oriol Nieto, Eric J. Humphrey, Juan Pablo Bello:
Compressing Music Recordings into Audio Summaries. ISMIR 2012: 313-318
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-15 20:37 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint