


default search action
21st MSR 2024: Lisbon, Portugal
- Diomidis Spinellis, Alberto Bacchelli, Eleni Constantinou:
21st IEEE/ACM International Conference on Mining Software Repositories, MSR 2024, Lisbon, Portugal, April 15-16, 2024. ACM 2024, ISBN 979-8-4007-0587-8 - Gwenyth Lincroft
, Minsung Cho
, Katherine Hough
, Mahsa Bazzaz
, Jonathan Bell
:
Thirty-Three Years of Mathematicians and Software Engineers: A Case Study of Domain Expertise and Participation in Proof Assistant Ecosystems. 1-13 - Can Li
, Jingxuan Zhang
, Yixuan Tang
, Zhuhang Li
, Tianyue Sun
:
Boosting API Misuse Detection via Integrating API Constraints from Multiple Sources. 14-26 - Ricardo de Sousa Job, Andre Hora
:
Availability and Usage of Platform-Specific APIs: A First Empirical Study. 27-31 - Jordan Samhi
, Tegawendé F. Bissyandé
, Jacques Klein
:
AndroLibZoo: A Reliable Dataset of Libraries Based on Software Dependency Analysis. 32-36 - Damien Jaime
, Joyce El Haddad
, Pascal Poizat
:
Goblin: A Framework for Enriching and Querying the Maven Central Dependency Graph. 37-41 - Mahmoud Jahanshahi
, Audris Mockus
:
Dataset: Copy-based Reuse in Open Source Software. 42-47 - Shane McIntosh
:
Mining Our Way Back to Incremental Builds for DevOps Pipelines. 48-49 - Guoliang Zhao
, Stefanos Georgiou
, Ying Zou
, Safwat Hassan
, Derek Truong
, Toby Corbin
:
Enhancing Performance Bug Prediction Using Performance Code Metrics. 50-62 - Xingyu Li
, Zheng Zhang
, Zhiyun Qian
, Trent Jaeger
, Chengyu Song
:
An Investigation of Patch Porting Practices of the Linux Kernel Ecosystem. 63-74 - Philip Oliver
, Jens Dietrich
, Craig Anslow
, Michael Homer
:
CrashJS: A NodeJS Benchmark for Automated Crash Reproduction. 75-87 - Xhulja Shahini
, Andreas Metzger
, Klaus Pohl
:
An Empirical Study on Just-in-time Conformal Defect Prediction. 88-99 - Mahi Begoug
, Moataz Chouchen
, Ali Ouni
, Eman Abdullah AlOmar
, Mohamed Wiem Mkaouer
:
Fine-Grained Just-In-Time Defect Prediction at the Block Level in Infrastructure-as-Code (IaC). 100-112 - Kaibo Liu
, Yudong Han
, Yiyang Liu
, Jie M. Zhang
, Zhenpeng Chen
, Federica Sarro
, Gang Huang
, Yun Ma
:
TrickyBugs: A Dataset of Corner-case Bugs in Plausible Programs. 113-117 - André Silva
, Nuno Saavedra
, Martin Monperrus
:
GitBug-Java: A Reproducible Benchmark of Recent Java Bugs. 118-122 - Dirk Beyer
, Lars Grunske
, Matthias Kettl
, Marian Lingsch Rosenfeld
, Moeketsi Raselimo
:
P3: A Dataset of Partial Program Patches. 123-127 - K. D. Pramod, W. T. N. De Silva, W. U. K. Thabrew, Ridwan Shariffdeen
, Sandareka Wickramanayake
:
BugsPHP: A dataset for Automated Program Repair in PHP. 128-132 - Zhipeng Liu
, Meng Yan
, Zhipeng Gao
, Dong Li
, Xiaohong Zhang
, Dan Yang
:
AW4C: A Commit-Aware C Dataset for Actionable Warning Identification. 133-137 - Suhaib Mujahid
, Diego Elias Costa
, Marco Castelluccio
:
Predicting the Impact of Crashes Across Release Channels. 138-139 - Sangameshwar Patil
, Balaraman Ravindran
:
Zero-shot Learning based Alternatives for Class Imbalanced Learning Problem in Enterprise Software Defect Analysis. 140-141 - Liangxuan Wu
, Yanjie Zhao
, Xinyi Hou
, Tianming Liu
, Haoyu Wang
:
ChatGPT Chats Decoded: Uncovering Prompt Patterns for Superior Solutions in Software Development Lifecycle. 142-146 - Konstantinos Moratis
, Themistoklis Diamantopoulos
, Dimitrios-Nikitas Nastos
, Andreas L. Symeonidis
:
Write me this Code: An Analysis of ChatGPT Quality for Producing Source Code. 147-151 - Mohammed Latif Siddiq
, Lindsay Roney
, Jiahao Zhang
, Joanna C. S. Santos
:
Quality Assessment of ChatGPT Generated Code and their Use by Developers. 152-156 - Balreet Grewal
, Wentao Lu
, Sarah Nadi
, Cor-Paul Bezemer
:
Analyzing Developer Use of ChatGPT Generated Code in Open Source GitHub Projects. 157-161 - Piotr Przymus
, Mikolaj Fejzer
, Jakub Narebski
, Krzysztof Stencel
:
How I Learned to Stop Worrying and Love ChatGPT. 162-166 - Kailun Jin
, Chung-Yu Wang
, Hung Viet Pham
, Hadi Hemmati
:
Can ChatGPT Support Developers? An Empirical Evaluation of Large Language Models for Code Generation. 167-171 - Rachna Raj
, Diego Elias Costa
:
The role of library versions in Developer-ChatGPT conversations. 172-176 - Md. Fazle Rabbi
, Arifa I. Champa
, Minhaz Fahim Zibran
, Md. Rakibul Islam
:
AI Writes, We Analyze: The ChatGPT Python Code Saga. 177-181 - Arifa I. Champa
, Md. Fazle Rabbi
, Costain Nachuma
, Minhaz F. Zibran
:
ChatGPT in Action: Analyzing Its Use in Software Development. 182-186 - Suad Mohamed
, Abdullah Parvin
, Esteban Parra
:
Chatting with AI: Deciphering Developer Conversations with ChatGPT. 187-191 - Yue Zhang
, Rachel Meredith
, Wilson Reeves
, Julia Coriolano
, Muhammad Ali Babar
, Akond Rahman
:
Does Generative AI Generate Smells Related to Container Orchestration?: An Exploratory Study with Kubernetes Manifests. 192-196 - Ertugrul Sagdic
, Arda Bayram
, Md. Rakibul Islam
:
On the Taxonomy of Developers' Discussion Topics with ChatGPT. 197-201 - Eman Abdullah AlOmar
, Anushkrishna Venkatakrishnan
, Mohamed Wiem Mkaouer
, Christian D. Newman
, Ali Ouni
:
How to Refactor this Code? An Exploratory Study on Developer-ChatGPT Refactoring Conversations. 202-206 - Soham Deo
, Divya Hinge, Omkar Sandip Chavan
, Yaxuan Olivia Wang
, Mohamed Wiem Mkaouer
:
Analyzing Developer-ChatGPT Conversations for Software Refactoring: An Exploratory Study. 207-211 - Moataz Chouchen
, Narjes Bessghaier
, Mahi Begoug
, Ali Ouni
, Eman Abdullah AlOmar, Mohamed Wiem Mkaouer
:
How Do So ware Developers Use ChatGPT? An Exploratory Study on GitHub Pull Requests. 212-216 - Joy Krishan Das
, Saikat Mondal
, Chanchal K. Roy
:
Investigating the Utility of ChatGPT in the Issue Tracking System: An Exploratory Study. 217-221 - Saikat Mondal
, Suborno Deb Bappon
, Chanchal K. Roy
:
Enhancing User Interaction in ChatGPT: Characterizing and Consolidating Multiple Prompts for Issue Resolution. 222-226 - Tao Xiao
, Christoph Treude
, Hideaki Hata
, Kenichi Matsumoto
:
DevGPT: Studying Developer-ChatGPT Conversations. 227-230 - Giovanni Rosa
, Simone Scalabrino
, Gregorio Robles
, Rocco Oliveto
:
Not all Dockerfile Smells are the Same: An Empirical Evaluation of Hadolint Writing Practices by Experts. 231-241 - Anamaria-Roberta Preda
, Christoph Mayr-Dorn
, Atif Mashkoor
, Alexander Egyed
:
Supporting High-Level to Low-Level Requirements Coverage Reviewing with Large Language Models. 242-253 - Md. Anaytul Islam
, Muhammad Asaduzzaman
, Shaowei Wang
:
On the Executability of R Markdown Files. 254-264 - Souhaila Serbout
, Cesare Pautasso
:
APIstic: A Large Collection of OpenAPI Metrics. 265-277 - Hong Yi Lin
, Patanamon Thongtanunam
, Christoph Treude
, Wachiraphan Charoenwet
:
Improving Automated Code Reviews: Learning from Experience. 278-283 - Tushar Sharma
:
Multi-faceted Code Smell Detection at Scale using DesigniteJava 2.0. 284-288 - Edi Sutoyo
, Andrea Capiluppi
:
SATDAUG - A Balanced and Augmented Dataset for Detecting Self-Admitted Technical Debt. 289-293 - Mingzhao Liang
, Wachiraphan Charoenwet
, Patanamon Thongtanunam
:
Curated Email-Based Code Reviews Datasets. 294-298 - André C. Hora
:
TestDossier: A Dataset of Tested Values Automatically Extracted from Test Execution. 299-303 - Saurabhsingh Rajput
, Maria Kechagia
, Federica Sarro
, Tushar Sharma
:
Greenlight: Highlighting TensorFlow APIs Energy Footprint. 304-308 - Kesina Baral
, Jack Johnson, Junayed Mahmud
, Sabiha Salma
, Mattia Fazzini
, Julia Rubin
, Jeff Offutt
, Kevin Moran
:
Automating GUI-based Test Oracles for Mobile Apps. 309-321 - Liu Wang
, Conghui Zheng
, Haoyu Wang
, Xiapu Luo
, Gareth Tyson
, Yi Wang
, Shangguang Wang
:
Global Prosperity or Local Monopoly? Understanding the Geography of App Popularity. 322-334 - Sabiha Salma
, S M Hasan Mansur
, Yule Zhang
, Kevin Moran
:
GuiEvo: Automated Evolution of Mobile Application GUIs. 335-347 - Magdalena Steinböck
, Jakob Bleier
, Mikka Rainer
, Tobias Urban
, Christine Utz
, Martina Lindorfer
:
Comparing Apples to Androids: Discovery, Retrieval, and Matching of iOS and Android Apps for Cross-Platform Analyses. 348-360 - Elliott Wen
, Jiaxiang Zhou
, Xiapu Luo
, Giovanni Russello
, Jens Dietrich
:
Keep Me Updated: An Empirical Study on Embedded JavaScript Engines in Android Apps. 361-372 - Sahrima Jannat Oishwee
, Natalia Stakhanova
, Zadia Codabux
:
Large Language Model vs. Stack Overflow in Addressing Android Permission Related Challenges. 373-383 - Yasaman Abedini
, Mohammad Hadi Hajihosseini
, Abbas Heydarnoori
:
DATAR: A Dataset for Tracking App Releases. 384-388 - Marco Alecci
, Pedro Jesús Ruiz Jiménez
, Kevin Allix
, Tegawendé F. Bissyandé
, Jacques Klein
:
AndroZoo: A Retrospective with a Glimpse into the Future. 389-393 - Oseremen Joy Idialu
, Noble Saji Mathews
, Rungroj Maipradit
, Joanne M. Atlee
, Meiyappan Nagappan
:
Whodunit: Classifying Code as Human Authored or GPT-4 generated- A case study on CodeChef problems. 394-406 - Nafiseh Nikeghbal
, Amir Hossein Kargaran
, Abbas Heydarnoori
:
GIRT-Model: Automated Generation of Issue Report Templates. 407-418 - Ahmed Saeed Alsayed
, Hoa Khanh Dam
, Chau Nguyen
:
MicroRec: Leveraging Large Language Models for Microservice Recommendation. 419-430 - Wenxin Jiang
, Jerin Yasmin
, Jason Jones
, Nicholas Synovic
, Jiashen Kuo
, Nathaniel Bielanski
, Yuan Tian
, George K. Thiruvathukal
, James C. Davis
:
PeaTMOSS: A Dataset and Initial Analysis of Pre-Trained Models in Open-Source Software. 431-443 - Binger Chen
, Jacek Golebiowski
, Ziawasch Abedjan
:
Data Augmentation for Supervised Code Translation Learning. 444-456 - Amir M. Mir
, Mehdi Keshani
, Sebastian Proksch
:
On the Effectiveness of Machine Learning-based Call Graph Pruning: An Empirical Study. 457-468 - Giuseppe Colavito
, Filippo Lanubile
, Nicole Novielli
, Luigi Quaranta
:
Leveraging GPT-like LLMs to Automate Issue Labeling. 469-480 - Kei Koyanagi
, Dong Wang
, Kotaro Noguchi
, Masanari Kondo
, Alexander Serebrenik
, Yasutaka Kamei
, Naoyasu Ubayashi
:
Exploring the Effect of Multiple Natural Languages on Code Suggestion Using GitHub Copilot. 481-486 - Md. Rakibul Islam
, Md. Fazle Rabbi
, Youngeun Jo
, Arifa I. Champa
, Ethan Young
, Camden Wilson
, Gavin Scott
, Minhaz Fahim Zibran
:
A Four-Dimension Gold Standard Dataset for Opinion Mining in Software Engineering. 487-491 - Anisha Islam
, Kalvin Eng
, Abram Hindle
:
Opening the Valve on Pure-Data: Usage Patterns and Programming Practices of a Data-Flow Based Visual Programming Language. 492-497 - Daniel Sokolowski
, David Spielmann
, Guido Salvaneschi
:
The PIPr Dataset of Public Infrastructure as Code Programs. 498-503 - Dario Amoroso d'Aragona
, Alexander Bakhtin
, Xiaozhou Li
, Ruoyu Su
, Lauren Adams
, Ernesto Aponte
, Francis Boyle
, Patrick Boyle
, Rachel Koerner
, Joseph Lee
, Fangchao Tian
, Yuqing Wang
, Jesse Nyyssölä
, Ernesto Quevedo
, Md Shahidur Rahaman
, Amr S. Abdelfattah
, Mika Mäntylä
, Tomás Cerný
, Davide Taibi
:
A Dataset of Microservices-based Open-Source Projects. 504-509 - Christian Birchler
, Cyrill Rohrbach
, Timo Kehrer
, Sebastiano Panichella
:
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars. 510-514 - Ramtin Ehsani
, Mia Mohammad Imran
, Robert Zita
, Kostadin Damevski
, Preetha Chatterjee
:
Incivility in Open Source Projects: A Comprehensive Annotated Dataset of Locked GitHub Issue Threads. 515-519 - Davi Tabosa
, Oton Pinheiro
, Lincoln S. Rocha
, Windson Viana
:
A Dataset of Atoms of Confusion in the Android Open Source Project. 520-524 - Andrea D'Angelo
, Claudio Di Sipio
, Cristiano Politowski
, Riccardo Rubei
:
PlayMyData: a curated dataset of multi-platform video games. 525-529 - Margaret-Anne D. Storey
:
Questioning the Questions We Ask About the Impact of AI on Software Engineering : MSR 2024 Keynote. 530 - Harshitha Menon
, Daniel Nichols
, Abhinav Bhatele
, Todd Gamblin
:
Learning to Predict and Improve Build Successes in Package Ecosystems. 531-542 - Ajiromola Kola-Olawuyi
, Nimmi Rashinika Weeraddana
, Meiyappan Nagappan
:
The Impact of Code Ownership of DevOps Artefacts on the Outcome of DevOps CI Builds. 543-555 - Zhili Zeng
, Tao Xiao
, Maxime Lamothe
, Hideaki Hata
, Shane McIntosh
:
A Mutation-Guided Assessment of Acceleration Approaches for Continuous Integration: An Empirical Study of YourBase. 556-568 - Nyyti Saarimäki
, Sira Vegas
, Valentina Lenarduzzi
, Davide Taibi
, Mikel Robredo
:
Cohort Studies for Mining Software Repositories. 569-570 - Rosalia Tufano
, Antonio Mastropaolo
, Federica Pepe
, Ozren Dabic
, Massimiliano Di Penta
, Gabriele Bavota
:
Unveiling ChatGPT's Usage in Open Source Projects: A Mining-based Study. 571-583 - Emna Ksontini
, Aycha Abid
, Rania Khalsi
, Marouane Kessentini
:
DRMiner: A Tool For Identifying And Analyzing Refactorings In Dockerfile. 584-594 - Jiaqi Wu
, Lingfeng Bao
, Xiaohu Yang
, Xin Xia
, Xing Hu
:
A Large-Scale Empirical Study of Open Source License Usage: Practices and Challenges. 595-606 - Joel Castaño
, Silverio Martínez-Fernández
, Xavier Franch
, Justus Bogner
:
Analyzing the Evolution and Maintenance of ML Models on Hugging Face. 607-618 - Florian Sihler
, Lukas Pietzschmann
, Raphael Straub
, Matthias Tichy
, Andor Diera
, Abdelhalim Hafedh Dahou
:
On the Anatomy of Real-World R Code for Static Analysis. 619-630 - Huy Nguyen
, Christoph Treude
, Patanamon Thongtanunam
:
Encoding Version History Context for Better Code Representation. 631-636 - Martin Weyssow
, Claudio Di Sipio
, Davide Di Ruscio
, Houari A. Sahraoui
:
CodeLL: A Lifelong Learning Dataset to Support the Co-Evolution of Data and Language Models of Code. 637-641 - Daniel Garijo
, Miguel Arroyo
, Esteban González
, Christoph Treude
, Nicola Tarocco
:
Bidirectional Paper-Repository Tracing in Software Engineering. 642-646 - Mojtaba Mostafavi Ghahfarokhi
, Arash Asgari
, Mohammad Abolnejadian
, Abbas Heydarnoori
:
DistilKaggle: A Distilled Dataset of Kaggle Jupyter Notebooks. 647-651 - Sophia Vargas
, Georg J. P. Link
, JaYoung Lee
:
Estimating Usage Of Open Source Projects. 652-653 - Georges Aaron Randrianaina
, Djamel Eddine Khelladi
, Olivier Zendra
, Mathieu Acher
:
Options Matter: Documenting and Fixing Non-Reproducible Builds in Highly-Configurable Systems. 654-664 - João Helis Bernardo
, Daniel Alencar da Costa
, Sérgio Queiroz de Medeiros
, Uirá Kulesza
:
How do Machine Learning Projects use Continuous Integration Practices? An Empirical Study on GitHub Actions. 665-676 - Guillaume Cardoen
, Tom Mens
, Alexandre Decan
:
A dataset of GitHub Actions workflow histories. 677-681 - Pooya Rostami Mazrae
, Alexandre Decan
, Tom Mens
:
gawd: A Differencing Tool for GitHub Actions Workflows. 682-686 - Natarajan Chidambaram
, Tom Mens
, Alexandre Decan
:
RABBIT: A tool for identifying bot accounts based on their recent GitHub event history. 687-691 - Hassan Onsori Delicheh
, Alexandre Decan
, Tom Mens
:
Quantifying Security Issues in Reusable JavaScript Actions in GitHub Workflows. 692-703 - Nicolás E. Díaz Ferreyra
, Mojtaba Shahin
, Mansooreh Zahedi
, Sodiq Quadri
, Riccardo Scandariato
:
What Can Self-Admitted Technical Debt Tell Us About Security? A Mixed-Methods Study. 704-715 - Triet Huynh Minh Le
, Xiaoning Du
, Muhammad Ali Babar
:
Are Latent Vulnerabilities Hidden Gems for Software Vulnerability Prediction? An Empirical Study. 716-727 - Nusrat Zahan
, Philipp Burckhardt
, Mikola Lysenko
, Feross Aboukhadijeh
, Laurie A. Williams
:
MalwareBench: Malware samples are not enough. 728-732 - Simone Scalco
, Ranindya Paramitha
:
Hash4Patch: A Lightweight Low False Positive Tool for Finding Vulnerability Patch Commits. 733-737 - Chao Ni
, Liyu Shen
, Xiaohu Yang
, Yan Zhu
, Shaohua Wang
:
MegaVul: A C/C++ Vulnerability Dataset with Comprehensive Code Representations. 738-742

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.