Better Modeling the Programming World with Code Concept Graphs-augmented Multi-modal Learning

Weyssow, Martin; Sahraoui, Houari; Liu, Bang

doi:10.1145/3510455.3512771

Computer Science > Software Engineering

arXiv:2201.03346 (cs)

[Submitted on 10 Jan 2022 (v1), last revised 21 Feb 2022 (this version, v2)]

Title:Better Modeling the Programming World with Code Concept Graphs-augmented Multi-modal Learning

Authors:Martin Weyssow, Houari Sahraoui, Bang Liu

View PDF

Abstract:The progress made in code modeling has been tremendous in recent years thanks to the design of natural language processing learning approaches based on state-of-the-art model architectures. Nevertheless, we believe that the current state-of-the-art does not focus enough on the full potential that data may bring to a learning process in software engineering. Our vision articulates on the idea of leveraging multi-modal learning approaches to modeling the programming world. In this paper, we investigate one of the underlying idea of our vision whose objective based on concept graphs of identifiers aims at leveraging high-level relationships between domain concepts manipulated through particular language constructs. In particular, we propose to enhance an existing pretrained language model of code by joint-learning it with a graph neural network based on our concept graphs. We conducted a preliminary evaluation that shows gain of effectiveness of the models for code search using a simple joint-learning method and prompts us to further investigate our research vision.

Comments:	4+1 pages
Subjects:	Software Engineering (cs.SE); Information Retrieval (cs.IR); Programming Languages (cs.PL)
Cite as:	arXiv:2201.03346 [cs.SE]
	(or arXiv:2201.03346v2 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2201.03346
Journal reference:	44th International Conference on Software Engineering, 2022, New Ideas and Emerging Results (ICSE-NIER)
Related DOI:	https://doi.org/10.1145/3510455.3512771

Submission history

From: Martin Weyssow [view email]
[v1] Mon, 10 Jan 2022 13:57:26 UTC (1,103 KB)
[v2] Mon, 21 Feb 2022 02:59:39 UTC (1,075 KB)

Computer Science > Software Engineering

Title:Better Modeling the Programming World with Code Concept Graphs-augmented Multi-modal Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:Better Modeling the Programming World with Code Concept Graphs-augmented Multi-modal Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators