UTCS Artificial Intelligence
courses
talks/events
demos
people
projects
publications
software/data
labs
areas
admin
Associating Natural Language Comment and Source Code Entities (2020)
Sheena Panthaplackel
, Milos Gligoric,
Raymond J. Mooney
and Junyi Jessy Li
Comments are an integral part of software development; they are natural language descriptions associated with source code elements. Understanding explicit associations can be useful in improving code comprehensibility and maintaining the consistency between code and comments. As an initial step towards this larger goal, we address the task of associating entities in Javadoc comments with elements in Java source code. We propose an approach for automatically extracting supervised data using revision histories of open source projects and present a manually annotated evaluation dataset for this task. We develop a binary classifier and a sequence labeling model by crafting a rich feature set which encompasses various aspects of code, comments, and the relationships between them. Experiments show that our systems outperform several baselines learning from the proposed supervision.
View:
PDF
Citation:
In
The AAAI Conference on Artificial Intelligence (AAAI)
, February 2020.
Bibtex:
@inproceedings{panthaplackel:aaai20, title={Associating Natural Language Comment and Source Code Entities}, author={Sheena Panthaplackel and Milos Gligoric and Raymond J. Mooney and Junyi Jessy Li}, booktitle={The AAAI Conference on Artificial Intelligence (AAAI)}, month={February}, url="http://www.cs.utexas.edu/users/ai-labpub-view.php?PubID=127775", year={2020} }
Presentation:
Slides (PDF)
Poster
People
Raymond J. Mooney
Faculty
mooney [at] cs utexas edu
Sheena Panthaplackel
Ph.D. Alumni
spantha [at] cs utexas edu
Areas of Interest
Natural Language for Software Engineering
Labs
Machine Learning