SampleFix: Learning to Correct Programs by Sampling Diverse Fixes

Hajipour, Hossein; Bhattacharyya, Apratim; Fritz, Mario

DetailsSummary

SampleFix: Learning to Correct Programs by Sampling Diverse Fixes

Hajipour, H., Bhattacharyya, A., & Fritz, M. (2019). SampleFix: Learning to Correct Programs by Sampling Diverse Fixes. Retrieved from http://arxiv.org/abs/1906.10502.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/21.11116/0000-0005-7491-4 Version Permalink: https://hdl.handle.net/21.11116/0000-0005-7492-3

Genre: Paper

Files

show Files

hide Files

:

arXiv:1906.10502.pdf (Preprint), 6MB

File Permalink:
-

Name:
arXiv:1906.10502.pdf

Description:
File downloaded from arXiv at 2020-01-10 08:52

OA-Status:

Visibility:
Private

MIME-Type / Checksum:
application/pdf

Technical Metadata:

Copyright Date:
-

Copyright Info:
-

License:
http://creativecommons.org/licenses/by/4.0/

Locators

show

Creators

show

hide

Creators:
Hajipour, Hossein¹, Author
Bhattacharyya, Apratim², Author
Fritz, Mario², Author

Affiliations:
1Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society, ou_1116547
2External Organizations, ou_persistent22

Content

show

hide

Free keywords: Computer Science, Software Engineering, cs.SE,Computer Science, Learning, cs.LG,Computer Science, Programming Languages, cs.PL,Statistics, Machine Learning, stat.ML

Abstract: Automatic program correction is an active topic of research, which holds the
potential of dramatically improving productivity of programmers during the
software development process and correctness of software in general. Recent
advances in machine learning, deep learning and NLP have rekindled the hope to
eventually fully automate the process of repairing programs. A key challenge is
ambiguity, as multiple codes -- or fixes -- can implement the same
functionality. In addition, datasets by nature fail to capture the variance
introduced by such ambiguities. Therefore, we propose a deep generative model
to automatically correct programming errors by learning a distribution of
potential fixes. Our model is formulated as a deep conditional variational
autoencoder that samples diverse fixes for the given erroneous programs. In
order to account for ambiguity and inherent lack of representative datasets, we
propose a novel regularizer to encourage the model to generate diverse fixes.
Our evaluations on common programming errors show for the first time the
generation of diverse fixes and strong improvements over the state-of-the-art
approaches by fixing up to 65% of the mistakes.

Details

show

hide

Language(s): eng - English

Dates: Created: 2019-06-24Modified: 2019-09-09Published Online: 2019

Publication Status: Published online

Pages: 13 p.

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: arXiv: 1906.10502
URI: http://arxiv.org/abs/1906.10502
BibTex Citekey: Hajipour_arXiv1906.10502

Degree: -

Event

show

Legal Case

show

Project information

show

Source

show