Credit: Pixabay/CC0 Public Domain
IMDEA Software researchers Facundo Molina, Juan Manuel Copia, and Alessandra Gorla present FIXCHECK, a new approach to improve patch remediation analysis that combines static analysis, randomized testing, and large-scale language models.
Their innovations are summarized in the paper “Improved Patch Accuracy Analysis with Random Testing and Large-Scale Language Models” and presented at the International Conference on Software Testing, Verification and Validation (ICST 2024). More details are available on the Zenodo server.
Producing patches to fix software defects is an important task in maintaining a software system. Software defects are typically reported through test cases, which reveal undesirable behavior of the software.
In response to these defects, developers must write patches that must be validated before being committed to the code base, ensuring that the tests provided do not expose the defects. However, the patch may not effectively address the underlying bug or may introduce new bugs, resulting in what is known as a bad fix or bad patch.
If these erroneous patches are detected, it can have a significant impact on the time and effort spent by developers on bug fixing, as well as the overall maintenance of the software system.
Automatic Program Repair (APR) provides software developers with tools that can automatically generate patches for buggy programs, but using these tools often results in the discovery of many erroneous patches that fail to fix the bugs.
To address this issue, researchers at IMDEA Software created FIXCHECK, a novel approach that combines static analysis, random testing, and large-scale language models (LLMs) to automatically generate tests that detect bugs in potentially erroneous patches and improve the output of patch correctness analysis.
FIXCHECK employs a two-step process: in the first step, we generate random tests to obtain a large number of test cases, and in the second step, we use a large language model to derive meaningful assertions for each test case.
In addition, FIXCHECK includes a selection and prioritization mechanism that runs new test cases against the patched program and discards or ranks these tests based on their likelihood of revealing bugs in the patch.
“The effectiveness of FIXCHECK in generating test cases that reveal bugs in erroneous patches was evaluated on 160 patches, including both developer-created patches and patches generated by RPA tools,” said Facundo Molina, postdoctoral researcher at the Institute IMDEA Software.
Results show that FIXCHECK can effectively generate bug-detecting tests with high confidence for 62% of developer-created erroneous patches. Furthermore, it complements existing patch remediation evaluation techniques by providing test cases that reveal bugs in up to 50% of erroneous patches identified by state-of-the-art techniques.
FIXCHECK represents a major advancement in the field of software repair and maintenance by providing a robust solution for automating test generation and detecting defects during software maintenance. This approach not only improves the effectiveness of patch verification but also fosters broader adoption of automated program repair methods.
Further information: Facundo Molina et al., “Improved patch accuracy analysis with random testing and large language models (Replication package),” Zenodo (2024). DOI: 10.5281/zenodo.10498173
Courtesy of IMDEA Software Institute
Citation: A New Approach to Improve Automatic Software Remediation by Generating Test Cases (July 23, 2024) Retrieved July 23, 2024 from https://techxplore.com/news/2024-07-approach-automatic-software-generating-cases.html
This document is subject to copyright. It may not be reproduced without written permission, except for fair dealing for the purposes of personal study or research. The content is provided for informational purposes only.