Academic Journal

Incorporating statistical and machine learning techniques into the optimization of correction factors for software development effort estimation.

Bibliographic Details
Title: Incorporating statistical and machine learning techniques into the optimization of correction factors for software development effort estimation.
Authors: Nhung, Ho Le Thi Kim1 (AUTHOR), Van Hai, Vo2 (AUTHOR), Silhavy, Petr3 (AUTHOR), Prokopova, Zdenka3 (AUTHOR), Silhavy, Radek3 (AUTHOR) rsilhavy@utb.cz
Superior Title: Journal of Software: Evolution & Process. May2024, Vol. 36 Issue 5, p1-37. 37p.
Subject Terms: *STATISTICAL learning, *CORRECTION factors, *COMPUTER software development, *MATHEMATICAL optimization, *STANDARD deviations
Abstract: Accurate effort estimation is necessary for efficient management of software development projects, as it relates to human resource management. Ensemble methods, which employ multiple statistical and machine learning techniques, are more robust, reliable, and accurate effort estimation techniques. This study develops a stacking ensemble model based on optimization correction factors by integrating seven statistical and machine learning techniques (K‐nearest neighbor, random forest, support vector regression, multilayer perception, gradient boosting, linear regression, and decision tree). The grid search optimization method is used to obtain valid search ranges and optimal configuration values, allowing more accurate estimation. We conducted experiments to compare the proposed method with related methods, such as use case points‐based single methods, optimization correction factors‐based single methods, and ensemble methods. The estimation accuracies of the methods were evaluated using statistical tests and unbiased performance measures on a total of four datasets, thus demonstrating the effectiveness of the proposed method more clearly. The proposed method successfully maintained its estimation accuracy across the four experimental datasets and gave the best results in terms of the sum of squares errors, mean absolute error, root mean square error, mean balance relative error, mean inverted balance relative error, median of magnitude of relative error, and percentage of prediction (0.25). The p‐value for the t‐test showed that the proposed method is statistically superior to other methods in terms of estimation accuracy. The results show that the proposed method is a comprehensive approach for improving estimation accuracy and minimizing project risks in the early stages of software development. [ABSTRACT FROM AUTHOR]
Copyright of Journal of Software: Evolution & Process is the property of Wiley-Blackwell and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Academic Search Premier
Description
Description not available.