NONLINEAR REGRESSION MODELS FOR ESTIMATING THE DURATION OF SOFTWARE DEVELOPMENT IN JAVA FOR PC BASED ON THE 2021 ISBSG DATA
DOI:
https://doi.org/10.15588/1607-3274-2022-3-14Keywords:
duration, software development, Java, personal computer, nonlinear regression model, normalizing transformation, non-Gaussian data, ISBSG.Abstract
Context. The problem of estimating the duration of software development in Java for personal computers (PC) is important because, first, failed duration estimating is often the main contributor to failed software projects, second, Java is a popular language, and, third, a personal computer is a widespread multi-purpose computer. The object of the study is the process of estimating the duration of software development in Java for PC. The subject of the study is the nonlinear regression models to estimate the duration of software development in Java for PC.
Objective. The goal of the work is to build nonlinear regression models for estimating the duration of software development in Java for PC based on the normalizing transformations and deleting outliers in data to increase the confidence of the estimation in comparison to the ISBSG model for the PC platform.
Method. The models, confidence, and prediction intervals of nonlinear regressions to estimate the duration of software development in Java for PC are constructed based on the normalizing transformations for non-Gaussian data with the help of appropriate techniques. The techniques to build the models, confidence, and prediction intervals of nonlinear regressions are based on normalizing transformations. Also, we apply outlier removal for model construction. In general, the above leads to a reduction of the mean magnitude of relative error, the widths of the confidence, and prediction intervals in comparison to nonlinear models constructed without outlier removal application in the model construction process.
Results. A comparison of the model based on the decimal logarithm transformation with the nonlinear regression models based on the Johnson (for the SB family) and Box-Cox transformations as both univariate and bivariate ones has been performed.
Conclusions. The nonlinear regression model to estimate the duration of software development in Java for PC is constructed based on the decimal logarithm transformation. This model, in comparison with other nonlinear regression models, has smaller widths of the confidence and prediction intervals for effort values that are bigger than 900 person-hours. The prospects for further research may include the application of bivariate normalizing transformations and data sets to construct the nonlinear regression models for estimating the duration of software development in other languages for PC and other platforms, for example, mainframe.
References
Boehm B. W. Software engineering economics. Englewood Cliffs, NJ, Prentice Hall, 1981, 768 p.
Boehm B. W., Abts C., Brown A. W. et al. Software cost estimation with COCOMO II. Upper Saddle River, NJ: Prentice Hall PTR, 2000, 506 p.
Owais M., Ramakishore R. Effort, duration and cost estimation in agile software development, 2016 Ninth International Conference on Contemporary Computing (IC3), 2016, pp. 1–5, DOI: 10.1109/IC3.2016.7880216.
Abran A. Data collection and industry standards: the ISBSG repository, Software Project Estimation: The Fundamentals for Providing High Quality Information to Decision Makers, IEEE, 2015, pp. 161–184, DOI: 10.1002/9781118959312.ch8.
Putnam L. H. A general empirical solution to the macrosoftware sizing and estimating problem, IEEE Transactions on Software Engineering, 1978, Vol. 4, No. 2, July, pp. 345– 361.
Oligny S., Bourque P. , Abran A. , Fournier B. Exploring the relation between effort and duration in software engineering projects, Proceedings of the World Computer Congress, Aug. 2000, P. 175–178.
Prykhodko S. B., Pukhalevich A. V. Developing PC Software Project Duration Model based on Johnson transformation, Proceedings of the 12th International Conference Modern Problems of Radio Engineering, Telecommunications and Computer Science TCSET’2014, Lviv-Slavske, Ukraine. Lviv, Polytechnic National University, 2014, pp. 114–116.
Prykhodko S.B., Pukhalevich A. V. Confidence interval estimation of PC software project duration regression based on Johnson transformation, Radioelectronic and Computer Systems. Kharkiv, 2014, No. 2 (66), pp. 104–107. ISSN: 1814-4225
López-Martín C., Abran A. Neural networks for predicting the duration of new software projects, Journal of Systems and Software, 2015, Vol. 101, pp. 127–135. DOI: 10.1016/J.JSS.2014.12.002
Pospieszny P., Czarnacka-Chrobot B., Kobylinski A. An effective approach for software project effort and duration estimation with machine learning algorithms, Journal of Systems and Software, 2018, pp. 184–196. DOI: 10.1016/J.JSS.2017.11.066
Prykhodko S., Prykhodko N. Mathematical modeling of non-Gaussian dependent random variables by nonlinear regression models based on the multivariate normalizing transformations, Mathematical Modeling and Simulation of Systems : 15th International Scientific-practical Conference MODS’2020. Chernihiv, Ukraine, June 29 – July 01, 2020, selected papers. Springer, Cham, 2021, P. 166–174. (Advances in Intelligent Systems and Computing, Vol. 1265). DOI: 10.1007/978-3-030-58124-4_16
Bates D. M., Watts D. G. Nonlinear regression analysis and its applications. New York, John Wiley & Sons, 1988, 384 p. DOI:10.1002/9780470316757
Seber G.A.F., C. J. Wild Nonlinear regression. New York, John Wiley & Sons, 1989, 768 p. DOI: 10.1002/0471725315
Ryan T. P. Modern regression methods. New York, John Wiley & Sons, 1997, 529 p. DOI: 10.1002/9780470382806
Drapper N. R., Smith H. Applied regression analysis. New York, John Wiley & Sons, 1998, 736 p.
Johnson R. A., Wichern D. W. Applied multivariate statistical analysis, Pearson Prentice Hall, 2007, 800 p.
Chatterjee S., Simonoff J. S. Handbook of regression analysis. New York, John Wiley & Sons, 2013, 236 p. DOI: 10.1002/9781118532843
Prykhodko S., Prykhodko N., Makarova L., et al. Detecting Outliers in Multivariate Non-Gaussian Data on the basis of Normalizing Transformations, Electrical and Computer Engineering : the 2017 IEEE First Ukraine Conference (UKRCON) «Celebrating 25 Years of IEEE Ukraine Section», Kyiv, Ukraine, May 29 – June 2, 2017 : proceedings. Kyiv, IEEE, 2017, pp. 846–849. DOI: 10.1109/UKRCON.2017.8100366
Prykhodko S., Prykhodko N., Makarova L. et al. Application of the Squared Mahalanobis Distance for Detecting Outliers in Multivariate Non-Gaussian Data, Radioelectronics, Telecommunications and Computer Engineering : 14th International Conference on Advanced Trends (TCSET). LvivSlavske, Ukraine, February 20–24, 2018 : proceedings, pp. 962–965. DOI: 10.1109/TCSET.2018.8336353
Prykhodko S., Prykhodko N., Knyrik K. Estimating the efforts of mobile application development in the planning phase using nonlinear regression analysis, Applied Computer Systems, 2020, Vol. 25, No. 2, pp. 172–179. DOI: 10.2478/acss-2020-0019
Mardia K.V. Measures of multivariate skewness and kurtosis with applications, Biometrika, 1970, Vol. 57, pp. 519– 530. DOI: 10.1093/biomet/57.3.519
Mardia K.V. Applications of some measures of multivariate skewness and kurtosis in testing normality and robustness studies, Sankhya: The Indian Journal of Statistics, Series B (1960–2002), 1974, Vol. 36, Issue 2, pp. 115–128.
Campbell S. L., Chancelier J.-P., Nikoukhah R.Modeling and simulation in Scilab/Scicos. Springer, 2005, 313 p.
Foss T., Stensrud E., Kitchenham B., Myrtveit I. A simulation study of the model evaluation criterion MMRE, IEEE Transactions on software engineering, 2003, Vol. 29, Issue 11, pp. 985–995. DOI: 10.1109/TSE.2003.1245300
Port D., Korte M. Comparative studies of the model evaluation criterions MMRE and PRED in software cost estimation research, Empirical Software Engineering and Measurement, the 2nd ACM-IEEE International Symposium ESEM, Kaiserslautern, Germany, October, 2008 : proceedings. New York, ACM, 2008, pp. 51–60
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2022 S. B. Prykhodko, A. V. Pukhalevych, K. S. Prykhodko, L. M. Makarova
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Creative Commons Licensing Notifications in the Copyright Notices
The journal allows the authors to hold the copyright without restrictions and to retain publishing rights without restrictions.
The journal allows readers to read, download, copy, distribute, print, search, or link to the full texts of its articles.
The journal allows to reuse and remixing of its content, in accordance with a Creative Commons license СС BY -SA.
Authors who publish with this journal agree to the following terms:
-
Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License CC BY-SA that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
-
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
-
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.