9 Effort Estimation Data in Software Engineering

It is worth highlighting the case of software effort estimation datasets with their peculiarities. First, most effort estimation datasets used in the literature are scattered through research papers with the exception of a few kept in the PROMISE repository. Mair et al (2005) also have analysed available datasets in the field of cost estimation identifying 65 different datasets in 50 papers.

Second, their size is very small with the exception of ISBSG repository discussed previously which a small sample is available through PROMISE and the China dataset with 499 instances.

Third, some can be quite old in a context and time that is not applicable to current development environments. The authors noted that the oldest datasets (COCOMO, Desharnais, Kemerer and Albrecht and Gaffney) tend to be the most studied ones and a subset of the most relevant ones. Also, from the artificial intelligence or data mining point of view effort estimation has been mainly tackled with different types of regression techniques and more recently with techniques which are also typically considered under the umbrella of data mining techniques. However, as the number of examples per dataset is increasing, other machine learning techniques are also being studied (e.g.: Dejaeger et al (2012) report on a comparison of several machine learning techniques to effort estimation with only 5 out the 9 used datasets publicly available). From the data mining point of view, the small number of instances hinders the application of machine learning techniques.

However, software effort and cost estimation still remain one of the main challenges in software engineering and have attracted a great deal of interest by many researchers (2007). For example, there are continuous analyses of whether software development follows economies or diseconomies of scale (see Kitchenham (2002)).

Next Table @ref(tab:effEstimation) (following Mair et al (2005) ) shows the most open cost/effort datasets available in the literature with their main reference.

(#tab:effEstimation) Effort Estimation Dataset from articles
Reference	Instances	Attributes
Abran and Robillard (1996)	21	31
Albrecht-Gaffney (1983)	24	7
Bailey and Basili (1981)	18	9
Belady and Lehman (1979)	33
Boehm (aka COCOMO Dataset) (1981)	63	43
China dataset¹	499	18
Desharnais (1988)	61	10
Dolado (1997)	24	7
Hastings and Sajeev (2001)	8	14
Heiat and Heiat (Heiat and Heiat 1997)	35	4
Jeffery and Stathis (1996)	17	7
Jorgensen (2004)	47	4
Jorgensen et al. (2003)	20	4
Kemerer (1987)	15	5
Kitchenham (Mermaid 2) (2002)	30	5
Kitchenham et al. (CSC) (2002)	145	9
Kitchenham and Taylor (ICL) (1985)	10	6
Kitchenham and Taylor (BT System X) (1985)	10	3
Kitchenham and Taylor (BT Software Houses) (1985)	12	6
Li et al.(USP05) (2007)²	202	16
Mišić and Tevsić (1998)	6	16
Maxwell (Dev Effort) (2002)	63	32
Maxwell (Maintenance Eff) (2002)	67	28
Miyazaki et al. (1994)	47	9
Moser et al. (1999)	37	4
Shepperd and Cartwright (Shepperd and Cartwright 2001)	39	3
Shepperd and Schofield (Telecom 1) (1997)	18	5
Schofield (real-time 1) Shepperd and Schofield (1997)	21	4
Schofield (Mermaid) (1998)	30	18
Schofield (Finnish) (1998)	39	30
Schofield (Hughes) (1998)	33	14
Woodfield et al. (1981)	63	8

9.1 What Is Missing in Many Effort Datasets

Beyond the number of projects, modern effort-estimation studies should report:

Context variables: development process, team size, domain, and technology stack.
Temporal information: project start/end period and release cadence.
Effort measurement protocol: person-hours vs person-days, inclusion/exclusion rules.
Feature definitions: exact meaning and units of each predictor.
Currency normalization (for cost): inflation year and conversion policy.
Handling of outliers: whether extreme projects were removed or transformed.
External validity notes: why results may or may not transfer to current agile/DevOps contexts.

Without this metadata, models are difficult to compare and often hard to reuse in practice.

Abran, A., and P. N. Robillard. 1996. “Function Points Analysis: An Empirical Study of Its Measurement Processes.” Software Engineering, IEEE Transactions on 22 (12): 895–910. https://doi.org/10.1109/32.553638.

Albrecht, A. J., and Jr. Gaffney J. E. 1983. “Software Function, Source Lines of Code, and Development Effort Prediction: A Software Science Validation.” IEEE Transactions on Software Engineering 9 (6): 639–48. https://doi.org/10.1109/TSE.1983.235271.

Bailey, John W., and Victor R. Basili. 1981. “A Meta-Model for Software Development Resource Expenditures.” Proceedings of the 5th International Conference on Software Engineering (ICSE’81) (Piscataway, NJ, USA), ICSE’81, 107–16. http://dl.acm.org/citation.cfm?id=800078.802522.

Banker, Rajiv D, Hsihui Chang, and Chris F Kemerer. 1994. “Evidence on Economies of Scale in Software Development.” Information and Software Technology 36 (5): 275–82. https://doi.org/10.1016/0950-5849(94)90083-3.

Belady, L. A., and M. M. Lehman. 1979. Research Directions in Software Technology. MIT Press.

Boehm, Barry W. 1981. Software Engineering Economics. 1st ed. Prentice Hall PTR.

Dejaeger, K., W. Verbeke, D. Martens, and B. Baesens. 2012. “Data Mining Techniques for Software Effort Estimation: A Comparative Study.” Software Engineering, IEEE Transactions on 38 (2): 375–97. https://doi.org/10.1109/TSE.2011.55.

Desharnais, J. M. 1988. “Analyse Statistique de La Productivite Des Projects de Development En Informatique a Partir de La Technique Des Points de Fonction.” MSc Thesis, Univ. du Quebec a Montreal.

Dolado, J. J. 2001. “On the Problem of the Software Cost Function.” Information and Software Technology 43 (1): 61–72. https://doi.org/10.1016/S0950-5849(00)00137-3.

Dolado, J. J. 1997. “A Study of the Relationships Among Albrecht and Mark II Function Points, Lines of Code 4GL and Effort.” Journal of Systems and Software 37 (2): 161–73. https://doi.org/10.1016/S0164-1212(96)00111-2.

Hastings, T. E., and A. S. M. Sajeev. 2001. “A Vector-Based Approach to Software Size Measurement and Effort Estimation.” IEEE Transactions on Software Engineering 27 (4): 337–50. https://doi.org/10.1109/32.917523.

Heiat, Abbas, and Nafisseh Heiat. 1997. “A Model for Estimating Efforts Required for Developing Small-Scale Business Applications.” Journal of Systems and Software 39 (1): 7–14. https://doi.org/10.1016/S0164-1212(96)00159-8.

Jeffery, Ross, and John Stathis. 1996. “Function Point Sizing: Structure, Validity and Applicability.” Empirical Software Engineering 1: 11–30. http://dx.doi.org/10.1007/BF00125809.

Jorgensen, M. 2004. “Realism in Assessment of Effort Estimation Uncertainty: It Matters How You Ask.” IEEE Transactions on Software Engineering 30 (4): 209–17. https://doi.org/10.1109/TSE.2004.1274041.

Jørgensen, Magne, Ulf Indahl, and Dag Sjøberg. 2003. “Software Effort Estimation by Analogy and ’Regression Toward Themean’.” Journal of Systems and Software 68 (3): 253–62. https://doi.org/10.1016/S0164-1212(03)00066-9.

Jørgensen, M., and M. Shepperd. 2007. “A Systematic Review of Software Development Cost Estimation Studies.” IEEE Transactions on Software Engineering 33 (1): 33–53. https://doi.org/10.1109/TSE.2007.256943.

Kemerer, Chris F. 1987. “An Empirical Validation of Software Cost Estimation Models.” Communications of the ACM (New York, NY, USA) 30 (5): 416–29. https://doi.org/10.1145/22899.22906.

Kitchenham, Barbara A. 2002. “The Question of Scale Economies in Software — Why Cannot Researchers Agree?” Information and Software Technology 44 (1): 13–24. https://doi.org/10.1016/S0950-5849(01)00204-X.

Kitchenham, Barbara A., and N. R. Taylor. 1985. “Software Project Development Cost Estimation.” Journal of Systems and Software 5 (4): 267–78. https://doi.org/10.1016/0164-1212(85)90026-3.

Kitchenham, Barbara, Shari Lawrence Pfleeger, Beth McColl, and Suzanne Eagan. 2002. “An Empirical Study of Maintenance and Development Estimation Accuracy.” Journal of Systems and Software 64 (1): 57–77. https://doi.org/10.1016/S0164-1212(02)00021-3.

Li, Jingzhou, Guenther Ruhe, Ahmed Al-Emran, and Michael M. Richter. 2007. “A Flexible Method for Software Effort Estimation by Analogy.” Empirical Software Engineering 12 (1): 65–106. https://doi.org/10.1007/s10664-006-7552-4.

Mair, Carolyn, Martin Shepperd, and Magne Jørgensen. 2005. “An Analysis of Data Sets Used to Train and Validate Cost Prediction Systems.” SIGSOFT Software Engineering Notes (New York, NY, USA) 30 (4): 1–6. https://doi.org/10.1145/1082983.1083166.

Maxwell, Katrina. 2002. Applied Statistics for Software Managers. Prentice Hall.

Mišić, Vojislav B, and Dejan N Tevsić. 1998. “Estimation of Effort and Complexity: An Object-Oriented Case Study.” Journal of Systems and Software 41 (2): 133–43. https://doi.org/10.1016/S0164-1212(97)10014-0.

Miyazaki, Y., M. Terakado, K. Ozaki, and H. Nozaki. 1994. “Robust Regression for Developing Software Estimation Models.” Journal of Systems and Software 27 (1): 3–16. https://doi.org/10.1016/0164-1212(94)90110-4.

Moser, Simon, Brian Henderson-Sellers, and Vojislav B Mišić. 1999. “Cost Estimation Based on Business Models.” Journal of Systems and Software 49 (1): 33–42. https://doi.org/10.1016/S0164-1212(99)00064-3.

Schofield, C. 1998. “An Empirical Investigation into Software Effort Estimation by Analogy.” PhD thesis, Bournemouth University.

Shepperd, M., and M. Cartwright. 2001. “Predicting with Sparse Data.” Software Engineering, IEEE Transactions on 27 (11): 987–98. https://doi.org/10.1109/32.965339.

Shepperd, M., and C. Schofield. 1997. “Estimating Software Project Effort Using Analogies.” IEEE Transactions on Software Engineering 23 (11): 736–43. https://doi.org/10.1109/32.637387.

Woodfield, S. N., V. Y. Shen, and H. E. Dunsmore. 1981. “A Study of Several Metrics for Programming Effort.” Journal of Systems and Software 2 (2): 97–103. https://doi.org/10.1016/0164-1212(81)90029-7.

Donated through PROMISE.↩︎
Only a subset of the data in the paper, the complete dataset is donated through PROMISE↩︎