
 
SWR is a statistical technique that allows us to 
build a prediction model representing the 
relationship between independent and dependent 
variables. The estimation model is obtained by 
adding, at each stage, the independent variable with 
the highest association to the dependent variable, 
taking into account all variables currently in the 
model. SWR aims to find the set of independent 
variables that best explains the variation in the 
dependent variable. 
To select the variables to be added in the model a 
Manual SWR (MSWR) can be applied, using the 
technique proposed by Kitchenham (1998). The idea 
underlying this procedure is to select the important 
independent variables, and then to use linear 
regression to obtain the final model.   
CBR is a branch of Artificial Intelligence where 
knowledge of similar past cases is used to solve new 
cases. The idea is to predict the effort of a new 
project by considering similar projects previously 
developed. We applied CBR by employing ANGEL 
(Shepperd and Schofield, 2000). ANGEL 
implements the Euclidean distance as similarity 
function and uses features normalised between 0 and 
1. We used 1, 2, and 3 analogies employing as 
adaptation strategies the mean of k analogies (simple 
average), the inverse distance weighted mean and 
the  inverse rank weighted mean (Shepperd and 
Schofield, 2000). We used the feature subset 
selection of ANGEL in order to let the tool to 
automatically choose, among all the variables, the 
ones to employ as set of key features in the analogy 
based estimation. 
To have a better visual insight on the 
effectiveness of the estimation models, we compared 
the prediction accuracies taking into account both 
the summary statistics and the boxplots of absolute 
residuals, where residuals are calculated as (EFreal – 
EFpred). Boxplots are widely employed in 
exploratory data analysis since they provide a quick 
visual representation to summarize the data, using 
five values: median, upper and lower quartiles, 
minimum and maximum values, and outliers 
(Kitchenham  et al., 2001). In development effort 
estimation, boxplots are used to visually represent 
the amount of the error for a given prediction 
technique. In particular, we used boxplot to 
graphically render the spread of the absolute 
residuals. 
In order to verify whether the estimates obtained 
with TS are characterized by significantly better 
accuracy than the considered benchmarks we 
statistically analyzed the absolute residuals, as 
suggested in (Kitchenham et al., 2001). Since (i) the 
absolute residuals for all the analyzed estimation 
methods were not normally distributed, and (ii) the 
data was naturally paired, we decided to use the 
Wilcoxon test (Royston, 1982). The achieved results 
were intended as statistically significant at = 0.05.  
We performed the empirical analysis by 
exploiting two datasets: the Desharnais (Desharnais, 
1989) dataset, containing 81 observations, and the 
NASA (Bailey and Basili, 1981) dataset, with 18 
observations. Despite of these datasets are quite old, 
they have been widely and recently used to evaluate 
and compare estimation methods (see e.g., (Burgess 
and Lefley, 2001) (Shepperd and Schofield, 2000)). 
As for Desharnais dataset, in our analysis we did not 
consider the length of the code as made in (Burgess 
and Lefley, 2001), and categorical variables (i.e., 
Language and YearEnd). We excluded four projects 
that had missing values, as done by Shepperd and 
Schofield (2000). The NASA dataset consists of two 
independent variables, i.e. Developed Lines (DL) of 
code and Methodology (Me). The descriptive 
statistics of the selected factors for the two datasets 
are shown in Tables 1 and 2. 
Table 1: Descriptive statistics of Desharnais dataset. 
Variable Min  Max  Mean Std.Dev. 
TeamExp 0.00  4.00  2.30  1.33 
ManagerExp 0.00  7.00  2.65  1.52 
Entities 7.00 387.00 120.55 86.11 
Transactions 9.00  886.00  177.47 146.08 
AdjustedFPs 73.00  1127.00  298.01  182.26 
RawFPs 62.00 1116.00 282.39 186.36 
Envergue 5.00  52.00  27.45 10.53 
EFH (m/h)  546.00  23490.00  4903.95  4188.19 
Table 2: Descriptive statistics of NASA dataset. 
Variable Min Max Mean Std. Dev. 
Me 19.00 35.00 27.78 5.39 
DL 2.10 100.80 35.26 35.10 
EFH (m/m)  5.00    138.30  49.47    45.73 
 
We exploited some parameter settings to find 
suitable values for moves and iterations numbers. 
Concerning the number of moves, we executed TS 
using four different values, i.e. 100, 500, 1000, and 
2000. The best results in terms of MMRE and 
Pred(25) were achieved with 1000 moves for 
Desharnais dataset and 100 moves for NASA 
dataset. We also executed the algorithm with 
different numbers of iterations, and the best results 
were achieved using 3000 and 500 iterations on 
Desharnais and NASA, respectively. We did not 
consider number of moves greater than 2000 for 
Desharnais dataset and 100 for NASA dataset since 
we noted a decreasing in the performance. 
Moreover, note that for NASA dataset, having a 
ICEIS 2010 - 12th International Conference on Enterprise Information Systems
238