SlideShare una empresa de Scribd logo
1 de 58
Descargar para leer sin conexión
Building
 Cost Estimation Models
Using Homogeneous Data


             Rahul Premraj
             Saarland University, Germany


             Thomas Zimmermann
             University of Calgary, Canada
software
engineering
  data
Review
                                                                                                                                                      systematic
May, 2007
 316                                                                     IEEE TRANSACTIONS ON SOFTWARE ENGINEERING,             VOL. 33, NO. 5,   MAY 2007




                Cross versus Within-Company Cost
              Estimation Studies: A Systematic Review
                    Barbara A. Kitchenham, Member, IEEE Computer Society, Emilia Mendes, and
                                             Guilherme H. Travassos

            Abstract—The objective of this paper is to determine under what circumstances individual organizations would be able to rely on
            cross-company-based estimation models. We performed a systematic review of studies that compared predictions from cross-
            company models with predictions from within-company models based on analysis of project data. Ten papers compared cross-
            company and within-company estimation models; however, only seven presented independent results. Of those seven, three found
            that cross-company models were not significantly different from within-company models, and four found that cross-company models
            were significantly worse than within-company models. Experimental procedures used by the studies differed making it impossible to
            undertake formal meta-analysis of the results. The main trend distinguishing study results was that studies with small within-company
            data sets (i.e., < 20 projects) that used leave-one-out cross validation all found that the within-company model was significantly
            different (better) from the cross-company model. The results of this review are inconclusive. It is clear that some organizations would
            be ill-served by cross-company models whereas others would benefit. Further studies are needed, but they must be independent (i.e.,
            based on different data bases or at least different single company data sets) and should address specific hypotheses concerning the
            conditions that would favor cross-company or within-company models. In addition, experimenters need to standardize their
            experimental procedures to enable formal meta-analysis, and recommendations are made in Section 3.

            Index Terms—Cost estimation, management, systematic review, software engineering.

                                                                                   Ç

 1      INTRODUCTION

 E         studies of cost estimation models (e.g., [12],                                 These problems motivated the use of cross-company
        ARLY
     [8]) suggested that general-purpose models such as                                models (models built using cross-company data sets, which
 COCOMO [1] and SLIM [24] needed to be calibrated to                                   are data sets containing data from several companies) for
 specific companies before they could be used effectively.                             effort estimation and productivity benchmarking, and,
 Taking this result further and following the proposals made                           subsequently, several studies compared the prediction
 by DeMarco [4], Kok et al. [14] suggested that cost                                   accuracy between cross-company and within-company
 estimation models should be developed only from single-                               models. In 1999, Maxwell et al. [18] analyzed a cross-
 company data. However, three main problems can occur                                  company benchmarking database by comparing the accu-
 when relying on within-company data sets [3], [2]:                                    racy of a within-company cost model with the accuracy of a
                                                                                       cross-company cost model. They claimed that the within-
       1.    The time required to accumulate enough data on
                                                                                       company model was more accurate than the cross-company
             past projects from a single company may be
                                                                                       model, based on the same holdout sample. In the same year,
             prohibitive.
                                                                                       Briand et al. [2] found that cross-company models could be
       2.    By the time the data set is large enough to be of use,
                                                                                       as accurate as within-company models. The following year,
             technologies used by the company may have
                                                                                       Briand et al. [3] reanalyzed the data set employed by
             changed, and older projects may no longer be
                                                                                       Maxwell et al. [18] and concluded that cross-company
             representative of current practices.
                                                                                       models were as good as within-company models. Two
       3.    Care is necessary as data needs to be collected in a
                                                                                       years later, Wieczorek and Ruhe [26] confirmed this same
             consistent manner.
                                                                                       trend using the same data set employed by [2]. Three years
                                                                                       later, Mendes et al. [20] also confirmed the same trend using
 . B.A. Kitchenham is with the School of Computing and Mathematics,                    yet another data set.
   University of Keele, Keele Village, Staffordshire, ST5 5BG, UK.                        These results seemed to contradict the results of the
   E-mail: b.a.kitchenham@cs.keele.ac.uk.
                                                                                       earlier studies and pave the way for improved estimation
 . E. Mendes is with the Computer Science Department, University of
                                                                                       methods for companies that did not have their own project
   Auckland, Private Bag 92019, Auckland, New Zealand.
   E-mail: emilia@cs.auckland.ac.nz.                                                   data. However, other researchers found less encouraging
 . G.H. Travassos is with UFRJ/COPPE, Systems Engineering and
                                                                                       results. Jeffery et al. undertook two studies, both of which
   Computer Science Program, PO Box 68511, 21941-972 Rio de Janeiro—
                                                                                       suggested that within-company models were superior to
   RJ, Brazil. E-mail: ght@cos.ufrj.br.
                                                                                       cross-company models [6], [7]. Two years later, Lefley and
 Manuscript received 6 June 2006; revised 27 Nov. 2006; accepted 2 Jan. 2007;
                                                                                       Shepperd claimed that the within-company model was
 published online 20 Feb. 2007.
 Recommended for acceptance by A. Mockus.                                              more accurate than the cross-company model, using the
 For information on obtaining reprints of this article, please send e-mail to:
                                                                                       same data set employed by Wieczorek and Ruhe [26] and
 tse@computer.org, and reference IEEECS Log Number TSE-0129-0606.
                                                                                       Briand et al. [2]. Finally, a year later Kitchenham and
 Digital Object Identifier no. 10.1109/TSE.2007.1001.
                                                 0098-5589/07/$25.00 ß 2007 IEEE       Published by the IEEE Computer Society
Review
                                                                                                                                                      systematic
May, 2007
                                                                                                                                                             Barbara Kitchenham
 316                                                                     IEEE TRANSACTIONS ON SOFTWARE ENGINEERING,             VOL. 33, NO. 5,   MAY 2007




                Cross versus Within-Company Cost
              Estimation Studies: A Systematic Review
                    Barbara A. Kitchenham, Member, IEEE Computer Society, Emilia Mendes, and
                                             Guilherme H. Travassos

            Abstract—The objective of this paper is to determine under what circumstances individual organizations would be able to rely on
            cross-company-based estimation models. We performed a systematic review of studies that compared predictions from cross-
            company models with predictions from within-company models based on analysis of project data. Ten papers compared cross-
            company and within-company estimation models; however, only seven presented independent results. Of those seven, three found
            that cross-company models were not significantly different from within-company models, and four found that cross-company models
            were significantly worse than within-company models. Experimental procedures used by the studies differed making it impossible to
            undertake formal meta-analysis of the results. The main trend distinguishing study results was that studies with small within-company
            data sets (i.e., < 20 projects) that used leave-one-out cross validation all found that the within-company model was significantly
            different (better) from the cross-company model. The results of this review are inconclusive. It is clear that some organizations would
            be ill-served by cross-company models whereas others would benefit. Further studies are needed, but they must be independent (i.e.,
            based on different data bases or at least different single company data sets) and should address specific hypotheses concerning the



                                                                                                                                                                                  Emilia Mendes
            conditions that would favor cross-company or within-company models. In addition, experimenters need to standardize their
            experimental procedures to enable formal meta-analysis, and recommendations are made in Section 3.

            Index Terms—Cost estimation, management, systematic review, software engineering.

                                                                                   Ç

 1      INTRODUCTION

 E         studies of cost estimation models (e.g., [12],                                 These problems motivated the use of cross-company
        ARLY
     [8]) suggested that general-purpose models such as                                models (models built using cross-company data sets, which
 COCOMO [1] and SLIM [24] needed to be calibrated to                                   are data sets containing data from several companies) for
 specific companies before they could be used effectively.                             effort estimation and productivity benchmarking, and,
 Taking this result further and following the proposals made                           subsequently, several studies compared the prediction
 by DeMarco [4], Kok et al. [14] suggested that cost                                   accuracy between cross-company and within-company
 estimation models should be developed only from single-                               models. In 1999, Maxwell et al. [18] analyzed a cross-
 company data. However, three main problems can occur                                  company benchmarking database by comparing the accu-
 when relying on within-company data sets [3], [2]:                                    racy of a within-company cost model with the accuracy of a
                                                                                       cross-company cost model. They claimed that the within-
       1.    The time required to accumulate enough data on
                                                                                       company model was more accurate than the cross-company
             past projects from a single company may be
                                                                                       model, based on the same holdout sample. In the same year,
             prohibitive.
                                                                                       Briand et al. [2] found that cross-company models could be
       2.    By the time the data set is large enough to be of use,
                                                                                       as accurate as within-company models. The following year,
             technologies used by the company may have


                                                                                                                                                                                                  Guilherme Travassos
                                                                                       Briand et al. [3] reanalyzed the data set employed by
             changed, and older projects may no longer be
                                                                                       Maxwell et al. [18] and concluded that cross-company
             representative of current practices.
                                                                                       models were as good as within-company models. Two
       3.    Care is necessary as data needs to be collected in a
                                                                                       years later, Wieczorek and Ruhe [26] confirmed this same
             consistent manner.
                                                                                       trend using the same data set employed by [2]. Three years
                                                                                       later, Mendes et al. [20] also confirmed the same trend using
 . B.A. Kitchenham is with the School of Computing and Mathematics,                    yet another data set.
   University of Keele, Keele Village, Staffordshire, ST5 5BG, UK.                        These results seemed to contradict the results of the
   E-mail: b.a.kitchenham@cs.keele.ac.uk.
                                                                                       earlier studies and pave the way for improved estimation
 . E. Mendes is with the Computer Science Department, University of
                                                                                       methods for companies that did not have their own project
   Auckland, Private Bag 92019, Auckland, New Zealand.
   E-mail: emilia@cs.auckland.ac.nz.                                                   data. However, other researchers found less encouraging
 . G.H. Travassos is with UFRJ/COPPE, Systems Engineering and
                                                                                       results. Jeffery et al. undertook two studies, both of which
   Computer Science Program, PO Box 68511, 21941-972 Rio de Janeiro—
                                                                                       suggested that within-company models were superior to
   RJ, Brazil. E-mail: ght@cos.ufrj.br.
                                                                                       cross-company models [6], [7]. Two years later, Lefley and
 Manuscript received 6 June 2006; revised 27 Nov. 2006; accepted 2 Jan. 2007;
                                                                                       Shepperd claimed that the within-company model was
 published online 20 Feb. 2007.
 Recommended for acceptance by A. Mockus.                                              more accurate than the cross-company model, using the
 For information on obtaining reprints of this article, please send e-mail to:
                                                                                       same data set employed by Wieczorek and Ruhe [26] and
 tse@computer.org, and reference IEEECS Log Number TSE-0129-0606.
                                                                                       Briand et al. [2]. Finally, a year later Kitchenham and
 Digital Object Identifier no. 10.1109/TSE.2007.1001.
                                                 0098-5589/07/$25.00 ß 2007 IEEE       Published by the IEEE Computer Society
Company            Cross        No Trend
Specific Models   Company Models
Company              Cross           No Trend
Specific Models     Company Models
   (four studies)      (four studies)   (two studies)
Company                 Cross                   No Trend
 Specific Models        Company Models
       (four studies)        (four studies)         (two studies)

Barbara Kitchenham       Emilia Mendes        Katrina Maxwell




                                              Martin Shepperd
Isabella Wieczorek       Lionel Briand
Company                     Cross                       No Trend
 Specific Models            Company Models
       (four studies)            (four studies)             (two studies)

Barbara Kitchenham           Emilia Mendes            Katrina Maxwell




                        2                         2




                                                      Martin Shepperd
Isabella Wieczorek           Lionel Briand




                        2
Company                     Cross                       No Trend
 Specific Models            Company Models
       (four studies)            (four studies)             (two studies)

Barbara Kitchenham           Emilia Mendes            Katrina Maxwell




                                                  1
                        2                         2                         1




                                                      Martin Shepperd
Isabella Wieczorek           Lionel Briand




                        3
                                                  2
                        2
Company                     Cross                       No Trend
 Specific Models            Company Models
       (four studies)            (four studies)             (two studies)

Barbara Kitchenham           Emilia Mendes            Katrina Maxwell




                                                  1                         1
                        2                         2                         1




                                                      Martin Shepperd
Isabella Wieczorek           Lionel Briand




                        3
                                                  2
                        2                                                   1
Erica
meet
she works here
she is a
 metrics
consultant
her job
Boss has a
  Erica’s
new project for her
what are my
options?
Company            Cross
Specific Models   Company Models
Company                              Cross
                     Business
Specific Models                     Company Models
                  Specific Models
why
Business
 Sector?
METRICS, 2005
                  An Empirical Analysis of Software Productivity Over Time

                             Rahul Premraj                                 Martin Shepperd
                      Bournemouth University, UK                       Brunel University, UK
                         rpremraj@bmth.ac.uk                        martin.shepperd@brunel.ac.uk
                          Barbara Kitchenham∗                                  Pekka Forselius
                         National ICT, Australia                              STTF Oy, Finland
                    Barbara.Kitchenham@nicta.com.au                      pekka.forselius@kolumbus.fi


                         Abstract                                  commodities. Unfortunately, for software the notion of out-
                                                                   put is not straightforward. Lines of code are problematic
                                                                   due to issues of layout, differing language and the fact that
OBJECTIVE - the aim is to investigate how software
                                                                   most software engineering activity does not directly involve
project productivity has changed over time. Within this
                                                                   code. An alternative is Function Points (FPs), in its various
overall goal we also compare productivity between differ-
                                                                   flavours, which although subject to some criticism [?] are
ent business sectors and seek to identify major drivers.
                                                                   in quite widespread use and so in a sense represent the least
METHOD - we analysed a data set of more than 600
                                                                   bad alternative. In our analysis the output (or size) measure
projects that have been collected from a number of Finnish
                                                                   collected is Experience Points 2.0 [?], a variant of FPs.
companies since 1978.
RESULTS - overall, we observed a quite pronounced im-
                                                                      Second, productivity is impacted by a very large num-
provement in productivity over the entire time period,
                                                                   ber of factors, many of which are inherently difficult to as-
though, this improvement is less marked since the 1990s.
                                                                   sess, e.g. task difficulty, skill of the project team, ease of
However, the trend is not smooth. We also observed pro-
                                                                   interaction with the customer/client and the level of non-
ductivity variability between company and business sec-
                                                                   functional requirements imposed such as dependability and
tor.
                                                                   performance.
CONCLUSIONS - whilst this data set is not a ran-
dom sample so generalisation is somewhat problematic,
                                                                      Third, there are clear interactions between many of these
we hope that it contributes to an overall body of knowl-
                                                                   factors so for instance, it is easier to be productive if quality
edge about software productivity and thereby facilitates the
                                                                   can be disregarded.
construction of a bigger picture.
Keywords: project management, projects, software produc-
                                                                      Despite these caveats, this paper seeks to analyse soft-
tivity, trend analysis, empirical analysis.
                                                                   ware project productivity trends from 1978-2003 from
                                                                   a data set of more than 600 projects from Finland. The
                                                                   projects are varied in size (6 - 5000+ FPs), business sec-
1. Introduction                                                    tor (e.g. Retail) and type (New Development or Mainte-
                                                                   nance). However, we believe there are sufficient data to
Given the importance and size of the software industry it is       draw some preliminary conclusions.
no surprise that there is a great deal of interest in productiv-
ity trends and in particular whether the industry, as a whole,         The remainder of the paper is organised as follows. The
is improving over time. Obviously this is a complex ques-          next section very briefly reviews some related work includ-
tion for at least three reasons.                                   ing a similar, earlier study by Maxwell and Forselius [?].
    First, productivity is difficult to measure because the tra-    Next we describe the data set used for our analysis. We then
ditional definition, i.e. the ratio of outputs to inputs re-        give the results of our analysis, first overall and then af-
quires that we have objective methods of measuring both            ter splitting the data set into groups of more closely related
                                                                   projects. We conclude with a discussion of the significance
                                                                   of the results and some comments on the actual process of
    Barbara Kitchenham is also with Keele University, UK Bar-
∗
    bara@cs.keele.ac.uk                                            analysing the data.
Business
Specific Models
All Data         Cleaned Data




      788


      395


  0      200      400       600        800



Finnish data set
Regression model
           Effort = αSize   β
  Effort




          Size
Test Sets
Test Sets



       companies
Test Sets



       companies
       ABCDE
Research Objectives
1. To develop company-specific cost models for
   comparisons against other models.

   Training Data              Testing Data
Research Objectives
I1. To develop cross-company cost models to
    compare against company-specific cost models.

   Training Data               Testing Data
Research Objectives
I1I. To develop business-specific models to compare
     their accuracy against company-specific and
     cross-company cost models.
   Training Data               Testing Data
Research Objectives
IV. To develop business-specific cost models to
    determine if they can be used by companies
    from other business sectors.
   Training Data               Testing Data
Pred (50)       Pred (25)




Pred (50)
Pred (25)                               better

            1.00   25.75        50.50       75.25      100.00
Pred (50)       Pred (25)




Pred (50)
Pred (25)                                       better

                    1.00   25.75        50.50       75.25      100.00

                                    MdMRE           MMRE



for comparability

MdMRE
MMRE                                                        better

                    1.00   25.75        50.50       75.25      100.00
Training            Testing

Company-Specific
Cost Models
        better                                               better
        Pred50        Pred25                 MdMRE              MMRE

A                                    A


B                                    B


C                                    C


D                                    D


E                                    E

    0    25      50    75      100       0   25        50        75       100
Training            Testing

Cross-Company
Cost Models
        better                                               better
        Pred50        Pred25                 MdMRE              MMRE

A                                    A


B                                    B


C                                    C


D                                    D


E                                    E

    0    25      50    75      100       0   25        50        75       100
Training            Testing

Business-Specific
Cost Models
         better                                               better
         Pred50        Pred25                 MdMRE              MMRE

A                                     A


B                                     B


C                                     C


D                                     D


 E                                    E

     0    25      50    75      100       0   25        50        75       100
Training       Testing

Cross-Business
Cost Models

   • Projects from some sectors could be used
     to predict for projects from other sectors.
   • For example, Retail sector projects could
     predict with high accuracy (Pred50 > 50%).
   • But projects from sectors are best used to
     predict for themselves.
Threats to Validity




                Picture: Mike, Delfini Group
Threats to Validity
Threats to Validity

           • Projects originated from Finland only.
external
Threats to Validity

           • Projects originated from Finland only.
external


           • Data cleaning removed nearly half the
internal
             projects.
           • Only used Size as independent variable.
Conclusions
Conclusions
    Company                     Cross                       No Trend
 Specific Models            Company Models
       (four studies)            (four studies)             (two studies)

Barbara Kitchenham           Emilia Mendes            Katrina Maxwell




                                                  1                         1
                        2                         2                         1




                                                      Martin Shepperd
Isabella Wieczorek           Lionel Briand




                        3
                                                  2
                        2                                                   1
Conclusions
    Company                     Cross                       No Trend
                                                                                what are my
 Specific Models            Company Models
       (four studies)            (four studies)             (two studies)

                                                                                options?
Barbara Kitchenham           Emilia Mendes            Katrina Maxwell




                                                  1                         1
                        2                         2                         1




                                                      Martin Shepperd
Isabella Wieczorek           Lionel Briand




                        3
                                                  2
                        2                                                   1
Conclusions
    Company                     Cross                       No Trend
                                                                                                                        what are my
 Specific Models            Company Models
       (four studies)            (four studies)             (two studies)

                                                                                                                        options?
Barbara Kitchenham           Emilia Mendes            Katrina Maxwell




                                                  1                         1
                        2                         2                         1




                                                      Martin Shepperd
Isabella Wieczorek           Lionel Briand




                        3
                                                  2
                        2                                                   1




                                                                      Company                              Cross
                                                                                        Business
                                                                   Specific Models                     Company Models
                                                                                     Specific Models
Conclusions
Conclusions
• No model performed consistently well
  across all experiments.
Conclusions
• No model performed consistently well
  across all experiments.
• Business-specific models performed
  comparably to company-specific models.
Conclusions
• No model performed consistently well
  across all experiments.
• Business-specific models performed
  comparably to company-specific models.
• Business-specific models performed better
  than cross-company models.
Conclusions
• No model performed consistently well
  across all experiments.
• Business-specific models performed
  comparably to company-specific models.
• Business-specific models performed better
  than cross-company models.
• Reducing heterogeneity in data may
  increase their applicability to problems.
Conclusions
• No model performed consistently well
  across all experiments.
• Business-specific models performed
  comparably to company-specific models.
• Business-specific models performed better
  than cross-company models.
• Reducing heterogeneity in data may
  increase their applicability to problems.
• ... and lead to better prediction models.
Open Questions
Open Questions

• Can we use other algorithms such as
  decision trees and statistical clustering?
Open Questions

• Can we use other algorithms such as
  decision trees and statistical clustering?
• What are the commonalities amongst
  projects?
Open Questions

• Can we use other algorithms such as
  decision trees and statistical clustering?
• What are the commonalities amongst
  projects?
• Does heterogeneity in data sets impact
  other software engineering areas?
Open Questions

• Can we use other algorithms such as
  decision trees and statistical clustering?

        Thank you!
• What are the commonalities amongst
  projects?
• Does heterogeneity in data sets impact
  other software engineering areas?

Más contenido relacionado

Destacado

Building estimation methods
Building estimation methodsBuilding estimation methods
Building estimation methodsSimarpreet Singh
 
Estimation and costing of a commercial building
Estimation and costing of a commercial buildingEstimation and costing of a commercial building
Estimation and costing of a commercial buildinglckr117
 
Cost Modeling for Purchasing - A Fundamental Skill
Cost Modeling for Purchasing - A Fundamental SkillCost Modeling for Purchasing - A Fundamental Skill
Cost Modeling for Purchasing - A Fundamental SkillBill Kohnen
 
SWS15: Winter School - Inclusive Access & Wayfinding for Built Environments a...
SWS15: Winter School - Inclusive Access & Wayfinding for Built Environments a...SWS15: Winter School - Inclusive Access & Wayfinding for Built Environments a...
SWS15: Winter School - Inclusive Access & Wayfinding for Built Environments a...CEPT University
 
SWS15: Winter School - Moving A Megapolis: Istanbul
SWS15: Winter School - Moving A Megapolis: IstanbulSWS15: Winter School - Moving A Megapolis: Istanbul
SWS15: Winter School - Moving A Megapolis: IstanbulCEPT University
 
Politike odprtega dostopa do raziskovalnih podatkov v Sloveniji in Načrt ravn...
Politike odprtega dostopa do raziskovalnih podatkov v Sloveniji in Načrt ravn...Politike odprtega dostopa do raziskovalnih podatkov v Sloveniji in Načrt ravn...
Politike odprtega dostopa do raziskovalnih podatkov v Sloveniji in Načrt ravn...Arhiv družboslovnih podatkov
 
Combined Multi-Annual Analysis, Estimation, and Trends Assessment Method for ...
Combined Multi-Annual Analysis, Estimation, and Trends Assessment Method for ...Combined Multi-Annual Analysis, Estimation, and Trends Assessment Method for ...
Combined Multi-Annual Analysis, Estimation, and Trends Assessment Method for ...Iulian Popa
 
20161015 taiyaba rashid sociology_contemporary culture_the metropolitan exper...
20161015 taiyaba rashid sociology_contemporary culture_the metropolitan exper...20161015 taiyaba rashid sociology_contemporary culture_the metropolitan exper...
20161015 taiyaba rashid sociology_contemporary culture_the metropolitan exper...Taiyaba Rashid
 
Critical Estimation Concepts Chapter3
Critical Estimation Concepts Chapter3Critical Estimation Concepts Chapter3
Critical Estimation Concepts Chapter3guestc87ab49
 
Production And Cost In The Short Run
Production And Cost In The Short RunProduction And Cost In The Short Run
Production And Cost In The Short Runmandalina landy
 
CEPT University 10th convocation 2016 - Procedure
CEPT University 10th convocation 2016 - ProcedureCEPT University 10th convocation 2016 - Procedure
CEPT University 10th convocation 2016 - ProcedureCEPT University
 
Basics of estimation
Basics of estimation Basics of estimation
Basics of estimation ddscte
 
An introduction to agile estimation and release planning
An introduction to agile estimation and release planningAn introduction to agile estimation and release planning
An introduction to agile estimation and release planningJames Whitehead
 

Destacado (19)

Residential Apartment Building
Residential Apartment BuildingResidential Apartment Building
Residential Apartment Building
 
Building estimation methods
Building estimation methodsBuilding estimation methods
Building estimation methods
 
Estimation and costing of a commercial building
Estimation and costing of a commercial buildingEstimation and costing of a commercial building
Estimation and costing of a commercial building
 
Cost Modeling for Purchasing - A Fundamental Skill
Cost Modeling for Purchasing - A Fundamental SkillCost Modeling for Purchasing - A Fundamental Skill
Cost Modeling for Purchasing - A Fundamental Skill
 
Cohen Egaf7
Cohen Egaf7Cohen Egaf7
Cohen Egaf7
 
SWS15: Winter School - Inclusive Access & Wayfinding for Built Environments a...
SWS15: Winter School - Inclusive Access & Wayfinding for Built Environments a...SWS15: Winter School - Inclusive Access & Wayfinding for Built Environments a...
SWS15: Winter School - Inclusive Access & Wayfinding for Built Environments a...
 
SWS15: Winter School - Moving A Megapolis: Istanbul
SWS15: Winter School - Moving A Megapolis: IstanbulSWS15: Winter School - Moving A Megapolis: Istanbul
SWS15: Winter School - Moving A Megapolis: Istanbul
 
Janwani - LAP Intro
Janwani - LAP IntroJanwani - LAP Intro
Janwani - LAP Intro
 
Politike odprtega dostopa do raziskovalnih podatkov v Sloveniji in Načrt ravn...
Politike odprtega dostopa do raziskovalnih podatkov v Sloveniji in Načrt ravn...Politike odprtega dostopa do raziskovalnih podatkov v Sloveniji in Načrt ravn...
Politike odprtega dostopa do raziskovalnih podatkov v Sloveniji in Načrt ravn...
 
Combined Multi-Annual Analysis, Estimation, and Trends Assessment Method for ...
Combined Multi-Annual Analysis, Estimation, and Trends Assessment Method for ...Combined Multi-Annual Analysis, Estimation, and Trends Assessment Method for ...
Combined Multi-Annual Analysis, Estimation, and Trends Assessment Method for ...
 
Formula cultural foundation
Formula cultural foundationFormula cultural foundation
Formula cultural foundation
 
20161015 taiyaba rashid sociology_contemporary culture_the metropolitan exper...
20161015 taiyaba rashid sociology_contemporary culture_the metropolitan exper...20161015 taiyaba rashid sociology_contemporary culture_the metropolitan exper...
20161015 taiyaba rashid sociology_contemporary culture_the metropolitan exper...
 
Participatory Budgeting
Participatory BudgetingParticipatory Budgeting
Participatory Budgeting
 
Critical Estimation Concepts Chapter3
Critical Estimation Concepts Chapter3Critical Estimation Concepts Chapter3
Critical Estimation Concepts Chapter3
 
Production And Cost In The Short Run
Production And Cost In The Short RunProduction And Cost In The Short Run
Production And Cost In The Short Run
 
Learning Contemporary Techniques In Teaching Practices:Blackboard and Edmodo
Learning Contemporary Techniques In Teaching Practices:Blackboard and EdmodoLearning Contemporary Techniques In Teaching Practices:Blackboard and Edmodo
Learning Contemporary Techniques In Teaching Practices:Blackboard and Edmodo
 
CEPT University 10th convocation 2016 - Procedure
CEPT University 10th convocation 2016 - ProcedureCEPT University 10th convocation 2016 - Procedure
CEPT University 10th convocation 2016 - Procedure
 
Basics of estimation
Basics of estimation Basics of estimation
Basics of estimation
 
An introduction to agile estimation and release planning
An introduction to agile estimation and release planningAn introduction to agile estimation and release planning
An introduction to agile estimation and release planning
 

Similar a Building Cost Estimation Models using Homogeneous Data

Cloud Computing Research Developments and Future Directions
Cloud Computing Research Developments and Future DirectionsCloud Computing Research Developments and Future Directions
Cloud Computing Research Developments and Future DirectionsIRJET Journal
 
A DATA EXTRACTION ALGORITHM FROM OPEN SOURCE SOFTWARE PROJECT REPOSITORIES FO...
A DATA EXTRACTION ALGORITHM FROM OPEN SOURCE SOFTWARE PROJECT REPOSITORIES FO...A DATA EXTRACTION ALGORITHM FROM OPEN SOURCE SOFTWARE PROJECT REPOSITORIES FO...
A DATA EXTRACTION ALGORITHM FROM OPEN SOURCE SOFTWARE PROJECT REPOSITORIES FO...ijseajournal
 
A DATA EXTRACTION ALGORITHM FROM OPEN SOURCE SOFTWARE PROJECT REPOSITORIES FO...
A DATA EXTRACTION ALGORITHM FROM OPEN SOURCE SOFTWARE PROJECT REPOSITORIES FO...A DATA EXTRACTION ALGORITHM FROM OPEN SOURCE SOFTWARE PROJECT REPOSITORIES FO...
A DATA EXTRACTION ALGORITHM FROM OPEN SOURCE SOFTWARE PROJECT REPOSITORIES FO...ijseajournal
 
Facilitating big-data management in modern business and organizations using c...
Facilitating big-data management in modern business and organizations using c...Facilitating big-data management in modern business and organizations using c...
Facilitating big-data management in modern business and organizations using c...kioa1
 
ADVANCES IN HIGHER EDUCATIONAL RESOURCE SHARING AND CLOUD SERVICES FOR KSA
ADVANCES IN HIGHER EDUCATIONAL RESOURCE SHARING AND CLOUD SERVICES FOR KSAADVANCES IN HIGHER EDUCATIONAL RESOURCE SHARING AND CLOUD SERVICES FOR KSA
ADVANCES IN HIGHER EDUCATIONAL RESOURCE SHARING AND CLOUD SERVICES FOR KSAIJCSES Journal
 
CHAPTER TWO13Literature Review041
CHAPTER TWO13Literature Review041CHAPTER TWO13Literature Review041
CHAPTER TWO13Literature Review041JinElias52
 
A systematic mapping study of performance analysis and modelling of cloud sys...
A systematic mapping study of performance analysis and modelling of cloud sys...A systematic mapping study of performance analysis and modelling of cloud sys...
A systematic mapping study of performance analysis and modelling of cloud sys...IJECEIAES
 
Classifier Model using Artificial Neural Network
Classifier Model using Artificial Neural NetworkClassifier Model using Artificial Neural Network
Classifier Model using Artificial Neural NetworkAI Publications
 
Annotation Approach for Document with Recommendation
Annotation Approach for Document with Recommendation Annotation Approach for Document with Recommendation
Annotation Approach for Document with Recommendation ijmpict
 
A systems engineering methodology for wide area network selection
A systems engineering methodology for wide area network selectionA systems engineering methodology for wide area network selection
A systems engineering methodology for wide area network selectionAlexander Decker
 
An Advanced Analysis Of Cloud Computing Concepts Based On The Computer Scienc...
An Advanced Analysis Of Cloud Computing Concepts Based On The Computer Scienc...An Advanced Analysis Of Cloud Computing Concepts Based On The Computer Scienc...
An Advanced Analysis Of Cloud Computing Concepts Based On The Computer Scienc...Michele Thomas
 
Demand-driven Gaussian window optimization for executing preferred population...
Demand-driven Gaussian window optimization for executing preferred population...Demand-driven Gaussian window optimization for executing preferred population...
Demand-driven Gaussian window optimization for executing preferred population...IJECEIAES
 
An Analysis of the Existing Frameworks in Cloud Computing Adoption and Introd...
An Analysis of the Existing Frameworks in Cloud Computing Adoption and Introd...An Analysis of the Existing Frameworks in Cloud Computing Adoption and Introd...
An Analysis of the Existing Frameworks in Cloud Computing Adoption and Introd...IJERA Editor
 
factorization methods
factorization methodsfactorization methods
factorization methodsShaina Raza
 
Data Mining Framework for Network Intrusion Detection using Efficient Techniques
Data Mining Framework for Network Intrusion Detection using Efficient TechniquesData Mining Framework for Network Intrusion Detection using Efficient Techniques
Data Mining Framework for Network Intrusion Detection using Efficient TechniquesIJAEMSJORNAL
 
IMPROVEMENT OF ENERGY EFFICIENCY IN CLOUD COMPUTING BY LOAD BALANCING ALGORITHM
IMPROVEMENT OF ENERGY EFFICIENCY IN CLOUD COMPUTING BY LOAD BALANCING ALGORITHMIMPROVEMENT OF ENERGY EFFICIENCY IN CLOUD COMPUTING BY LOAD BALANCING ALGORITHM
IMPROVEMENT OF ENERGY EFFICIENCY IN CLOUD COMPUTING BY LOAD BALANCING ALGORITHMAssociate Professor in VSB Coimbatore
 
1Chapter Two Literature ReviewStudents NameName of the
1Chapter Two Literature ReviewStudents NameName of the1Chapter Two Literature ReviewStudents NameName of the
1Chapter Two Literature ReviewStudents NameName of theEttaBenton28
 

Similar a Building Cost Estimation Models using Homogeneous Data (20)

Cloud Computing Research Developments and Future Directions
Cloud Computing Research Developments and Future DirectionsCloud Computing Research Developments and Future Directions
Cloud Computing Research Developments and Future Directions
 
A DATA EXTRACTION ALGORITHM FROM OPEN SOURCE SOFTWARE PROJECT REPOSITORIES FO...
A DATA EXTRACTION ALGORITHM FROM OPEN SOURCE SOFTWARE PROJECT REPOSITORIES FO...A DATA EXTRACTION ALGORITHM FROM OPEN SOURCE SOFTWARE PROJECT REPOSITORIES FO...
A DATA EXTRACTION ALGORITHM FROM OPEN SOURCE SOFTWARE PROJECT REPOSITORIES FO...
 
A DATA EXTRACTION ALGORITHM FROM OPEN SOURCE SOFTWARE PROJECT REPOSITORIES FO...
A DATA EXTRACTION ALGORITHM FROM OPEN SOURCE SOFTWARE PROJECT REPOSITORIES FO...A DATA EXTRACTION ALGORITHM FROM OPEN SOURCE SOFTWARE PROJECT REPOSITORIES FO...
A DATA EXTRACTION ALGORITHM FROM OPEN SOURCE SOFTWARE PROJECT REPOSITORIES FO...
 
Facilitating big-data management in modern business and organizations using c...
Facilitating big-data management in modern business and organizations using c...Facilitating big-data management in modern business and organizations using c...
Facilitating big-data management in modern business and organizations using c...
 
ADVANCES IN HIGHER EDUCATIONAL RESOURCE SHARING AND CLOUD SERVICES FOR KSA
ADVANCES IN HIGHER EDUCATIONAL RESOURCE SHARING AND CLOUD SERVICES FOR KSAADVANCES IN HIGHER EDUCATIONAL RESOURCE SHARING AND CLOUD SERVICES FOR KSA
ADVANCES IN HIGHER EDUCATIONAL RESOURCE SHARING AND CLOUD SERVICES FOR KSA
 
CHAPTER TWO13Literature Review041
CHAPTER TWO13Literature Review041CHAPTER TWO13Literature Review041
CHAPTER TWO13Literature Review041
 
A systematic mapping study of performance analysis and modelling of cloud sys...
A systematic mapping study of performance analysis and modelling of cloud sys...A systematic mapping study of performance analysis and modelling of cloud sys...
A systematic mapping study of performance analysis and modelling of cloud sys...
 
Classifier Model using Artificial Neural Network
Classifier Model using Artificial Neural NetworkClassifier Model using Artificial Neural Network
Classifier Model using Artificial Neural Network
 
Annotation Approach for Document with Recommendation
Annotation Approach for Document with Recommendation Annotation Approach for Document with Recommendation
Annotation Approach for Document with Recommendation
 
A systems engineering methodology for wide area network selection
A systems engineering methodology for wide area network selectionA systems engineering methodology for wide area network selection
A systems engineering methodology for wide area network selection
 
Journals analysis ppt
Journals analysis pptJournals analysis ppt
Journals analysis ppt
 
2008.11560v2.pdf
2008.11560v2.pdf2008.11560v2.pdf
2008.11560v2.pdf
 
An Advanced Analysis Of Cloud Computing Concepts Based On The Computer Scienc...
An Advanced Analysis Of Cloud Computing Concepts Based On The Computer Scienc...An Advanced Analysis Of Cloud Computing Concepts Based On The Computer Scienc...
An Advanced Analysis Of Cloud Computing Concepts Based On The Computer Scienc...
 
Demand-driven Gaussian window optimization for executing preferred population...
Demand-driven Gaussian window optimization for executing preferred population...Demand-driven Gaussian window optimization for executing preferred population...
Demand-driven Gaussian window optimization for executing preferred population...
 
An Analysis of the Existing Frameworks in Cloud Computing Adoption and Introd...
An Analysis of the Existing Frameworks in Cloud Computing Adoption and Introd...An Analysis of the Existing Frameworks in Cloud Computing Adoption and Introd...
An Analysis of the Existing Frameworks in Cloud Computing Adoption and Introd...
 
Paper review
Paper reviewPaper review
Paper review
 
factorization methods
factorization methodsfactorization methods
factorization methods
 
Data Mining Framework for Network Intrusion Detection using Efficient Techniques
Data Mining Framework for Network Intrusion Detection using Efficient TechniquesData Mining Framework for Network Intrusion Detection using Efficient Techniques
Data Mining Framework for Network Intrusion Detection using Efficient Techniques
 
IMPROVEMENT OF ENERGY EFFICIENCY IN CLOUD COMPUTING BY LOAD BALANCING ALGORITHM
IMPROVEMENT OF ENERGY EFFICIENCY IN CLOUD COMPUTING BY LOAD BALANCING ALGORITHMIMPROVEMENT OF ENERGY EFFICIENCY IN CLOUD COMPUTING BY LOAD BALANCING ALGORITHM
IMPROVEMENT OF ENERGY EFFICIENCY IN CLOUD COMPUTING BY LOAD BALANCING ALGORITHM
 
1Chapter Two Literature ReviewStudents NameName of the
1Chapter Two Literature ReviewStudents NameName of the1Chapter Two Literature ReviewStudents NameName of the
1Chapter Two Literature ReviewStudents NameName of the
 

Más de Rahul Premraj

Approximating Change Sets at Philips Healthcare: A Case Study
Approximating Change Sets at Philips Healthcare: A Case StudyApproximating Change Sets at Philips Healthcare: A Case Study
Approximating Change Sets at Philips Healthcare: A Case StudyRahul Premraj
 
An Empirical Analysis of Software Productivity Over Time
An Empirical Analysis of Software Productivity Over TimeAn Empirical Analysis of Software Productivity Over Time
An Empirical Analysis of Software Productivity Over TimeRahul Premraj
 
How Developer Communication Frequency Relates to Bug Introducing Changes
How Developer Communication Frequency Relates to Bug Introducing ChangesHow Developer Communication Frequency Relates to Bug Introducing Changes
How Developer Communication Frequency Relates to Bug Introducing ChangesRahul Premraj
 
Improving Bug Tracking Systems
Improving Bug Tracking SystemsImproving Bug Tracking Systems
Improving Bug Tracking SystemsRahul Premraj
 
What makes a good bug report?
What makes a good bug report?What makes a good bug report?
What makes a good bug report?Rahul Premraj
 
Predicting Software Metrics at Design Time
Predicting Software Metrics at Design TimePredicting Software Metrics at Design Time
Predicting Software Metrics at Design TimeRahul Premraj
 
On the Treatment of Bug Reports in Open-Source Projects
On the Treatment of Bug Reports in Open-Source ProjectsOn the Treatment of Bug Reports in Open-Source Projects
On the Treatment of Bug Reports in Open-Source ProjectsRahul Premraj
 

Más de Rahul Premraj (7)

Approximating Change Sets at Philips Healthcare: A Case Study
Approximating Change Sets at Philips Healthcare: A Case StudyApproximating Change Sets at Philips Healthcare: A Case Study
Approximating Change Sets at Philips Healthcare: A Case Study
 
An Empirical Analysis of Software Productivity Over Time
An Empirical Analysis of Software Productivity Over TimeAn Empirical Analysis of Software Productivity Over Time
An Empirical Analysis of Software Productivity Over Time
 
How Developer Communication Frequency Relates to Bug Introducing Changes
How Developer Communication Frequency Relates to Bug Introducing ChangesHow Developer Communication Frequency Relates to Bug Introducing Changes
How Developer Communication Frequency Relates to Bug Introducing Changes
 
Improving Bug Tracking Systems
Improving Bug Tracking SystemsImproving Bug Tracking Systems
Improving Bug Tracking Systems
 
What makes a good bug report?
What makes a good bug report?What makes a good bug report?
What makes a good bug report?
 
Predicting Software Metrics at Design Time
Predicting Software Metrics at Design TimePredicting Software Metrics at Design Time
Predicting Software Metrics at Design Time
 
On the Treatment of Bug Reports in Open-Source Projects
On the Treatment of Bug Reports in Open-Source ProjectsOn the Treatment of Bug Reports in Open-Source Projects
On the Treatment of Bug Reports in Open-Source Projects
 

Último

From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 

Último (20)

From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 

Building Cost Estimation Models using Homogeneous Data

  • 1. Building Cost Estimation Models Using Homogeneous Data Rahul Premraj Saarland University, Germany Thomas Zimmermann University of Calgary, Canada
  • 3. Review systematic May, 2007 316 IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, VOL. 33, NO. 5, MAY 2007 Cross versus Within-Company Cost Estimation Studies: A Systematic Review Barbara A. Kitchenham, Member, IEEE Computer Society, Emilia Mendes, and Guilherme H. Travassos Abstract—The objective of this paper is to determine under what circumstances individual organizations would be able to rely on cross-company-based estimation models. We performed a systematic review of studies that compared predictions from cross- company models with predictions from within-company models based on analysis of project data. Ten papers compared cross- company and within-company estimation models; however, only seven presented independent results. Of those seven, three found that cross-company models were not significantly different from within-company models, and four found that cross-company models were significantly worse than within-company models. Experimental procedures used by the studies differed making it impossible to undertake formal meta-analysis of the results. The main trend distinguishing study results was that studies with small within-company data sets (i.e., < 20 projects) that used leave-one-out cross validation all found that the within-company model was significantly different (better) from the cross-company model. The results of this review are inconclusive. It is clear that some organizations would be ill-served by cross-company models whereas others would benefit. Further studies are needed, but they must be independent (i.e., based on different data bases or at least different single company data sets) and should address specific hypotheses concerning the conditions that would favor cross-company or within-company models. In addition, experimenters need to standardize their experimental procedures to enable formal meta-analysis, and recommendations are made in Section 3. Index Terms—Cost estimation, management, systematic review, software engineering. Ç 1 INTRODUCTION E studies of cost estimation models (e.g., [12], These problems motivated the use of cross-company ARLY [8]) suggested that general-purpose models such as models (models built using cross-company data sets, which COCOMO [1] and SLIM [24] needed to be calibrated to are data sets containing data from several companies) for specific companies before they could be used effectively. effort estimation and productivity benchmarking, and, Taking this result further and following the proposals made subsequently, several studies compared the prediction by DeMarco [4], Kok et al. [14] suggested that cost accuracy between cross-company and within-company estimation models should be developed only from single- models. In 1999, Maxwell et al. [18] analyzed a cross- company data. However, three main problems can occur company benchmarking database by comparing the accu- when relying on within-company data sets [3], [2]: racy of a within-company cost model with the accuracy of a cross-company cost model. They claimed that the within- 1. The time required to accumulate enough data on company model was more accurate than the cross-company past projects from a single company may be model, based on the same holdout sample. In the same year, prohibitive. Briand et al. [2] found that cross-company models could be 2. By the time the data set is large enough to be of use, as accurate as within-company models. The following year, technologies used by the company may have Briand et al. [3] reanalyzed the data set employed by changed, and older projects may no longer be Maxwell et al. [18] and concluded that cross-company representative of current practices. models were as good as within-company models. Two 3. Care is necessary as data needs to be collected in a years later, Wieczorek and Ruhe [26] confirmed this same consistent manner. trend using the same data set employed by [2]. Three years later, Mendes et al. [20] also confirmed the same trend using . B.A. Kitchenham is with the School of Computing and Mathematics, yet another data set. University of Keele, Keele Village, Staffordshire, ST5 5BG, UK. These results seemed to contradict the results of the E-mail: b.a.kitchenham@cs.keele.ac.uk. earlier studies and pave the way for improved estimation . E. Mendes is with the Computer Science Department, University of methods for companies that did not have their own project Auckland, Private Bag 92019, Auckland, New Zealand. E-mail: emilia@cs.auckland.ac.nz. data. However, other researchers found less encouraging . G.H. Travassos is with UFRJ/COPPE, Systems Engineering and results. Jeffery et al. undertook two studies, both of which Computer Science Program, PO Box 68511, 21941-972 Rio de Janeiro— suggested that within-company models were superior to RJ, Brazil. E-mail: ght@cos.ufrj.br. cross-company models [6], [7]. Two years later, Lefley and Manuscript received 6 June 2006; revised 27 Nov. 2006; accepted 2 Jan. 2007; Shepperd claimed that the within-company model was published online 20 Feb. 2007. Recommended for acceptance by A. Mockus. more accurate than the cross-company model, using the For information on obtaining reprints of this article, please send e-mail to: same data set employed by Wieczorek and Ruhe [26] and tse@computer.org, and reference IEEECS Log Number TSE-0129-0606. Briand et al. [2]. Finally, a year later Kitchenham and Digital Object Identifier no. 10.1109/TSE.2007.1001. 0098-5589/07/$25.00 ß 2007 IEEE Published by the IEEE Computer Society
  • 4. Review systematic May, 2007 Barbara Kitchenham 316 IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, VOL. 33, NO. 5, MAY 2007 Cross versus Within-Company Cost Estimation Studies: A Systematic Review Barbara A. Kitchenham, Member, IEEE Computer Society, Emilia Mendes, and Guilherme H. Travassos Abstract—The objective of this paper is to determine under what circumstances individual organizations would be able to rely on cross-company-based estimation models. We performed a systematic review of studies that compared predictions from cross- company models with predictions from within-company models based on analysis of project data. Ten papers compared cross- company and within-company estimation models; however, only seven presented independent results. Of those seven, three found that cross-company models were not significantly different from within-company models, and four found that cross-company models were significantly worse than within-company models. Experimental procedures used by the studies differed making it impossible to undertake formal meta-analysis of the results. The main trend distinguishing study results was that studies with small within-company data sets (i.e., < 20 projects) that used leave-one-out cross validation all found that the within-company model was significantly different (better) from the cross-company model. The results of this review are inconclusive. It is clear that some organizations would be ill-served by cross-company models whereas others would benefit. Further studies are needed, but they must be independent (i.e., based on different data bases or at least different single company data sets) and should address specific hypotheses concerning the Emilia Mendes conditions that would favor cross-company or within-company models. In addition, experimenters need to standardize their experimental procedures to enable formal meta-analysis, and recommendations are made in Section 3. Index Terms—Cost estimation, management, systematic review, software engineering. Ç 1 INTRODUCTION E studies of cost estimation models (e.g., [12], These problems motivated the use of cross-company ARLY [8]) suggested that general-purpose models such as models (models built using cross-company data sets, which COCOMO [1] and SLIM [24] needed to be calibrated to are data sets containing data from several companies) for specific companies before they could be used effectively. effort estimation and productivity benchmarking, and, Taking this result further and following the proposals made subsequently, several studies compared the prediction by DeMarco [4], Kok et al. [14] suggested that cost accuracy between cross-company and within-company estimation models should be developed only from single- models. In 1999, Maxwell et al. [18] analyzed a cross- company data. However, three main problems can occur company benchmarking database by comparing the accu- when relying on within-company data sets [3], [2]: racy of a within-company cost model with the accuracy of a cross-company cost model. They claimed that the within- 1. The time required to accumulate enough data on company model was more accurate than the cross-company past projects from a single company may be model, based on the same holdout sample. In the same year, prohibitive. Briand et al. [2] found that cross-company models could be 2. By the time the data set is large enough to be of use, as accurate as within-company models. The following year, technologies used by the company may have Guilherme Travassos Briand et al. [3] reanalyzed the data set employed by changed, and older projects may no longer be Maxwell et al. [18] and concluded that cross-company representative of current practices. models were as good as within-company models. Two 3. Care is necessary as data needs to be collected in a years later, Wieczorek and Ruhe [26] confirmed this same consistent manner. trend using the same data set employed by [2]. Three years later, Mendes et al. [20] also confirmed the same trend using . B.A. Kitchenham is with the School of Computing and Mathematics, yet another data set. University of Keele, Keele Village, Staffordshire, ST5 5BG, UK. These results seemed to contradict the results of the E-mail: b.a.kitchenham@cs.keele.ac.uk. earlier studies and pave the way for improved estimation . E. Mendes is with the Computer Science Department, University of methods for companies that did not have their own project Auckland, Private Bag 92019, Auckland, New Zealand. E-mail: emilia@cs.auckland.ac.nz. data. However, other researchers found less encouraging . G.H. Travassos is with UFRJ/COPPE, Systems Engineering and results. Jeffery et al. undertook two studies, both of which Computer Science Program, PO Box 68511, 21941-972 Rio de Janeiro— suggested that within-company models were superior to RJ, Brazil. E-mail: ght@cos.ufrj.br. cross-company models [6], [7]. Two years later, Lefley and Manuscript received 6 June 2006; revised 27 Nov. 2006; accepted 2 Jan. 2007; Shepperd claimed that the within-company model was published online 20 Feb. 2007. Recommended for acceptance by A. Mockus. more accurate than the cross-company model, using the For information on obtaining reprints of this article, please send e-mail to: same data set employed by Wieczorek and Ruhe [26] and tse@computer.org, and reference IEEECS Log Number TSE-0129-0606. Briand et al. [2]. Finally, a year later Kitchenham and Digital Object Identifier no. 10.1109/TSE.2007.1001. 0098-5589/07/$25.00 ß 2007 IEEE Published by the IEEE Computer Society
  • 5. Company Cross No Trend Specific Models Company Models
  • 6. Company Cross No Trend Specific Models Company Models (four studies) (four studies) (two studies)
  • 7. Company Cross No Trend Specific Models Company Models (four studies) (four studies) (two studies) Barbara Kitchenham Emilia Mendes Katrina Maxwell Martin Shepperd Isabella Wieczorek Lionel Briand
  • 8. Company Cross No Trend Specific Models Company Models (four studies) (four studies) (two studies) Barbara Kitchenham Emilia Mendes Katrina Maxwell 2 2 Martin Shepperd Isabella Wieczorek Lionel Briand 2
  • 9. Company Cross No Trend Specific Models Company Models (four studies) (four studies) (two studies) Barbara Kitchenham Emilia Mendes Katrina Maxwell 1 2 2 1 Martin Shepperd Isabella Wieczorek Lionel Briand 3 2 2
  • 10. Company Cross No Trend Specific Models Company Models (four studies) (four studies) (two studies) Barbara Kitchenham Emilia Mendes Katrina Maxwell 1 1 2 2 1 Martin Shepperd Isabella Wieczorek Lionel Briand 3 2 2 1
  • 13. she is a metrics consultant
  • 15. Boss has a Erica’s new project for her
  • 17. Company Cross Specific Models Company Models
  • 18. Company Cross Business Specific Models Company Models Specific Models
  • 19.
  • 21. METRICS, 2005 An Empirical Analysis of Software Productivity Over Time Rahul Premraj Martin Shepperd Bournemouth University, UK Brunel University, UK rpremraj@bmth.ac.uk martin.shepperd@brunel.ac.uk Barbara Kitchenham∗ Pekka Forselius National ICT, Australia STTF Oy, Finland Barbara.Kitchenham@nicta.com.au pekka.forselius@kolumbus.fi Abstract commodities. Unfortunately, for software the notion of out- put is not straightforward. Lines of code are problematic due to issues of layout, differing language and the fact that OBJECTIVE - the aim is to investigate how software most software engineering activity does not directly involve project productivity has changed over time. Within this code. An alternative is Function Points (FPs), in its various overall goal we also compare productivity between differ- flavours, which although subject to some criticism [?] are ent business sectors and seek to identify major drivers. in quite widespread use and so in a sense represent the least METHOD - we analysed a data set of more than 600 bad alternative. In our analysis the output (or size) measure projects that have been collected from a number of Finnish collected is Experience Points 2.0 [?], a variant of FPs. companies since 1978. RESULTS - overall, we observed a quite pronounced im- Second, productivity is impacted by a very large num- provement in productivity over the entire time period, ber of factors, many of which are inherently difficult to as- though, this improvement is less marked since the 1990s. sess, e.g. task difficulty, skill of the project team, ease of However, the trend is not smooth. We also observed pro- interaction with the customer/client and the level of non- ductivity variability between company and business sec- functional requirements imposed such as dependability and tor. performance. CONCLUSIONS - whilst this data set is not a ran- dom sample so generalisation is somewhat problematic, Third, there are clear interactions between many of these we hope that it contributes to an overall body of knowl- factors so for instance, it is easier to be productive if quality edge about software productivity and thereby facilitates the can be disregarded. construction of a bigger picture. Keywords: project management, projects, software produc- Despite these caveats, this paper seeks to analyse soft- tivity, trend analysis, empirical analysis. ware project productivity trends from 1978-2003 from a data set of more than 600 projects from Finland. The projects are varied in size (6 - 5000+ FPs), business sec- 1. Introduction tor (e.g. Retail) and type (New Development or Mainte- nance). However, we believe there are sufficient data to Given the importance and size of the software industry it is draw some preliminary conclusions. no surprise that there is a great deal of interest in productiv- ity trends and in particular whether the industry, as a whole, The remainder of the paper is organised as follows. The is improving over time. Obviously this is a complex ques- next section very briefly reviews some related work includ- tion for at least three reasons. ing a similar, earlier study by Maxwell and Forselius [?]. First, productivity is difficult to measure because the tra- Next we describe the data set used for our analysis. We then ditional definition, i.e. the ratio of outputs to inputs re- give the results of our analysis, first overall and then af- quires that we have objective methods of measuring both ter splitting the data set into groups of more closely related projects. We conclude with a discussion of the significance of the results and some comments on the actual process of Barbara Kitchenham is also with Keele University, UK Bar- ∗ bara@cs.keele.ac.uk analysing the data.
  • 22.
  • 24. All Data Cleaned Data 788 395 0 200 400 600 800 Finnish data set
  • 25. Regression model Effort = αSize β Effort Size
  • 27. Test Sets companies
  • 28. Test Sets companies ABCDE
  • 29. Research Objectives 1. To develop company-specific cost models for comparisons against other models. Training Data Testing Data
  • 30. Research Objectives I1. To develop cross-company cost models to compare against company-specific cost models. Training Data Testing Data
  • 31. Research Objectives I1I. To develop business-specific models to compare their accuracy against company-specific and cross-company cost models. Training Data Testing Data
  • 32. Research Objectives IV. To develop business-specific cost models to determine if they can be used by companies from other business sectors. Training Data Testing Data
  • 33.
  • 34. Pred (50) Pred (25) Pred (50) Pred (25) better 1.00 25.75 50.50 75.25 100.00
  • 35. Pred (50) Pred (25) Pred (50) Pred (25) better 1.00 25.75 50.50 75.25 100.00 MdMRE MMRE for comparability MdMRE MMRE better 1.00 25.75 50.50 75.25 100.00
  • 36. Training Testing Company-Specific Cost Models better better Pred50 Pred25 MdMRE MMRE A A B B C C D D E E 0 25 50 75 100 0 25 50 75 100
  • 37. Training Testing Cross-Company Cost Models better better Pred50 Pred25 MdMRE MMRE A A B B C C D D E E 0 25 50 75 100 0 25 50 75 100
  • 38. Training Testing Business-Specific Cost Models better better Pred50 Pred25 MdMRE MMRE A A B B C C D D E E 0 25 50 75 100 0 25 50 75 100
  • 39. Training Testing Cross-Business Cost Models • Projects from some sectors could be used to predict for projects from other sectors. • For example, Retail sector projects could predict with high accuracy (Pred50 > 50%). • But projects from sectors are best used to predict for themselves.
  • 40. Threats to Validity Picture: Mike, Delfini Group
  • 42. Threats to Validity • Projects originated from Finland only. external
  • 43. Threats to Validity • Projects originated from Finland only. external • Data cleaning removed nearly half the internal projects. • Only used Size as independent variable.
  • 45. Conclusions Company Cross No Trend Specific Models Company Models (four studies) (four studies) (two studies) Barbara Kitchenham Emilia Mendes Katrina Maxwell 1 1 2 2 1 Martin Shepperd Isabella Wieczorek Lionel Briand 3 2 2 1
  • 46. Conclusions Company Cross No Trend what are my Specific Models Company Models (four studies) (four studies) (two studies) options? Barbara Kitchenham Emilia Mendes Katrina Maxwell 1 1 2 2 1 Martin Shepperd Isabella Wieczorek Lionel Briand 3 2 2 1
  • 47. Conclusions Company Cross No Trend what are my Specific Models Company Models (four studies) (four studies) (two studies) options? Barbara Kitchenham Emilia Mendes Katrina Maxwell 1 1 2 2 1 Martin Shepperd Isabella Wieczorek Lionel Briand 3 2 2 1 Company Cross Business Specific Models Company Models Specific Models
  • 49. Conclusions • No model performed consistently well across all experiments.
  • 50. Conclusions • No model performed consistently well across all experiments. • Business-specific models performed comparably to company-specific models.
  • 51. Conclusions • No model performed consistently well across all experiments. • Business-specific models performed comparably to company-specific models. • Business-specific models performed better than cross-company models.
  • 52. Conclusions • No model performed consistently well across all experiments. • Business-specific models performed comparably to company-specific models. • Business-specific models performed better than cross-company models. • Reducing heterogeneity in data may increase their applicability to problems.
  • 53. Conclusions • No model performed consistently well across all experiments. • Business-specific models performed comparably to company-specific models. • Business-specific models performed better than cross-company models. • Reducing heterogeneity in data may increase their applicability to problems. • ... and lead to better prediction models.
  • 55. Open Questions • Can we use other algorithms such as decision trees and statistical clustering?
  • 56. Open Questions • Can we use other algorithms such as decision trees and statistical clustering? • What are the commonalities amongst projects?
  • 57. Open Questions • Can we use other algorithms such as decision trees and statistical clustering? • What are the commonalities amongst projects? • Does heterogeneity in data sets impact other software engineering areas?
  • 58. Open Questions • Can we use other algorithms such as decision trees and statistical clustering? Thank you! • What are the commonalities amongst projects? • Does heterogeneity in data sets impact other software engineering areas?