SlideShare una empresa de Scribd logo
1 de 6
Descargar para leer sin conexión
Full Paper
Proc. of Int. Conf. on Advances in Computer Science and Application 2013

Estimation of Functional Size of a Data Warehouse
System using COSMIC FSM Method
Avinash Samuel1, Anil Kumar Pandey2, Vivek Kumar Sharma2

1Deprement of Computer Science & Engineering, Invertis University, Bareilly, India
Avinash_12141@yahoo.co.in
2Deprement of Computer Science & Engineering, Invertis University, Bareilly, India
{ Anipandey, Sharma.vivek109}@gmail.com

II. RELATED WORK

Abstract — It is not easy to measure the functional size of
Data Warehouse System. Data Warehouse system is not
traditional system and it can be easily measured using FSM
(Functional Size Measurement) Method. In this paper we have
shown with the help of a case study to measure the functional
size of the Data Warehouse System using COSMIC
FSMmethod. We will explore the use of COSMIC in sizing
Data Warehouse Systems.

Functional Size Measurement is a new and emerging field,
but many its foundations were laid by Allan Albercht in
1978[10].He was a pioneer or we can say he is the Father of
Functional Size Measurement. He proposed IFPUG’s FPA
which uses Internal logical files, External interface files,
External Input, External Output and External Query to measure
the functional size of the software. Mark II FPA method or
MK II FPA method was proposed by Charles Symons at
Nolan Norton in 1984. It uses the Input Data Elements, Entity
References and Output Data Elements as the Base Functional
Components (BFCs) to measure the functional size of the
system. COSMIC FSM method was created by international
consortium of industry subject matter experts and academics
from 19 countries in the year 1997. It uses data movements
(Read, Write, Entry, and Exit) to measure the functional size
[9]. The COSMIC FSM method is capable to measure the
software having the layered architecture. COSMIC released
its guidelines to measure the functional size of the Data
Warehouse System, but even with the availability of the
guideline it is quite complex to measure the functional size of
the data warehouse system. Not much work is done in this
field and it is yet too clear how to measure the functional size
of the data warehouse system.

Keywords — Functional Size, Data Warehouse, COSMIC
FSM.

I. INTRODUCTION
In today’s market it has now become a necessity to create
a sustainable competitive advantage against competitors by
creating a system with which the current operations can be
easily tracked, and predict their future strategies. Therefore,
one of the trends in the market at the moment is the growing
interest in the development of large data warehouses.
It is very hard to predict/estimate the effort and resources
required to build a data warehouse system in earlier phases
of development. One of the methods proposed by IFPUG
(International Function Point User Group) is FPM (Functional
Point Analysis). Function Points uses the data movements
in the system to measure the end-user requirements. Therefore we can derive an early estimation of the functional size
of the system before any code has been written [4]. If development production figures are known, the cost of developing new software can therefore be estimated early enough to
make direct comparisons to the cost of buying a software
package, or simply using a non-technical solution[4].
The most applied method of measuring functional size is
Function Point Analysis (FPA). Another ISO certified method,
is COSMIC (Common Software Measurement International
Consortium)FSM. In recent history, frameworks have been
described of how to measure data warehouse applications
with FSM. COSMIC is a better way of measuring the functional size of the software because of the COSMIC’s capability to measure the software in different layers and because of
the fact that the size of individual functions are not cut off by
the maximum size of a function, like in FSM [2].
In this paper, using COSMIC FSM method we measure
the functional size of a data warehouse system to understand
this concept.

© 2013 ACEEE
DOI: 03.LSCS.2013.3.553

III. WHAT IS A DATA WAREHOUSE?
A data warehouse is a Subject oriented, integrated, Time
variant, Non-volatile collection of data in support of
management’s decision making process [6].
Note: The data warehouse is always a physically separate store of data transformed from the application data found
in the operational environment.
The figure above (Fig 1.) shows the architecture of a data
warehouse along with its components. We will now take a
look at the components and their functions.
A. Operational data sources
It is the operational system which stores the transactions of the business. It is located outside the data warehouse, as the data warehouse has no control upon the content and the format of the data. The data in these systems is
stored in many formats i.e. flat files to hierarchical and relational databases.
25
Full Paper
Proc. of Int. Conf. on Advances in Computer Science and Application 2013

Fig 1: Architecture of Data Warehouse

B. Data Staging Area
Its function is restricted to extracting, cleaning, matching
and loading data from multiple source sources. It is explicitly
off limits to the end- users, i.e. the end-user has no access or
no control over the data staging area. It does not support
query or presentation services. A data-cleansing tool is used
to process data to resolve name and address misspellings.

F. Metadata management
Metadata is not the actual data; but rather information
that addresses a number of data characteristics such as names
and definitions of data elements, where the data comes from,
how it is collected, and what transformations it goes through
before being stored in the data warehouse [1,2]. Also,
meaningful metadata identifies important relationships among
data elements that are critical to using and interpreting the
data available through the end user query tools.

C. Extract Transform Load (ETL) Processes
Data-ETL processes are used to extract data from data
sources, cleanse the data, perform data transformations, and
load the target data warehouse and then again to load the
data marts. The ETL processes are also used to generate and
maintain a central metadata repository and support the data
warehouse [1, 10].

G. Business Intelligence (End User functionality)
Before the end-users can access the data, the data is stored
into the business intelligence layer. Here, the data can be
visualized as cubes or multidimensional data. There are
numerous ways for users to extract the data from the data
marts, or from the data warehouse. OLAP tools analyze the
data and try to find correlations and meaningful patterns in a
fully automated way [1, 2].

D. Data Warehouse Database
It is a relational data structure that is optimized for
distribution. It collects and stores integrated sets of historical,
non-volatile data from multiple operational systems and feeds
them to one or more data marts [1].

IV. FUNCTIONAL MEASUREMENT DEFINITIONS
A. Functional Size
It is the size of a system/software as viewed from a logical,
non-technical point of view. It is more significant to the user
than physical or technical size, as for example Lines of Code.
This size should be shared between users and developers of
the given system [2].

E. Data Marts
Data Marts can be viewed as an extension to the data
warehouse. The data comes from the multiple data sources
and it is integrated before entering the Data Warehouse
System. The data marts contain subject specific information
supporting the requirements of the end users in individual
business units. Data marts can provide rapid response to
end-user requests if most queries are directed to precomputed, aggregated data stored in the data mart [1].

© 2013 ACEEE
DOI: 03.LSCS.2013.3.553

B. COSMIC FSM Method
COSMIC is a superset of functional metrics, which
provides wider applicability than the IFPUG method. Its key
concepts are the possibility of viewing the measured system
under different linked layers (different levels of conceptual
26
Full Paper
Proc. of Int. Conf. on Advances in Computer Science and Application 2013
abstraction of the system functions) [2].

The FURs is to be examined and the various functional
processes are to be identified. Any of these functional
processes consist of a number of sub processes (BFC’s),
which are called data movements (with included data
manipulations).
COSMIC recognizes four kinds of data movement sub
processes, Entry, Exit, Read and Write types [4, 9]. The data
moments and their data groups are illustrated in Fig 3.
The different data movements and their data groups are
as follows:

V. COSMIC PRINCIPLES
In COSMIC the complete set of requirements only the
functional user requirements are measured. The Base
Functional Components (BFC’s) must be identified within
the set of functional user requirements and these BFC’s must
be measured [4, 9]. The Basic diagram for BFC’s is shown in
Fig 2.
Base Functional Components in COSMIC are data
movement types, which are identified per functional process
type. The underlying principles are:
1. Software is activated by input and produces output,
or result, that is of use to the user.
2. Software processes parts/pieces of data, which are
materialized by data groups, which are a subset of
an object of interest (OOI). A data group may
consists of one or more data attributes.

Fig 3: Data Groups in COSMIC FSM Method [3,4]

COSMIC provides the layering concept, which allows the
measure to functionally partition the software into different
layers, to make sure that all functional processes function on
the same level of abstraction. COSMIC also allows the
software residing within one layer to be partitioned into peer
components, if these components are developed with different
technologies, or if they are implemented on different
processors[1]. A view of this is presented in Fig 4.
Fig 2: Base Functional Requirements

Fig 4: Division of Software into its peer components

© 2013 ACEEE
DOI: 03.LSCS.2013.3.553

27
Full Paper
Proc. of Int. Conf. on Advances in Computer Science and Application 2013
VI. CASE STUDY

B. Data Warehouse Component
The second component is Data warehouse component.
Fig 7 illustrates the flow of data from staging area to data
warehouse component.

To explain the functional size measurement procedure of
the Data Warehouse System we take into consideration a
data warehouse. The data warehouse that we take for this
purpose is the data warehouse of an organization containing
its employee’s information. As the COSMIC method is capable
to measure the functional size of the peer components, we
will take a look at the components one by one.
A. Staging Area
The figure shown below (Fig 5) illustrates the initial flow
of data from the operational data sources to the staging area.

Fig 7: Functional Processes within Data Warehouse Component [1]

Data Movements: A simple functional process of the ETL
data warehouse tool that extracts, transforms and loads data
describing a single OOI-type would have the data movements
as shown in Fig 8.

Fig 5: Functional Processes within the Staging Area [1]

Data Movements: For a simple functional process in an ETL
(Extraction, Transformation & Loading) Staging Area (SA)
tool that must move data about a single OOI-type (where E =
Entry, R = Read, W = Write and X = Exit). The Data Movements
between the operational data sources, Staging area and
metadata is shown in Fig 6.

Fig 8: Data Movements within Data Warehouse Components

Table 2 illustrates the data movements and their
description within the Data Warehouse Component.
TABLE II. T OTAL D ATA MOVEMENTS OF D ATA WAREHOUSE C OMPONENT
E

X

Fig 6: Data Movement within Staging Area Component
E

Table I illustrates the data movements and their
description within the staging area.

R
W

TABLE I: TOTAL DATA MOVEMENTS OF STAGING AREA COMPONENTS
E

X

E
R
W
X
Total

© 2013 ACEEE
DOI: 03.LSCS.2013.3.553

X
Total

To start the functional process
(e.g. a clock tick, if a batch
process).
to the metadata management tool
to obtain the transformation rules
for this OOI
from the metadata management
tool with the required metadata
of the operational data source
of the transformed data to the
staging area
error/confirmation messages
6 CFP

To start the functional process
(e.g. a clock tick, if a batch
process).
to the metadata management tool
to obtain the transformation rules
for this OOI
from the metadata management
tool with the required metadata
of the staging area files
of the transformed data to the
data warehouse database
error/confirmation messages
6 CFP

C. Data Mart Component
The Third component is Data mart component. Fig 9
shows the data flow from Data warehouse component to the
Data mart Component.
Data Movements: In the ETL data mart tools we find the
functional processes that feed the data marts from the data
28
Full Paper
Proc. of Int. Conf. on Advances in Computer Science and Application 2013
processes the request of the end users and provides them
with generated reports.

Fig 9: Functional Processes within Data Mart Component [1]

describing the OOI’s that are stored in the data warehouse
component In the data mart databases, the data will be stored
in a dimensional way, thus in star schemas, which shows
both ‘dimension tables’ and ‘fact tables’ [10]. Fig 10 illustrates
the data movements within Data mart component.

Fig 11: Functional Process within the Business Intelligent
Components [1]

Data Movements: There are only three data movements in
the system as illustrated in Table 4. The end user sends the
enquiry or the query, which is counted as 1 E, the required
solution of the query is gathered from the persistent storage
1 R, then the results are returned to the end user in form of
some report 1 X.
TABLE IV: T OTAL D ATA MOVEMENTS OF BUSINESS INTELLIGENT COMPONENTS
E
R
X
Total

Fig 10: Data Movements within Data Mart Component

Table 3 illustrates the data movements and their
description within the Data mart Component.
TABLE III: TOTAL D ATA MOVEMENTS OF D ATA MART C OMPONENT
E

To start the functional
process.

R

Read from dimension
table of the employee
( Employee Output)

R

Read from dimension
table of the employee
( Employee Personal
Information)

R

Read from dimension
table of the employee

W
X

Of the employee’s data.
error/confirmation
messages

Total

6 CFP

to start the enquiry
Reads the data from the
persistent storage
Returns the result
3 CFP

E. Metadata Management Component
The metadata administrator have a number of functional
processes to his disposal, with which he can create new
metadata rules, maintain existing rules or delete metadata rules
[10]. User profiles, access privilege files, data processing rules
and use statistics can be considered as OOI [1, 7].
Business metadata which are like data dictionaries may
contain Data on historical aspects, data on a data owner, etc.
[8].
It is good practice in the analysis and design of business
application software to check the required stages of the lifecycle of every object of interest (OOI) for which persistent
data are held, because each possible transition from one stage
to another (in UML terms a ‘state transition’) should
correspond to a functional process. This rule is summarized
by the acronym ‘CRUD’ where C = Create, R = Read, U =
Update and D = Delete (sometimes known as ‘CRUDL’ where
L = List). Data about every object of interest must be created,
is invariably read, and will usually be updated and deleted,
and maybe listed [5].
The Data Movements that are generally encountered
within the metadata management component are shown the
tables below. Table 5 shows the relationship type i.e. the
composite data movements made up of one or more basic
data movements.
VII. RESULTS

Table 6 illustrates the different components of the Data
D. Business Intelligence Components
Warehouse along with their functional size and the total
functional size of the system.
The fourth component is Business Intelligence
Summing up the functional size of all the peer components
component it basically generates reports and provide it to
we find out that the functional size of our data warehouse
the end user. Fig 11 illustrates the flow of data from the data
comes to about 39 CFP.
mart component to the business intelligence component which
29
© 2013 ACEEE
DOI: 03.LSCS.2013.3.553
Full Paper
Proc. of Int. Conf. on Advances in Computer Science and Application 2013
high (detects changes of single data element types and single
entity references) where COSMIC FSM’s sensitivity to make
small changes to requirements is moderate (detects changes
to single data-groups) and the smallest feasible enhancement that can be made using the MK II FPA method is 0.26fp
where COSMIC FSM can handle the smallest feasible enhancement of 1fp only [11].
Also, new FSM method can be proposed that may include the following characteristics as the current FSM methods are unable to cope with them: Measures corrective maintenance (fixes), Measures perfective maintenance (refactoring
for improved performance), Measures algorithmic complexity and Measures reuse of code.

TABLE V: DATA MOVEMENTS OF C OMPOSITE RELATIONSHIP TYPES
Composite Relationship type
Creates
Views
Lists
Changes or Uses
Deletes
Archives
Updates
Reads
Total

Data movement types
1 E, 1 W
1 E, 1 R, 1 X
1 E, 1 R, 1 X
1 E, 1 R, 1 X, 1 W
1 E, 1 W
1 E, 1 W
1 E, 1 W
1R
18 CFP

TABLE VI: T OTAL FUNCTIONAL SIZE OF THE DATA WAREHOUSE
Component Name
Staging Area Component
Data Warehouse Component
Data Mart Component
Business Intelligence
Component
Metadata Management
Component
Total

Calculated Functional Size
of Component
6 CFP
6 CFP
6 CFP
3 CFP

REFERENCES
[1] Van Heeringen, H., Measuring the functional size of a data
warehouse application using the COSMIC FFP method,
Software Measurement European Forum Conference, Rome,
Italy, May 2006.
[2] Santillo, L., “Size & Estimation of data warehouse systems”, in
FESMA DASMA 2001 conference proceedings, Heidelberg
(Germany), May 2001.
[3] “International Software Benchmarking Standards Group
database, version 9”, January 2008.
[4] “The COSMIC functional size measurement method, version
3.0: Measurement Manual”, September 2007.
[5] “The COSMIC functional size measurement method, version
3.0: Guideline for sizing business application software”, Version
1.1, May 2008.
[6] Inmon, W.H., “What is a Data Warehouse?” Prism, Volume 1,
Number 1, 1995.
[7] Inmon, W.H., “Metadata in a Data in a Data Warehouse: A
Statement of Vision”, White Paper, Pine Cone Systems,
Colorado, December 2005.
[8] Chaudhuri, S. and Dayal, U., “An Overview of Data Warehousing
and OLAP Technology”, ACM Sigmod record vol. 26 (1),
1997, pp. 65-74.
[9] Sachdeva, S., Meta data architecture for data warehousing, DM
Review Magazine, April 1998.
[10] “The COSMIC Method v3.0: Guideline for Sizing Data
Warehouse Software”, 2009.
[11] “MK II Function Point Analysis Counting Practices Manual
v1.3.1”, 1998

18 CFP
39 CFP

VIII. CONCLUSION
The sizing of the Data Warehouse will help the Warehouse Administrators to allocate the resources and predict
the effort that will be needed. The various data warehouse
software components can be separately measured using the
COSMIC method due to the layered sizing feature [4].
Using the COSMIC FSM method we have successfully
measure the functional size of our data warehouse. Using
this information if the developers need to design a similar
data warehouse using this information they can easily manage resources, cut down coasts, efficiently manage manpower
and complete the project on schedule.
Future work may include developing a new Hybrid FSM
method that may include the features of both the COSMIC
FSM and MK II FSM method for instance the sensitivity to
make small changes to requirements in MK II FSM method is

© 2013 ACEEE
DOI: 03.LSCS.2013.3.553

30

Más contenido relacionado

La actualidad más candente

An ontological approach to handle multidimensional schema evolution for data ...
An ontological approach to handle multidimensional schema evolution for data ...An ontological approach to handle multidimensional schema evolution for data ...
An ontological approach to handle multidimensional schema evolution for data ...ijdms
 
Birthof Relation Database
Birthof Relation DatabaseBirthof Relation Database
Birthof Relation DatabaseRaj Bhat
 
ETL and its impact on Business Intelligence
ETL and its impact on Business IntelligenceETL and its impact on Business Intelligence
ETL and its impact on Business IntelligenceIshaPande
 
CS3C - Jonbon Libreja
CS3C - Jonbon LibrejaCS3C - Jonbon Libreja
CS3C - Jonbon LibrejaPog Arenas
 
Mi0034 database management systems
Mi0034  database management systemsMi0034  database management systems
Mi0034 database management systemssmumbahelp
 
DATACENTRE TOTAL COST OF OWNERSHIP (TCO) MODELS: A SURVEY
DATACENTRE TOTAL COST OF OWNERSHIP (TCO) MODELS: A SURVEYDATACENTRE TOTAL COST OF OWNERSHIP (TCO) MODELS: A SURVEY
DATACENTRE TOTAL COST OF OWNERSHIP (TCO) MODELS: A SURVEYIJCSEA Journal
 
Mi0034 database management systems
Mi0034  database management systemsMi0034  database management systems
Mi0034 database management systemssmumbahelp
 
Using Model-Driven Engineering for Decision Support Systems Modelling, Implem...
Using Model-Driven Engineering for Decision Support Systems Modelling, Implem...Using Model-Driven Engineering for Decision Support Systems Modelling, Implem...
Using Model-Driven Engineering for Decision Support Systems Modelling, Implem...CSCJournals
 
GR-FB Block Cleaning Scheme in Flash Memory
GR-FB Block Cleaning Scheme in Flash MemoryGR-FB Block Cleaning Scheme in Flash Memory
GR-FB Block Cleaning Scheme in Flash MemoryIDES Editor
 
Systems Lifecycle workbook
Systems Lifecycle workbookSystems Lifecycle workbook
Systems Lifecycle workbookMISY
 

La actualidad más candente (17)

An ontological approach to handle multidimensional schema evolution for data ...
An ontological approach to handle multidimensional schema evolution for data ...An ontological approach to handle multidimensional schema evolution for data ...
An ontological approach to handle multidimensional schema evolution for data ...
 
K1803015864
K1803015864K1803015864
K1803015864
 
Ijcatr04071001
Ijcatr04071001Ijcatr04071001
Ijcatr04071001
 
Unit 5
Unit 5 Unit 5
Unit 5
 
Management Information System
Management Information SystemManagement Information System
Management Information System
 
Birthof Relation Database
Birthof Relation DatabaseBirthof Relation Database
Birthof Relation Database
 
ETL and its impact on Business Intelligence
ETL and its impact on Business IntelligenceETL and its impact on Business Intelligence
ETL and its impact on Business Intelligence
 
CS3C - Jonbon Libreja
CS3C - Jonbon LibrejaCS3C - Jonbon Libreja
CS3C - Jonbon Libreja
 
F04302053057
F04302053057F04302053057
F04302053057
 
Mi0034 database management systems
Mi0034  database management systemsMi0034  database management systems
Mi0034 database management systems
 
DATACENTRE TOTAL COST OF OWNERSHIP (TCO) MODELS: A SURVEY
DATACENTRE TOTAL COST OF OWNERSHIP (TCO) MODELS: A SURVEYDATACENTRE TOTAL COST OF OWNERSHIP (TCO) MODELS: A SURVEY
DATACENTRE TOTAL COST OF OWNERSHIP (TCO) MODELS: A SURVEY
 
FCAME2014
FCAME2014FCAME2014
FCAME2014
 
Dss
DssDss
Dss
 
Mi0034 database management systems
Mi0034  database management systemsMi0034  database management systems
Mi0034 database management systems
 
Using Model-Driven Engineering for Decision Support Systems Modelling, Implem...
Using Model-Driven Engineering for Decision Support Systems Modelling, Implem...Using Model-Driven Engineering for Decision Support Systems Modelling, Implem...
Using Model-Driven Engineering for Decision Support Systems Modelling, Implem...
 
GR-FB Block Cleaning Scheme in Flash Memory
GR-FB Block Cleaning Scheme in Flash MemoryGR-FB Block Cleaning Scheme in Flash Memory
GR-FB Block Cleaning Scheme in Flash Memory
 
Systems Lifecycle workbook
Systems Lifecycle workbookSystems Lifecycle workbook
Systems Lifecycle workbook
 

Similar a Estimation of Functional Size of a Data Warehouse System using COSMIC FSM Method

An Integrated ERP With Web Portal
An Integrated ERP With Web PortalAn Integrated ERP With Web Portal
An Integrated ERP With Web PortalTracy Morgan
 
UNIT-5 DATA WAREHOUSING.docx
UNIT-5 DATA WAREHOUSING.docxUNIT-5 DATA WAREHOUSING.docx
UNIT-5 DATA WAREHOUSING.docxDURGADEVIL
 
An Integrated ERP with Web Portal
An Integrated ERP with Web Portal An Integrated ERP with Web Portal
An Integrated ERP with Web Portal acijjournal
 
Implementation of Data Marts in Data ware house
Implementation of Data Marts in Data ware houseImplementation of Data Marts in Data ware house
Implementation of Data Marts in Data ware houseIJARIIT
 
Data Ware House System in Cloud Environment
Data Ware House System in Cloud EnvironmentData Ware House System in Cloud Environment
Data Ware House System in Cloud EnvironmentIJERA Editor
 
Warehouse Planning and Implementation
Warehouse Planning and ImplementationWarehouse Planning and Implementation
Warehouse Planning and ImplementationSHIKHA GAUTAM
 
Unit-IV-Introduction to Data Warehousing .pptx
Unit-IV-Introduction to Data Warehousing .pptxUnit-IV-Introduction to Data Warehousing .pptx
Unit-IV-Introduction to Data Warehousing .pptxHarsha Patel
 
Data warehouse concepts
Data warehouse conceptsData warehouse concepts
Data warehouse conceptsobieefans
 
A Study Review of Common Big Data Architecture for Small-Medium Enterprise
A Study Review of Common Big Data Architecture for Small-Medium EnterpriseA Study Review of Common Big Data Architecture for Small-Medium Enterprise
A Study Review of Common Big Data Architecture for Small-Medium EnterpriseRidwan Fadjar
 
Database Management Systems ( Dbms )
Database Management Systems ( Dbms )Database Management Systems ( Dbms )
Database Management Systems ( Dbms )Patty Buckley
 

Similar a Estimation of Functional Size of a Data Warehouse System using COSMIC FSM Method (20)

Ems
EmsEms
Ems
 
An Integrated ERP With Web Portal
An Integrated ERP With Web PortalAn Integrated ERP With Web Portal
An Integrated ERP With Web Portal
 
Fulltext01
Fulltext01Fulltext01
Fulltext01
 
J0212065068
J0212065068J0212065068
J0212065068
 
Advanced Database System
Advanced Database SystemAdvanced Database System
Advanced Database System
 
UNIT-5 DATA WAREHOUSING.docx
UNIT-5 DATA WAREHOUSING.docxUNIT-5 DATA WAREHOUSING.docx
UNIT-5 DATA WAREHOUSING.docx
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Project report
Project reportProject report
Project report
 
An Integrated ERP with Web Portal
An Integrated ERP with Web Portal An Integrated ERP with Web Portal
An Integrated ERP with Web Portal
 
Implementation of Data Marts in Data ware house
Implementation of Data Marts in Data ware houseImplementation of Data Marts in Data ware house
Implementation of Data Marts in Data ware house
 
DMDW 1st module.pdf
DMDW 1st module.pdfDMDW 1st module.pdf
DMDW 1st module.pdf
 
Data Ware House System in Cloud Environment
Data Ware House System in Cloud EnvironmentData Ware House System in Cloud Environment
Data Ware House System in Cloud Environment
 
Warehouse Planning and Implementation
Warehouse Planning and ImplementationWarehouse Planning and Implementation
Warehouse Planning and Implementation
 
H1802045666
H1802045666H1802045666
H1802045666
 
Course Outline Ch 2
Course Outline Ch 2Course Outline Ch 2
Course Outline Ch 2
 
Unit-IV-Introduction to Data Warehousing .pptx
Unit-IV-Introduction to Data Warehousing .pptxUnit-IV-Introduction to Data Warehousing .pptx
Unit-IV-Introduction to Data Warehousing .pptx
 
realtime system.docx
realtime system.docxrealtime system.docx
realtime system.docx
 
Data warehouse concepts
Data warehouse conceptsData warehouse concepts
Data warehouse concepts
 
A Study Review of Common Big Data Architecture for Small-Medium Enterprise
A Study Review of Common Big Data Architecture for Small-Medium EnterpriseA Study Review of Common Big Data Architecture for Small-Medium Enterprise
A Study Review of Common Big Data Architecture for Small-Medium Enterprise
 
Database Management Systems ( Dbms )
Database Management Systems ( Dbms )Database Management Systems ( Dbms )
Database Management Systems ( Dbms )
 

Más de idescitation (20)

65 113-121
65 113-12165 113-121
65 113-121
 
69 122-128
69 122-12869 122-128
69 122-128
 
71 338-347
71 338-34771 338-347
71 338-347
 
72 129-135
72 129-13572 129-135
72 129-135
 
74 136-143
74 136-14374 136-143
74 136-143
 
80 152-157
80 152-15780 152-157
80 152-157
 
82 348-355
82 348-35582 348-355
82 348-355
 
84 11-21
84 11-2184 11-21
84 11-21
 
62 328-337
62 328-33762 328-337
62 328-337
 
46 102-112
46 102-11246 102-112
46 102-112
 
47 292-298
47 292-29847 292-298
47 292-298
 
49 299-305
49 299-30549 299-305
49 299-305
 
57 306-311
57 306-31157 306-311
57 306-311
 
60 312-318
60 312-31860 312-318
60 312-318
 
5 1-10
5 1-105 1-10
5 1-10
 
11 69-81
11 69-8111 69-81
11 69-81
 
14 284-291
14 284-29114 284-291
14 284-291
 
15 82-87
15 82-8715 82-87
15 82-87
 
29 88-96
29 88-9629 88-96
29 88-96
 
43 97-101
43 97-10143 97-101
43 97-101
 

Último

Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...PsychoTech Services
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 

Último (20)

Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 

Estimation of Functional Size of a Data Warehouse System using COSMIC FSM Method

  • 1. Full Paper Proc. of Int. Conf. on Advances in Computer Science and Application 2013 Estimation of Functional Size of a Data Warehouse System using COSMIC FSM Method Avinash Samuel1, Anil Kumar Pandey2, Vivek Kumar Sharma2 1Deprement of Computer Science & Engineering, Invertis University, Bareilly, India Avinash_12141@yahoo.co.in 2Deprement of Computer Science & Engineering, Invertis University, Bareilly, India { Anipandey, Sharma.vivek109}@gmail.com II. RELATED WORK Abstract — It is not easy to measure the functional size of Data Warehouse System. Data Warehouse system is not traditional system and it can be easily measured using FSM (Functional Size Measurement) Method. In this paper we have shown with the help of a case study to measure the functional size of the Data Warehouse System using COSMIC FSMmethod. We will explore the use of COSMIC in sizing Data Warehouse Systems. Functional Size Measurement is a new and emerging field, but many its foundations were laid by Allan Albercht in 1978[10].He was a pioneer or we can say he is the Father of Functional Size Measurement. He proposed IFPUG’s FPA which uses Internal logical files, External interface files, External Input, External Output and External Query to measure the functional size of the software. Mark II FPA method or MK II FPA method was proposed by Charles Symons at Nolan Norton in 1984. It uses the Input Data Elements, Entity References and Output Data Elements as the Base Functional Components (BFCs) to measure the functional size of the system. COSMIC FSM method was created by international consortium of industry subject matter experts and academics from 19 countries in the year 1997. It uses data movements (Read, Write, Entry, and Exit) to measure the functional size [9]. The COSMIC FSM method is capable to measure the software having the layered architecture. COSMIC released its guidelines to measure the functional size of the Data Warehouse System, but even with the availability of the guideline it is quite complex to measure the functional size of the data warehouse system. Not much work is done in this field and it is yet too clear how to measure the functional size of the data warehouse system. Keywords — Functional Size, Data Warehouse, COSMIC FSM. I. INTRODUCTION In today’s market it has now become a necessity to create a sustainable competitive advantage against competitors by creating a system with which the current operations can be easily tracked, and predict their future strategies. Therefore, one of the trends in the market at the moment is the growing interest in the development of large data warehouses. It is very hard to predict/estimate the effort and resources required to build a data warehouse system in earlier phases of development. One of the methods proposed by IFPUG (International Function Point User Group) is FPM (Functional Point Analysis). Function Points uses the data movements in the system to measure the end-user requirements. Therefore we can derive an early estimation of the functional size of the system before any code has been written [4]. If development production figures are known, the cost of developing new software can therefore be estimated early enough to make direct comparisons to the cost of buying a software package, or simply using a non-technical solution[4]. The most applied method of measuring functional size is Function Point Analysis (FPA). Another ISO certified method, is COSMIC (Common Software Measurement International Consortium)FSM. In recent history, frameworks have been described of how to measure data warehouse applications with FSM. COSMIC is a better way of measuring the functional size of the software because of the COSMIC’s capability to measure the software in different layers and because of the fact that the size of individual functions are not cut off by the maximum size of a function, like in FSM [2]. In this paper, using COSMIC FSM method we measure the functional size of a data warehouse system to understand this concept. © 2013 ACEEE DOI: 03.LSCS.2013.3.553 III. WHAT IS A DATA WAREHOUSE? A data warehouse is a Subject oriented, integrated, Time variant, Non-volatile collection of data in support of management’s decision making process [6]. Note: The data warehouse is always a physically separate store of data transformed from the application data found in the operational environment. The figure above (Fig 1.) shows the architecture of a data warehouse along with its components. We will now take a look at the components and their functions. A. Operational data sources It is the operational system which stores the transactions of the business. It is located outside the data warehouse, as the data warehouse has no control upon the content and the format of the data. The data in these systems is stored in many formats i.e. flat files to hierarchical and relational databases. 25
  • 2. Full Paper Proc. of Int. Conf. on Advances in Computer Science and Application 2013 Fig 1: Architecture of Data Warehouse B. Data Staging Area Its function is restricted to extracting, cleaning, matching and loading data from multiple source sources. It is explicitly off limits to the end- users, i.e. the end-user has no access or no control over the data staging area. It does not support query or presentation services. A data-cleansing tool is used to process data to resolve name and address misspellings. F. Metadata management Metadata is not the actual data; but rather information that addresses a number of data characteristics such as names and definitions of data elements, where the data comes from, how it is collected, and what transformations it goes through before being stored in the data warehouse [1,2]. Also, meaningful metadata identifies important relationships among data elements that are critical to using and interpreting the data available through the end user query tools. C. Extract Transform Load (ETL) Processes Data-ETL processes are used to extract data from data sources, cleanse the data, perform data transformations, and load the target data warehouse and then again to load the data marts. The ETL processes are also used to generate and maintain a central metadata repository and support the data warehouse [1, 10]. G. Business Intelligence (End User functionality) Before the end-users can access the data, the data is stored into the business intelligence layer. Here, the data can be visualized as cubes or multidimensional data. There are numerous ways for users to extract the data from the data marts, or from the data warehouse. OLAP tools analyze the data and try to find correlations and meaningful patterns in a fully automated way [1, 2]. D. Data Warehouse Database It is a relational data structure that is optimized for distribution. It collects and stores integrated sets of historical, non-volatile data from multiple operational systems and feeds them to one or more data marts [1]. IV. FUNCTIONAL MEASUREMENT DEFINITIONS A. Functional Size It is the size of a system/software as viewed from a logical, non-technical point of view. It is more significant to the user than physical or technical size, as for example Lines of Code. This size should be shared between users and developers of the given system [2]. E. Data Marts Data Marts can be viewed as an extension to the data warehouse. The data comes from the multiple data sources and it is integrated before entering the Data Warehouse System. The data marts contain subject specific information supporting the requirements of the end users in individual business units. Data marts can provide rapid response to end-user requests if most queries are directed to precomputed, aggregated data stored in the data mart [1]. © 2013 ACEEE DOI: 03.LSCS.2013.3.553 B. COSMIC FSM Method COSMIC is a superset of functional metrics, which provides wider applicability than the IFPUG method. Its key concepts are the possibility of viewing the measured system under different linked layers (different levels of conceptual 26
  • 3. Full Paper Proc. of Int. Conf. on Advances in Computer Science and Application 2013 abstraction of the system functions) [2]. The FURs is to be examined and the various functional processes are to be identified. Any of these functional processes consist of a number of sub processes (BFC’s), which are called data movements (with included data manipulations). COSMIC recognizes four kinds of data movement sub processes, Entry, Exit, Read and Write types [4, 9]. The data moments and their data groups are illustrated in Fig 3. The different data movements and their data groups are as follows: V. COSMIC PRINCIPLES In COSMIC the complete set of requirements only the functional user requirements are measured. The Base Functional Components (BFC’s) must be identified within the set of functional user requirements and these BFC’s must be measured [4, 9]. The Basic diagram for BFC’s is shown in Fig 2. Base Functional Components in COSMIC are data movement types, which are identified per functional process type. The underlying principles are: 1. Software is activated by input and produces output, or result, that is of use to the user. 2. Software processes parts/pieces of data, which are materialized by data groups, which are a subset of an object of interest (OOI). A data group may consists of one or more data attributes. Fig 3: Data Groups in COSMIC FSM Method [3,4] COSMIC provides the layering concept, which allows the measure to functionally partition the software into different layers, to make sure that all functional processes function on the same level of abstraction. COSMIC also allows the software residing within one layer to be partitioned into peer components, if these components are developed with different technologies, or if they are implemented on different processors[1]. A view of this is presented in Fig 4. Fig 2: Base Functional Requirements Fig 4: Division of Software into its peer components © 2013 ACEEE DOI: 03.LSCS.2013.3.553 27
  • 4. Full Paper Proc. of Int. Conf. on Advances in Computer Science and Application 2013 VI. CASE STUDY B. Data Warehouse Component The second component is Data warehouse component. Fig 7 illustrates the flow of data from staging area to data warehouse component. To explain the functional size measurement procedure of the Data Warehouse System we take into consideration a data warehouse. The data warehouse that we take for this purpose is the data warehouse of an organization containing its employee’s information. As the COSMIC method is capable to measure the functional size of the peer components, we will take a look at the components one by one. A. Staging Area The figure shown below (Fig 5) illustrates the initial flow of data from the operational data sources to the staging area. Fig 7: Functional Processes within Data Warehouse Component [1] Data Movements: A simple functional process of the ETL data warehouse tool that extracts, transforms and loads data describing a single OOI-type would have the data movements as shown in Fig 8. Fig 5: Functional Processes within the Staging Area [1] Data Movements: For a simple functional process in an ETL (Extraction, Transformation & Loading) Staging Area (SA) tool that must move data about a single OOI-type (where E = Entry, R = Read, W = Write and X = Exit). The Data Movements between the operational data sources, Staging area and metadata is shown in Fig 6. Fig 8: Data Movements within Data Warehouse Components Table 2 illustrates the data movements and their description within the Data Warehouse Component. TABLE II. T OTAL D ATA MOVEMENTS OF D ATA WAREHOUSE C OMPONENT E X Fig 6: Data Movement within Staging Area Component E Table I illustrates the data movements and their description within the staging area. R W TABLE I: TOTAL DATA MOVEMENTS OF STAGING AREA COMPONENTS E X E R W X Total © 2013 ACEEE DOI: 03.LSCS.2013.3.553 X Total To start the functional process (e.g. a clock tick, if a batch process). to the metadata management tool to obtain the transformation rules for this OOI from the metadata management tool with the required metadata of the operational data source of the transformed data to the staging area error/confirmation messages 6 CFP To start the functional process (e.g. a clock tick, if a batch process). to the metadata management tool to obtain the transformation rules for this OOI from the metadata management tool with the required metadata of the staging area files of the transformed data to the data warehouse database error/confirmation messages 6 CFP C. Data Mart Component The Third component is Data mart component. Fig 9 shows the data flow from Data warehouse component to the Data mart Component. Data Movements: In the ETL data mart tools we find the functional processes that feed the data marts from the data 28
  • 5. Full Paper Proc. of Int. Conf. on Advances in Computer Science and Application 2013 processes the request of the end users and provides them with generated reports. Fig 9: Functional Processes within Data Mart Component [1] describing the OOI’s that are stored in the data warehouse component In the data mart databases, the data will be stored in a dimensional way, thus in star schemas, which shows both ‘dimension tables’ and ‘fact tables’ [10]. Fig 10 illustrates the data movements within Data mart component. Fig 11: Functional Process within the Business Intelligent Components [1] Data Movements: There are only three data movements in the system as illustrated in Table 4. The end user sends the enquiry or the query, which is counted as 1 E, the required solution of the query is gathered from the persistent storage 1 R, then the results are returned to the end user in form of some report 1 X. TABLE IV: T OTAL D ATA MOVEMENTS OF BUSINESS INTELLIGENT COMPONENTS E R X Total Fig 10: Data Movements within Data Mart Component Table 3 illustrates the data movements and their description within the Data mart Component. TABLE III: TOTAL D ATA MOVEMENTS OF D ATA MART C OMPONENT E To start the functional process. R Read from dimension table of the employee ( Employee Output) R Read from dimension table of the employee ( Employee Personal Information) R Read from dimension table of the employee W X Of the employee’s data. error/confirmation messages Total 6 CFP to start the enquiry Reads the data from the persistent storage Returns the result 3 CFP E. Metadata Management Component The metadata administrator have a number of functional processes to his disposal, with which he can create new metadata rules, maintain existing rules or delete metadata rules [10]. User profiles, access privilege files, data processing rules and use statistics can be considered as OOI [1, 7]. Business metadata which are like data dictionaries may contain Data on historical aspects, data on a data owner, etc. [8]. It is good practice in the analysis and design of business application software to check the required stages of the lifecycle of every object of interest (OOI) for which persistent data are held, because each possible transition from one stage to another (in UML terms a ‘state transition’) should correspond to a functional process. This rule is summarized by the acronym ‘CRUD’ where C = Create, R = Read, U = Update and D = Delete (sometimes known as ‘CRUDL’ where L = List). Data about every object of interest must be created, is invariably read, and will usually be updated and deleted, and maybe listed [5]. The Data Movements that are generally encountered within the metadata management component are shown the tables below. Table 5 shows the relationship type i.e. the composite data movements made up of one or more basic data movements. VII. RESULTS Table 6 illustrates the different components of the Data D. Business Intelligence Components Warehouse along with their functional size and the total functional size of the system. The fourth component is Business Intelligence Summing up the functional size of all the peer components component it basically generates reports and provide it to we find out that the functional size of our data warehouse the end user. Fig 11 illustrates the flow of data from the data comes to about 39 CFP. mart component to the business intelligence component which 29 © 2013 ACEEE DOI: 03.LSCS.2013.3.553
  • 6. Full Paper Proc. of Int. Conf. on Advances in Computer Science and Application 2013 high (detects changes of single data element types and single entity references) where COSMIC FSM’s sensitivity to make small changes to requirements is moderate (detects changes to single data-groups) and the smallest feasible enhancement that can be made using the MK II FPA method is 0.26fp where COSMIC FSM can handle the smallest feasible enhancement of 1fp only [11]. Also, new FSM method can be proposed that may include the following characteristics as the current FSM methods are unable to cope with them: Measures corrective maintenance (fixes), Measures perfective maintenance (refactoring for improved performance), Measures algorithmic complexity and Measures reuse of code. TABLE V: DATA MOVEMENTS OF C OMPOSITE RELATIONSHIP TYPES Composite Relationship type Creates Views Lists Changes or Uses Deletes Archives Updates Reads Total Data movement types 1 E, 1 W 1 E, 1 R, 1 X 1 E, 1 R, 1 X 1 E, 1 R, 1 X, 1 W 1 E, 1 W 1 E, 1 W 1 E, 1 W 1R 18 CFP TABLE VI: T OTAL FUNCTIONAL SIZE OF THE DATA WAREHOUSE Component Name Staging Area Component Data Warehouse Component Data Mart Component Business Intelligence Component Metadata Management Component Total Calculated Functional Size of Component 6 CFP 6 CFP 6 CFP 3 CFP REFERENCES [1] Van Heeringen, H., Measuring the functional size of a data warehouse application using the COSMIC FFP method, Software Measurement European Forum Conference, Rome, Italy, May 2006. [2] Santillo, L., “Size & Estimation of data warehouse systems”, in FESMA DASMA 2001 conference proceedings, Heidelberg (Germany), May 2001. [3] “International Software Benchmarking Standards Group database, version 9”, January 2008. [4] “The COSMIC functional size measurement method, version 3.0: Measurement Manual”, September 2007. [5] “The COSMIC functional size measurement method, version 3.0: Guideline for sizing business application software”, Version 1.1, May 2008. [6] Inmon, W.H., “What is a Data Warehouse?” Prism, Volume 1, Number 1, 1995. [7] Inmon, W.H., “Metadata in a Data in a Data Warehouse: A Statement of Vision”, White Paper, Pine Cone Systems, Colorado, December 2005. [8] Chaudhuri, S. and Dayal, U., “An Overview of Data Warehousing and OLAP Technology”, ACM Sigmod record vol. 26 (1), 1997, pp. 65-74. [9] Sachdeva, S., Meta data architecture for data warehousing, DM Review Magazine, April 1998. [10] “The COSMIC Method v3.0: Guideline for Sizing Data Warehouse Software”, 2009. [11] “MK II Function Point Analysis Counting Practices Manual v1.3.1”, 1998 18 CFP 39 CFP VIII. CONCLUSION The sizing of the Data Warehouse will help the Warehouse Administrators to allocate the resources and predict the effort that will be needed. The various data warehouse software components can be separately measured using the COSMIC method due to the layered sizing feature [4]. Using the COSMIC FSM method we have successfully measure the functional size of our data warehouse. Using this information if the developers need to design a similar data warehouse using this information they can easily manage resources, cut down coasts, efficiently manage manpower and complete the project on schedule. Future work may include developing a new Hybrid FSM method that may include the features of both the COSMIC FSM and MK II FSM method for instance the sensitivity to make small changes to requirements in MK II FSM method is © 2013 ACEEE DOI: 03.LSCS.2013.3.553 30