SlideShare una empresa de Scribd logo
1 de 26
Descargar para leer sin conexión
Automating the Formalization of Product Comparison Matrices 
Guillaume Bécan, Nicolas Sannier, Mathieu Acher, Olivier Barais, Arnaud Blouin, Benoit Baudry
Product lines everywhere 
Automating the Formalization of Product Comparison Matrices 
- 2
Product Comparison Matrices (PCMs) 
Automating the Formalization of Product Comparison Matrices 
- 3
Services on top of PCMs 
Automating the Formalization of Product Comparison Matrices 
- 4 
Edit 
Compare 
Visualize 
Filter 
Rank 
Merge 
Configure 
Multi-objective optimization
Problem 
Automating the Formalization of Product Comparison Matrices 
- 5 
Edit 
Compare 
Visualize 
… 
Information is: 
•Uncontrolled 
•Heterogeneous 
•Ambiguous 
[Sannier et al, ASE 2013] 
| [[Acer Inc.|Acer]] 
| [[Acer beTouch E110|beTouch E110]] 
| {{dts|format=dmy|2010|2|15}} 
| 1.5 
| [[320x240|320x240 QVGA]] 
| {{convert|2.8|in|mm|abbr=on}} 
| Touch, accelerometer 
| 
* [[GSM]]/​GPRS/​[[Enhanced Data Rates for GSM Evolution|EDGE]] 
* [[Universal Mobile Telecommunications System|UMTS]] 850 1900 
* CSD
Problem 
Automating the Formalization of Product Comparison Matrices 
- 6 
Common 
language 
Transformation 
Edit 
Compare 
Visualize 
… 
•How to formalize data contained in natural language PCMs? 
•How to automate the formalization of PCMs? 
•What tools and services can be built on top of this formalization?
Contributions 
Automating the Formalization of Product Comparison Matrices 
- 7 
1.Design of a metamodel for product comparison matrices 
2.Automated techniques for formalizing raw data into formalized product comparison matrix model 
3.Evaluation on 30,000+ cells from Wikipedia
Metamodeling driven by (lots of) data 
Automating the Formalization of Product Comparison Matrices 
- 8 
Working on the metamodel since February 2013 
300+ PCMs – 300,000 cells 
Numerous domains 
Manual review of 50 PCMs (thousands of cells) 
Statistics on all PCMs 
Analysis of Wikipedia syntax for tables 
Automated transformation of all PCMs to PCM models
PCM metamodel 
Automating the Formalization of Product Comparison Matrices 
- 9
PCM metamodel 
Automating the Formalization of Product Comparison Matrices 
- 10 
Structure of a PCM
PCM metamodel 
Automating the Formalization of Product Comparison Matrices 
- 11 
Feature/Product oriented
Automating the Formalization of Product Comparison Matrices 
- 12 
Formalized interpretation of a cell 
Data types: Boolean, Integer, Real 
Special values: Unknown, Empty, Inconsistent, Partial 
PCM metamodel 
row string 
formalized integer
Contributions 
Automating the Formalization of Product Comparison Matrices 
- 13 
1.Design of a metamodel for product comparison matrices 
2.Automated techniques for formalizing raw data into formalized product comparison matrix model 
3.Evaluation on 30,000+ cells from Wikipedia
Approach 
Automating the Formalization of Product Comparison Matrices 
- 14 
Parsing: transform a PCM artefact in a PCM model 
PCM 
PCM model 
parsing 
preprocessing 
extracting 
information 
exploiting 
PCM model 
PCM model 
PCM metamodel 
S 
E 
R 
V 
I 
C 
E 
S 
| [[Acer Inc.|Acer]] 
| [[Acer beTouch E110|beTouch E110]] 
| {{dts|format=dmy|2010|2|15}} 
| 1.5 
| [[320x240|320x240 QVGA]] 
| {{convert|2.8|in|mm|abbr=on}} 
| Touch, accelerometer 
| 
* [[GSM]]/​GPRS/​[[Enhanced Data Rates for GSM Evolution|EDGE]] 
* [[Universal Mobile Telecommunications System|UMTS]] 850 1900 
* CSD 
Enable the development of a generic formalization process
Approach 
Automating the Formalization of Product Comparison Matrices 
- 15 
Preprocessing: 
Contributors cannot be trusted: missing cells, headers everywhere 
We have to normalize the matrix and identify headers 
Default strategy: first line and first column are headers 
PCM 
PCM model 
parsing 
preprocessing 
extracting 
information 
exploiting 
PCM model 
PCM model 
PCM metamodel 
S 
E 
R 
V 
I 
C 
E 
S
Approach 
Automating the Formalization of Product Comparison Matrices 
- 16 
Extracting information: 
•Identify features and products 
•Interpret cells based on a set of syntactic rules (regex) 
PCM 
PCM model 
parsing 
preprocessing 
extracting 
information 
exploiting 
PCM model 
PCM model 
PCM metamodel 
S 
E 
R 
V 
I 
C 
E 
S 
List of rules: … "d+" => Integer … 
match 
Integer(100) 
Same process as the metamodel for creating the rules
Contributions 
Automating the Formalization of Product Comparison Matrices 
- 17 
1.Design of a metamodel for product comparison matrices 
2.Automated techniques for formalizing raw data into formalized product comparison matrix model 
3.Evaluation on 30,000+ cells from Wikipedia
Evaluation 
Automating the Formalization of Product Comparison Matrices 
- 18 
Experimental settings: 
•75 PCMs from Wikipedia 
•Headers specified manually 
•Automated extraction of information 
PCM 
PCM model 
parsing 
preprocessing 
extracting 
information 
PCM model 
PCM model 
PCM metamodel 
exploiting 
S 
E 
R 
V 
I 
C 
E 
S 
RQ1 
RQ2 
RQ3
Evaluation 
Automating the Formalization of Product Comparison Matrices 
- 19 
Task: check interpretation of each cell (30,000+) 
•Validate 
•Correct it with existing concept 
•Correct it with a new concept 
•I don’t know / there is no interpretation 
20 evaluators 
Online editor
Evaluation 
Automating the Formalization of Product Comparison Matrices 
- 20 
Metrics: 
•Number of valid cells 
•Number of cells corrected with concepts from the metamodel 
•Number of cells corrected with new concepts 
•List of new concepts
Evaluation 
Automating the Formalization of Product Comparison Matrices 
- 21 
RQ1: To what extent can PCMs be formalized? 
93.11% of the cells are valid 
2.61% are corrected with concepts from the metamodel 
4.28% are invalid and the evaluators proposed a new concept 
•Dates 
•Dimensions and units 
•Versions 
Solution: 
•Add corresponding data types to the metamodel 
•Create new rules for interpreting cells
Evaluation 
Automating the Formalization of Product Comparison Matrices 
- 22 
RQ2: To what extent can the formalization be automated? 
93,11% of the cells are correctly formalized 
Formalization errors may arise from 4 main areas: 
•Overlapping concepts (e.g. what does an empty cell mean?) 
•Missing concepts (e.g. dates, versions…) 
•Missing interpretation rules 
•Bad rules
Evaluation 
Automating the Formalization of Product Comparison Matrices 
- 23 
RQ3: What services can be built on top of formalized PCMs? 
Editing and formalizing PCMs Warnings during edition (inconsistent cells) Filtering capabilities Translate PCMs to variability models 
The metamodel provides 
•Feature/product oriented perspective 
•Clear semantics
Results of the evaluation 
Automating the Formalization of Product Comparison Matrices 
- 24 
We now have a common language for PCMs 
•validated by humans 
•validated by transformation 
•validated by the editor 
A large proportion of the formalization can be automated 
BUT human is necessary 
Good news: the editor can help formalizing the data
Future work 
Automating the Formalization of Product Comparison Matrices 
- 26 
Universal editor 
Support large datasets 
Community of PCM contributors 
Synchronization with Wikipedia
Questions? 
Automating the Formalization of Product Comparison Matrices 
- 27 
PCM 
PCM model 
parsing 
preprocessing 
extracting 
information 
exploiting 
PCM model 
PCM model 
PCM metamodel 
S 
E 
R 
V 
I 
C 
E 
S

Más contenido relacionado

La actualidad más candente

CIS110 Computer Programming Design Chapter (7)
CIS110 Computer Programming Design Chapter  (7)CIS110 Computer Programming Design Chapter  (7)
CIS110 Computer Programming Design Chapter (7)Dr. Ahmed Al Zaidy
 
Programming Logic and Design: Working with Data
Programming Logic and Design: Working with DataProgramming Logic and Design: Working with Data
Programming Logic and Design: Working with DataNicole Ryan
 
Machine learning with scikitlearn
Machine learning with scikitlearnMachine learning with scikitlearn
Machine learning with scikitlearnPratap Dangeti
 
CIS110 Computer Programming Design Chapter (10)
CIS110 Computer Programming Design Chapter  (10)CIS110 Computer Programming Design Chapter  (10)
CIS110 Computer Programming Design Chapter (10)Dr. Ahmed Al Zaidy
 
CIS110 Computer Programming Design Chapter (13)
CIS110 Computer Programming Design Chapter  (13)CIS110 Computer Programming Design Chapter  (13)
CIS110 Computer Programming Design Chapter (13)Dr. Ahmed Al Zaidy
 
Building largescalepredictionsystemv1
Building largescalepredictionsystemv1Building largescalepredictionsystemv1
Building largescalepredictionsystemv1arthi v
 
Logic Formulation 3
Logic Formulation 3Logic Formulation 3
Logic Formulation 3deathful
 
Feature Selection Techniques for Software Fault Prediction (Summary)
Feature Selection Techniques for Software Fault Prediction (Summary)Feature Selection Techniques for Software Fault Prediction (Summary)
Feature Selection Techniques for Software Fault Prediction (Summary)SungdoGu
 
Chapter 12 Lecture: GUI Programming, Multithreading, and Animation
Chapter 12 Lecture: GUI Programming, Multithreading, and AnimationChapter 12 Lecture: GUI Programming, Multithreading, and Animation
Chapter 12 Lecture: GUI Programming, Multithreading, and AnimationNicole Ryan
 
Decision Support Analyss for Software Effort Estimation by Analogy
Decision Support Analyss for Software Effort Estimation by AnalogyDecision Support Analyss for Software Effort Estimation by Analogy
Decision Support Analyss for Software Effort Estimation by AnalogyTim Menzies
 
Decision table
Decision tableDecision table
Decision tablejeebala
 
Circuit analysis i with matlab computing and simulink sim powersystems modeling
Circuit analysis i with matlab computing and simulink sim powersystems modelingCircuit analysis i with matlab computing and simulink sim powersystems modeling
Circuit analysis i with matlab computing and simulink sim powersystems modelingIndra S Wahyudi
 
Pycon2015 scope
Pycon2015 scopePycon2015 scope
Pycon2015 scopearthi v
 

La actualidad más candente (13)

CIS110 Computer Programming Design Chapter (7)
CIS110 Computer Programming Design Chapter  (7)CIS110 Computer Programming Design Chapter  (7)
CIS110 Computer Programming Design Chapter (7)
 
Programming Logic and Design: Working with Data
Programming Logic and Design: Working with DataProgramming Logic and Design: Working with Data
Programming Logic and Design: Working with Data
 
Machine learning with scikitlearn
Machine learning with scikitlearnMachine learning with scikitlearn
Machine learning with scikitlearn
 
CIS110 Computer Programming Design Chapter (10)
CIS110 Computer Programming Design Chapter  (10)CIS110 Computer Programming Design Chapter  (10)
CIS110 Computer Programming Design Chapter (10)
 
CIS110 Computer Programming Design Chapter (13)
CIS110 Computer Programming Design Chapter  (13)CIS110 Computer Programming Design Chapter  (13)
CIS110 Computer Programming Design Chapter (13)
 
Building largescalepredictionsystemv1
Building largescalepredictionsystemv1Building largescalepredictionsystemv1
Building largescalepredictionsystemv1
 
Logic Formulation 3
Logic Formulation 3Logic Formulation 3
Logic Formulation 3
 
Feature Selection Techniques for Software Fault Prediction (Summary)
Feature Selection Techniques for Software Fault Prediction (Summary)Feature Selection Techniques for Software Fault Prediction (Summary)
Feature Selection Techniques for Software Fault Prediction (Summary)
 
Chapter 12 Lecture: GUI Programming, Multithreading, and Animation
Chapter 12 Lecture: GUI Programming, Multithreading, and AnimationChapter 12 Lecture: GUI Programming, Multithreading, and Animation
Chapter 12 Lecture: GUI Programming, Multithreading, and Animation
 
Decision Support Analyss for Software Effort Estimation by Analogy
Decision Support Analyss for Software Effort Estimation by AnalogyDecision Support Analyss for Software Effort Estimation by Analogy
Decision Support Analyss for Software Effort Estimation by Analogy
 
Decision table
Decision tableDecision table
Decision table
 
Circuit analysis i with matlab computing and simulink sim powersystems modeling
Circuit analysis i with matlab computing and simulink sim powersystems modelingCircuit analysis i with matlab computing and simulink sim powersystems modeling
Circuit analysis i with matlab computing and simulink sim powersystems modeling
 
Pycon2015 scope
Pycon2015 scopePycon2015 scope
Pycon2015 scope
 

Similar a Automating the Formalization of Product Comparison Matrices

ES2022-Minh-Nguyen-ShapingTestsIntoModelsForAutomatedTCGeneration.pdf
ES2022-Minh-Nguyen-ShapingTestsIntoModelsForAutomatedTCGeneration.pdfES2022-Minh-Nguyen-ShapingTestsIntoModelsForAutomatedTCGeneration.pdf
ES2022-Minh-Nguyen-ShapingTestsIntoModelsForAutomatedTCGeneration.pdfMinh Nguyen
 
CAD/CAM -PPT UNIT NO-I
CAD/CAM -PPT UNIT NO-ICAD/CAM -PPT UNIT NO-I
CAD/CAM -PPT UNIT NO-IMalothHeeralal
 
20220914-MBT-Experiences-SB1-final.pptx
20220914-MBT-Experiences-SB1-final.pptx20220914-MBT-Experiences-SB1-final.pptx
20220914-MBT-Experiences-SB1-final.pptxMinh Nguyen
 
Modelon Modelica executable requirements Ansys Conference 2016
Modelon Modelica executable requirements Ansys Conference 2016Modelon Modelica executable requirements Ansys Conference 2016
Modelon Modelica executable requirements Ansys Conference 2016Modelon
 
Automotive engineering design - Model Based Design
Automotive engineering design - Model Based DesignAutomotive engineering design - Model Based Design
Automotive engineering design - Model Based DesignVinayagam Mariappan
 
Presentation Verification & Validation
Presentation Verification & ValidationPresentation Verification & Validation
Presentation Verification & ValidationElmar Selbach
 
Feedback Strategy for Closed-loop Inspection Based on STEP-NC
Feedback Strategy for Closed-loop Inspection Based on STEP-NCFeedback Strategy for Closed-loop Inspection Based on STEP-NC
Feedback Strategy for Closed-loop Inspection Based on STEP-NCCristhian Riaño Jaimes
 
An Integrated Simulation Tool Framework for Process Data Management
An Integrated Simulation Tool Framework for Process Data ManagementAn Integrated Simulation Tool Framework for Process Data Management
An Integrated Simulation Tool Framework for Process Data ManagementCognizant
 
Guiding through a typical Machine Learning Pipeline
Guiding through a typical Machine Learning PipelineGuiding through a typical Machine Learning Pipeline
Guiding through a typical Machine Learning PipelineMichael Gerke
 
Customer choice probabilities
Customer choice probabilitiesCustomer choice probabilities
Customer choice probabilitiesAllan D. Butler
 
QDM WEB System -- Connect your suppliers and global enterprise
QDM WEB System -- Connect your suppliers and global enterpriseQDM WEB System -- Connect your suppliers and global enterprise
QDM WEB System -- Connect your suppliers and global enterpriseBenjamin Reese
 
Intro to LV in 3 Hours for Control and Sim 8_5.pptx
Intro to LV in 3 Hours for Control and Sim 8_5.pptxIntro to LV in 3 Hours for Control and Sim 8_5.pptx
Intro to LV in 3 Hours for Control and Sim 8_5.pptxDeepakJangid87
 
Icon solutions presentation - Pure Hybrid Cloud Event, 11th September London
Icon solutions presentation - Pure Hybrid Cloud Event, 11th September LondonIcon solutions presentation - Pure Hybrid Cloud Event, 11th September London
Icon solutions presentation - Pure Hybrid Cloud Event, 11th September LondonIBM Systems UKI
 
Modeling and Testing Dovetail in MagicDraw
Modeling and Testing Dovetail in MagicDrawModeling and Testing Dovetail in MagicDraw
Modeling and Testing Dovetail in MagicDrawGregory Solovey
 
Model-Based Design For Motor Control Development
Model-Based Design For Motor Control DevelopmentModel-Based Design For Motor Control Development
Model-Based Design For Motor Control DevelopmentThe Hartford
 
B Kindilien Finding Efficiency In Mach 120408
B Kindilien Finding Efficiency In Mach 120408B Kindilien Finding Efficiency In Mach 120408
B Kindilien Finding Efficiency In Mach 120408jgIpotiwon
 

Similar a Automating the Formalization of Product Comparison Matrices (20)

ES2022-Minh-Nguyen-ShapingTestsIntoModelsForAutomatedTCGeneration.pdf
ES2022-Minh-Nguyen-ShapingTestsIntoModelsForAutomatedTCGeneration.pdfES2022-Minh-Nguyen-ShapingTestsIntoModelsForAutomatedTCGeneration.pdf
ES2022-Minh-Nguyen-ShapingTestsIntoModelsForAutomatedTCGeneration.pdf
 
CAD/CAM -PPT UNIT NO-I
CAD/CAM -PPT UNIT NO-ICAD/CAM -PPT UNIT NO-I
CAD/CAM -PPT UNIT NO-I
 
20220914-MBT-Experiences-SB1-final.pptx
20220914-MBT-Experiences-SB1-final.pptx20220914-MBT-Experiences-SB1-final.pptx
20220914-MBT-Experiences-SB1-final.pptx
 
Modelon Modelica executable requirements Ansys Conference 2016
Modelon Modelica executable requirements Ansys Conference 2016Modelon Modelica executable requirements Ansys Conference 2016
Modelon Modelica executable requirements Ansys Conference 2016
 
CIM unit-1
CIM  unit-1CIM  unit-1
CIM unit-1
 
Automotive engineering design - Model Based Design
Automotive engineering design - Model Based DesignAutomotive engineering design - Model Based Design
Automotive engineering design - Model Based Design
 
Presentation Verification & Validation
Presentation Verification & ValidationPresentation Verification & Validation
Presentation Verification & Validation
 
Feedback Strategy for Closed-loop Inspection Based on STEP-NC
Feedback Strategy for Closed-loop Inspection Based on STEP-NCFeedback Strategy for Closed-loop Inspection Based on STEP-NC
Feedback Strategy for Closed-loop Inspection Based on STEP-NC
 
An Integrated Simulation Tool Framework for Process Data Management
An Integrated Simulation Tool Framework for Process Data ManagementAn Integrated Simulation Tool Framework for Process Data Management
An Integrated Simulation Tool Framework for Process Data Management
 
Guiding through a typical Machine Learning Pipeline
Guiding through a typical Machine Learning PipelineGuiding through a typical Machine Learning Pipeline
Guiding through a typical Machine Learning Pipeline
 
Critical parameter management
Critical parameter managementCritical parameter management
Critical parameter management
 
Customer choice probabilities
Customer choice probabilitiesCustomer choice probabilities
Customer choice probabilities
 
QDM WEB System -- Connect your suppliers and global enterprise
QDM WEB System -- Connect your suppliers and global enterpriseQDM WEB System -- Connect your suppliers and global enterprise
QDM WEB System -- Connect your suppliers and global enterprise
 
Intro to LV in 3 Hours for Control and Sim 8_5.pptx
Intro to LV in 3 Hours for Control and Sim 8_5.pptxIntro to LV in 3 Hours for Control and Sim 8_5.pptx
Intro to LV in 3 Hours for Control and Sim 8_5.pptx
 
Icon solutions presentation - Pure Hybrid Cloud Event, 11th September London
Icon solutions presentation - Pure Hybrid Cloud Event, 11th September LondonIcon solutions presentation - Pure Hybrid Cloud Event, 11th September London
Icon solutions presentation - Pure Hybrid Cloud Event, 11th September London
 
Syllabus cim
Syllabus cimSyllabus cim
Syllabus cim
 
Modeling and Testing Dovetail in MagicDraw
Modeling and Testing Dovetail in MagicDrawModeling and Testing Dovetail in MagicDraw
Modeling and Testing Dovetail in MagicDraw
 
Model-Based Design For Motor Control Development
Model-Based Design For Motor Control DevelopmentModel-Based Design For Motor Control Development
Model-Based Design For Motor Control Development
 
B Kindilien Finding Efficiency In Mach 120408
B Kindilien Finding Efficiency In Mach 120408B Kindilien Finding Efficiency In Mach 120408
B Kindilien Finding Efficiency In Mach 120408
 
Final
FinalFinal
Final
 

Último

Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...jana861314
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencySheetal Arora
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxBroad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxjana861314
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 

Último (20)

Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxBroad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 

Automating the Formalization of Product Comparison Matrices

  • 1. Automating the Formalization of Product Comparison Matrices Guillaume Bécan, Nicolas Sannier, Mathieu Acher, Olivier Barais, Arnaud Blouin, Benoit Baudry
  • 2. Product lines everywhere Automating the Formalization of Product Comparison Matrices - 2
  • 3. Product Comparison Matrices (PCMs) Automating the Formalization of Product Comparison Matrices - 3
  • 4. Services on top of PCMs Automating the Formalization of Product Comparison Matrices - 4 Edit Compare Visualize Filter Rank Merge Configure Multi-objective optimization
  • 5. Problem Automating the Formalization of Product Comparison Matrices - 5 Edit Compare Visualize … Information is: •Uncontrolled •Heterogeneous •Ambiguous [Sannier et al, ASE 2013] | [[Acer Inc.|Acer]] | [[Acer beTouch E110|beTouch E110]] | {{dts|format=dmy|2010|2|15}} | 1.5 | [[320x240|320x240 QVGA]] | {{convert|2.8|in|mm|abbr=on}} | Touch, accelerometer | * [[GSM]]/​GPRS/​[[Enhanced Data Rates for GSM Evolution|EDGE]] * [[Universal Mobile Telecommunications System|UMTS]] 850 1900 * CSD
  • 6. Problem Automating the Formalization of Product Comparison Matrices - 6 Common language Transformation Edit Compare Visualize … •How to formalize data contained in natural language PCMs? •How to automate the formalization of PCMs? •What tools and services can be built on top of this formalization?
  • 7. Contributions Automating the Formalization of Product Comparison Matrices - 7 1.Design of a metamodel for product comparison matrices 2.Automated techniques for formalizing raw data into formalized product comparison matrix model 3.Evaluation on 30,000+ cells from Wikipedia
  • 8. Metamodeling driven by (lots of) data Automating the Formalization of Product Comparison Matrices - 8 Working on the metamodel since February 2013 300+ PCMs – 300,000 cells Numerous domains Manual review of 50 PCMs (thousands of cells) Statistics on all PCMs Analysis of Wikipedia syntax for tables Automated transformation of all PCMs to PCM models
  • 9. PCM metamodel Automating the Formalization of Product Comparison Matrices - 9
  • 10. PCM metamodel Automating the Formalization of Product Comparison Matrices - 10 Structure of a PCM
  • 11. PCM metamodel Automating the Formalization of Product Comparison Matrices - 11 Feature/Product oriented
  • 12. Automating the Formalization of Product Comparison Matrices - 12 Formalized interpretation of a cell Data types: Boolean, Integer, Real Special values: Unknown, Empty, Inconsistent, Partial PCM metamodel row string formalized integer
  • 13. Contributions Automating the Formalization of Product Comparison Matrices - 13 1.Design of a metamodel for product comparison matrices 2.Automated techniques for formalizing raw data into formalized product comparison matrix model 3.Evaluation on 30,000+ cells from Wikipedia
  • 14. Approach Automating the Formalization of Product Comparison Matrices - 14 Parsing: transform a PCM artefact in a PCM model PCM PCM model parsing preprocessing extracting information exploiting PCM model PCM model PCM metamodel S E R V I C E S | [[Acer Inc.|Acer]] | [[Acer beTouch E110|beTouch E110]] | {{dts|format=dmy|2010|2|15}} | 1.5 | [[320x240|320x240 QVGA]] | {{convert|2.8|in|mm|abbr=on}} | Touch, accelerometer | * [[GSM]]/​GPRS/​[[Enhanced Data Rates for GSM Evolution|EDGE]] * [[Universal Mobile Telecommunications System|UMTS]] 850 1900 * CSD Enable the development of a generic formalization process
  • 15. Approach Automating the Formalization of Product Comparison Matrices - 15 Preprocessing: Contributors cannot be trusted: missing cells, headers everywhere We have to normalize the matrix and identify headers Default strategy: first line and first column are headers PCM PCM model parsing preprocessing extracting information exploiting PCM model PCM model PCM metamodel S E R V I C E S
  • 16. Approach Automating the Formalization of Product Comparison Matrices - 16 Extracting information: •Identify features and products •Interpret cells based on a set of syntactic rules (regex) PCM PCM model parsing preprocessing extracting information exploiting PCM model PCM model PCM metamodel S E R V I C E S List of rules: … "d+" => Integer … match Integer(100) Same process as the metamodel for creating the rules
  • 17. Contributions Automating the Formalization of Product Comparison Matrices - 17 1.Design of a metamodel for product comparison matrices 2.Automated techniques for formalizing raw data into formalized product comparison matrix model 3.Evaluation on 30,000+ cells from Wikipedia
  • 18. Evaluation Automating the Formalization of Product Comparison Matrices - 18 Experimental settings: •75 PCMs from Wikipedia •Headers specified manually •Automated extraction of information PCM PCM model parsing preprocessing extracting information PCM model PCM model PCM metamodel exploiting S E R V I C E S RQ1 RQ2 RQ3
  • 19. Evaluation Automating the Formalization of Product Comparison Matrices - 19 Task: check interpretation of each cell (30,000+) •Validate •Correct it with existing concept •Correct it with a new concept •I don’t know / there is no interpretation 20 evaluators Online editor
  • 20. Evaluation Automating the Formalization of Product Comparison Matrices - 20 Metrics: •Number of valid cells •Number of cells corrected with concepts from the metamodel •Number of cells corrected with new concepts •List of new concepts
  • 21. Evaluation Automating the Formalization of Product Comparison Matrices - 21 RQ1: To what extent can PCMs be formalized? 93.11% of the cells are valid 2.61% are corrected with concepts from the metamodel 4.28% are invalid and the evaluators proposed a new concept •Dates •Dimensions and units •Versions Solution: •Add corresponding data types to the metamodel •Create new rules for interpreting cells
  • 22. Evaluation Automating the Formalization of Product Comparison Matrices - 22 RQ2: To what extent can the formalization be automated? 93,11% of the cells are correctly formalized Formalization errors may arise from 4 main areas: •Overlapping concepts (e.g. what does an empty cell mean?) •Missing concepts (e.g. dates, versions…) •Missing interpretation rules •Bad rules
  • 23. Evaluation Automating the Formalization of Product Comparison Matrices - 23 RQ3: What services can be built on top of formalized PCMs? Editing and formalizing PCMs Warnings during edition (inconsistent cells) Filtering capabilities Translate PCMs to variability models The metamodel provides •Feature/product oriented perspective •Clear semantics
  • 24. Results of the evaluation Automating the Formalization of Product Comparison Matrices - 24 We now have a common language for PCMs •validated by humans •validated by transformation •validated by the editor A large proportion of the formalization can be automated BUT human is necessary Good news: the editor can help formalizing the data
  • 25. Future work Automating the Formalization of Product Comparison Matrices - 26 Universal editor Support large datasets Community of PCM contributors Synchronization with Wikipedia
  • 26. Questions? Automating the Formalization of Product Comparison Matrices - 27 PCM PCM model parsing preprocessing extracting information exploiting PCM model PCM model PCM metamodel S E R V I C E S