SlideShare una empresa de Scribd logo
1 de 34
Descargar para leer sin conexión
Improving Decision-Making Support
 by Linking Database results to Simulations

             Gio Wiederhold
              Stanford University
                   July 2011



                                    Gio Wiederhold SimQL 1
Problem            : Mismatch
Database Technology should support Decision-Making

• What does database technology do?
     o   Databases provide information about past events
             » Consistent
             » Reliable
             » Fast

• What does a decision-maker do?
     o   Guess how decisions will affect the future
             » Multiple possibilities
             » Uncertainty
             » Slow, manual, multiple tools
                                                  Gio Wiederhold SimQL 2
 8/17/2012                        Gio: SimQL
Information Systems should also
                 Project into the Future




past                      now                          future
                                            time

Support of decision-making requires dealing with the future ,
   as well the past
• Databases deal well with the past
• Sensors can provide current status
• Spreadsheets, simulations deal with the likely futures
Information systems should be able to combine all three
                                                    Gio Wiederhold SimQL 3
8/17/2012                   Gio: SimQL
Decision-making (DM)

 Analyze Alternatives
 • Current Capabilities
 • Future Expectations
 • Planning for them
                                       now              future

 Process tasks:
 •   List resources
 •   Enumerate alternatives
 •   Prune alternative
 •   Compare alternatives
8/17/2012                 Gio: SimQL         Gio Wiederhold SimQL 4
Current Processes

                    • Data conversion to files for spreadsheets.
                       • Model building and testing by analysts
                          • Planning for likely future scenarios
                             • Recording expected results       .
•   Data collection              • Comparing many scenarios .
•   Data validation                  • Finding the best plans .
•   Data integration                   • Advising the actual .
•   Information selection                        decision maker
•   Data reduction & summarization
•   File generation for analysts
                                                  Gio Wiederhold SimQL 5
     8/17/2012               Gio: SimQL
Progress in Data Integration

Information Integration has progressed in supporting
  Decision Making
1.          Integrate data from distributed sources
       o      Issues: inconsistency of scope and timing
2.          Capture new relationships
       o      Often requires expert inter-domain knowledge

3.          Include current sensor data
       o      Select streaming data
4.      include predictions about future courses
       ******* A new, potentially major topic *******
                                                          Gio Wiederhold SimQL 6
8/17/2012                       Gio: SimQL
DM support is disjoint
                                does not interoperate



            Planning Science




                                     extensions to move
                 Distribution        to networked support
                                     are also disjoint
8/17/2012              Gio: SimQL           Gio Wiederhold SimQL 7
Current state of DM Support
     past                        now                                  future

            organized support                     disjointed support
                             x17 @qbfera
                             ffga 67 .78 jjkl,a
                             nsnd nn 23.5a                 Intuition +
     Data integration                             • Spreadsheets
                                                  • Resource allocations
                                                  • Explicit simulations
         Databases
                                                  various point assessments
distributed, heterogeneous


                 Past                                       future          time
                                                                 Gio Wiederhold SimQL 8
     8/17/2012                    Gio: SimQL
Prediction Requires Tools




                                   E-mail this book,
                                     Alfred Knopf, 1997



8/17/2012             Gio: SimQL      Gio Wiederhold SimQL 9
Requirements for DM
• Ubiquitous access to simulations
               of a wide variety of types
• Rapid response to parameter changes
   o Access to up-to-date facts
   o May need High-Performance recomputation

• Model, scenario, and choice retention
   o Analysts’ planning to be reused
        » But updatable

                                          Gio Wiederhold SimQL 10
  8/17/2012               Gio: SimQL
How to merge 2 disciplines
   • Databases
            o High-level languages
                 » Data descriptions
            o   Drive detailed processes
            o   Intentional
   • Simulations & spreadsheets
            o   High-level languages
                 » Model desriptions
            o Parameter driven
            o Extensional
                                             Gio Wiederhold SimQL 11
8/17/2012                       Gio: SimQL
Integration concept
• Enable intentional simulation access
   o Follow database model
            » Similar to data description
    o       Provide interfaces
            »To support needed processes

Create         SimQL similar to SQL
            schema & links to access procedures

                                            Gio Wiederhold SimQL 12
8/17/2012                  Gio: SimQL
Transform Data to Information
Database                             oo                middle-




                                     -)
 Design                                            management
                    Schema        SQL user
Data                                         Reports




                                                              :-(
 Collection

Model                                value-added
              :-)


 Design                                services

Data-driven                                    decision-makers
 Modeling
                                                   Plans
                    o o


 8/17/2012                   Gio: SimQL            Gio Wiederhold SimQL 13
Language implementation
Stanford Experiment uses an existing SQL parser:
1. Replace the SELECT verb with ESTIMATE;
2. Remove the UPDATE statement. Nothing persists
3. Replace CREATE DATABASE with CREATE MODEL;
4. Add to the CREATE attributes IN, OUT, and INOUT;
5. Add a REGISTER statement to identify resources;
6. Replaced SQL’s functions code generators that access
   stored data with functions that deliver the
   a. Query IN parameters to various simulations
   b. Collect the data specified as OUT parameters
   c. Return the result.
                                          Gio Wiederhold SimQL 14
 8/17/2012             Gio: SimQL
Examples
  SQL:
  SELECT Temperature, Cloudcover, Windspeed,
   Winddirection FROM
  WeatherDB WHERE Date = `yesterday' AND
   Location = `ORD'.

  SimQL:
  ESTIMATE Temperature, Cloudcover, Windspeed,
    Winddirection FROM
  WeatherSimulation WHERE Date = `tomorrow' AND
    Location = `ORD'.
                                       Gio Wiederhold SimQL 15
8/17/2012            Gio: SimQL
Available Functions
1. Continously executing: weather prediction
    o SimQL result reports best match samples

2. Execution specific to query: Spreadsheet what-if assessment
    o may require HPC power for adequate response

3. Past simulations collect results in a base: materials
    o performs inter- or extra-polations to match query parameters

4. Combinations, i.e., 2. + 3.: top layer simulation using stored
   partial lower level results: weapon performance in new setting
5. Human-in-the-loop: Wrapper for Amazon’s Mechanical Turk
Note
• A simulation service program can be written in any language
• A simulation service must be compliant to the interface spec.
                                                     Gio Wiederhold SimQL 16
 8/17/2012                   Gio: SimQL
System Concept Layout




                                 Gio Wiederhold SimQL 17
8/17/2012           Gio: SimQL
Interfaces enable integration:
            SimQL to access Simulations




     past            now           future
                                              time

           Databases,                 Simulations,
       accessed via SQL or
      XML, CORBA compliant       accessed via SimQL and
            wrappers               compliant wrappers
                             Msg
                           systems,
                           sensors
                                                     Gio Wiederhold SimQL 18
8/17/2012                     Gio: SimQL
Current State of SimQL research
                             GUI
                                          collect language
                                            requirements

                       Test Application


             wrapper       wrapper              wrapper




        Spreadsheets      Weather             Engineering
                                                   Gio Wiederhold SimQL 19
8/17/2012                 Gio: SimQL
Stanford Experiment Models

               Logistics
              Application                       Manufacturing
                                                 Application

  SimQL access                                             SimQL access
                        SimQL access
                                                SQL access
                            wrapper                            wrapper
            wrapper                   wrapper




                       Weather                      Test     Engineering
    Spreadsheets                                    Data
                       (short-, long-term)                    Gio Wiederhold SimQL 20
8/17/2012                        Gio: SimQL
More to be done
• Stanford experiment only produced point results.
• A decision maker would estimate multiple scenarios
     1.      Collect results identified with parameters
     2.      Provide search functions to compare results
             1. Consider time lines for result synchronization
     3.      Support pruning of low-value results
     4.      Deliver only high-value results to decision-maker



                                                          Gio Wiederhold SimQL 21
 8/17/2012                        Gio: SimQL
Use of Simulation Results
                      0.6      0.3    0.2
                                      0.1   0.07
                                0.5         0.03
                                      0.5   0.5       0.3

                                      0.1              0.2
            time
                      0.4       0.2   0.1   0.1          prob

       Simulation results can be composed for
           alternative Courses-of-actions
       Composition should include computation
         and recomputation of likelihoods
       Likelihoods change as now moves forwards
         and eliminates earlier alternatives.
                                                   Gio Wiederhold SimQL 22
8/17/2012               Gio: SimQL
Estimates have probabilities
      •     p=30% chance of rain
      •     Flight p=91% likely to arrive with 15 min of ETA
      •     Interest rate p=50% same, p=25% 1% higher, … .
      •     Employee p=50% returns to work in a week, … .
      •     Project p=10% completed in time, …
      •     Spreadsheets can compute alternative values
            with such data provided by the model builder,
            not the SimQL user.



                                                  Gio Wiederhold SimQL 23
8/17/2012                     Gio: SimQL
The branches can be labeled with probabilities,
           then assessed using the outcome with values
                                                                                prob

                       value

                                                         0.1 100   0.3                 1000
  Next period alternatives
                                   1200            0.4                                 2000
                                           0.5
and subsequent periods                                              600
                             0.6                            0.1                        5000
                                     66
                     1266                        0.1 0.3 1100 0.2
                                                                500
                                                                                       1000
                                    134      0.2
                                             0.3             200     200                    0
                                                                  0.1
                     -1086                                     -420
                                                            0.07       0              -6000
                             0.4   -1220
                                           0.2
                                                            -820 0.13
                                                                    -400              -3000
                                                                                   Values
  past               now                           future
                                                            time
                                                                         Gio Wiederhold SimQL 24
     8/17/2012                     Gio: SimQL
Integrating data & planning support will make
         our data reusable and much more valuable

                                           A Pruned Bush
 Re-assess as time                                        100    ?                      ?
 marches forward !                                 1200           600
                                                                                    1000

                                       1266 ?                                       2000
                                                          1100   500
                                                     66                             5000
                                                           200    200
                                                                                    1000
                                                                     0
                                                                                          0
            past                           now                    future
                                                                                  time
                                                        Spreadsheets,
                   Databases, . . .                   other simulations,


8/17/2012
                                           Msgs
                                      Gio: SimQL
                                                                         Gio Wiederhold SimQL 25

                                          sensors
Even the present needs SimQL
                                                   point-in-time for
      last recorded observations                     situational
                                                     assessment




                               simple simulations
                               to extrapolate data

       past                                    now                             future
                                                                  time

                                        Is the delivery truck in X?
Not all data are current:           • Is the right stuff on the truck?
                                          • Will the crew be at X?
                            • Will the forces be ready to accept delivery?


8/17/2012                             Gio: SimQL                             Gio Wiederhold SimQL 26
Use of Simulation Results




Simulation results can be composed for
    Alternative Courses-of-actions
Composition should be seamless, elegant,
 with computation and recomputation of
 likelihoods
Results change as now moves forwards and
 eliminates earlier alternatives.
                                  Gio Wiederhold SimQL 27
Summary
Databases                               Simulations should
• serve clients via SQL by              • serve clients via SimQL by
    Sharing a Model (The Schema)
                                              Sharing a Model (research q.)
    A query language over the model           A query language over the model
the SQL interface enables               a SimQL interface enables
• independence of                       • independence of
    application development                   application development
    DBMS technology development               simulation technology develop’t
    reuse of infrastructure                   reuse of infrastructure
Today                                   Objective
• most new systems use a                • build information systems
  DBMS for data storage                   combining DBMS, Simulations
    even with less performance,               even with less performance,
    inability to handle all problems,         inability to handle all problems,
    but enough of them well enough.           but enough of them . . .
                                                                 Gio Wiederhold SimQL 28
  8/17/2012                      Gio: SimQL
Further research questions

• How to move seamlessly from the past to the future?
• How can multiple futures be managed (indexed)?
• How can multiple futures be compared, selected?
• How should joint uncertainty be computed?
• How can the NOW point be moved automatically?



                                           Gio Wiederhold SimQL 29
 8/17/2012             Gio: SimQL
Future information systems
Combine data from the past, with current data,
 knowledge, and predictions into the future

                                      oo
                         o o

                   o o




                                           Assessment of the
                                           values of alternative
                                           possible outcomes



                                                  Gio Wiederhold SimQL 30
8/17/2012                Gio: SimQL
SimQL research questions

• How little of the model needs to be exposed?
• How can defaults be set rationally?
• How should expected execution cost be reported?
• How should uncertainty be reported?
• Are there differences among application areas that
  require different language structures?
• Are there differences among application areas that
  require different language features?
• How will the language interface support effective
  partitioning and distribution?
                                           Gio Wiederhold SimQL 31
8/17/2012              Gio: SimQL
Moving to a Service Paradigm

• Server is an independent contractor, defines service
• Client selects service, and specifies parameters
• Server’s success depends on value provided
•       Some form of payment received for services

             x,y




 Databases are a current example.
 Simulations have the same potential.
 8/17/2012                  Gio: SimQL      Gio Wiederhold SimQL 32
Summary of SimQL
            A new service for Decision Making:
            • follows database paradigm
              – ( by about 25 years )
            • coherence in prediction
              – displacement of ad-hoc practices
            • seamless information integration
              – single paradigm for decision makers
            • simulation industry infrastructure
              – investment has a potential market
              – should follows database industry model:
               Interfaces promote new industries
8/17/2012                 Gio: SimQL               Gio Wiederhold SimQL 33
Publications
    Gio Wiederhold: "Information Systems that Really
       Support Decision-making"; 11th International
       Symposium on Methodologies for Intelligent Systems
       (ISMIS), Warsaw Poland, June 1999, in Ras & Skowron
       Foundations for Intelligent Systems, Springer LNAI
       1609, pages 56-66
    Gio Wiederhold and Rushan Jiang: “Augmenting
       Information Systems with Access to Predictive Tools”;
    http://infolab.stanford.edu/pub/gio/2000/VLDB2000-1.htm

    The specifics of the language as implemented are at
    http://www-db.stanford.edu/LIC/SimQL.html



                                                    Gio Wiederhold SimQL 34
8/17/2012                   Gio: SimQL

Más contenido relacionado

Destacado (11)

Cs207 2
Cs207 2Cs207 2
Cs207 2
 
Cs207 6
Cs207 6Cs207 6
Cs207 6
 
Cs207 7
Cs207 7Cs207 7
Cs207 7
 
Cs207 7
Cs207 7Cs207 7
Cs207 7
 
Camon
CamonCamon
Camon
 
Camon
CamonCamon
Camon
 
Quantifying the future
Quantifying the futureQuantifying the future
Quantifying the future
 
Cs207 5
Cs207 5Cs207 5
Cs207 5
 
Cs207 6
Cs207 6Cs207 6
Cs207 6
 
Cs207 1
Cs207 1Cs207 1
Cs207 1
 
Software economics+ssitc13 tutorial
Software economics+ssitc13 tutorialSoftware economics+ssitc13 tutorial
Software economics+ssitc13 tutorial
 

Similar a Quantifying thefuture

Graph Data Science at Scale
Graph Data Science at ScaleGraph Data Science at Scale
Graph Data Science at ScaleNeo4j
 
Towards Lensfield
Towards LensfieldTowards Lensfield
Towards LensfieldJim Downing
 
Machine Learning Infrastructure
Machine Learning InfrastructureMachine Learning Infrastructure
Machine Learning InfrastructureSigOpt
 
Conceptional Data Vault
Conceptional Data VaultConceptional Data Vault
Conceptional Data VaultTorsten Glunde
 
Mortgage Data for Machine Learning Algorithms
Mortgage Data for Machine Learning AlgorithmsMortgage Data for Machine Learning Algorithms
Mortgage Data for Machine Learning AlgorithmsAnne Klieve
 
STING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
STING: Spatio-Temporal Interaction Networks and Graphs for Intel PlatformsSTING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
STING: Spatio-Temporal Interaction Networks and Graphs for Intel PlatformsJason Riedy
 
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUMETHE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUMEGigaom
 
Lessons Learned Replatforming A Large Machine Learning Application To Apache ...
Lessons Learned Replatforming A Large Machine Learning Application To Apache ...Lessons Learned Replatforming A Large Machine Learning Application To Apache ...
Lessons Learned Replatforming A Large Machine Learning Application To Apache ...Databricks
 
Transforming Innovation: Digital Twin for the Win!
Transforming Innovation: Digital Twin for the Win!Transforming Innovation: Digital Twin for the Win!
Transforming Innovation: Digital Twin for the Win!Neo4j
 
Polyglot Persistence and Database Deployment by Sandeep Khuperkar CTO and Dir...
Polyglot Persistence and Database Deployment by Sandeep Khuperkar CTO and Dir...Polyglot Persistence and Database Deployment by Sandeep Khuperkar CTO and Dir...
Polyglot Persistence and Database Deployment by Sandeep Khuperkar CTO and Dir...Ashnikbiz
 
Ultime Novità di Prodotto Neo4j
Ultime Novità di Prodotto Neo4j Ultime Novità di Prodotto Neo4j
Ultime Novità di Prodotto Neo4j Neo4j
 
An Active and Hybrid Storage System for Data-intensive Applications
An Active and Hybrid Storage System for Data-intensive ApplicationsAn Active and Hybrid Storage System for Data-intensive Applications
An Active and Hybrid Storage System for Data-intensive ApplicationsXiao Qin
 
Amsterdam - The Neo4j Graph Data Platform Today & Tomorrow
Amsterdam - The Neo4j Graph Data Platform Today & TomorrowAmsterdam - The Neo4j Graph Data Platform Today & Tomorrow
Amsterdam - The Neo4j Graph Data Platform Today & TomorrowNeo4j
 
Neo4j – The Fastest Path to Scalable Real-Time Analytics
Neo4j – The Fastest Path to Scalable Real-Time AnalyticsNeo4j – The Fastest Path to Scalable Real-Time Analytics
Neo4j – The Fastest Path to Scalable Real-Time AnalyticsNeo4j
 
Visualizing big data in the browser using spark
Visualizing big data in the browser using sparkVisualizing big data in the browser using spark
Visualizing big data in the browser using sparkDatabricks
 
Introduction to Graph databases and Neo4j (by Stefan Armbruster)
Introduction to Graph databases and Neo4j (by Stefan Armbruster)Introduction to Graph databases and Neo4j (by Stefan Armbruster)
Introduction to Graph databases and Neo4j (by Stefan Armbruster)barcelonajug
 

Similar a Quantifying thefuture (20)

Spark at Zillow
Spark at ZillowSpark at Zillow
Spark at Zillow
 
Graph Data Science at Scale
Graph Data Science at ScaleGraph Data Science at Scale
Graph Data Science at Scale
 
Towards Lensfield
Towards LensfieldTowards Lensfield
Towards Lensfield
 
Machine Learning Infrastructure
Machine Learning InfrastructureMachine Learning Infrastructure
Machine Learning Infrastructure
 
Conceptional Data Vault
Conceptional Data VaultConceptional Data Vault
Conceptional Data Vault
 
Mortgage Data for Machine Learning Algorithms
Mortgage Data for Machine Learning AlgorithmsMortgage Data for Machine Learning Algorithms
Mortgage Data for Machine Learning Algorithms
 
Making Sense of Remote Sensing
Making Sense of Remote SensingMaking Sense of Remote Sensing
Making Sense of Remote Sensing
 
STING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
STING: Spatio-Temporal Interaction Networks and Graphs for Intel PlatformsSTING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
STING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
 
Bigdata-Intro.pptx
Bigdata-Intro.pptxBigdata-Intro.pptx
Bigdata-Intro.pptx
 
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUMETHE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
 
Lessons Learned Replatforming A Large Machine Learning Application To Apache ...
Lessons Learned Replatforming A Large Machine Learning Application To Apache ...Lessons Learned Replatforming A Large Machine Learning Application To Apache ...
Lessons Learned Replatforming A Large Machine Learning Application To Apache ...
 
Transforming Innovation: Digital Twin for the Win!
Transforming Innovation: Digital Twin for the Win!Transforming Innovation: Digital Twin for the Win!
Transforming Innovation: Digital Twin for the Win!
 
Polyglot Persistence and Database Deployment by Sandeep Khuperkar CTO and Dir...
Polyglot Persistence and Database Deployment by Sandeep Khuperkar CTO and Dir...Polyglot Persistence and Database Deployment by Sandeep Khuperkar CTO and Dir...
Polyglot Persistence and Database Deployment by Sandeep Khuperkar CTO and Dir...
 
Ultime Novità di Prodotto Neo4j
Ultime Novità di Prodotto Neo4j Ultime Novità di Prodotto Neo4j
Ultime Novità di Prodotto Neo4j
 
An Active and Hybrid Storage System for Data-intensive Applications
An Active and Hybrid Storage System for Data-intensive ApplicationsAn Active and Hybrid Storage System for Data-intensive Applications
An Active and Hybrid Storage System for Data-intensive Applications
 
Developing Digital Twins
Developing Digital TwinsDeveloping Digital Twins
Developing Digital Twins
 
Amsterdam - The Neo4j Graph Data Platform Today & Tomorrow
Amsterdam - The Neo4j Graph Data Platform Today & TomorrowAmsterdam - The Neo4j Graph Data Platform Today & Tomorrow
Amsterdam - The Neo4j Graph Data Platform Today & Tomorrow
 
Neo4j – The Fastest Path to Scalable Real-Time Analytics
Neo4j – The Fastest Path to Scalable Real-Time AnalyticsNeo4j – The Fastest Path to Scalable Real-Time Analytics
Neo4j – The Fastest Path to Scalable Real-Time Analytics
 
Visualizing big data in the browser using spark
Visualizing big data in the browser using sparkVisualizing big data in the browser using spark
Visualizing big data in the browser using spark
 
Introduction to Graph databases and Neo4j (by Stefan Armbruster)
Introduction to Graph databases and Neo4j (by Stefan Armbruster)Introduction to Graph databases and Neo4j (by Stefan Armbruster)
Introduction to Graph databases and Neo4j (by Stefan Armbruster)
 

Más de Gio Wiederhold (8)

Software economics+ssitc13 tutorial
Software economics+ssitc13 tutorialSoftware economics+ssitc13 tutorial
Software economics+ssitc13 tutorial
 
Cs207 8
Cs207 8Cs207 8
Cs207 8
 
Cs207 9
Cs207 9Cs207 9
Cs207 9
 
Cs207 4
Cs207 4Cs207 4
Cs207 4
 
Cs207 2
Cs207 2Cs207 2
Cs207 2
 
I3master
I3masterI3master
I3master
 
Cs207 9
Cs207 9Cs207 9
Cs207 9
 
Cs207 1
Cs207 1Cs207 1
Cs207 1
 

Último

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 

Último (20)

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 

Quantifying thefuture

  • 1. Improving Decision-Making Support by Linking Database results to Simulations Gio Wiederhold Stanford University July 2011 Gio Wiederhold SimQL 1
  • 2. Problem : Mismatch Database Technology should support Decision-Making • What does database technology do? o Databases provide information about past events » Consistent » Reliable » Fast • What does a decision-maker do? o Guess how decisions will affect the future » Multiple possibilities » Uncertainty » Slow, manual, multiple tools Gio Wiederhold SimQL 2 8/17/2012 Gio: SimQL
  • 3. Information Systems should also Project into the Future past now future time Support of decision-making requires dealing with the future , as well the past • Databases deal well with the past • Sensors can provide current status • Spreadsheets, simulations deal with the likely futures Information systems should be able to combine all three Gio Wiederhold SimQL 3 8/17/2012 Gio: SimQL
  • 4. Decision-making (DM) Analyze Alternatives • Current Capabilities • Future Expectations • Planning for them now future Process tasks: • List resources • Enumerate alternatives • Prune alternative • Compare alternatives 8/17/2012 Gio: SimQL Gio Wiederhold SimQL 4
  • 5. Current Processes • Data conversion to files for spreadsheets. • Model building and testing by analysts • Planning for likely future scenarios • Recording expected results . • Data collection • Comparing many scenarios . • Data validation • Finding the best plans . • Data integration • Advising the actual . • Information selection decision maker • Data reduction & summarization • File generation for analysts Gio Wiederhold SimQL 5 8/17/2012 Gio: SimQL
  • 6. Progress in Data Integration Information Integration has progressed in supporting Decision Making 1. Integrate data from distributed sources o Issues: inconsistency of scope and timing 2. Capture new relationships o Often requires expert inter-domain knowledge 3. Include current sensor data o Select streaming data 4. include predictions about future courses ******* A new, potentially major topic ******* Gio Wiederhold SimQL 6 8/17/2012 Gio: SimQL
  • 7. DM support is disjoint does not interoperate Planning Science extensions to move Distribution to networked support are also disjoint 8/17/2012 Gio: SimQL Gio Wiederhold SimQL 7
  • 8. Current state of DM Support past now future organized support disjointed support x17 @qbfera ffga 67 .78 jjkl,a nsnd nn 23.5a Intuition + Data integration • Spreadsheets • Resource allocations • Explicit simulations Databases various point assessments distributed, heterogeneous Past future time Gio Wiederhold SimQL 8 8/17/2012 Gio: SimQL
  • 9. Prediction Requires Tools E-mail this book, Alfred Knopf, 1997 8/17/2012 Gio: SimQL Gio Wiederhold SimQL 9
  • 10. Requirements for DM • Ubiquitous access to simulations of a wide variety of types • Rapid response to parameter changes o Access to up-to-date facts o May need High-Performance recomputation • Model, scenario, and choice retention o Analysts’ planning to be reused » But updatable Gio Wiederhold SimQL 10 8/17/2012 Gio: SimQL
  • 11. How to merge 2 disciplines • Databases o High-level languages » Data descriptions o Drive detailed processes o Intentional • Simulations & spreadsheets o High-level languages » Model desriptions o Parameter driven o Extensional Gio Wiederhold SimQL 11 8/17/2012 Gio: SimQL
  • 12. Integration concept • Enable intentional simulation access o Follow database model » Similar to data description o Provide interfaces »To support needed processes Create SimQL similar to SQL schema & links to access procedures Gio Wiederhold SimQL 12 8/17/2012 Gio: SimQL
  • 13. Transform Data to Information Database oo middle- -) Design management Schema SQL user Data Reports :-( Collection Model value-added :-) Design services Data-driven decision-makers Modeling Plans o o 8/17/2012 Gio: SimQL Gio Wiederhold SimQL 13
  • 14. Language implementation Stanford Experiment uses an existing SQL parser: 1. Replace the SELECT verb with ESTIMATE; 2. Remove the UPDATE statement. Nothing persists 3. Replace CREATE DATABASE with CREATE MODEL; 4. Add to the CREATE attributes IN, OUT, and INOUT; 5. Add a REGISTER statement to identify resources; 6. Replaced SQL’s functions code generators that access stored data with functions that deliver the a. Query IN parameters to various simulations b. Collect the data specified as OUT parameters c. Return the result. Gio Wiederhold SimQL 14 8/17/2012 Gio: SimQL
  • 15. Examples SQL: SELECT Temperature, Cloudcover, Windspeed, Winddirection FROM WeatherDB WHERE Date = `yesterday' AND Location = `ORD'. SimQL: ESTIMATE Temperature, Cloudcover, Windspeed, Winddirection FROM WeatherSimulation WHERE Date = `tomorrow' AND Location = `ORD'. Gio Wiederhold SimQL 15 8/17/2012 Gio: SimQL
  • 16. Available Functions 1. Continously executing: weather prediction o SimQL result reports best match samples 2. Execution specific to query: Spreadsheet what-if assessment o may require HPC power for adequate response 3. Past simulations collect results in a base: materials o performs inter- or extra-polations to match query parameters 4. Combinations, i.e., 2. + 3.: top layer simulation using stored partial lower level results: weapon performance in new setting 5. Human-in-the-loop: Wrapper for Amazon’s Mechanical Turk Note • A simulation service program can be written in any language • A simulation service must be compliant to the interface spec. Gio Wiederhold SimQL 16 8/17/2012 Gio: SimQL
  • 17. System Concept Layout Gio Wiederhold SimQL 17 8/17/2012 Gio: SimQL
  • 18. Interfaces enable integration: SimQL to access Simulations past now future time Databases, Simulations, accessed via SQL or XML, CORBA compliant accessed via SimQL and wrappers compliant wrappers Msg systems, sensors Gio Wiederhold SimQL 18 8/17/2012 Gio: SimQL
  • 19. Current State of SimQL research GUI collect language requirements Test Application wrapper wrapper wrapper Spreadsheets Weather Engineering Gio Wiederhold SimQL 19 8/17/2012 Gio: SimQL
  • 20. Stanford Experiment Models Logistics Application Manufacturing Application SimQL access SimQL access SimQL access SQL access wrapper wrapper wrapper wrapper Weather Test Engineering Spreadsheets Data (short-, long-term) Gio Wiederhold SimQL 20 8/17/2012 Gio: SimQL
  • 21. More to be done • Stanford experiment only produced point results. • A decision maker would estimate multiple scenarios 1. Collect results identified with parameters 2. Provide search functions to compare results 1. Consider time lines for result synchronization 3. Support pruning of low-value results 4. Deliver only high-value results to decision-maker Gio Wiederhold SimQL 21 8/17/2012 Gio: SimQL
  • 22. Use of Simulation Results 0.6 0.3 0.2 0.1 0.07 0.5 0.03 0.5 0.5 0.3 0.1 0.2 time 0.4 0.2 0.1 0.1 prob Simulation results can be composed for alternative Courses-of-actions Composition should include computation and recomputation of likelihoods Likelihoods change as now moves forwards and eliminates earlier alternatives. Gio Wiederhold SimQL 22 8/17/2012 Gio: SimQL
  • 23. Estimates have probabilities • p=30% chance of rain • Flight p=91% likely to arrive with 15 min of ETA • Interest rate p=50% same, p=25% 1% higher, … . • Employee p=50% returns to work in a week, … . • Project p=10% completed in time, … • Spreadsheets can compute alternative values with such data provided by the model builder, not the SimQL user. Gio Wiederhold SimQL 23 8/17/2012 Gio: SimQL
  • 24. The branches can be labeled with probabilities, then assessed using the outcome with values prob value 0.1 100 0.3 1000 Next period alternatives 1200 0.4 2000 0.5 and subsequent periods 600 0.6 0.1 5000 66 1266 0.1 0.3 1100 0.2 500 1000 134 0.2 0.3 200 200 0 0.1 -1086 -420 0.07 0 -6000 0.4 -1220 0.2 -820 0.13 -400 -3000 Values past now future time Gio Wiederhold SimQL 24 8/17/2012 Gio: SimQL
  • 25. Integrating data & planning support will make our data reusable and much more valuable A Pruned Bush Re-assess as time 100 ? ? marches forward ! 1200 600 1000 1266 ? 2000 1100 500 66 5000 200 200 1000 0 0 past now future time Spreadsheets, Databases, . . . other simulations, 8/17/2012 Msgs Gio: SimQL Gio Wiederhold SimQL 25 sensors
  • 26. Even the present needs SimQL point-in-time for last recorded observations situational assessment simple simulations to extrapolate data past now future time Is the delivery truck in X? Not all data are current: • Is the right stuff on the truck? • Will the crew be at X? • Will the forces be ready to accept delivery? 8/17/2012 Gio: SimQL Gio Wiederhold SimQL 26
  • 27. Use of Simulation Results Simulation results can be composed for Alternative Courses-of-actions Composition should be seamless, elegant, with computation and recomputation of likelihoods Results change as now moves forwards and eliminates earlier alternatives. Gio Wiederhold SimQL 27
  • 28. Summary Databases Simulations should • serve clients via SQL by • serve clients via SimQL by Sharing a Model (The Schema) Sharing a Model (research q.) A query language over the model A query language over the model the SQL interface enables a SimQL interface enables • independence of • independence of application development application development DBMS technology development simulation technology develop’t reuse of infrastructure reuse of infrastructure Today Objective • most new systems use a • build information systems DBMS for data storage combining DBMS, Simulations even with less performance, even with less performance, inability to handle all problems, inability to handle all problems, but enough of them well enough. but enough of them . . . Gio Wiederhold SimQL 28 8/17/2012 Gio: SimQL
  • 29. Further research questions • How to move seamlessly from the past to the future? • How can multiple futures be managed (indexed)? • How can multiple futures be compared, selected? • How should joint uncertainty be computed? • How can the NOW point be moved automatically? Gio Wiederhold SimQL 29 8/17/2012 Gio: SimQL
  • 30. Future information systems Combine data from the past, with current data, knowledge, and predictions into the future oo o o o o Assessment of the values of alternative possible outcomes Gio Wiederhold SimQL 30 8/17/2012 Gio: SimQL
  • 31. SimQL research questions • How little of the model needs to be exposed? • How can defaults be set rationally? • How should expected execution cost be reported? • How should uncertainty be reported? • Are there differences among application areas that require different language structures? • Are there differences among application areas that require different language features? • How will the language interface support effective partitioning and distribution? Gio Wiederhold SimQL 31 8/17/2012 Gio: SimQL
  • 32. Moving to a Service Paradigm • Server is an independent contractor, defines service • Client selects service, and specifies parameters • Server’s success depends on value provided • Some form of payment received for services x,y Databases are a current example. Simulations have the same potential. 8/17/2012 Gio: SimQL Gio Wiederhold SimQL 32
  • 33. Summary of SimQL A new service for Decision Making: • follows database paradigm – ( by about 25 years ) • coherence in prediction – displacement of ad-hoc practices • seamless information integration – single paradigm for decision makers • simulation industry infrastructure – investment has a potential market – should follows database industry model: Interfaces promote new industries 8/17/2012 Gio: SimQL Gio Wiederhold SimQL 33
  • 34. Publications Gio Wiederhold: "Information Systems that Really Support Decision-making"; 11th International Symposium on Methodologies for Intelligent Systems (ISMIS), Warsaw Poland, June 1999, in Ras & Skowron Foundations for Intelligent Systems, Springer LNAI 1609, pages 56-66 Gio Wiederhold and Rushan Jiang: “Augmenting Information Systems with Access to Predictive Tools”; http://infolab.stanford.edu/pub/gio/2000/VLDB2000-1.htm The specifics of the language as implemented are at http://www-db.stanford.edu/LIC/SimQL.html Gio Wiederhold SimQL 34 8/17/2012 Gio: SimQL