SlideShare una empresa de Scribd logo
1 de 21
FIRST
 European research for web information extraction
and analysis for supporting financial decision making
               ABI Lab Forum 2012
          Tomás Pariente Lobo – Atos Spain
Motivation



 Vision



Innovation



  Tools
Why FIRST? - Motivations

The most reliable data sources today…




…also have their weakness!
They do not consider unstructured data, rumors, market
sentiments, etc.
  3
Why FIRST? - Motivations

Example: Apple iPhone 1 Announcement on 2007-01-09




       Stock prices were skyrocketing after the announcement.
        However, the announcement could be sensed before…




  4
Why FIRST? - Motivations

Example: Market surveillance via FIRST (the Google news case)
   September 2008: Google news announced “United Airlines bankruptcy”.
   Within 12 minutes  stock price decreased 75%  wiped out US $ 1bn.




   The “news” was actually 6 years old…
 Plausibility checking will help in identifying hoaxes: consistence with regulatory news
  and other sources.
    5
Why FIRST? – Motivations
    A growing universe of unstructured data




          … how to separate the wheat
               from the chaff ?

6
Motivation



 Vision



Innovation



  Tools
FIRST Project

          European-funded research project



                 Project facts

Running from October 2010 until
  September 2013
9 partners
More than 30 people
Preliminary results available
More to come...
  Stay tuned (http://project-first.eu/)

8
Who is behind FIRST?

Industrial partners




  Academic/Research




      SMEs
FIRST Vision




                              Vision
         is to make available the relevant information
             of the entire financial information space
     (including unreliable, unstructured, sentiment sources)
             to the decision maker in near-real time
                         in an automated way
10
FIRST Vision
      Financial
      Resources
     Structured


                                            AUTOMATION




                              Acquisition    Processing   Analysis   Decision
                                                                     support


 Unstructured
Blog, analysis, bulletin boards…
Unreliable, poor quality,
noisy… 11
Motivation



 Vision



Innovation



  Tools
Mining the Web for financial texts

           Data Acquisition pipeline: Web mining



                                     Natural Language
                                  preprocessing and entity
                                         extraction




   Streaming




                  Cleaning

                                          Financial terms,
                                             Companies,
                                            Intruments …
Data acquisition after one year

 Some numbers
      176 Web sites
      2,671 RSS sources
      ~40,000 documents per day
      >5,000,000 documents by end of 2011
       o And growing



Essential for future evaluation and analysis


 14
Analysing sentiments in Web texts

    The Analytical Pipeline: Identify, extract, classify, aggregate
  Document
                    SENTIMENT             Document with      SENTIMENT
    with                                                                                 Aggregated
                 CLASSIFICATION            sentiment       AGGREGATION
    basic                                                                                sentiments
                 per object and feature    sentences      per object and feature
 annotations



                                           Indicators
Object




                                                                                   Positive sentiment


                                                                               Sentiment
                                                                               Sentences

         15
Supporting the decision making process

The Decision Support techniques: Analysis and visualization


                   Machine
                   Learning
  FIRST           Techniques                          Outputs:
Acquisition &
 Analytical                                          Forecasts of
                                                volatility or returns,
 Pipelines         Qualitative    Forecasting    Alert on pump and
                   Modeling         Models              dump,
                                                Reputation change
                                                  of a counterpart
                                                       Signals,
 Knowledge                                              Charts,
   Base                                             Topic Spaces,
                                                    Topic Trends,
                  Visualization
                                                       Reports
                  Techniques                               …
    16
Glassbox model




Sentiment
               Drill down
                            Document



Objects                     sentences

Features




 17
Motivation



 Vision



Innovation



  Tools
The three FIRST use cases &
     their relevance for the industry
         Market Surveillance
       Capital markets compliance can be automated today using structured data, but
        the automation does not take unstructured data into account
       FIRST will
          make use of large volumes of unstructured data into financial compliance;
          develop automated techniques to better detect market abuse/insider
           trading..


         Reputational Risk Management
       No off-the-shelf solutions or methodologies for reputational risk management.
       FIRST will
         provide a sustainable tool for reputational risk monitoring;
         contribute to break new ground in this field of dramatically high impact in FSI.



         Retail Brokerage
       Today, mainly based on quantitative analysis and key figures.
       FIRST will
         use unstructured data to leverage both information for private investors and
            sophisticated tools for professional users.

19
20
     Stay tuned (http://project-first.eu/)
Acknowledgement
The research leading to these results has received funding from the
     European Community's Seventh Framework Programme
       (FP7/2007-2013) under grant agreement n°257928.


                         THANKS

Más contenido relacionado

Destacado

Silicon Valley Marketo User Group Meeting February 29, 2012
Silicon Valley Marketo User Group Meeting February 29, 2012Silicon Valley Marketo User Group Meeting February 29, 2012
Silicon Valley Marketo User Group Meeting February 29, 2012ryanvong
 
Exchange Auditing in the Enterprise
Exchange Auditing in the EnterpriseExchange Auditing in the Enterprise
Exchange Auditing in the EnterpriseNetwrix Corporation
 
Evaluation question 2
Evaluation question 2Evaluation question 2
Evaluation question 2aliceaustin1
 
INSTALACJA Zielonka
INSTALACJA ZielonkaINSTALACJA Zielonka
INSTALACJA ZielonkasalonyVi
 
Shir hallel lakaas - שיר הלל לכעס
Shir hallel lakaas - שיר הלל לכעסShir hallel lakaas - שיר הלל לכעס
Shir hallel lakaas - שיר הלל לכעסNili Glazer
 
DiretoBrasil Creating Value
DiretoBrasil Creating ValueDiretoBrasil Creating Value
DiretoBrasil Creating ValueDiretoBrasil
 
2 how to-build_document_management_system
2 how to-build_document_management_system2 how to-build_document_management_system
2 how to-build_document_management_systemKichiemon Adachi
 
Resource2
Resource2Resource2
Resource2grosi
 
TUBE FEEDING FOR DYSMOTILITY CONSUMERS
TUBE FEEDING FOR  DYSMOTILITY CONSUMERSTUBE FEEDING FOR  DYSMOTILITY CONSUMERS
TUBE FEEDING FOR DYSMOTILITY CONSUMERSmitoaction
 
GETTING THROUGH THE DAY WITH MITO: Treatments, Supplements and Humor
GETTING THROUGH THE DAY WITH MITO: Treatments, Supplements and HumorGETTING THROUGH THE DAY WITH MITO: Treatments, Supplements and Humor
GETTING THROUGH THE DAY WITH MITO: Treatments, Supplements and Humormitoaction
 
The inaugural Professional Diversity Network Diversity Jobs Index and Report
The inaugural Professional Diversity Network Diversity Jobs Index and ReportThe inaugural Professional Diversity Network Diversity Jobs Index and Report
The inaugural Professional Diversity Network Diversity Jobs Index and ReportDaniel Sullivan
 
Leap pad explorer features 1
Leap pad explorer features 1Leap pad explorer features 1
Leap pad explorer features 1cvhaslacker0
 
Commercial areas design_guidelines
Commercial areas design_guidelinesCommercial areas design_guidelines
Commercial areas design_guidelinesPalak Acharya
 
Hh5eko lagunak
Hh5eko lagunakHh5eko lagunak
Hh5eko lagunakELIZALDE
 

Destacado (19)

Silicon Valley Marketo User Group Meeting February 29, 2012
Silicon Valley Marketo User Group Meeting February 29, 2012Silicon Valley Marketo User Group Meeting February 29, 2012
Silicon Valley Marketo User Group Meeting February 29, 2012
 
Exchange Auditing in the Enterprise
Exchange Auditing in the EnterpriseExchange Auditing in the Enterprise
Exchange Auditing in the Enterprise
 
Evaluation question 2
Evaluation question 2Evaluation question 2
Evaluation question 2
 
INSTALACJA Zielonka
INSTALACJA ZielonkaINSTALACJA Zielonka
INSTALACJA Zielonka
 
Shir hallel lakaas - שיר הלל לכעס
Shir hallel lakaas - שיר הלל לכעסShir hallel lakaas - שיר הלל לכעס
Shir hallel lakaas - שיר הלל לכעס
 
DiretoBrasil Creating Value
DiretoBrasil Creating ValueDiretoBrasil Creating Value
DiretoBrasil Creating Value
 
2 how to-build_document_management_system
2 how to-build_document_management_system2 how to-build_document_management_system
2 how to-build_document_management_system
 
Xrs9965 70 g
Xrs9965 70 gXrs9965 70 g
Xrs9965 70 g
 
Resource2
Resource2Resource2
Resource2
 
TUBE FEEDING FOR DYSMOTILITY CONSUMERS
TUBE FEEDING FOR  DYSMOTILITY CONSUMERSTUBE FEEDING FOR  DYSMOTILITY CONSUMERS
TUBE FEEDING FOR DYSMOTILITY CONSUMERS
 
GETTING THROUGH THE DAY WITH MITO: Treatments, Supplements and Humor
GETTING THROUGH THE DAY WITH MITO: Treatments, Supplements and HumorGETTING THROUGH THE DAY WITH MITO: Treatments, Supplements and Humor
GETTING THROUGH THE DAY WITH MITO: Treatments, Supplements and Humor
 
Ibf 8-risk-return
Ibf 8-risk-returnIbf 8-risk-return
Ibf 8-risk-return
 
The inaugural Professional Diversity Network Diversity Jobs Index and Report
The inaugural Professional Diversity Network Diversity Jobs Index and ReportThe inaugural Professional Diversity Network Diversity Jobs Index and Report
The inaugural Professional Diversity Network Diversity Jobs Index and Report
 
IHOTE FESTA
IHOTE FESTAIHOTE FESTA
IHOTE FESTA
 
Leap pad explorer features 1
Leap pad explorer features 1Leap pad explorer features 1
Leap pad explorer features 1
 
Commercial areas design_guidelines
Commercial areas design_guidelinesCommercial areas design_guidelines
Commercial areas design_guidelines
 
Presentacion sintesis
Presentacion sintesisPresentacion sintesis
Presentacion sintesis
 
Hh5eko lagunak
Hh5eko lagunakHh5eko lagunak
Hh5eko lagunak
 
Hgth2
Hgth2Hgth2
Hgth2
 

Similar a First european research for web information extraction and analysis for supporting financial decision making v0.4

The New Normal: Predictive Power on the Front Lines
The New Normal: Predictive Power on the Front LinesThe New Normal: Predictive Power on the Front Lines
The New Normal: Predictive Power on the Front LinesInside Analysis
 
Predictive analytics km chicago
Predictive analytics km chicagoPredictive analytics km chicago
Predictive analytics km chicagoKM Chicago
 
Customer Insights Summit Toronto 2012
Customer Insights Summit Toronto 2012Customer Insights Summit Toronto 2012
Customer Insights Summit Toronto 2012Fabiana Pereira
 
Developing an Analytical Mindset – Becoming an Analytical Competitor
Developing an Analytical Mindset – Becoming an Analytical CompetitorDeveloping an Analytical Mindset – Becoming an Analytical Competitor
Developing an Analytical Mindset – Becoming an Analytical CompetitorSAS Asia Pacific
 
Think Big Analytics AWS for Financial Services
Think Big Analytics AWS for Financial ServicesThink Big Analytics AWS for Financial Services
Think Big Analytics AWS for Financial ServicesAmazon Web Services
 
Mesh Labs Introduction June 2012
Mesh Labs Introduction June 2012Mesh Labs Introduction June 2012
Mesh Labs Introduction June 2012Umesh Ramalingachar
 
Smart analytics - Data Visualisation and Predictive Analytics solutions for F...
Smart analytics - Data Visualisation and Predictive Analytics solutions for F...Smart analytics - Data Visualisation and Predictive Analytics solutions for F...
Smart analytics - Data Visualisation and Predictive Analytics solutions for F...Srini Bezwada
 
Open Source for Enterprise Search: Breaking Down the Barriers to Information
Open Source for Enterprise Search: Breaking Down the Barriers to InformationOpen Source for Enterprise Search: Breaking Down the Barriers to Information
Open Source for Enterprise Search: Breaking Down the Barriers to InformationLucidworks (Archived)
 
Prepping the Analytics organization for Artificial Intelligence evolution
Prepping the Analytics organization for Artificial Intelligence evolutionPrepping the Analytics organization for Artificial Intelligence evolution
Prepping the Analytics organization for Artificial Intelligence evolutionRamkumar Ravichandran
 
Iotx futures research_futures_trends_2011
Iotx futures research_futures_trends_2011Iotx futures research_futures_trends_2011
Iotx futures research_futures_trends_2011Andy Hunter
 
Marshall Sponder - Social Media Monitoring Analytics - Measure13
Marshall Sponder - Social Media Monitoring Analytics - Measure13Marshall Sponder - Social Media Monitoring Analytics - Measure13
Marshall Sponder - Social Media Monitoring Analytics - Measure13Our Social Times
 
[En] Spotter Solutions and Applications 2013
[En] Spotter Solutions and Applications 2013[En] Spotter Solutions and Applications 2013
[En] Spotter Solutions and Applications 2013Celine Molina
 
XMANAI Technical Project Overview
XMANAI Technical Project OverviewXMANAI Technical Project Overview
XMANAI Technical Project OverviewXMANAI
 
20121108 sntmnt data_sciencenl
20121108 sntmnt data_sciencenl20121108 sntmnt data_sciencenl
20121108 sntmnt data_sciencenldatasciencenl
 
Hispanic Digital and Print Media Conference 2012 - Oscar Padilla
Hispanic Digital and Print Media Conference 2012 - Oscar PadillaHispanic Digital and Print Media Conference 2012 - Oscar Padilla
Hispanic Digital and Print Media Conference 2012 - Oscar PadillaPortada
 
Analytics for fundraisers
Analytics for fundraisersAnalytics for fundraisers
Analytics for fundraisersSrini Bezwada
 
How is Watson Changing the Future of the Automative Industry?
How is Watson Changing the Future of the Automative Industry?How is Watson Changing the Future of the Automative Industry?
How is Watson Changing the Future of the Automative Industry?IBM Watson
 
Big data and Analytics
Big data and AnalyticsBig data and Analytics
Big data and AnalyticsKevin Magee
 
Big dataforcf os1_23_12_final
Big dataforcf os1_23_12_finalBig dataforcf os1_23_12_final
Big dataforcf os1_23_12_finalBurrPilgerMayer
 

Similar a First european research for web information extraction and analysis for supporting financial decision making v0.4 (20)

The New Normal: Predictive Power on the Front Lines
The New Normal: Predictive Power on the Front LinesThe New Normal: Predictive Power on the Front Lines
The New Normal: Predictive Power on the Front Lines
 
Predictive analytics km chicago
Predictive analytics km chicagoPredictive analytics km chicago
Predictive analytics km chicago
 
Customer Insights Summit Toronto 2012
Customer Insights Summit Toronto 2012Customer Insights Summit Toronto 2012
Customer Insights Summit Toronto 2012
 
Developing an Analytical Mindset – Becoming an Analytical Competitor
Developing an Analytical Mindset – Becoming an Analytical CompetitorDeveloping an Analytical Mindset – Becoming an Analytical Competitor
Developing an Analytical Mindset – Becoming an Analytical Competitor
 
Think Big Analytics AWS for Financial Services
Think Big Analytics AWS for Financial ServicesThink Big Analytics AWS for Financial Services
Think Big Analytics AWS for Financial Services
 
Mesh Labs Introduction June 2012
Mesh Labs Introduction June 2012Mesh Labs Introduction June 2012
Mesh Labs Introduction June 2012
 
Smart analytics - Data Visualisation and Predictive Analytics solutions for F...
Smart analytics - Data Visualisation and Predictive Analytics solutions for F...Smart analytics - Data Visualisation and Predictive Analytics solutions for F...
Smart analytics - Data Visualisation and Predictive Analytics solutions for F...
 
Open Source for Enterprise Search: Breaking Down the Barriers to Information
Open Source for Enterprise Search: Breaking Down the Barriers to InformationOpen Source for Enterprise Search: Breaking Down the Barriers to Information
Open Source for Enterprise Search: Breaking Down the Barriers to Information
 
Prepping the Analytics organization for Artificial Intelligence evolution
Prepping the Analytics organization for Artificial Intelligence evolutionPrepping the Analytics organization for Artificial Intelligence evolution
Prepping the Analytics organization for Artificial Intelligence evolution
 
Iotx futures research_futures_trends_2011
Iotx futures research_futures_trends_2011Iotx futures research_futures_trends_2011
Iotx futures research_futures_trends_2011
 
Marshall Sponder - Social Media Monitoring Analytics - Measure13
Marshall Sponder - Social Media Monitoring Analytics - Measure13Marshall Sponder - Social Media Monitoring Analytics - Measure13
Marshall Sponder - Social Media Monitoring Analytics - Measure13
 
[En] Spotter Solutions and Applications 2013
[En] Spotter Solutions and Applications 2013[En] Spotter Solutions and Applications 2013
[En] Spotter Solutions and Applications 2013
 
XMANAI Technical Project Overview
XMANAI Technical Project OverviewXMANAI Technical Project Overview
XMANAI Technical Project Overview
 
20121108 sntmnt data_sciencenl
20121108 sntmnt data_sciencenl20121108 sntmnt data_sciencenl
20121108 sntmnt data_sciencenl
 
Hispanic Digital and Print Media Conference 2012 - Oscar Padilla
Hispanic Digital and Print Media Conference 2012 - Oscar PadillaHispanic Digital and Print Media Conference 2012 - Oscar Padilla
Hispanic Digital and Print Media Conference 2012 - Oscar Padilla
 
Analytics for fundraisers
Analytics for fundraisersAnalytics for fundraisers
Analytics for fundraisers
 
How is Watson Changing the Future of the Automative Industry?
How is Watson Changing the Future of the Automative Industry?How is Watson Changing the Future of the Automative Industry?
How is Watson Changing the Future of the Automative Industry?
 
Big data and Analytics
Big data and AnalyticsBig data and Analytics
Big data and Analytics
 
IBM Stream au Hadoop User Group
IBM Stream au Hadoop User GroupIBM Stream au Hadoop User Group
IBM Stream au Hadoop User Group
 
Big dataforcf os1_23_12_final
Big dataforcf os1_23_12_finalBig dataforcf os1_23_12_final
Big dataforcf os1_23_12_final
 

Último

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 

Último (20)

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 

First european research for web information extraction and analysis for supporting financial decision making v0.4

  • 1. FIRST European research for web information extraction and analysis for supporting financial decision making ABI Lab Forum 2012 Tomás Pariente Lobo – Atos Spain
  • 3. Why FIRST? - Motivations The most reliable data sources today… …also have their weakness! They do not consider unstructured data, rumors, market sentiments, etc. 3
  • 4. Why FIRST? - Motivations Example: Apple iPhone 1 Announcement on 2007-01-09  Stock prices were skyrocketing after the announcement. However, the announcement could be sensed before… 4
  • 5. Why FIRST? - Motivations Example: Market surveillance via FIRST (the Google news case)  September 2008: Google news announced “United Airlines bankruptcy”.  Within 12 minutes  stock price decreased 75%  wiped out US $ 1bn.  The “news” was actually 6 years old…  Plausibility checking will help in identifying hoaxes: consistence with regulatory news and other sources. 5
  • 6. Why FIRST? – Motivations A growing universe of unstructured data … how to separate the wheat from the chaff ? 6
  • 8. FIRST Project European-funded research project Project facts Running from October 2010 until September 2013 9 partners More than 30 people Preliminary results available More to come... Stay tuned (http://project-first.eu/) 8
  • 9. Who is behind FIRST? Industrial partners Academic/Research SMEs
  • 10. FIRST Vision Vision is to make available the relevant information of the entire financial information space (including unreliable, unstructured, sentiment sources) to the decision maker in near-real time in an automated way 10
  • 11. FIRST Vision Financial Resources Structured AUTOMATION Acquisition Processing Analysis Decision support Unstructured Blog, analysis, bulletin boards… Unreliable, poor quality, noisy… 11
  • 13. Mining the Web for financial texts Data Acquisition pipeline: Web mining Natural Language preprocessing and entity extraction Streaming Cleaning Financial terms, Companies, Intruments …
  • 14. Data acquisition after one year Some numbers 176 Web sites 2,671 RSS sources ~40,000 documents per day >5,000,000 documents by end of 2011 o And growing Essential for future evaluation and analysis 14
  • 15. Analysing sentiments in Web texts The Analytical Pipeline: Identify, extract, classify, aggregate Document SENTIMENT Document with SENTIMENT with Aggregated CLASSIFICATION sentiment AGGREGATION basic sentiments per object and feature sentences per object and feature annotations Indicators Object Positive sentiment Sentiment Sentences 15
  • 16. Supporting the decision making process The Decision Support techniques: Analysis and visualization Machine Learning FIRST Techniques Outputs: Acquisition & Analytical Forecasts of volatility or returns, Pipelines Qualitative Forecasting Alert on pump and Modeling Models dump, Reputation change of a counterpart Signals, Knowledge Charts, Base Topic Spaces, Topic Trends, Visualization Reports Techniques … 16
  • 17. Glassbox model Sentiment Drill down Document Objects sentences Features 17
  • 19. The three FIRST use cases & their relevance for the industry Market Surveillance  Capital markets compliance can be automated today using structured data, but the automation does not take unstructured data into account  FIRST will  make use of large volumes of unstructured data into financial compliance;  develop automated techniques to better detect market abuse/insider trading.. Reputational Risk Management  No off-the-shelf solutions or methodologies for reputational risk management.  FIRST will  provide a sustainable tool for reputational risk monitoring;  contribute to break new ground in this field of dramatically high impact in FSI. Retail Brokerage  Today, mainly based on quantitative analysis and key figures.  FIRST will  use unstructured data to leverage both information for private investors and sophisticated tools for professional users. 19
  • 20. 20 Stay tuned (http://project-first.eu/)
  • 21. Acknowledgement The research leading to these results has received funding from the European Community's Seventh Framework Programme (FP7/2007-2013) under grant agreement n°257928. THANKS

Notas del editor

  1. Explain the events captured: Greek crisis, sovereing debt crises. EU central bank loans, Italina prime minister change…
  2. Stress and explain (orally) that the analysis is object and feature (eg price, volatility, reputation) specific. Features can be identified by explicit mentions or by indicators that refer to specific features and specific types of objects (eg stocks). The ones in the example are fundamental micro indicators, indicating price change.