SlideShare una empresa de Scribd logo
1 de 24
Enterprise Java Tools & Techniques
                                 Data Aggregation & Grids in SOA
                       SiliconIndia – Java Conference – October 15th, 2011
                                            Hyderabad



                                                             SUNILA GOLLAPUDI
                                                                   Technology Specialist




-----------------------------------------------------------------------------------------------------------------------------------------------------------------------
© Copyright 2010                                                                                                                               October 15th, 2011
AGENDA
    • Broadridge
             –    A Quick note on Our Business
    • Broader Picture – The Financial Service Industry
             –    Key Trends, Requirements and Challenges
    • Outlining the Problem Context and Solution Areas
    • A Solution that “Just Works” …
             – Service Oriented Architecture & Data Services
                      •   Scope, Architecture, Solutions & Options

             – Data Aggregation/Composition & Data Federation/Virtualization
                      •   Scope, Architecture, Solutions & Options

             – Data Availability & Reliability/Quality
                      •   Scope, Architecture, Solutions & Options

             – Scalability in large volume context
                      •   Scope, Architecture, Solutions & Options

    • Q&A
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------
© Copyright 2010                                                       October 15th, 2011
Broadridge
                                                 in annual revenues as of FY 2010
                                                   in annual pre-tax earnings
                                              of experience in securities processing
     We are an                      business
  Industry Leader
                                   Senior management team averages 15 years of
                                    tenure
                                   Ranked #1 in Black Book of Outsourcing:
                                    Brokerage Processing Providers Survey
                                       #1 in 14 of 18 categories surveyed vs. 52
                                         other providers
                                                                                                                 Securities                  Investor
                                                                                                                Processing                   Communications
                                   Pure             and Outsourcing service provider
                                                                                                                  Since 1962                 Since 1989
                                   Components of our securities processing solutions
                                    used by 8-of-the-top-10 U.S. broker-dealers

   We provide
                                                                                                                                Business
  Mission-Critical                                                                                                               Process
    Solutions                                                                                                                  Outsourcing
                                                       annually for every broker/dealer
                                    in the U.S.                                                                                 Since 2004




                                   Securities processing capabilities for more than 50
Our offerings are                   countries
broad and flexible                 Solutions ranging from hosted service bureau, to
                                    customized BPO supporting full outsourcing
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------
© Copyright 2010                                                       October 15th, 2011
Financial Service Industry
                                         (Key Trends, Requirements & Challenges)

    •      Mergers & Acquisitions
             –    Financial Industry is most dynamic and M&A that enhance market position or add offerings is
                  frequent
             –    Challenge is to federate the Duplicate Data
    •      Compliance Reporting
             –    Compliance to changing Regulatory norms is one of the biggest needs in Financial Service
                  Industry.
             –    As internal systems are optimized for operations & not compliance, integrating the data from
                  these systems often proves to be expensive
             –    Challenge is to virtualize data across these operational systems
    •      Risk Management
             –    Financial institutions must continuously monitor exposure to a range of risks
             –    Aggregating a single view of institution-wide risk in real-time is the requirement
             –    Challenge is DATA: Data Quality, Data Availability and Data Access
    •      Reference Data Sharing
             –     Reference data facilitates rapid, error-free execution of financial transactions and analysis across multiple
                   geographical markets
             – Challenge is single virtual source for reference data and enables federation of additional master and
                   operational data to complete a financial transaction or analysis
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------
© Copyright 2010                                                       October 15th, 2011
Outlining the Problem Context

    • Technology Agnostic way of Accessing Data
             Data Services and Service Oriented Architecture

    • Centralized Data Access
             Data Aggregation/Composition & Date Federation/ Virtualization
    • Data Reliability & Availability – Seamless access to Real-time Data
             Data Cleansing, Clustered Caching, Transparent Data Partitioning with
             Failover and Fail backing
    • Enormous Data Loads and Transaction Volumes
             Parallel Programming, MapReduce Techniques, NoSQL
             Options, & the Data GRID
           Let’s look at various Technology Options, Tools and Framework
                                     Alternatives ?
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------
© Copyright 2010                                                       October 15th, 2011
An Integration Platform
                            Data Services & Service Oriented Architecture


    • Service Oriented Architecture
             – An Architecture Style that provides a Technology agnostic way
               of Integrating Disparate Applications within an Enterprise by
               improving Reuse and eliminating duplication of Application Logic
             – Componentization & Servicization is what typically happens
               here.
             – Types of Components, in the order of increasing consolidation:
                      • Data Services that provide access to data without a need for worrying
                        the data representation or storage
                      • Business Services that contain business logic for specific, well-
                        defined tasks and perform business transactions via data services
                      • Business Processes that coordinate multiple business services within
                        the context of a workflow.

-----------------------------------------------------------------------------------------------------------------------------------------------------------------------
© Copyright 2010                                                       October 15th, 2011
An Integration Platform
                            Data Services & Service Oriented Architecture
                                                                                  • Implementation Options for
                                                                                    Service Oriented Data Access:
                                                                                         The heart of any enterprise application is
                                                                                         data. Applications provide the ability to
                                                                                         view, sort, filter, edit, create, and delete
                                                                                         data
                                                                                          – Expose a Database as a Service
                                                                                              (above an ORM Layer)
                                                                                          – Expose a Stored Procedure as a
                                                                                              Service
                                                                                          – Expose a DAO as a Service
                                                                                          – Wrap an existing business object (EJB
Data services provide agility and reuse                                                       or POJO) with a web service
 Data Services are essentially the SOA-                                                         The term “Service” here is corresponds
 equivalent of the Data Access Object pattern                                                   to a Web Service


-----------------------------------------------------------------------------------------------------------------------------------------------------------------------
© Copyright 2010                                                       October 15th, 2011
Composite Applications



Tools & Framework Options:
                                                        Business Processes
1.      Any Java Application /
        Web Server
2.      Web Service Engine                            Business Services
        like Axis / Metro
3.      Specific Data Service
        Frameworks like
        1.    WSO2 Data
              Services with
                                                      Data Services
              any Application
              Server
        2.    Composite
              Software Data
              Services etc …
        3.    JBoss                                    Data Stores
              Enterprise Data
              Services
              Platform


                                                           Are we missing something ?
                                         Yes, What about the issue of Data Duplication?
 -----------------------------------------------------------------------------------------------------------------------------------------------------------------------
© Copyright 2010                                                        October 15th, 2011
Data Integration Techniques
• Going beyond a Simple Data Access …
• Data Integration Patterns:
   • Data Federation Pattern (also referred as Virtualization)


                Data Virtualization
              (using a Data Federation Server)




   • Data Aggregation Pattern (also referred as Consolidation)


                   Target Database



                       Apply/Load
                    Process/Transform
                      Gather/Extract
Data Services using Federation Technique

    •      Context
            – Need for unified view of data that often involves the integration of
              a bewildering array of disparate backend sources, and services
    •      Value
             –    Transparency of underlying heterogeneity
             –    Time-to-market advantage
             –    Reduced Development & maintenance costs
             –    Performance Advantage
             –    Reusability Advantage
             –    Improved Governance
    •      Scope
             – Effectively join and process information from heterogeneous sources
             – Receive a query, transform it using complex query optimizing algorithms,
               creating a series of sub-operations that are invoked on the base system
             – Finally assembling the results and returning to the requesting system.

-----------------------------------------------------------------------------------------------------------------------------------------------------------------------
© Copyright 2010                                                       October 15th, 2011
The Data Federation Pattern


                                                                                                                 The functionality of the data
                                                                                                                 federation pattern can be
                                                                                                                 implemented using either
                                                                                                                 database-related technologies
                                                                                                                 such as optimizer or
                                                                                                                 compensation, or by home-
                                                                                                                 grown applications. Due to the
                                                                                                                 complexity of query optimization
                                                                                                                 over heterogeneous sources, it
                                                                                                                 is an industry best practice to
                                                                                                                 use a data federation
                                                                                                                 implementation that leverages
                                                                                                                 query optimization technology
                                                                                                                 as provided by most database
                                                                                                                 management systems




-----------------------------------------------------------------------------------------------------------------------------------------------------------------------
© Copyright 2010                                                       October 15th, 2011
Data Services Using Consolidation Techniques

    •      Context                                                                       •      Scope
             – Integrate Information from Sources                                                 •      Phase 1: Data Aggregation
               that are highly Heterogeneous in                                                          Server gathers / extracts data
               nature                                                                                    from the data sources
             – Need for extensive transformation                                                  •      Phase 2: Source Data is
               logic to resolve data conflicts                                                           integrated and transformed to
             – Need for Data event publishing                                                            conform to target model
                                                                                                  •      Phase 3: Apply the transformed
    •      Value
                                                                                                         data to the target data store
             –    Transparency
             –    Reusability                                                                                         Target Database
             –    Improved Governance
             –    Additionally, Single version of                                                                    3. Apply/Load
                  truth – high quality that involves                                                                 2. Process/Transform
                                                                                                                     1. Gather/Extract
                  resolving of data conflicts



-----------------------------------------------------------------------------------------------------------------------------------------------------------------------
© Copyright 2010                                                       October 15th, 2011
Data Federation Tools / Frameworks
                                  Composite Applications




    Business Processes                                                                                              •      IBM
                                                                                                                             –     Websphere
                                                                                                                                   Information Integrator
                                                                                                                             –     Websphere
  Business Services                                                                                                                Information Integrator
                                                                                                                                   Classic Federation
                                                                                                                             –     Websphere
                                                                                                                                   Information Services
                                                                                                                                   Director
  Data Services
                                                                                                                             –     Websphere DataStage
                                                                                                                    •      Composite Software
                                                                                                                           Data Federation
                                                                                                                           Service
                                       Data Integration Layer
                                                                                                                    •      Progress Software
   Data Stores                                                                                                             ObjectStore
                                                                                                                    •      Oracle Data Integrator


-----------------------------------------------------------------------------------------------------------------------------------------------------------------------
© Copyright 2010                                                       October 15th, 2011
Data Availability & Reliability

    • SOA Demands High Data Reliability and Availability
    •      Solutions / Options to achieve reliability & availability
             – Reliability Techniques
                      • Data Cleansing Pattern
                      • Master Data Management
             – Availability Techniques
                      •   Clustered Caching
                      •   State Management Through Virtualization
                      •   Transparent Data Partitioning
                      •   Failover and Failback
                      •   Insulation from Failures in Other Services


-----------------------------------------------------------------------------------------------------------------------------------------------------------------------
© Copyright 2010                                                       October 15th, 2011
Data Reliability Option: Data Cleansing Pattern

    • Scope:
             – Parsing of input data and association to standard and fine-grained
               elements
             – Standardization of data
             – Matching and de-duplication of data entries
             – Survivorship of the correct information




-----------------------------------------------------------------------------------------------------------------------------------------------------------------------
© Copyright 2010                                                       October 15th, 2011
Clustered Caching ensures Availability and thus
                                reliability
        •      In-memory data management solution is what is required
        •      It makes sharing and managing data in a cluster as simple as on a single
               server. It accomplishes this by coordinating updates to the data by using
               clusterwide concurrency control, replicating and distributing data
               modifications across the cluster by using the highest-performing clustered
               protocol available, and delivering notifications of data modifications to any
               servers that request them.
                                   Data Services




                                  Cache
                                                                        Data Integration Layer
                                    Data Stores



-----------------------------------------------------------------------------------------------------------------------------------------------------------------------
© Copyright 2010                                                       October 15th, 2011
Data Availability & Reliability : Other Techniques


    •      State Management Through Virtualization
    •      Failover and Failback
    •      Insulation from Failures in Other Service
    •      Transparent Data Partitioning Achieves Continuous Availability
           and Reliability

    Tool Options
    1. IBM Websphere Quality Stage
    2. Many MDM Vendors, IBM, Oracle, Informatica etc …
    3. Oracle Coherence
    4. MemCached
    5. EHCache (Open Source)
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------
© Copyright 2010                                                       October 15th, 2011
Performance
                            (Scalability,            Throughput, and Points of Congestion )
    • Bottlenecks
             –    Shared intermediary services - Such services perform common integration tasks such as
                  data transformation, content based routing, and filtering.
             –    The services themselves - That is, application code exposed as a service and invoked by
                  other services on the network, whether directly or through an orchestration engine.
             –    SOA infrastructure operations
                  In most cases, the scalability bottlenecks across these SOA components are caused when
                  disk I/O, memory, or CPU saturation levels are reached.
    • Need
             –    Large Datasets
             –    Enormous Loads & Large Transaction Volumes handled Without Compromise

    • Solution
             – Data Store alternatives like NoSQL with Parallel Processing /
               Programming MapReduce Techniques and
             – Data Grids


-----------------------------------------------------------------------------------------------------------------------------------------------------------------------
© Copyright 2010                                                       October 15th, 2011
NoSQL – Not Only SQL
•     What’s wrong this RDBMSs ?                                                               •      NoSQL Usecases
        – Well, actually nothing… it just has                                                           •      Logging/Archiving
          limitations                                                                                   •      Social Computing Insight
                 •    They use table-based normalization approach
                                                                                                        •      External Data Feed Integration
                 •    They allow versioning
                 •    Performance falls off as RDBMS normalizes data                                    •      Front-ens order processing
                      as the data grows.                                                                       systems
•     NoSQL Databases                                                                                   •      Real-time stats & analytics
        –     Non Relational
        –     Designed for distributed data stores for large
              data needs
        –     Doesn’t require fixed table schemes
        –     (Usually) avoids join operations
        –     Can Scale Horizontally – allows adding more
              nodes to storage systems
        –      Types
                 •    Key-Value Stores
                 •    Column Family Stores
                 •    Document Databases
                 •    Graph Databases
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------
© Copyright 2010                                                       October 15th, 2011
Teamed with MapReduce Technique
•     What’s MapReduce ?
        – Restricted Parallel Programming model for large clusters (User implements Map()
          and Reduce())
        – Parallel Computing Framework
                 • Libraries take care of EVERYTHING else
                          –    Parallelization
                          –    Fault Tolerance
                          –    Data Distribution
                          –    Load Balancing
        – Map and Reduce (borrowed from Lisp)
                 • Map() – Processes a Key-Value Pair that produces immediate Key-Value Pairs
                 • Reduce() – Marge all intermediate values associated with the same key




-----------------------------------------------------------------------------------------------------------------------------------------------------------------------
© Copyright 2010                                                       October 15th, 2011
MapReduce Execution Model




-----------------------------------------------------------------------------------------------------------------------------------------------------------------------
© Copyright 2010                                                       October 15th, 2011
SOA with Data Grid Option
                                                (Service Oriented Computing)
• An SOA grid transparently solves many of the difficult
  problems related to achieving high availability, reliability,
  scalability, and performance in a distributed environment




-----------------------------------------------------------------------------------------------------------------------------------------------------------------------
© Copyright 2010                                                       October 15th, 2011
Tools/Products/Frameworks
    • NoSQL Options
             –    HBase
             –    Cassandra
             –    CoucheDB
             –    MongoDB
    • MapReduce Options
             – Hadoop
             – Amazon Elastic MapReduce
             – GreenPlum MapReduce
    • Grid Options
             – IBM Globus
             – Sun Grid Engine
             – Oracle Coherence
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------
© Copyright 2010                                                       October 15th, 2011
Q&A




     • Sunila.Gollapudi@broadridge.com

-----------------------------------------------------------------------------------------------------------------------------------------------------------------------
© Copyright 2010                                                       October 15th, 2011

Más contenido relacionado

La actualidad más candente

Managed Services and Outsourcing in Telecoms
Managed Services and Outsourcing in TelecomsManaged Services and Outsourcing in Telecoms
Managed Services and Outsourcing in TelecomsAlan Quayle
 
E Payables Aberdeen Report Aug 2010
E Payables Aberdeen Report Aug 2010E Payables Aberdeen Report Aug 2010
E Payables Aberdeen Report Aug 2010nsprau
 
One Bank, One Architecture
One Bank, One ArchitectureOne Bank, One Architecture
One Bank, One ArchitecturePhi Jack
 
White Paper - The Business Case For Business Intelligence
White Paper -  The Business Case For Business IntelligenceWhite Paper -  The Business Case For Business Intelligence
White Paper - The Business Case For Business IntelligenceDavid Walker
 
Newgen Insurance Spectrum
Newgen Insurance SpectrumNewgen Insurance Spectrum
Newgen Insurance SpectrumRahul Bhatia
 
IT Shared Services Costing
IT Shared Services CostingIT Shared Services Costing
IT Shared Services CostingMiguel Garcia
 
2009 05 08 Updated Cth Preso (6)
2009 05 08 Updated Cth Preso (6)2009 05 08 Updated Cth Preso (6)
2009 05 08 Updated Cth Preso (6)C.T. Hellmuth
 
Sample - Data Warehouse Requirements
Sample -  Data Warehouse RequirementsSample -  Data Warehouse Requirements
Sample - Data Warehouse RequirementsDavid Walker
 
Accounts Payable Optimization Survey sponsored by Canon Business Process Serv...
Accounts Payable Optimization Survey sponsored by Canon Business Process Serv...Accounts Payable Optimization Survey sponsored by Canon Business Process Serv...
Accounts Payable Optimization Survey sponsored by Canon Business Process Serv...marketing2015
 
Enterprise Architecture J.P Morgan Chase
Enterprise Architecture J.P Morgan ChaseEnterprise Architecture J.P Morgan Chase
Enterprise Architecture J.P Morgan ChaseHampus Ahlqvist
 
Capturing Data Requirements
Capturing Data RequirementsCapturing Data Requirements
Capturing Data Requirementsmcomtraining
 
Service catalogue
Service catalogueService catalogue
Service cataloguekanturek
 
Core banking transformation_measuring_the_value
Core banking transformation_measuring_the_valueCore banking transformation_measuring_the_value
Core banking transformation_measuring_the_valueNidal Bashaireh
 
Outsourcing
OutsourcingOutsourcing
OutsourcingRay Luis
 
The First 100 Days for a New CIO - Using the Innovation Value Institute IT Ca...
The First 100 Days for a New CIO - Using the Innovation Value Institute IT Ca...The First 100 Days for a New CIO - Using the Innovation Value Institute IT Ca...
The First 100 Days for a New CIO - Using the Innovation Value Institute IT Ca...Alan McSweeney
 

La actualidad más candente (20)

Managed Services and Outsourcing in Telecoms
Managed Services and Outsourcing in TelecomsManaged Services and Outsourcing in Telecoms
Managed Services and Outsourcing in Telecoms
 
E Payables Aberdeen Report Aug 2010
E Payables Aberdeen Report Aug 2010E Payables Aberdeen Report Aug 2010
E Payables Aberdeen Report Aug 2010
 
One Bank, One Architecture
One Bank, One ArchitectureOne Bank, One Architecture
One Bank, One Architecture
 
White Paper - The Business Case For Business Intelligence
White Paper -  The Business Case For Business IntelligenceWhite Paper -  The Business Case For Business Intelligence
White Paper - The Business Case For Business Intelligence
 
Lean Information Technology Webinar
Lean Information Technology WebinarLean Information Technology Webinar
Lean Information Technology Webinar
 
Newgen Insurance Spectrum
Newgen Insurance SpectrumNewgen Insurance Spectrum
Newgen Insurance Spectrum
 
BI Business Requirements - A Framework For Business Analysts
BI Business Requirements -  A Framework For Business AnalystsBI Business Requirements -  A Framework For Business Analysts
BI Business Requirements - A Framework For Business Analysts
 
IT Shared Services Costing
IT Shared Services CostingIT Shared Services Costing
IT Shared Services Costing
 
2009 05 08 Updated Cth Preso (6)
2009 05 08 Updated Cth Preso (6)2009 05 08 Updated Cth Preso (6)
2009 05 08 Updated Cth Preso (6)
 
Sample - Data Warehouse Requirements
Sample -  Data Warehouse RequirementsSample -  Data Warehouse Requirements
Sample - Data Warehouse Requirements
 
Accounts Payable Optimization Survey sponsored by Canon Business Process Serv...
Accounts Payable Optimization Survey sponsored by Canon Business Process Serv...Accounts Payable Optimization Survey sponsored by Canon Business Process Serv...
Accounts Payable Optimization Survey sponsored by Canon Business Process Serv...
 
Enterprise Architecture J.P Morgan Chase
Enterprise Architecture J.P Morgan ChaseEnterprise Architecture J.P Morgan Chase
Enterprise Architecture J.P Morgan Chase
 
Capturing Data Requirements
Capturing Data RequirementsCapturing Data Requirements
Capturing Data Requirements
 
Mohammad Elagha CV
Mohammad Elagha CVMohammad Elagha CV
Mohammad Elagha CV
 
Service catalogue
Service catalogueService catalogue
Service catalogue
 
Core banking transformation_measuring_the_value
Core banking transformation_measuring_the_valueCore banking transformation_measuring_the_value
Core banking transformation_measuring_the_value
 
Outsourcing
OutsourcingOutsourcing
Outsourcing
 
The First 100 Days for a New CIO - Using the Innovation Value Institute IT Ca...
The First 100 Days for a New CIO - Using the Innovation Value Institute IT Ca...The First 100 Days for a New CIO - Using the Innovation Value Institute IT Ca...
The First 100 Days for a New CIO - Using the Innovation Value Institute IT Ca...
 
Mi fin financial product suite
Mi fin financial product suiteMi fin financial product suite
Mi fin financial product suite
 
Bpr
BprBpr
Bpr
 

Similar a Sunila Silicon India Java Conference Session7 Enterprise Java Tool And Techniques Data Aggregation And Grids In Soa

Automotive IT Strategy: IT Sourcing
Automotive IT Strategy: IT SourcingAutomotive IT Strategy: IT Sourcing
Automotive IT Strategy: IT SourcingDeloitte Deutschland
 
Jon A Cohn - CTO / VP / Sr Director - joncohn@comcast.net
Jon A Cohn - CTO / VP / Sr Director - joncohn@comcast.netJon A Cohn - CTO / VP / Sr Director - joncohn@comcast.net
Jon A Cohn - CTO / VP / Sr Director - joncohn@comcast.netJon Cohn
 
AdvisorAssist Presentation: Cloud Computing and Compliance For RIAs
AdvisorAssist Presentation:  Cloud Computing and Compliance For RIAsAdvisorAssist Presentation:  Cloud Computing and Compliance For RIAs
AdvisorAssist Presentation: Cloud Computing and Compliance For RIAsAdvisorAssist, LLC
 
Backoffice Au Pk HTD
Backoffice Au Pk HTDBackoffice Au Pk HTD
Backoffice Au Pk HTDHujaj Khan
 
Dianne Wyllie, Brocade: Transforming the Annual Planning Process
Dianne Wyllie, Brocade: Transforming the Annual Planning ProcessDianne Wyllie, Brocade: Transforming the Annual Planning Process
Dianne Wyllie, Brocade: Transforming the Annual Planning ProcessUMT
 
Key Considerations While Rolling Out Denodo Platform
Key Considerations While Rolling Out Denodo PlatformKey Considerations While Rolling Out Denodo Platform
Key Considerations While Rolling Out Denodo PlatformDenodo
 
Att consulting external deck
Att consulting external deckAtt consulting external deck
Att consulting external deckEric Sineath
 
ADM Target Operating Models
ADM Target Operating ModelsADM Target Operating Models
ADM Target Operating ModelsSteven Hall
 
Sustainable Architectures
Sustainable Architectures Sustainable Architectures
Sustainable Architectures MrsAlways RigHt
 
[Webinar] - Using RPA to Accelerate the Benefits from Shared Services
[Webinar] - Using RPA to Accelerate the Benefits from Shared Services[Webinar] - Using RPA to Accelerate the Benefits from Shared Services
[Webinar] - Using RPA to Accelerate the Benefits from Shared ServicesJK Tech
 
Managed Services Balanced Scorecard Presentation By Sourcing Gurus
Managed Services Balanced Scorecard Presentation By Sourcing GurusManaged Services Balanced Scorecard Presentation By Sourcing Gurus
Managed Services Balanced Scorecard Presentation By Sourcing GurusSystems Plus Solutions
 
How to Improve Performance with Next-Gen Sales Enablement Technology in Finan...
How to Improve Performance with Next-Gen Sales Enablement Technology in Finan...How to Improve Performance with Next-Gen Sales Enablement Technology in Finan...
How to Improve Performance with Next-Gen Sales Enablement Technology in Finan...Perficient, Inc.
 
Esouag r12 presentation
Esouag r12 presentationEsouag r12 presentation
Esouag r12 presentationIshtiaq Khan
 
Pentaho technical whitepaper-1-6
Pentaho technical whitepaper-1-6Pentaho technical whitepaper-1-6
Pentaho technical whitepaper-1-6skonda
 
090119 Enabling Strategic Sourcing
090119 Enabling Strategic Sourcing090119 Enabling Strategic Sourcing
090119 Enabling Strategic SourcingHan Driessen
 
Crafting Your Accounting Innovation Strategy
Crafting Your Accounting Innovation StrategyCrafting Your Accounting Innovation Strategy
Crafting Your Accounting Innovation StrategyAggregage
 
About Sovereign Business Integration Group
About Sovereign Business Integration GroupAbout Sovereign Business Integration Group
About Sovereign Business Integration Grouppujad
 

Similar a Sunila Silicon India Java Conference Session7 Enterprise Java Tool And Techniques Data Aggregation And Grids In Soa (20)

Automotive IT Strategy 2021
Automotive IT Strategy 2021Automotive IT Strategy 2021
Automotive IT Strategy 2021
 
Automotive IT Strategy: IT Sourcing
Automotive IT Strategy: IT SourcingAutomotive IT Strategy: IT Sourcing
Automotive IT Strategy: IT Sourcing
 
Jon A Cohn - CTO / VP / Sr Director - joncohn@comcast.net
Jon A Cohn - CTO / VP / Sr Director - joncohn@comcast.netJon A Cohn - CTO / VP / Sr Director - joncohn@comcast.net
Jon A Cohn - CTO / VP / Sr Director - joncohn@comcast.net
 
AdvisorAssist Presentation: Cloud Computing and Compliance For RIAs
AdvisorAssist Presentation:  Cloud Computing and Compliance For RIAsAdvisorAssist Presentation:  Cloud Computing and Compliance For RIAs
AdvisorAssist Presentation: Cloud Computing and Compliance For RIAs
 
Backoffice Au Pk HTD
Backoffice Au Pk HTDBackoffice Au Pk HTD
Backoffice Au Pk HTD
 
Dianne Wyllie, Brocade: Transforming the Annual Planning Process
Dianne Wyllie, Brocade: Transforming the Annual Planning ProcessDianne Wyllie, Brocade: Transforming the Annual Planning Process
Dianne Wyllie, Brocade: Transforming the Annual Planning Process
 
Key Considerations While Rolling Out Denodo Platform
Key Considerations While Rolling Out Denodo PlatformKey Considerations While Rolling Out Denodo Platform
Key Considerations While Rolling Out Denodo Platform
 
Att consulting external deck
Att consulting external deckAtt consulting external deck
Att consulting external deck
 
ADM Target Operating Models
ADM Target Operating ModelsADM Target Operating Models
ADM Target Operating Models
 
Sustainable Architectures
Sustainable Architectures Sustainable Architectures
Sustainable Architectures
 
How to Win Friends and Save Money
How to Win Friends and Save MoneyHow to Win Friends and Save Money
How to Win Friends and Save Money
 
[Webinar] - Using RPA to Accelerate the Benefits from Shared Services
[Webinar] - Using RPA to Accelerate the Benefits from Shared Services[Webinar] - Using RPA to Accelerate the Benefits from Shared Services
[Webinar] - Using RPA to Accelerate the Benefits from Shared Services
 
Mindshare company presentation rev 9.0
Mindshare company presentation rev 9.0 Mindshare company presentation rev 9.0
Mindshare company presentation rev 9.0
 
Managed Services Balanced Scorecard Presentation By Sourcing Gurus
Managed Services Balanced Scorecard Presentation By Sourcing GurusManaged Services Balanced Scorecard Presentation By Sourcing Gurus
Managed Services Balanced Scorecard Presentation By Sourcing Gurus
 
How to Improve Performance with Next-Gen Sales Enablement Technology in Finan...
How to Improve Performance with Next-Gen Sales Enablement Technology in Finan...How to Improve Performance with Next-Gen Sales Enablement Technology in Finan...
How to Improve Performance with Next-Gen Sales Enablement Technology in Finan...
 
Esouag r12 presentation
Esouag r12 presentationEsouag r12 presentation
Esouag r12 presentation
 
Pentaho technical whitepaper-1-6
Pentaho technical whitepaper-1-6Pentaho technical whitepaper-1-6
Pentaho technical whitepaper-1-6
 
090119 Enabling Strategic Sourcing
090119 Enabling Strategic Sourcing090119 Enabling Strategic Sourcing
090119 Enabling Strategic Sourcing
 
Crafting Your Accounting Innovation Strategy
Crafting Your Accounting Innovation StrategyCrafting Your Accounting Innovation Strategy
Crafting Your Accounting Innovation Strategy
 
About Sovereign Business Integration Group
About Sovereign Business Integration GroupAbout Sovereign Business Integration Group
About Sovereign Business Integration Group
 

Sunila Silicon India Java Conference Session7 Enterprise Java Tool And Techniques Data Aggregation And Grids In Soa

  • 1. Enterprise Java Tools & Techniques Data Aggregation & Grids in SOA SiliconIndia – Java Conference – October 15th, 2011 Hyderabad SUNILA GOLLAPUDI Technology Specialist ----------------------------------------------------------------------------------------------------------------------------------------------------------------------- © Copyright 2010 October 15th, 2011
  • 2. AGENDA • Broadridge – A Quick note on Our Business • Broader Picture – The Financial Service Industry – Key Trends, Requirements and Challenges • Outlining the Problem Context and Solution Areas • A Solution that “Just Works” … – Service Oriented Architecture & Data Services • Scope, Architecture, Solutions & Options – Data Aggregation/Composition & Data Federation/Virtualization • Scope, Architecture, Solutions & Options – Data Availability & Reliability/Quality • Scope, Architecture, Solutions & Options – Scalability in large volume context • Scope, Architecture, Solutions & Options • Q&A ----------------------------------------------------------------------------------------------------------------------------------------------------------------------- © Copyright 2010 October 15th, 2011
  • 3. Broadridge in annual revenues as of FY 2010 in annual pre-tax earnings of experience in securities processing We are an business Industry Leader  Senior management team averages 15 years of tenure  Ranked #1 in Black Book of Outsourcing: Brokerage Processing Providers Survey  #1 in 14 of 18 categories surveyed vs. 52 other providers Securities Investor Processing Communications  Pure and Outsourcing service provider Since 1962 Since 1989  Components of our securities processing solutions used by 8-of-the-top-10 U.S. broker-dealers We provide Business Mission-Critical Process Solutions Outsourcing annually for every broker/dealer in the U.S. Since 2004  Securities processing capabilities for more than 50 Our offerings are countries broad and flexible  Solutions ranging from hosted service bureau, to customized BPO supporting full outsourcing ----------------------------------------------------------------------------------------------------------------------------------------------------------------------- © Copyright 2010 October 15th, 2011
  • 4. Financial Service Industry (Key Trends, Requirements & Challenges) • Mergers & Acquisitions – Financial Industry is most dynamic and M&A that enhance market position or add offerings is frequent – Challenge is to federate the Duplicate Data • Compliance Reporting – Compliance to changing Regulatory norms is one of the biggest needs in Financial Service Industry. – As internal systems are optimized for operations & not compliance, integrating the data from these systems often proves to be expensive – Challenge is to virtualize data across these operational systems • Risk Management – Financial institutions must continuously monitor exposure to a range of risks – Aggregating a single view of institution-wide risk in real-time is the requirement – Challenge is DATA: Data Quality, Data Availability and Data Access • Reference Data Sharing – Reference data facilitates rapid, error-free execution of financial transactions and analysis across multiple geographical markets – Challenge is single virtual source for reference data and enables federation of additional master and operational data to complete a financial transaction or analysis ----------------------------------------------------------------------------------------------------------------------------------------------------------------------- © Copyright 2010 October 15th, 2011
  • 5. Outlining the Problem Context • Technology Agnostic way of Accessing Data Data Services and Service Oriented Architecture • Centralized Data Access Data Aggregation/Composition & Date Federation/ Virtualization • Data Reliability & Availability – Seamless access to Real-time Data Data Cleansing, Clustered Caching, Transparent Data Partitioning with Failover and Fail backing • Enormous Data Loads and Transaction Volumes Parallel Programming, MapReduce Techniques, NoSQL Options, & the Data GRID Let’s look at various Technology Options, Tools and Framework Alternatives ? ----------------------------------------------------------------------------------------------------------------------------------------------------------------------- © Copyright 2010 October 15th, 2011
  • 6. An Integration Platform Data Services & Service Oriented Architecture • Service Oriented Architecture – An Architecture Style that provides a Technology agnostic way of Integrating Disparate Applications within an Enterprise by improving Reuse and eliminating duplication of Application Logic – Componentization & Servicization is what typically happens here. – Types of Components, in the order of increasing consolidation: • Data Services that provide access to data without a need for worrying the data representation or storage • Business Services that contain business logic for specific, well- defined tasks and perform business transactions via data services • Business Processes that coordinate multiple business services within the context of a workflow. ----------------------------------------------------------------------------------------------------------------------------------------------------------------------- © Copyright 2010 October 15th, 2011
  • 7. An Integration Platform Data Services & Service Oriented Architecture • Implementation Options for Service Oriented Data Access: The heart of any enterprise application is data. Applications provide the ability to view, sort, filter, edit, create, and delete data – Expose a Database as a Service (above an ORM Layer) – Expose a Stored Procedure as a Service – Expose a DAO as a Service – Wrap an existing business object (EJB Data services provide agility and reuse or POJO) with a web service Data Services are essentially the SOA- The term “Service” here is corresponds equivalent of the Data Access Object pattern to a Web Service ----------------------------------------------------------------------------------------------------------------------------------------------------------------------- © Copyright 2010 October 15th, 2011
  • 8. Composite Applications Tools & Framework Options: Business Processes 1. Any Java Application / Web Server 2. Web Service Engine Business Services like Axis / Metro 3. Specific Data Service Frameworks like 1. WSO2 Data Services with Data Services any Application Server 2. Composite Software Data Services etc … 3. JBoss Data Stores Enterprise Data Services Platform Are we missing something ? Yes, What about the issue of Data Duplication? ----------------------------------------------------------------------------------------------------------------------------------------------------------------------- © Copyright 2010 October 15th, 2011
  • 9. Data Integration Techniques • Going beyond a Simple Data Access … • Data Integration Patterns: • Data Federation Pattern (also referred as Virtualization) Data Virtualization (using a Data Federation Server) • Data Aggregation Pattern (also referred as Consolidation) Target Database Apply/Load Process/Transform Gather/Extract
  • 10. Data Services using Federation Technique • Context – Need for unified view of data that often involves the integration of a bewildering array of disparate backend sources, and services • Value – Transparency of underlying heterogeneity – Time-to-market advantage – Reduced Development & maintenance costs – Performance Advantage – Reusability Advantage – Improved Governance • Scope – Effectively join and process information from heterogeneous sources – Receive a query, transform it using complex query optimizing algorithms, creating a series of sub-operations that are invoked on the base system – Finally assembling the results and returning to the requesting system. ----------------------------------------------------------------------------------------------------------------------------------------------------------------------- © Copyright 2010 October 15th, 2011
  • 11. The Data Federation Pattern The functionality of the data federation pattern can be implemented using either database-related technologies such as optimizer or compensation, or by home- grown applications. Due to the complexity of query optimization over heterogeneous sources, it is an industry best practice to use a data federation implementation that leverages query optimization technology as provided by most database management systems ----------------------------------------------------------------------------------------------------------------------------------------------------------------------- © Copyright 2010 October 15th, 2011
  • 12. Data Services Using Consolidation Techniques • Context • Scope – Integrate Information from Sources • Phase 1: Data Aggregation that are highly Heterogeneous in Server gathers / extracts data nature from the data sources – Need for extensive transformation • Phase 2: Source Data is logic to resolve data conflicts integrated and transformed to – Need for Data event publishing conform to target model • Phase 3: Apply the transformed • Value data to the target data store – Transparency – Reusability Target Database – Improved Governance – Additionally, Single version of 3. Apply/Load truth – high quality that involves 2. Process/Transform 1. Gather/Extract resolving of data conflicts ----------------------------------------------------------------------------------------------------------------------------------------------------------------------- © Copyright 2010 October 15th, 2011
  • 13. Data Federation Tools / Frameworks Composite Applications Business Processes • IBM – Websphere Information Integrator – Websphere Business Services Information Integrator Classic Federation – Websphere Information Services Director Data Services – Websphere DataStage • Composite Software Data Federation Service Data Integration Layer • Progress Software Data Stores ObjectStore • Oracle Data Integrator ----------------------------------------------------------------------------------------------------------------------------------------------------------------------- © Copyright 2010 October 15th, 2011
  • 14. Data Availability & Reliability • SOA Demands High Data Reliability and Availability • Solutions / Options to achieve reliability & availability – Reliability Techniques • Data Cleansing Pattern • Master Data Management – Availability Techniques • Clustered Caching • State Management Through Virtualization • Transparent Data Partitioning • Failover and Failback • Insulation from Failures in Other Services ----------------------------------------------------------------------------------------------------------------------------------------------------------------------- © Copyright 2010 October 15th, 2011
  • 15. Data Reliability Option: Data Cleansing Pattern • Scope: – Parsing of input data and association to standard and fine-grained elements – Standardization of data – Matching and de-duplication of data entries – Survivorship of the correct information ----------------------------------------------------------------------------------------------------------------------------------------------------------------------- © Copyright 2010 October 15th, 2011
  • 16. Clustered Caching ensures Availability and thus reliability • In-memory data management solution is what is required • It makes sharing and managing data in a cluster as simple as on a single server. It accomplishes this by coordinating updates to the data by using clusterwide concurrency control, replicating and distributing data modifications across the cluster by using the highest-performing clustered protocol available, and delivering notifications of data modifications to any servers that request them. Data Services Cache Data Integration Layer Data Stores ----------------------------------------------------------------------------------------------------------------------------------------------------------------------- © Copyright 2010 October 15th, 2011
  • 17. Data Availability & Reliability : Other Techniques • State Management Through Virtualization • Failover and Failback • Insulation from Failures in Other Service • Transparent Data Partitioning Achieves Continuous Availability and Reliability Tool Options 1. IBM Websphere Quality Stage 2. Many MDM Vendors, IBM, Oracle, Informatica etc … 3. Oracle Coherence 4. MemCached 5. EHCache (Open Source) ----------------------------------------------------------------------------------------------------------------------------------------------------------------------- © Copyright 2010 October 15th, 2011
  • 18. Performance (Scalability, Throughput, and Points of Congestion ) • Bottlenecks – Shared intermediary services - Such services perform common integration tasks such as data transformation, content based routing, and filtering. – The services themselves - That is, application code exposed as a service and invoked by other services on the network, whether directly or through an orchestration engine. – SOA infrastructure operations In most cases, the scalability bottlenecks across these SOA components are caused when disk I/O, memory, or CPU saturation levels are reached. • Need – Large Datasets – Enormous Loads & Large Transaction Volumes handled Without Compromise • Solution – Data Store alternatives like NoSQL with Parallel Processing / Programming MapReduce Techniques and – Data Grids ----------------------------------------------------------------------------------------------------------------------------------------------------------------------- © Copyright 2010 October 15th, 2011
  • 19. NoSQL – Not Only SQL • What’s wrong this RDBMSs ? • NoSQL Usecases – Well, actually nothing… it just has • Logging/Archiving limitations • Social Computing Insight • They use table-based normalization approach • External Data Feed Integration • They allow versioning • Performance falls off as RDBMS normalizes data • Front-ens order processing as the data grows. systems • NoSQL Databases • Real-time stats & analytics – Non Relational – Designed for distributed data stores for large data needs – Doesn’t require fixed table schemes – (Usually) avoids join operations – Can Scale Horizontally – allows adding more nodes to storage systems – Types • Key-Value Stores • Column Family Stores • Document Databases • Graph Databases ----------------------------------------------------------------------------------------------------------------------------------------------------------------------- © Copyright 2010 October 15th, 2011
  • 20. Teamed with MapReduce Technique • What’s MapReduce ? – Restricted Parallel Programming model for large clusters (User implements Map() and Reduce()) – Parallel Computing Framework • Libraries take care of EVERYTHING else – Parallelization – Fault Tolerance – Data Distribution – Load Balancing – Map and Reduce (borrowed from Lisp) • Map() – Processes a Key-Value Pair that produces immediate Key-Value Pairs • Reduce() – Marge all intermediate values associated with the same key ----------------------------------------------------------------------------------------------------------------------------------------------------------------------- © Copyright 2010 October 15th, 2011
  • 22. SOA with Data Grid Option (Service Oriented Computing) • An SOA grid transparently solves many of the difficult problems related to achieving high availability, reliability, scalability, and performance in a distributed environment ----------------------------------------------------------------------------------------------------------------------------------------------------------------------- © Copyright 2010 October 15th, 2011
  • 23. Tools/Products/Frameworks • NoSQL Options – HBase – Cassandra – CoucheDB – MongoDB • MapReduce Options – Hadoop – Amazon Elastic MapReduce – GreenPlum MapReduce • Grid Options – IBM Globus – Sun Grid Engine – Oracle Coherence ----------------------------------------------------------------------------------------------------------------------------------------------------------------------- © Copyright 2010 October 15th, 2011
  • 24. Q&A • Sunila.Gollapudi@broadridge.com ----------------------------------------------------------------------------------------------------------------------------------------------------------------------- © Copyright 2010 October 15th, 2011