SlideShare una empresa de Scribd logo
1 de 8
Descargar para leer sin conexión
View Point




                     Use of Big Data Technologies in Capital Markets
                  - Ruchi Verma, Sathyan R Mani



                  Abstract
                  Data is growing at a tremendous rate with an increase in digital universe from 281 Exabyte’s (year 2007) to 1,800 Exabyte’s (year
                  2011). However the increase in data is, in itself, a minor problem, but the increase in percentage of unstructured data in the overall
                  data volume is what is concerning all, including Wall Street.
                  Capital market firms have been transforming organically to handle the big data over years. This article discusses the key
                  transformations that capital market firms are undergoing to handle big data, drivers for use of big data technology in capital markets
                  and relevant uses cases.
                  The article further outlines the future big data technologies to handle big data complexity and responsiveness – a sense & response
                  system, a federated database and in memory analytics.




             www.infosys.com
Data is growing at a tremendous rate with an overall increase in digital universe with petabytes of data flowing from
 OVERVIEW
              messaging, emails, open source surveys, blogs, Twitters, Facebook and so on. According to studies conducted by top market
              research firms, the big data market is expected to grow from US$3.2 billion in 2010 to US$16.9 billion in 2015. The next big
              challenge for capital markets will be the velocity of data (value of extremely large and diverse data sets), considering the
              frequency of data generation and delivery in financial services industries like investment banking (highest stored data per
              firm), and the need to be still analysed in near real-time. Responding to these trends, capital market firms are setting aside
              a bigger pie of IT spend on big data technologies.

              According to a report by a major analyst firm, using big data analytics in the field of customer intelligence in the investment
              banking sector in UK has reaped benefits of £554 million and it is expected to go up to £5,275 million by the end of 2017.
              Investment banks in UK are spending 4% of their gross operating surplus on big data R&D.

              Capital market firms are also seeking competitive advantages by exploring the plethora of information from all possible
              sources and by transforming organically to handle the big data. The key technological transformations taking place in
              capital market to handle Big Data are –




    Scale-out storage                        The massive amount of stored and generated data means the scope for scale-up is reduced
    infrastructure that grows                and the focus is more on “scaling out” which means use of shared pools of storage devices
    with data                                that deliver more capacity with same operational cost. Scale-out storage is the key to
                                             managing Big Data.




    Convergence of data                      A unified analytics platform that helps to collaborate between the structured and un-structured
    sources                                  data. The increase in the unstructured data means more data to analyse. The applications that
                                             are used to analyse the Wall Street data (capital markets data) must be capable of collaboration
                                             and correlation. Big Data is just bits (data that makes no sense) until its value is unlocked.




                                             NASDAQ and NYSE Euronext stock exchanges have implemented various data-warehouse
    Innovative applications                  applications like IBM Netezza and EMC Greenplum to store big data. These tools/applications
                                             store the unstructured data got from various sources, handle the data and then process the
                                             data to make more sense out of the unstructured data.




                                             The traditional RDBMS (Relational Database Modelling Systems) cannot handle big data and
    Re-modelling of data                     hence there is a shift towards specialized, non-relational databases like Hadoop. Microsoft
    models to make better,                   SQL, which was the standard for querying data out of RDBMS systems, is now being replaced
    faster and less risky                    by Map Reduce, which is the distributed querying and data processing engine used to
    decisions                                extract data from big datasets hosted on clusters in any typical Hadoop implementation.
                                             The systems which have both, the structured and unstructured data are currently operated
                                             in mutually exclusive mode without interoperability. Vendors are trying to bridge the
                                             connectivity gap between RDBMS and the Hadoop systems



    Use of cloud storage                     The usage of shared pools via cloud storage is increasing the need for virtual infrastructure
                                             products.


2 | Infosys
Following are the key business drivers for using Big Data Technologies
                                                 in Capital Market
Capital Market - Drivers for Big                 Changing regulatory landscape
Data Technologies                                Financial services at large are seeing lots of regulatory make over in last
Data management has been a big challenge         few years. Capital Market firms are facing this new regulation landscape
                                                 and are gearing up for meeting the upcoming regulations (Basel II, EMIR,
for capital markets firms for decades. The
                                                 Dodd Frank, MiFID etc.)
financial industry has collectively spent
billions of dollars over the years trying to     Changing trading strategies
create appropriate datasets. Data sharing        Capital markets have evolved from simple strategies, like 1980s-paired
across institutions continues to be a            models, to the intricate gaming strategies of today. Trading strategies
challenge, as each business unit prefers         have started including unstructured data (an example is Twitter-based
calculating using its own set of data, making    trading strategies). In 2012, analysts expect to see an increased growth
enterprise-wide data analysis tremendously       in the use of unstructured data in trading strategies and alpha discovery
difficult. The sources of unstructured data      (Returns generation from various trades).
from which data needs to be collected,           Increasing data volume
analysed and stored are increasing. For          Market data volumes is growing really big. For years, capital markets
instance, traders who are interested in          firms have used advanced technology to store historical market data
particular companies or industries news,         for their quants. However with the growing data sources(social media,
and advanced trading operations have             Internet, mobility), the need for big data storage will become more
developed tools to analyse news in real time     profound
and hence help them make efficient trading       Increasing complexities of processes
decisions. Now, with news coming in via
                                                 There is an increase in data due to the increase in the complexities of the
additional channels such as audio, video or      underlying business process
even through social media (Twitter), some
firms are trying to come up with new ways
                                                 Stringent timeliness
to analyse all of this data. Firms are also      In this tough economic environment, it is very important to find value in
analysing data from different sources such       the data and be as fast as possible. Real-time execution is been seen to
                                                 take the physical extremes by collocated high-frequency trading.
as documents, data from surveys, “click”
data from websites, web search traffic data      Need for handling speed, agility and control
and more. As terabytes grow into petabytes,      The speed required for retrieving the right data, manipulating it and
traditional relational database structures       using the analysis to gain control of the market dynamics is one of the
are having a difficult time in storing and       biggest business drivers.
handling the data.                               Visibility of data
Technologies play an important role in           Since front and back offices are physically more distributed but more
storage, analytics and processing of Big Data.   closely integrated from a business perspective, there is a need to adapt
In-memory tools can likely be deployed           to changes which allows everyone to see what they need to see.
in four main areas of the business: front        Need for integration of siloes systems
office, credit valuation adjustment, market
                                                 In order to understand the risks an investment bank is accumulating,
risk management and client reporting. The        it is important to connect the data from various disparate systems. The
current economic environment necessitates        investment bank will not have an accurate view of potential risks unless
the business drivers for any investments.        they have a system in place which connects the risks and the analytical
                                                 tools. This in turn means that they have to have a central platform for
                                                 risk calculation.




                                                                                                                  Infosys | 3
Capital Market – Use cases of Big Data Technologies
Capital market firms are using big data (Unstructured data) primarily in five key areas –


                                              Data Storage for Historical Trading, Internal Data Management Challenge and overall control on
 FINANCIAL DATA MANAGEMENT                    reference Data (On-demand data mining to dig into meta-data to deconstruct/reconstruct data
     AND REFERENCE DATA                       models, etc.). It can be very tough in maintaining (storing, handling and processing) the data from
                                              various asset classes coming from various vendors.
        MANAGEMENT




                                              Includes preparation for regulations like Dodd Frank, Solvency II, EMIR, audits etc.
               REGULATION



                                              Includes fraud mitigation, anti-money laundering (AML), Know Your Customer (KYC), rogue trading,
              RISK ANALYTICS                  on-demand enterprise risk management, etc.



                                              Includes Analytics for High Frequency Trading, Predictive Analytics, Pre-trade decision-support
        TRADING ANALYTICS                     analytics, including sentiment measurement and temporal/bi-temporal analytics etc.




                                              In enterprise-level monitoring and reporting, it’s often hard to match and reconcile trades from
                                              various systems built on different symbology standards — usually resulting in invalid, duplicated
              DATA TAGGING                    and missed trades. Data tagging can easily identify trades and events such as corporate actions and
                                              enable regulators detect stress signs early.




Capital market firms are primarily using the following big data technologies –


                                                                                                              Massively parallel
                                 Data grids                         Compute grids
                                                                                                                processors

                           Which use distributed                  Which offer a way of                        Which involves the
                          caching to manage large                parallelizing processes                 coordinated processing of a
                          volumes of data across a               across multiple servers,               programme between multiple
                             network of servers                 handling capacity/failure               independent computers, each
                                                                issues and orchestrating                with its own operating system
                                                                   tasks across the grid                          and memory




           In-memory                                                                            Specialized
                                                       ‘NoSQL’                                                                          Hadoop
            databases                                                                           databases

      Which are databases                       Which are ‘shell’ relational                   Which contains                  This is a tool used to
      that store data in the                database management systems                         the necessary                 query the unstructured
      main memory rather                     that don’t use Structured Query                architecture to store              data which is a major
     than on a disk, as is the              Language, are more simple than                    the unstructured                    part of big data
      case with traditional                 traditional databases and whose                  data. Ex: IBM Viper                      analytics.
            databases                      tables are compatible with a wide                DB2 database, EMC                  Ex: EMC Map Reduce,
                                               range of external platforms                       Greenplum                          IBM Netezza




4 | Infosys
Other Use Cases –

Firms                       Big Data Use Cases                                                           Areas

                            An investment firm with assets of over US$1 trillion and operations in
Investment Bank             approximately 50 countries uses big data technology to deliver Reference     Reference Data Management
                            Data to the Murex trading platform and other downstream systems.


                            An investment firm with assets of over US$1 trillion and operations in
Investment Bank             approximately 50 countries uses big data to manage risk exposure through     Risk Analytics
                            real-time communication across bond, futures and credits trading


                            An investment bank shifted the risk management and P&L towards a real-
US Investment Bank          time environment. Big Data technologies were leveraged to help the firm to   Regulation
                            gather all relevant data into one place

                            An investment bank used Big Data analytics to track performance
European Investment Bank                                                                                 Risk Analytics & Regulation
                            monitoring, risk analytics and reporting

                            An investment bank used Big Data technologies to generate on-demand
Asian Investment Bank       performance metrics for risk measures across multiple global trading         Trading Analytics
                            businesses.


European Investment         An investment manager used Big Data technology to gather relevant details
                                                                                                        Compliance
Manager                     so as to respond as a witness to a litigation action against a prime broker


                            Investment manager used Big Data technology to centralize data and
US Investment Manager       applications to apply governance policies and mitigate risk of damages       Risk Analytics & Regulation
                            from litigation discovery

                            A major global exchange used Big Data technology to provide global
                            market participants with on-demand access to data and data-mining            Trading Analytics & Risk
Global Exchange
                            tools for trading, analytics and risk management in a cloud-based/hosted     Analytics
                            environment

                            A US regulator used Big Data technology to create a searchable library of
US Regulator                research, econometric and other information generated by the regulator’s     Regulation
                            activities


                            A major buy-side firm uses Big Data technologies for market surveillance, an
Buy Side firm                                                                                            Regulation
                            activity requiring processing of vast quantities of market information


                            Fiduciary management – a new area of interest in which asset managers
                                                                                                         Fiduciary Management –
Asset Manager               outsource management of their portfolios to third-party administrators in
                                                                                                         Emerging area
                            order to benefit from economies of scale.


                            An investment bank use big-data techniques to handle and manage
                            petabytes of data for regulatory compliance and advanced analytics. The
Regulatory compliance and
                            bank used technology from Hadoop, an open source framework that              Regulation & Risk Analytics
advanced analytics
                            supports data-intensive distributed computing, which allow data to be
                            crunched over a distributed network of computers.



                                                                                                                             Infosys | 5
Capital Market – Future Big Data Technologies
Buy side firms, sell side firms, stock exchanges and also regulators are all looking at implementing big data technologies to a greater extent.
However as the amount of data is increasing, responding to data demands and handling data complexity will be the two key challenges faced
by capital market institutions in handling Big Data.



   Evolution of “Sense & Response System” to handle responsiveness

    Although capital markets have previously been focused on high velocity market data, it is now moving toward unstructured data due
    to changing trading dynamics. Unstructured data these days primarily consists of daily stock feeds, Twitter chats and blogs, a reality
    that would have been considered science fiction even a few years ago. Converting the unstructured information into machine-readable
    data is the key to gaining an edge in Wall Street trading. This data is then used by algorithmic traders to produce some alpha (returns
    from the market) from the news. These untapped sources can help traders gain a competitive edge in trading. 20-30 years back, a trader
    could capture some Alpha (Returns) in a stock even after a week of a company-specific or industry-specific event having occurred.
    Today, that cushion is gone and stock returns move in milliseconds. A system that examines blogs and Twitter posts is part of a larger,
    smart platform, known as a “sense and response system”. Wall Street has to look forward to make Sense & Response System a part of its
    day-to-day processing. Below figure depicts an example of how an Investment Strategy can be determined using big data technology,
    Sense & Response System (Figure 1).




                                  Happy Twitter Tweets

                                     Positive/
                                      Calm                                                       News Feeds
                                      Mood
                                                         Buy Stocks

                     Sense and
                Response System
                                                             3 days Later
                                                                                                    Blogs          Algorithmic       Trading
                                                                                                                     Engine         Decisions




                             Sad/Unhappy Twitter Tweets                                          Twitter Posts


                                    Negative/
                                     Anxious
                                                         Sell Stocks

                     Sense and                                                                    Stock Data
                Response System
                                                             3 days Later




    Use of the output from sense and response system as a parameter to the Algorithmic trading black box not only gives capital market firms
    a competitive advantage but also helps in timely trading decisions for better alpha.




6 | Infosys
Evolution of federated database approach for handling complexity of data

Complexity of storage: shared storage and parallel processing are two key aspects to handle complex data especially when data is in
structured and unstructured format. The structured data is stored in Oracle Real Application Cluster (RAC) node 1 and the unstructured
data in RAC node 2. The RAC is a key to providing shared storage that is used in cloud computing. Taking into consideration the concept
of parallel processing, databases like Teradata and Greenplum have multiple buckets for storing data, which helps in accessing the
structured and unstructured data in parallel and thereby improving access response times. Using the above proposed mix of shared
storage and parallel processing, complex big data can be handled proactively.



Evolution of in memory analytics

The increasing amount of data being made available to investment banks, coupled with the need to enter the capital markets as early
as possible and desire to conduct real-time analytics to reduce latency is driving the trend towards in-memory analytics. In-memory
software can improve the speed of data analytics by a factor of 14,000. Hence a request that previously took 5 hours can now be
accomplished in just one second. The increase in speed is gained by putting data analytics into a CPU, into ‘memory’, as opposed to
being carried out on disk. Queries sent to on-disk databases take longer because mechanical interaction take place, which is removed
when query is, conducted in-memory.




                         Conclusion
                         Big Data technology implementation comes with its own cost. In spite of this, capital market firms are seen
                         heavily investing in these technologies not only to stay competitive but also to ensure investors earn optimal
                         ROI from their investments. The 3 V’s of data (variety, volume & velocity) play an extremely important role in
                         capital market. Firms, who are still thinking of investing in Big Data technologies, should gear up soon before
                         it becomes too late to remain competitive.




                         References
                         •	 A Team Group Survey – Big Data Solutions in Capital Market
                         •	www.bigdata.com/blog/
                         • 	 www.emc.com/microsites/bigdata/index.html




                                                                                                                               Infosys | 7
About Infosys
Infosys partners with global enterprises to drive their innovation-led growth.
That's why Forbes ranked Infosys 19 among the top 100 most innovative
companies. As a leading provider of next-generation consulting, technology
and outsourcing solutions, Infosys helps clients in more than 30 countries
realize their goals. Visit www.infosys.com and see how Infosys (NASDAQ: INFY),
with its 150,000+ people, is Building Tomorrow's Enterprise® today.

For more information, contact askus@infosys.com                                                                                                                                       www.infosys.com
© 2012 Infosys Limited, Bangalore, India. Infosys believes the information in this publication is accurate as of its publication date; such information is subject to change without notice. Infosys acknowledges
the proprietary rights of the trademarks and product names of other companies mentioned in this document.

Más contenido relacionado

La actualidad más candente

Business intelligence ppt
Business intelligence pptBusiness intelligence ppt
Business intelligence pptsujithkylm007
 
DTCC Securities - Life Cycle " [LcoaS.pdf]
DTCC Securities - Life Cycle " [LcoaS.pdf]DTCC Securities - Life Cycle " [LcoaS.pdf]
DTCC Securities - Life Cycle " [LcoaS.pdf]Ashishkumar Lakshmanan
 
Introduction to Business Intelligence
Introduction to Business IntelligenceIntroduction to Business Intelligence
Introduction to Business IntelligenceAlmog Ramrajkar
 
Exploring Blockchain Technology, Risks, and Emerging Trends
Exploring Blockchain Technology, Risks, and Emerging TrendsExploring Blockchain Technology, Risks, and Emerging Trends
Exploring Blockchain Technology, Risks, and Emerging TrendsAmazon Web Services
 
RapidMiner: Introduction To Rapid Miner
RapidMiner: Introduction To Rapid MinerRapidMiner: Introduction To Rapid Miner
RapidMiner: Introduction To Rapid MinerDataminingTools Inc
 
Crypto Wallet Types Explained
Crypto Wallet Types ExplainedCrypto Wallet Types Explained
Crypto Wallet Types Explained101 Blockchains
 
Applications of Big Data Analytics in Businesses
Applications of Big Data Analytics in BusinessesApplications of Big Data Analytics in Businesses
Applications of Big Data Analytics in BusinessesT.S. Lim
 
Overview of the Investment Banking Industry
Overview of the Investment Banking IndustryOverview of the Investment Banking Industry
Overview of the Investment Banking IndustryInvestment Banking Jobs
 
Blockchain Essentials and Blockchain on Azure
Blockchain Essentials and Blockchain on AzureBlockchain Essentials and Blockchain on Azure
Blockchain Essentials and Blockchain on AzureNuri Cankaya
 
Algo trading with machine learning ppt
Algo trading with machine learning pptAlgo trading with machine learning ppt
Algo trading with machine learning pptDeb prakash ganguly
 
"Build Effective Risk Management on Top of Your Trading Strategy" by Danielle...
"Build Effective Risk Management on Top of Your Trading Strategy" by Danielle..."Build Effective Risk Management on Top of Your Trading Strategy" by Danielle...
"Build Effective Risk Management on Top of Your Trading Strategy" by Danielle...Quantopian
 
Data analytics introduction
Data analytics introductionData analytics introduction
Data analytics introductionamiyadash
 
Advantages and disadvantages of permissionless blockchain
Advantages and disadvantages of permissionless blockchainAdvantages and disadvantages of permissionless blockchain
Advantages and disadvantages of permissionless blockchainBlockchain Council
 

La actualidad más candente (20)

Algorithmic Trading
Algorithmic TradingAlgorithmic Trading
Algorithmic Trading
 
Business intelligence ppt
Business intelligence pptBusiness intelligence ppt
Business intelligence ppt
 
DTCC Securities - Life Cycle " [LcoaS.pdf]
DTCC Securities - Life Cycle " [LcoaS.pdf]DTCC Securities - Life Cycle " [LcoaS.pdf]
DTCC Securities - Life Cycle " [LcoaS.pdf]
 
BLOCKCHAIN TECHNOLOGY
BLOCKCHAIN TECHNOLOGYBLOCKCHAIN TECHNOLOGY
BLOCKCHAIN TECHNOLOGY
 
Introduction to Business Intelligence
Introduction to Business IntelligenceIntroduction to Business Intelligence
Introduction to Business Intelligence
 
Exploring Blockchain Technology, Risks, and Emerging Trends
Exploring Blockchain Technology, Risks, and Emerging TrendsExploring Blockchain Technology, Risks, and Emerging Trends
Exploring Blockchain Technology, Risks, and Emerging Trends
 
RapidMiner: Introduction To Rapid Miner
RapidMiner: Introduction To Rapid MinerRapidMiner: Introduction To Rapid Miner
RapidMiner: Introduction To Rapid Miner
 
Data analytics
Data analyticsData analytics
Data analytics
 
Crypto Wallet Types Explained
Crypto Wallet Types ExplainedCrypto Wallet Types Explained
Crypto Wallet Types Explained
 
Stock exchange
Stock exchangeStock exchange
Stock exchange
 
Applications of Big Data Analytics in Businesses
Applications of Big Data Analytics in BusinessesApplications of Big Data Analytics in Businesses
Applications of Big Data Analytics in Businesses
 
Overview of the Investment Banking Industry
Overview of the Investment Banking IndustryOverview of the Investment Banking Industry
Overview of the Investment Banking Industry
 
Blockchain Technology
Blockchain TechnologyBlockchain Technology
Blockchain Technology
 
Blockchain Essentials and Blockchain on Azure
Blockchain Essentials and Blockchain on AzureBlockchain Essentials and Blockchain on Azure
Blockchain Essentials and Blockchain on Azure
 
Data Warehouse
Data Warehouse Data Warehouse
Data Warehouse
 
Algo trading with machine learning ppt
Algo trading with machine learning pptAlgo trading with machine learning ppt
Algo trading with machine learning ppt
 
"Build Effective Risk Management on Top of Your Trading Strategy" by Danielle...
"Build Effective Risk Management on Top of Your Trading Strategy" by Danielle..."Build Effective Risk Management on Top of Your Trading Strategy" by Danielle...
"Build Effective Risk Management on Top of Your Trading Strategy" by Danielle...
 
Data analytics introduction
Data analytics introductionData analytics introduction
Data analytics introduction
 
Cryptocurrency
CryptocurrencyCryptocurrency
Cryptocurrency
 
Advantages and disadvantages of permissionless blockchain
Advantages and disadvantages of permissionless blockchainAdvantages and disadvantages of permissionless blockchain
Advantages and disadvantages of permissionless blockchain
 

Destacado

Big Data Expo 2015 - Data Science Innovation Privacy Considerations
Big Data Expo 2015 - Data Science Innovation Privacy ConsiderationsBig Data Expo 2015 - Data Science Innovation Privacy Considerations
Big Data Expo 2015 - Data Science Innovation Privacy ConsiderationsBigDataExpo
 
Data Science Connect, July 22nd 2014 @IBM Innovation Center Zurich
Data Science Connect, July 22nd 2014 @IBM Innovation Center ZurichData Science Connect, July 22nd 2014 @IBM Innovation Center Zurich
Data Science Connect, July 22nd 2014 @IBM Innovation Center ZurichRomeo Kienzler
 
The Complete Guide to Capital Markets for Quantitative Professionals - Summary
The Complete Guide to Capital Markets for Quantitative Professionals - SummaryThe Complete Guide to Capital Markets for Quantitative Professionals - Summary
The Complete Guide to Capital Markets for Quantitative Professionals - Summaryradhap
 
Data Science at Atlassian: 
The transition towards a data-driven organisation
Data Science at Atlassian: 
The transition towards a data-driven organisationData Science at Atlassian: 
The transition towards a data-driven organisation
Data Science at Atlassian: 
The transition towards a data-driven organisationIlias Flaounas
 
UNICEF Innovation: Innovation Lab Do-It-Yourself Guide
UNICEF Innovation: Innovation Lab Do-It-Yourself GuideUNICEF Innovation: Innovation Lab Do-It-Yourself Guide
UNICEF Innovation: Innovation Lab Do-It-Yourself GuideChristopher Fabian
 
The Epidemiology of Innovation
The Epidemiology of InnovationThe Epidemiology of Innovation
The Epidemiology of InnovationTim Stock
 
Big Data and Data Science @ BNL - D. Morgagni & L. Dell'Anna
Big Data and Data Science @ BNL - D. Morgagni & L. Dell'AnnaBig Data and Data Science @ BNL - D. Morgagni & L. Dell'Anna
Big Data and Data Science @ BNL - D. Morgagni & L. Dell'AnnaData Driven Innovation
 
Open Innovation
Open Innovation Open Innovation
Open Innovation Alar Kolk
 
Understand Innovation in 5 Minutes
Understand Innovation in 5 MinutesUnderstand Innovation in 5 Minutes
Understand Innovation in 5 MinutesGordon Graham
 
Booz Allen Field Guide to Data Science
Booz Allen Field Guide to Data Science Booz Allen Field Guide to Data Science
Booz Allen Field Guide to Data Science Booz Allen Hamilton
 

Destacado (16)

Big Data Expo 2015 - Data Science Innovation Privacy Considerations
Big Data Expo 2015 - Data Science Innovation Privacy ConsiderationsBig Data Expo 2015 - Data Science Innovation Privacy Considerations
Big Data Expo 2015 - Data Science Innovation Privacy Considerations
 
Data Science Connect, July 22nd 2014 @IBM Innovation Center Zurich
Data Science Connect, July 22nd 2014 @IBM Innovation Center ZurichData Science Connect, July 22nd 2014 @IBM Innovation Center Zurich
Data Science Connect, July 22nd 2014 @IBM Innovation Center Zurich
 
Data juice
Data juiceData juice
Data juice
 
The DATALAB - building a world-class innovation centre in data science
The DATALAB - building a world-class innovation centre in data scienceThe DATALAB - building a world-class innovation centre in data science
The DATALAB - building a world-class innovation centre in data science
 
Big data Summit
Big data SummitBig data Summit
Big data Summit
 
The Complete Guide to Capital Markets for Quantitative Professionals - Summary
The Complete Guide to Capital Markets for Quantitative Professionals - SummaryThe Complete Guide to Capital Markets for Quantitative Professionals - Summary
The Complete Guide to Capital Markets for Quantitative Professionals - Summary
 
Data Science at Atlassian: 
The transition towards a data-driven organisation
Data Science at Atlassian: 
The transition towards a data-driven organisationData Science at Atlassian: 
The transition towards a data-driven organisation
Data Science at Atlassian: 
The transition towards a data-driven organisation
 
UNICEF Innovation: Innovation Lab Do-It-Yourself Guide
UNICEF Innovation: Innovation Lab Do-It-Yourself GuideUNICEF Innovation: Innovation Lab Do-It-Yourself Guide
UNICEF Innovation: Innovation Lab Do-It-Yourself Guide
 
The Epidemiology of Innovation
The Epidemiology of InnovationThe Epidemiology of Innovation
The Epidemiology of Innovation
 
Big Data and Data Science @ BNL - D. Morgagni & L. Dell'Anna
Big Data and Data Science @ BNL - D. Morgagni & L. Dell'AnnaBig Data and Data Science @ BNL - D. Morgagni & L. Dell'Anna
Big Data and Data Science @ BNL - D. Morgagni & L. Dell'Anna
 
Open Innovation
Open Innovation Open Innovation
Open Innovation
 
Innovation can be Trained
Innovation can be TrainedInnovation can be Trained
Innovation can be Trained
 
Understand Innovation in 5 Minutes
Understand Innovation in 5 MinutesUnderstand Innovation in 5 Minutes
Understand Innovation in 5 Minutes
 
Innovation Strategy
Innovation StrategyInnovation Strategy
Innovation Strategy
 
Booz Allen Field Guide to Data Science
Booz Allen Field Guide to Data Science Booz Allen Field Guide to Data Science
Booz Allen Field Guide to Data Science
 
What is Big Data?
What is Big Data?What is Big Data?
What is Big Data?
 

Similar a Use of big data technologies in capital markets

Big Data: Beyond the Hype - Why Big Data Matters to You
Big Data: Beyond the Hype - Why Big Data Matters to YouBig Data: Beyond the Hype - Why Big Data Matters to You
Big Data: Beyond the Hype - Why Big Data Matters to YouDATAVERSITY
 
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANC...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS  IN KNOWLEDGE MANAGEMENT FOR ENHANC...LEVERAGING CLOUD BASED BIG DATA ANALYTICS  IN KNOWLEDGE MANAGEMENT FOR ENHANC...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANC...ijdpsjournal
 
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...ijdpsjournal
 
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...ijdpsjournal
 
Idc big data whitepaper_final
Idc big data whitepaper_finalIdc big data whitepaper_final
Idc big data whitepaper_finalOsman Circi
 
BIG DATA & DATA ANALYTICS
BIG  DATA & DATA  ANALYTICSBIG  DATA & DATA  ANALYTICS
BIG DATA & DATA ANALYTICSNAGARAJAGIDDE
 
Ab cs of big data
Ab cs of big dataAb cs of big data
Ab cs of big dataDigimark
 
Impact of big data on DCMI market
Impact of big data on DCMI marketImpact of big data on DCMI market
Impact of big data on DCMI marketMohsin Baig
 
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...IT Support Engineer
 
Big data document (basic concepts,3vs,Bigdata vs Smalldata,importance,storage...
Big data document (basic concepts,3vs,Bigdata vs Smalldata,importance,storage...Big data document (basic concepts,3vs,Bigdata vs Smalldata,importance,storage...
Big data document (basic concepts,3vs,Bigdata vs Smalldata,importance,storage...Taniya Fansupkar
 
Big data seminor
Big data seminorBig data seminor
Big data seminorberasrujana
 
Big Data Analytics Research Report
Big Data Analytics Research ReportBig Data Analytics Research Report
Big Data Analytics Research ReportIla Group
 
Getting down to business on Big Data analytics
Getting down to business on Big Data analyticsGetting down to business on Big Data analytics
Getting down to business on Big Data analyticsThe Marketing Distillery
 

Similar a Use of big data technologies in capital markets (20)

Big Data: Beyond the Hype - Why Big Data Matters to You
Big Data: Beyond the Hype - Why Big Data Matters to YouBig Data: Beyond the Hype - Why Big Data Matters to You
Big Data: Beyond the Hype - Why Big Data Matters to You
 
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANC...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS  IN KNOWLEDGE MANAGEMENT FOR ENHANC...LEVERAGING CLOUD BASED BIG DATA ANALYTICS  IN KNOWLEDGE MANAGEMENT FOR ENHANC...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANC...
 
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
 
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
 
Idc big data whitepaper_final
Idc big data whitepaper_finalIdc big data whitepaper_final
Idc big data whitepaper_final
 
Big data Analytics
Big data Analytics Big data Analytics
Big data Analytics
 
1
11
1
 
The ABCs of Big Data
The ABCs of Big DataThe ABCs of Big Data
The ABCs of Big Data
 
BIG DATA & DATA ANALYTICS
BIG  DATA & DATA  ANALYTICSBIG  DATA & DATA  ANALYTICS
BIG DATA & DATA ANALYTICS
 
Using Big Data Smarter Decision Making
Using Big Data Smarter Decision MakingUsing Big Data Smarter Decision Making
Using Big Data Smarter Decision Making
 
Ab cs of big data
Ab cs of big dataAb cs of big data
Ab cs of big data
 
Unit-1 introduction to Big data.pdf
Unit-1 introduction to Big data.pdfUnit-1 introduction to Big data.pdf
Unit-1 introduction to Big data.pdf
 
Unit-1 introduction to Big data.pdf
Unit-1 introduction to Big data.pdfUnit-1 introduction to Big data.pdf
Unit-1 introduction to Big data.pdf
 
Impact of big data on DCMI market
Impact of big data on DCMI marketImpact of big data on DCMI market
Impact of big data on DCMI market
 
Unit III.pdf
Unit III.pdfUnit III.pdf
Unit III.pdf
 
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
 
Big data document (basic concepts,3vs,Bigdata vs Smalldata,importance,storage...
Big data document (basic concepts,3vs,Bigdata vs Smalldata,importance,storage...Big data document (basic concepts,3vs,Bigdata vs Smalldata,importance,storage...
Big data document (basic concepts,3vs,Bigdata vs Smalldata,importance,storage...
 
Big data seminor
Big data seminorBig data seminor
Big data seminor
 
Big Data Analytics Research Report
Big Data Analytics Research ReportBig Data Analytics Research Report
Big Data Analytics Research Report
 
Getting down to business on Big Data analytics
Getting down to business on Big Data analyticsGetting down to business on Big Data analytics
Getting down to business on Big Data analytics
 

Más de Infosys

Demystifying Machine Learning for Manufacturing: Data Science for all
Demystifying Machine Learning for Manufacturing: Data Science for allDemystifying Machine Learning for Manufacturing: Data Science for all
Demystifying Machine Learning for Manufacturing: Data Science for allInfosys
 
Digital Outlook: Healthcare Industry
Digital Outlook: Healthcare IndustryDigital Outlook: Healthcare Industry
Digital Outlook: Healthcare IndustryInfosys
 
5 tips to make your mainframe as fit as you
5 tips to make your mainframe as fit as you5 tips to make your mainframe as fit as you
5 tips to make your mainframe as fit as youInfosys
 
Mainframe modernization powered by AI
Mainframe modernization powered by AIMainframe modernization powered by AI
Mainframe modernization powered by AIInfosys
 
Human Amplification In The Enterprise - Resources and Utilities
Human Amplification In The Enterprise - Resources and UtilitiesHuman Amplification In The Enterprise - Resources and Utilities
Human Amplification In The Enterprise - Resources and UtilitiesInfosys
 
Human Amplification In The Enterprise - Telecom and Communication
Human Amplification In The Enterprise - Telecom and CommunicationHuman Amplification In The Enterprise - Telecom and Communication
Human Amplification In The Enterprise - Telecom and CommunicationInfosys
 
Human Amplification In The Enterprise - Retail and CPG
Human Amplification In The Enterprise - Retail and CPGHuman Amplification In The Enterprise - Retail and CPG
Human Amplification In The Enterprise - Retail and CPGInfosys
 
Human Amplification In The Enterprise - Manufacturing and High-tech
Human Amplification In The Enterprise - Manufacturing and High-techHuman Amplification In The Enterprise - Manufacturing and High-tech
Human Amplification In The Enterprise - Manufacturing and High-techInfosys
 
Human amplification in the enterprise - Automation. Innovation. Learning.
Human amplification in the enterprise - Automation. Innovation. Learning.Human amplification in the enterprise - Automation. Innovation. Learning.
Human amplification in the enterprise - Automation. Innovation. Learning.Infosys
 
Human Amplification In The Enterprise - Healthcare and Life Sciences
Human Amplification In The Enterprise - Healthcare and Life SciencesHuman Amplification In The Enterprise - Healthcare and Life Sciences
Human Amplification In The Enterprise - Healthcare and Life SciencesInfosys
 
Human Amplification In The Enterprise - Banking and Insurance
Human Amplification In The Enterprise - Banking and InsuranceHuman Amplification In The Enterprise - Banking and Insurance
Human Amplification In The Enterprise - Banking and InsuranceInfosys
 
Mainframe modernization powered by AI
Mainframe modernization powered by AIMainframe modernization powered by AI
Mainframe modernization powered by AIInfosys
 
Reimagining the future of IT Infrastructure
Reimagining the future of IT InfrastructureReimagining the future of IT Infrastructure
Reimagining the future of IT InfrastructureInfosys
 
Infosys Amplifying Human Potential
Infosys Amplifying Human PotentialInfosys Amplifying Human Potential
Infosys Amplifying Human PotentialInfosys
 
Snapshots from Infosys Confluence 2016
Snapshots from Infosys Confluence 2016Snapshots from Infosys Confluence 2016
Snapshots from Infosys Confluence 2016Infosys
 
Be Digital. Be More.
Be Digital. Be More.Be Digital. Be More.
Be Digital. Be More.Infosys
 
Being Digital
Being DigitalBeing Digital
Being DigitalInfosys
 
Disruptive forces in digital payments
Disruptive forces in digital paymentsDisruptive forces in digital payments
Disruptive forces in digital paymentsInfosys
 
Infosys 'Go Green' Initiative
Infosys 'Go Green' InitiativeInfosys 'Go Green' Initiative
Infosys 'Go Green' InitiativeInfosys
 
Serving the perfect Information Cocktail
Serving the perfect Information CocktailServing the perfect Information Cocktail
Serving the perfect Information CocktailInfosys
 

Más de Infosys (20)

Demystifying Machine Learning for Manufacturing: Data Science for all
Demystifying Machine Learning for Manufacturing: Data Science for allDemystifying Machine Learning for Manufacturing: Data Science for all
Demystifying Machine Learning for Manufacturing: Data Science for all
 
Digital Outlook: Healthcare Industry
Digital Outlook: Healthcare IndustryDigital Outlook: Healthcare Industry
Digital Outlook: Healthcare Industry
 
5 tips to make your mainframe as fit as you
5 tips to make your mainframe as fit as you5 tips to make your mainframe as fit as you
5 tips to make your mainframe as fit as you
 
Mainframe modernization powered by AI
Mainframe modernization powered by AIMainframe modernization powered by AI
Mainframe modernization powered by AI
 
Human Amplification In The Enterprise - Resources and Utilities
Human Amplification In The Enterprise - Resources and UtilitiesHuman Amplification In The Enterprise - Resources and Utilities
Human Amplification In The Enterprise - Resources and Utilities
 
Human Amplification In The Enterprise - Telecom and Communication
Human Amplification In The Enterprise - Telecom and CommunicationHuman Amplification In The Enterprise - Telecom and Communication
Human Amplification In The Enterprise - Telecom and Communication
 
Human Amplification In The Enterprise - Retail and CPG
Human Amplification In The Enterprise - Retail and CPGHuman Amplification In The Enterprise - Retail and CPG
Human Amplification In The Enterprise - Retail and CPG
 
Human Amplification In The Enterprise - Manufacturing and High-tech
Human Amplification In The Enterprise - Manufacturing and High-techHuman Amplification In The Enterprise - Manufacturing and High-tech
Human Amplification In The Enterprise - Manufacturing and High-tech
 
Human amplification in the enterprise - Automation. Innovation. Learning.
Human amplification in the enterprise - Automation. Innovation. Learning.Human amplification in the enterprise - Automation. Innovation. Learning.
Human amplification in the enterprise - Automation. Innovation. Learning.
 
Human Amplification In The Enterprise - Healthcare and Life Sciences
Human Amplification In The Enterprise - Healthcare and Life SciencesHuman Amplification In The Enterprise - Healthcare and Life Sciences
Human Amplification In The Enterprise - Healthcare and Life Sciences
 
Human Amplification In The Enterprise - Banking and Insurance
Human Amplification In The Enterprise - Banking and InsuranceHuman Amplification In The Enterprise - Banking and Insurance
Human Amplification In The Enterprise - Banking and Insurance
 
Mainframe modernization powered by AI
Mainframe modernization powered by AIMainframe modernization powered by AI
Mainframe modernization powered by AI
 
Reimagining the future of IT Infrastructure
Reimagining the future of IT InfrastructureReimagining the future of IT Infrastructure
Reimagining the future of IT Infrastructure
 
Infosys Amplifying Human Potential
Infosys Amplifying Human PotentialInfosys Amplifying Human Potential
Infosys Amplifying Human Potential
 
Snapshots from Infosys Confluence 2016
Snapshots from Infosys Confluence 2016Snapshots from Infosys Confluence 2016
Snapshots from Infosys Confluence 2016
 
Be Digital. Be More.
Be Digital. Be More.Be Digital. Be More.
Be Digital. Be More.
 
Being Digital
Being DigitalBeing Digital
Being Digital
 
Disruptive forces in digital payments
Disruptive forces in digital paymentsDisruptive forces in digital payments
Disruptive forces in digital payments
 
Infosys 'Go Green' Initiative
Infosys 'Go Green' InitiativeInfosys 'Go Green' Initiative
Infosys 'Go Green' Initiative
 
Serving the perfect Information Cocktail
Serving the perfect Information CocktailServing the perfect Information Cocktail
Serving the perfect Information Cocktail
 

Use of big data technologies in capital markets

  • 1. View Point Use of Big Data Technologies in Capital Markets - Ruchi Verma, Sathyan R Mani Abstract Data is growing at a tremendous rate with an increase in digital universe from 281 Exabyte’s (year 2007) to 1,800 Exabyte’s (year 2011). However the increase in data is, in itself, a minor problem, but the increase in percentage of unstructured data in the overall data volume is what is concerning all, including Wall Street. Capital market firms have been transforming organically to handle the big data over years. This article discusses the key transformations that capital market firms are undergoing to handle big data, drivers for use of big data technology in capital markets and relevant uses cases. The article further outlines the future big data technologies to handle big data complexity and responsiveness – a sense & response system, a federated database and in memory analytics. www.infosys.com
  • 2. Data is growing at a tremendous rate with an overall increase in digital universe with petabytes of data flowing from OVERVIEW messaging, emails, open source surveys, blogs, Twitters, Facebook and so on. According to studies conducted by top market research firms, the big data market is expected to grow from US$3.2 billion in 2010 to US$16.9 billion in 2015. The next big challenge for capital markets will be the velocity of data (value of extremely large and diverse data sets), considering the frequency of data generation and delivery in financial services industries like investment banking (highest stored data per firm), and the need to be still analysed in near real-time. Responding to these trends, capital market firms are setting aside a bigger pie of IT spend on big data technologies. According to a report by a major analyst firm, using big data analytics in the field of customer intelligence in the investment banking sector in UK has reaped benefits of £554 million and it is expected to go up to £5,275 million by the end of 2017. Investment banks in UK are spending 4% of their gross operating surplus on big data R&D. Capital market firms are also seeking competitive advantages by exploring the plethora of information from all possible sources and by transforming organically to handle the big data. The key technological transformations taking place in capital market to handle Big Data are – Scale-out storage The massive amount of stored and generated data means the scope for scale-up is reduced infrastructure that grows and the focus is more on “scaling out” which means use of shared pools of storage devices with data that deliver more capacity with same operational cost. Scale-out storage is the key to managing Big Data. Convergence of data A unified analytics platform that helps to collaborate between the structured and un-structured sources data. The increase in the unstructured data means more data to analyse. The applications that are used to analyse the Wall Street data (capital markets data) must be capable of collaboration and correlation. Big Data is just bits (data that makes no sense) until its value is unlocked. NASDAQ and NYSE Euronext stock exchanges have implemented various data-warehouse Innovative applications applications like IBM Netezza and EMC Greenplum to store big data. These tools/applications store the unstructured data got from various sources, handle the data and then process the data to make more sense out of the unstructured data. The traditional RDBMS (Relational Database Modelling Systems) cannot handle big data and Re-modelling of data hence there is a shift towards specialized, non-relational databases like Hadoop. Microsoft models to make better, SQL, which was the standard for querying data out of RDBMS systems, is now being replaced faster and less risky by Map Reduce, which is the distributed querying and data processing engine used to decisions extract data from big datasets hosted on clusters in any typical Hadoop implementation. The systems which have both, the structured and unstructured data are currently operated in mutually exclusive mode without interoperability. Vendors are trying to bridge the connectivity gap between RDBMS and the Hadoop systems Use of cloud storage The usage of shared pools via cloud storage is increasing the need for virtual infrastructure products. 2 | Infosys
  • 3. Following are the key business drivers for using Big Data Technologies in Capital Market Capital Market - Drivers for Big Changing regulatory landscape Data Technologies Financial services at large are seeing lots of regulatory make over in last Data management has been a big challenge few years. Capital Market firms are facing this new regulation landscape and are gearing up for meeting the upcoming regulations (Basel II, EMIR, for capital markets firms for decades. The Dodd Frank, MiFID etc.) financial industry has collectively spent billions of dollars over the years trying to Changing trading strategies create appropriate datasets. Data sharing Capital markets have evolved from simple strategies, like 1980s-paired across institutions continues to be a models, to the intricate gaming strategies of today. Trading strategies challenge, as each business unit prefers have started including unstructured data (an example is Twitter-based calculating using its own set of data, making trading strategies). In 2012, analysts expect to see an increased growth enterprise-wide data analysis tremendously in the use of unstructured data in trading strategies and alpha discovery difficult. The sources of unstructured data (Returns generation from various trades). from which data needs to be collected, Increasing data volume analysed and stored are increasing. For Market data volumes is growing really big. For years, capital markets instance, traders who are interested in firms have used advanced technology to store historical market data particular companies or industries news, for their quants. However with the growing data sources(social media, and advanced trading operations have Internet, mobility), the need for big data storage will become more developed tools to analyse news in real time profound and hence help them make efficient trading Increasing complexities of processes decisions. Now, with news coming in via There is an increase in data due to the increase in the complexities of the additional channels such as audio, video or underlying business process even through social media (Twitter), some firms are trying to come up with new ways Stringent timeliness to analyse all of this data. Firms are also In this tough economic environment, it is very important to find value in analysing data from different sources such the data and be as fast as possible. Real-time execution is been seen to take the physical extremes by collocated high-frequency trading. as documents, data from surveys, “click” data from websites, web search traffic data Need for handling speed, agility and control and more. As terabytes grow into petabytes, The speed required for retrieving the right data, manipulating it and traditional relational database structures using the analysis to gain control of the market dynamics is one of the are having a difficult time in storing and biggest business drivers. handling the data. Visibility of data Technologies play an important role in Since front and back offices are physically more distributed but more storage, analytics and processing of Big Data. closely integrated from a business perspective, there is a need to adapt In-memory tools can likely be deployed to changes which allows everyone to see what they need to see. in four main areas of the business: front Need for integration of siloes systems office, credit valuation adjustment, market In order to understand the risks an investment bank is accumulating, risk management and client reporting. The it is important to connect the data from various disparate systems. The current economic environment necessitates investment bank will not have an accurate view of potential risks unless the business drivers for any investments. they have a system in place which connects the risks and the analytical tools. This in turn means that they have to have a central platform for risk calculation. Infosys | 3
  • 4. Capital Market – Use cases of Big Data Technologies Capital market firms are using big data (Unstructured data) primarily in five key areas – Data Storage for Historical Trading, Internal Data Management Challenge and overall control on FINANCIAL DATA MANAGEMENT reference Data (On-demand data mining to dig into meta-data to deconstruct/reconstruct data AND REFERENCE DATA models, etc.). It can be very tough in maintaining (storing, handling and processing) the data from various asset classes coming from various vendors. MANAGEMENT Includes preparation for regulations like Dodd Frank, Solvency II, EMIR, audits etc. REGULATION Includes fraud mitigation, anti-money laundering (AML), Know Your Customer (KYC), rogue trading, RISK ANALYTICS on-demand enterprise risk management, etc. Includes Analytics for High Frequency Trading, Predictive Analytics, Pre-trade decision-support TRADING ANALYTICS analytics, including sentiment measurement and temporal/bi-temporal analytics etc. In enterprise-level monitoring and reporting, it’s often hard to match and reconcile trades from various systems built on different symbology standards — usually resulting in invalid, duplicated DATA TAGGING and missed trades. Data tagging can easily identify trades and events such as corporate actions and enable regulators detect stress signs early. Capital market firms are primarily using the following big data technologies – Massively parallel Data grids Compute grids processors Which use distributed Which offer a way of Which involves the caching to manage large parallelizing processes coordinated processing of a volumes of data across a across multiple servers, programme between multiple network of servers handling capacity/failure independent computers, each issues and orchestrating with its own operating system tasks across the grid and memory In-memory Specialized ‘NoSQL’ Hadoop databases databases Which are databases Which are ‘shell’ relational Which contains This is a tool used to that store data in the database management systems the necessary query the unstructured main memory rather that don’t use Structured Query architecture to store data which is a major than on a disk, as is the Language, are more simple than the unstructured part of big data case with traditional traditional databases and whose data. Ex: IBM Viper analytics. databases tables are compatible with a wide DB2 database, EMC Ex: EMC Map Reduce, range of external platforms Greenplum IBM Netezza 4 | Infosys
  • 5. Other Use Cases – Firms Big Data Use Cases Areas An investment firm with assets of over US$1 trillion and operations in Investment Bank approximately 50 countries uses big data technology to deliver Reference Reference Data Management Data to the Murex trading platform and other downstream systems. An investment firm with assets of over US$1 trillion and operations in Investment Bank approximately 50 countries uses big data to manage risk exposure through Risk Analytics real-time communication across bond, futures and credits trading An investment bank shifted the risk management and P&L towards a real- US Investment Bank time environment. Big Data technologies were leveraged to help the firm to Regulation gather all relevant data into one place An investment bank used Big Data analytics to track performance European Investment Bank Risk Analytics & Regulation monitoring, risk analytics and reporting An investment bank used Big Data technologies to generate on-demand Asian Investment Bank performance metrics for risk measures across multiple global trading Trading Analytics businesses. European Investment An investment manager used Big Data technology to gather relevant details Compliance Manager so as to respond as a witness to a litigation action against a prime broker Investment manager used Big Data technology to centralize data and US Investment Manager applications to apply governance policies and mitigate risk of damages Risk Analytics & Regulation from litigation discovery A major global exchange used Big Data technology to provide global market participants with on-demand access to data and data-mining Trading Analytics & Risk Global Exchange tools for trading, analytics and risk management in a cloud-based/hosted Analytics environment A US regulator used Big Data technology to create a searchable library of US Regulator research, econometric and other information generated by the regulator’s Regulation activities A major buy-side firm uses Big Data technologies for market surveillance, an Buy Side firm Regulation activity requiring processing of vast quantities of market information Fiduciary management – a new area of interest in which asset managers Fiduciary Management – Asset Manager outsource management of their portfolios to third-party administrators in Emerging area order to benefit from economies of scale. An investment bank use big-data techniques to handle and manage petabytes of data for regulatory compliance and advanced analytics. The Regulatory compliance and bank used technology from Hadoop, an open source framework that Regulation & Risk Analytics advanced analytics supports data-intensive distributed computing, which allow data to be crunched over a distributed network of computers. Infosys | 5
  • 6. Capital Market – Future Big Data Technologies Buy side firms, sell side firms, stock exchanges and also regulators are all looking at implementing big data technologies to a greater extent. However as the amount of data is increasing, responding to data demands and handling data complexity will be the two key challenges faced by capital market institutions in handling Big Data. Evolution of “Sense & Response System” to handle responsiveness Although capital markets have previously been focused on high velocity market data, it is now moving toward unstructured data due to changing trading dynamics. Unstructured data these days primarily consists of daily stock feeds, Twitter chats and blogs, a reality that would have been considered science fiction even a few years ago. Converting the unstructured information into machine-readable data is the key to gaining an edge in Wall Street trading. This data is then used by algorithmic traders to produce some alpha (returns from the market) from the news. These untapped sources can help traders gain a competitive edge in trading. 20-30 years back, a trader could capture some Alpha (Returns) in a stock even after a week of a company-specific or industry-specific event having occurred. Today, that cushion is gone and stock returns move in milliseconds. A system that examines blogs and Twitter posts is part of a larger, smart platform, known as a “sense and response system”. Wall Street has to look forward to make Sense & Response System a part of its day-to-day processing. Below figure depicts an example of how an Investment Strategy can be determined using big data technology, Sense & Response System (Figure 1). Happy Twitter Tweets Positive/ Calm News Feeds Mood Buy Stocks Sense and Response System 3 days Later Blogs Algorithmic Trading Engine Decisions Sad/Unhappy Twitter Tweets Twitter Posts Negative/ Anxious Sell Stocks Sense and Stock Data Response System 3 days Later Use of the output from sense and response system as a parameter to the Algorithmic trading black box not only gives capital market firms a competitive advantage but also helps in timely trading decisions for better alpha. 6 | Infosys
  • 7. Evolution of federated database approach for handling complexity of data Complexity of storage: shared storage and parallel processing are two key aspects to handle complex data especially when data is in structured and unstructured format. The structured data is stored in Oracle Real Application Cluster (RAC) node 1 and the unstructured data in RAC node 2. The RAC is a key to providing shared storage that is used in cloud computing. Taking into consideration the concept of parallel processing, databases like Teradata and Greenplum have multiple buckets for storing data, which helps in accessing the structured and unstructured data in parallel and thereby improving access response times. Using the above proposed mix of shared storage and parallel processing, complex big data can be handled proactively. Evolution of in memory analytics The increasing amount of data being made available to investment banks, coupled with the need to enter the capital markets as early as possible and desire to conduct real-time analytics to reduce latency is driving the trend towards in-memory analytics. In-memory software can improve the speed of data analytics by a factor of 14,000. Hence a request that previously took 5 hours can now be accomplished in just one second. The increase in speed is gained by putting data analytics into a CPU, into ‘memory’, as opposed to being carried out on disk. Queries sent to on-disk databases take longer because mechanical interaction take place, which is removed when query is, conducted in-memory. Conclusion Big Data technology implementation comes with its own cost. In spite of this, capital market firms are seen heavily investing in these technologies not only to stay competitive but also to ensure investors earn optimal ROI from their investments. The 3 V’s of data (variety, volume & velocity) play an extremely important role in capital market. Firms, who are still thinking of investing in Big Data technologies, should gear up soon before it becomes too late to remain competitive. References • A Team Group Survey – Big Data Solutions in Capital Market • www.bigdata.com/blog/ • www.emc.com/microsites/bigdata/index.html Infosys | 7
  • 8. About Infosys Infosys partners with global enterprises to drive their innovation-led growth. That's why Forbes ranked Infosys 19 among the top 100 most innovative companies. As a leading provider of next-generation consulting, technology and outsourcing solutions, Infosys helps clients in more than 30 countries realize their goals. Visit www.infosys.com and see how Infosys (NASDAQ: INFY), with its 150,000+ people, is Building Tomorrow's Enterprise® today. For more information, contact askus@infosys.com www.infosys.com © 2012 Infosys Limited, Bangalore, India. Infosys believes the information in this publication is accurate as of its publication date; such information is subject to change without notice. Infosys acknowledges the proprietary rights of the trademarks and product names of other companies mentioned in this document.