SlideShare a Scribd company logo
1 of 12
IDT Partners
         www.idtpartners.com




Big Data for Investment
Research Management
Discover how IDT Partners helps Financial
Services, Market Research, and Investment
Management firms turn big data into
actionable research insights.
Executive Summary

IDT Partners was retained by a financial client to investigate a legacy Research
Management System (RMS) and provide its team with a more robust research
dataset. The client’s investment research process includes supply chain [1]
research and ongoing investment portfolio monitoring. The research information
is collected and aggregated through proprietary supply chain data and by
conducting channel checks. [2]

The IDT Partners’ team worked with the client to understand the business
function dependencies on the legacy RMS, explored what business value potential
system improvements could bring, and identified what functionality, such as
reporting, researchers and executive management needed. A proposal offering
options for a new system featuring improved data management, business process
automation, and an intuitive user interface was put forth by IDT Partners and
approved by the client.

IDT Partners Solution
The IDT Partners’ team worked with the client to develop and deploy a new
system. The new system featured a much needed intuitive User Interface (UI),
improved Workflows, Business Process Automation (BPA), Supply Chain Trend
Spotting Tools, Dashboards, Reporting, and many other improvements.
Additionally, dozens of new data sources were added and a scalable Data
Warehouse was created to store and manage historical data.



IDT Partners   New York, NY   www.idtpartners.com   info@idtpartners.com   Page 2 of 12
IDT Partners increased the client’s research dataset eightyfold, from 50GB to over
4TB, while decreasing the data processing time from ten hours to less than two
hours. The IDT Partner’s solution has provided the client’s research team with a
technology-enabled competitive edge by improving the client’s operational
efficiency by 20%, significantly improving its business processes management
and automation capabilities, and using big data processing to provide actionable
research insights needed to support its investment research process.


IDT Partners Solution Highlights


       •       8,000% Larger Dataset

       •       1,000% Increase in number of Data Sources

       •       500% Faster Data Processing and Up-to-date Information

       •       100% ROI within 12 months

       •       20% Increase in Operational Efficiency

       •       Automated Trend Spotting & Monitoring with Big Data

       •       Business Process Management & Automation (BPM, BPA)

       •       Consistent Data with Automated Data De-duplication

       •       Intuitive and Streamlined User Interface (UI)

       •       Highly Customized Automated Reports

       •       Executive Summary Dashboards


IDT Partners    New York, NY   www.idtpartners.com   info@idtpartners.com   Page 3 of 12
Solution Overview:
Research Management System (RMS)

The following is an overview of a Research Management System (RMS) built by
IDT Partners for a financial client. The system facilitates investment research,
provides actionable insights, and helps support investment decisions by analyzing
the supply chain [1] for companies in the client’s investment portfolio.


System Architecture

A proprietary framework was used to build the RMS as a Service-Oriented
Architecture (SOA). This has allowed for an increased flexibility in tool choice and
made it easy to exchange information with client’s internal systems (Business
Intelligence, Compliance, Vendor Management, and Billing) via XML-based Web
Services.


Data Sources
IDT Partners assisted the client with data vendor selection, custom data
collection, data aggregation, data processing, and Master Data Management
(MDM) strategy for the following multi-terabyte datasets.

       Supply Chain Data                              Industry Contacts
       Company Data                                   Supply Chain Taxonomies
       Product Data                                   Social Media
       Company Taxonomies                             International Trade Data
       News                                           Custom Research
       Market Data                                    Proprietary Datasets


IDT Partners   New York, NY   www.idtpartners.com       info@idtpartners.com   Page 4 of 12
The following diagram provides an overview of the data management process as
it relates to extracting research insights from raw data sources.




                                   Data Source Management


                                 Data Ingestion & Integration


                            Master Data Management (MDM)




                                                          Market Data                   Industry
                                                                                        Contacts

            Supply Chain                   Company
               Data                       Taxonomies


                           Product Data


             Company                                                    Supply Chain
               Data                                                     Taxonomies


                              News                                                     Social Media


                                          International
                                           Trade Data

                                                                                         Custom
                                                                                        Research

                                                                        Proprietary
                                                                         Datasets
Data Warehousing (DW)


                                Custom Data Processing


                        Analytics & Business Intelligence (BI)


                       Research Management System (RMS)


                        Business Process Automation (BPA)


                              Actionable Research Insights



Big Data

Hadoop and MapReduce technology was used to convert multi-terabyte supply-
chain-related datasets into actionable research data and Business Intelligence
(BI) tools used by client to help facilitate investment research and decisions. The
solution analyzes over a billion supply chain data points on ongoing basis to
provide insights into historical supply chain changes.

The Hadoop infrastructure breaks up large datasets into "chunks" and
coordinates the processing of the data out into the distributed, clustered
environment. The Hadoop infrastructure included the following core components:
HDFS containers, MapReduce processing infrastructure, Hive for immutable table
data store, and HBase for mutable table data store.


IDT Partners   New York, NY     www.idtpartners.com   info@idtpartners.com   Page 6 of 12
The following diagram illustrates a simplified version of a data workflow that
utilizes big data processing technology to provide the RMS with actionable data
via an integrated BI platform.


                                                          Data Sources
                     Dozens of Proprietary and Commercial Supply Chain-related Data Sources (multi-terabyte)




          Data Ingestion                    Data Processing                       Hadoop Compute Cluster

                                                                                  Master 1    Map
                 Scheduler                      Classification
                                                                                  Master 2
               Task Repository                 Deduplication
                                                                                  Slave 1
                  Admin UI                     Normalization
                                                                                  Slave 2     Map


                                                                                  …
                                                                                  Slave n     Map



                                                                                                           Reduce

                                     Research Dataset


                      Current                 Custom
                      Dataset               Trends DW
                                                                                             BI Platform




                                     Research Management System




IDT Partners          New York, NY          www.idtpartners.com             info@idtpartners.com               Page 7 of 12
Key Technologies
The following are some of the key technologies used for this solution.

       PHP                        MapReduce                 Linux                  Lucene
       LAMP                       HDFS                      Apache                 Solr
       JAVA                       HBase                     MongoDB                ETL
       jQuery                     Pig                       MSSQL                  SSIS
       Hadoop                     Hive                      MarkLogic              Pentaho




User Interface (UI)
The solution leveraged a combination of Rich Internet Application (RIA)
technology and intuitive User Interface (UI) design to simplify the supply chain
navigation process, seamlessly manage the underlying research processes, and to
provide a way to easily review large quantities of data using data visualization and
dashboards.




IDT Partners     New York, NY        www.idtpartners.com        info@idtpartners.com          Page 8 of 12
The following are some of the UI features of the RMS.

     Innovative Data Visualization interfaces
     Intuitive upstream and downstream supply chain navigation with rollover
        information popups.
     Analysis of companies within each node in the supply chain.
     Supplier, vendor, manufacturer, customer, and distributor information.
     Competitor information for companies within the supply chain.
     Comparison matrix for any two entities in the supply chain.
     Supply chain company filter with research-focused criteria.
     Supply chain search using hundreds of criteria including historical trends.
     Sharing of screens, graphs, charts, dashboards, and research feedback.

Automated Trend Spotting & Monitoring
IDT Partners developed a data warehouse and tools to keep track of historical
supply chain changes to help identify supply chain trends. Automated trend
spotting and monitoring automated and streamlined what had previously been a
highly time consuming and error-prone manual process. This has proved to be
invaluable for the client’s research team.



Reporting

The solution features dozens of highly customized reports leveraging a much
larger dataset and the newly built data warehouse.




IDT Partners   New York, NY   www.idtpartners.com   info@idtpartners.com   Page 9 of 12
Some of the features include Custom Dashboards, Trend
and Portfolio Monitoring Alerts, dozens of Highly
Customized Reports, and Data Export (Excel, PDF, Email).




Definitions
The following are definitions for some domain-specific terminology referenced in
this white paper.



        [1] Supply Chain

               A supply chain is a system of organizations, people, technology,
               activities, information and resources involved in moving a product or
               service from supplier to customer. Supply chain activities transform
               natural resources, raw materials and components into a finished
               product that is delivered to the end customer. The following is an
               overview of an electronics manufacturing supply chain.

                         Manufacturing                                  Marketing




                     R&D             Design           Assembly    Distribution      Sales




               The following simplified supply chain diagram is an example of how
               NVIDIA, a global technology company and GPU manufacturer, supplies




IDT Partners       New York, NY          www.idtpartners.com     info@idtpartners.com       Page 10 of 12
directly to the customer, to PC vendors, and retailers. Each point in the
               supply chain may include hundreds of companies across many tiers.


                                                             Direct PC
                Semiconductor
                                                             Vendors
                  Materials


                                    Component
                                                                                          Customers
                                     Makers /
                                   Semiconduct
                                   or Foundries

                                                            Indirect PC            Resellers /
                Semiconductor                                 Vendors              Retailers
                Manufacturing
                  Equipment

                                                            Distributors




        [2] Channel Check

               In financial analysis, a channel check is third-party research on a
               company's business based on collecting information from the
               distribution channels of the company. It may be conducted in order to
               value the company or to perform due diligence in various contexts.
               Industries where channel checks are more often conducted include
               retail, technology, commodities, etc. Channel checks can give insights
               complementary to balance sheet analysis, such as distributor and
               retailer attitudes towards a product and its competitors, seasonal and
               geographic variation, and inventory levels (notably channel stuffing).




IDT Partners        New York, NY      www.idtpartners.com          info@idtpartners.com          Page 11 of 12
About IDT Partners
IDT Partners is a New York City based technology solutions and web application
development firm specializing in enterprise-level web application development,
custom product development, and technology solutions that help customers
accelerate growth, capitalize on new market opportunities, and optimize
operational efficiency by leveraging the latest technology.




                Contact IDT Partners at info@idtpartners.com




IDT Partners   New York, NY   www.idtpartners.com   info@idtpartners.com   Page 12 of 12

More Related Content

What's hot

Metadata Use Cases You Can Use
Metadata Use Cases You Can UseMetadata Use Cases You Can Use
Metadata Use Cases You Can Usedmurph4
 
Storage Characteristics Of Call Data Records In Column Store Databases
Storage Characteristics Of Call Data Records In Column Store DatabasesStorage Characteristics Of Call Data Records In Column Store Databases
Storage Characteristics Of Call Data Records In Column Store DatabasesDavid Walker
 
TUW - 184.742 Evaluating Data Concerns for DaaS
TUW - 184.742 Evaluating Data Concerns for DaaSTUW - 184.742 Evaluating Data Concerns for DaaS
TUW - 184.742 Evaluating Data Concerns for DaaSHong-Linh Truong
 
Tech Talk SQL Server 2012 Business Intelligence
Tech Talk SQL Server 2012 Business IntelligenceTech Talk SQL Server 2012 Business Intelligence
Tech Talk SQL Server 2012 Business IntelligenceRay Cochrane
 
Architecting a-big-data-platform-for-analytics 24606569
Architecting a-big-data-platform-for-analytics 24606569Architecting a-big-data-platform-for-analytics 24606569
Architecting a-big-data-platform-for-analytics 24606569Kun Le
 
Best practices for building and deploying predictive models over big data pre...
Best practices for building and deploying predictive models over big data pre...Best practices for building and deploying predictive models over big data pre...
Best practices for building and deploying predictive models over big data pre...Kun Le
 
Big Data Analytics in a Heterogeneous World - Joydeep Das of Sybase
Big Data Analytics in a Heterogeneous World - Joydeep Das of SybaseBig Data Analytics in a Heterogeneous World - Joydeep Das of Sybase
Big Data Analytics in a Heterogeneous World - Joydeep Das of SybaseBigDataCloud
 
Intel Cloud summit: Big Data by Nick Knupffer
Intel Cloud summit: Big Data by Nick KnupfferIntel Cloud summit: Big Data by Nick Knupffer
Intel Cloud summit: Big Data by Nick KnupfferIntelAPAC
 
Next Generation BI: current state and changing product assumptions
Next Generation BI: current state and changing product assumptionsNext Generation BI: current state and changing product assumptions
Next Generation BI: current state and changing product assumptionsmark madsen
 
White Paper - Data Warehouse Governance
White Paper -  Data Warehouse GovernanceWhite Paper -  Data Warehouse Governance
White Paper - Data Warehouse GovernanceDavid Walker
 
W 1&2 introduction to omt-ii
W 1&2 introduction to omt-iiW 1&2 introduction to omt-ii
W 1&2 introduction to omt-iiMajid Orakzai
 
Enterprise Master Data Architecture: Design Decisions and Options
Enterprise Master Data Architecture: Design Decisions and OptionsEnterprise Master Data Architecture: Design Decisions and Options
Enterprise Master Data Architecture: Design Decisions and OptionsBoris Otto
 
NASA Facilities GIS
NASA Facilities GISNASA Facilities GIS
NASA Facilities GISrjinterr
 
Papyrus Digital Asset Management
Papyrus Digital Asset ManagementPapyrus Digital Asset Management
Papyrus Digital Asset Managementmjpucher
 
Teradata Overview
Teradata OverviewTeradata Overview
Teradata OverviewTeradata
 
IBM Smarter Water Keynote, IWA Montreal September 2010
IBM Smarter Water Keynote, IWA Montreal September 2010IBM Smarter Water Keynote, IWA Montreal September 2010
IBM Smarter Water Keynote, IWA Montreal September 2010Cameron Brooks
 

What's hot (20)

Metadata Use Cases You Can Use
Metadata Use Cases You Can UseMetadata Use Cases You Can Use
Metadata Use Cases You Can Use
 
Storage Characteristics Of Call Data Records In Column Store Databases
Storage Characteristics Of Call Data Records In Column Store DatabasesStorage Characteristics Of Call Data Records In Column Store Databases
Storage Characteristics Of Call Data Records In Column Store Databases
 
Gic2011 aula3-ingles
Gic2011 aula3-inglesGic2011 aula3-ingles
Gic2011 aula3-ingles
 
TUW - 184.742 Evaluating Data Concerns for DaaS
TUW - 184.742 Evaluating Data Concerns for DaaSTUW - 184.742 Evaluating Data Concerns for DaaS
TUW - 184.742 Evaluating Data Concerns for DaaS
 
Tech Talk SQL Server 2012 Business Intelligence
Tech Talk SQL Server 2012 Business IntelligenceTech Talk SQL Server 2012 Business Intelligence
Tech Talk SQL Server 2012 Business Intelligence
 
Architecting a-big-data-platform-for-analytics 24606569
Architecting a-big-data-platform-for-analytics 24606569Architecting a-big-data-platform-for-analytics 24606569
Architecting a-big-data-platform-for-analytics 24606569
 
Best practices for building and deploying predictive models over big data pre...
Best practices for building and deploying predictive models over big data pre...Best practices for building and deploying predictive models over big data pre...
Best practices for building and deploying predictive models over big data pre...
 
Data ware house
Data ware houseData ware house
Data ware house
 
Big Data Analytics in a Heterogeneous World - Joydeep Das of Sybase
Big Data Analytics in a Heterogeneous World - Joydeep Das of SybaseBig Data Analytics in a Heterogeneous World - Joydeep Das of Sybase
Big Data Analytics in a Heterogeneous World - Joydeep Das of Sybase
 
Intel Cloud summit: Big Data by Nick Knupffer
Intel Cloud summit: Big Data by Nick KnupfferIntel Cloud summit: Big Data by Nick Knupffer
Intel Cloud summit: Big Data by Nick Knupffer
 
Mis 3
Mis 3Mis 3
Mis 3
 
Next Generation BI: current state and changing product assumptions
Next Generation BI: current state and changing product assumptionsNext Generation BI: current state and changing product assumptions
Next Generation BI: current state and changing product assumptions
 
White Paper - Data Warehouse Governance
White Paper -  Data Warehouse GovernanceWhite Paper -  Data Warehouse Governance
White Paper - Data Warehouse Governance
 
Slideshare ga
Slideshare gaSlideshare ga
Slideshare ga
 
W 1&2 introduction to omt-ii
W 1&2 introduction to omt-iiW 1&2 introduction to omt-ii
W 1&2 introduction to omt-ii
 
Enterprise Master Data Architecture: Design Decisions and Options
Enterprise Master Data Architecture: Design Decisions and OptionsEnterprise Master Data Architecture: Design Decisions and Options
Enterprise Master Data Architecture: Design Decisions and Options
 
NASA Facilities GIS
NASA Facilities GISNASA Facilities GIS
NASA Facilities GIS
 
Papyrus Digital Asset Management
Papyrus Digital Asset ManagementPapyrus Digital Asset Management
Papyrus Digital Asset Management
 
Teradata Overview
Teradata OverviewTeradata Overview
Teradata Overview
 
IBM Smarter Water Keynote, IWA Montreal September 2010
IBM Smarter Water Keynote, IWA Montreal September 2010IBM Smarter Water Keynote, IWA Montreal September 2010
IBM Smarter Water Keynote, IWA Montreal September 2010
 

Similar to Big Data For Investment Research Management

Anexinet Big Data Solutions
Anexinet Big Data SolutionsAnexinet Big Data Solutions
Anexinet Big Data SolutionsMark Kromer
 
Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831Cana Ko
 
Informatica World 2006 - MDM Data Quality
Informatica World 2006 - MDM Data QualityInformatica World 2006 - MDM Data Quality
Informatica World 2006 - MDM Data QualityDatabase Architechs
 
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendIntroducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendCaserta
 
data resource management
 data resource management data resource management
data resource managementsoodsurbhi123
 
Intel Cloud Summit: Big Data
Intel Cloud Summit: Big DataIntel Cloud Summit: Big Data
Intel Cloud Summit: Big DataIntelAPAC
 
Investigative Analytics- What's in a Data Scientists Toolbox
Investigative Analytics- What's in a Data Scientists ToolboxInvestigative Analytics- What's in a Data Scientists Toolbox
Investigative Analytics- What's in a Data Scientists ToolboxData Science London
 
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data European Data Forum
 
Virtualisation de données : Enjeux, Usages & Bénéfices
Virtualisation de données : Enjeux, Usages & BénéficesVirtualisation de données : Enjeux, Usages & Bénéfices
Virtualisation de données : Enjeux, Usages & BénéficesDenodo
 
Teradata Big Data London Seminar
Teradata Big Data London SeminarTeradata Big Data London Seminar
Teradata Big Data London SeminarHortonworks
 
Martin Wildberger Presentation
Martin Wildberger PresentationMartin Wildberger Presentation
Martin Wildberger PresentationMauricio Godoy
 
Data Mining
Data MiningData Mining
Data Miningswami920
 
Big data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantBig data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantStuart Miniman
 
Big Data and Implications on Platform Architecture
Big Data and Implications on Platform ArchitectureBig Data and Implications on Platform Architecture
Big Data and Implications on Platform ArchitectureOdinot Stanislas
 
Magic quadrant for data warehouse database management systems
Magic quadrant for data warehouse database management systems Magic quadrant for data warehouse database management systems
Magic quadrant for data warehouse database management systems divjeev
 

Similar to Big Data For Investment Research Management (20)

Anexinet Big Data Solutions
Anexinet Big Data SolutionsAnexinet Big Data Solutions
Anexinet Big Data Solutions
 
Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831
 
Informatica World 2006 - MDM Data Quality
Informatica World 2006 - MDM Data QualityInformatica World 2006 - MDM Data Quality
Informatica World 2006 - MDM Data Quality
 
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendIntroducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
 
Data mining
Data miningData mining
Data mining
 
data resource management
 data resource management data resource management
data resource management
 
Intel Cloud Summit: Big Data
Intel Cloud Summit: Big DataIntel Cloud Summit: Big Data
Intel Cloud Summit: Big Data
 
Investigative Analytics- What's in a Data Scientists Toolbox
Investigative Analytics- What's in a Data Scientists ToolboxInvestigative Analytics- What's in a Data Scientists Toolbox
Investigative Analytics- What's in a Data Scientists Toolbox
 
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data
 
Virtualisation de données : Enjeux, Usages & Bénéfices
Virtualisation de données : Enjeux, Usages & BénéficesVirtualisation de données : Enjeux, Usages & Bénéfices
Virtualisation de données : Enjeux, Usages & Bénéfices
 
Teradata Big Data London Seminar
Teradata Big Data London SeminarTeradata Big Data London Seminar
Teradata Big Data London Seminar
 
Barak regev
Barak regevBarak regev
Barak regev
 
Martin Wildberger Presentation
Martin Wildberger PresentationMartin Wildberger Presentation
Martin Wildberger Presentation
 
Data Mining
Data MiningData Mining
Data Mining
 
Big data use cases
Big data use casesBig data use cases
Big data use cases
 
Enterprise Services Solutions
Enterprise Services SolutionsEnterprise Services Solutions
Enterprise Services Solutions
 
Big data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantBig data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You Want
 
Big Data and Implications on Platform Architecture
Big Data and Implications on Platform ArchitectureBig Data and Implications on Platform Architecture
Big Data and Implications on Platform Architecture
 
Magic quadrant for data warehouse database management systems
Magic quadrant for data warehouse database management systems Magic quadrant for data warehouse database management systems
Magic quadrant for data warehouse database management systems
 
Big Data przt.pptx
Big Data przt.pptxBig Data przt.pptx
Big Data przt.pptx
 

Big Data For Investment Research Management

  • 1. IDT Partners www.idtpartners.com Big Data for Investment Research Management Discover how IDT Partners helps Financial Services, Market Research, and Investment Management firms turn big data into actionable research insights.
  • 2. Executive Summary IDT Partners was retained by a financial client to investigate a legacy Research Management System (RMS) and provide its team with a more robust research dataset. The client’s investment research process includes supply chain [1] research and ongoing investment portfolio monitoring. The research information is collected and aggregated through proprietary supply chain data and by conducting channel checks. [2] The IDT Partners’ team worked with the client to understand the business function dependencies on the legacy RMS, explored what business value potential system improvements could bring, and identified what functionality, such as reporting, researchers and executive management needed. A proposal offering options for a new system featuring improved data management, business process automation, and an intuitive user interface was put forth by IDT Partners and approved by the client. IDT Partners Solution The IDT Partners’ team worked with the client to develop and deploy a new system. The new system featured a much needed intuitive User Interface (UI), improved Workflows, Business Process Automation (BPA), Supply Chain Trend Spotting Tools, Dashboards, Reporting, and many other improvements. Additionally, dozens of new data sources were added and a scalable Data Warehouse was created to store and manage historical data. IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 2 of 12
  • 3. IDT Partners increased the client’s research dataset eightyfold, from 50GB to over 4TB, while decreasing the data processing time from ten hours to less than two hours. The IDT Partner’s solution has provided the client’s research team with a technology-enabled competitive edge by improving the client’s operational efficiency by 20%, significantly improving its business processes management and automation capabilities, and using big data processing to provide actionable research insights needed to support its investment research process. IDT Partners Solution Highlights • 8,000% Larger Dataset • 1,000% Increase in number of Data Sources • 500% Faster Data Processing and Up-to-date Information • 100% ROI within 12 months • 20% Increase in Operational Efficiency • Automated Trend Spotting & Monitoring with Big Data • Business Process Management & Automation (BPM, BPA) • Consistent Data with Automated Data De-duplication • Intuitive and Streamlined User Interface (UI) • Highly Customized Automated Reports • Executive Summary Dashboards IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 3 of 12
  • 4. Solution Overview: Research Management System (RMS) The following is an overview of a Research Management System (RMS) built by IDT Partners for a financial client. The system facilitates investment research, provides actionable insights, and helps support investment decisions by analyzing the supply chain [1] for companies in the client’s investment portfolio. System Architecture A proprietary framework was used to build the RMS as a Service-Oriented Architecture (SOA). This has allowed for an increased flexibility in tool choice and made it easy to exchange information with client’s internal systems (Business Intelligence, Compliance, Vendor Management, and Billing) via XML-based Web Services. Data Sources IDT Partners assisted the client with data vendor selection, custom data collection, data aggregation, data processing, and Master Data Management (MDM) strategy for the following multi-terabyte datasets.  Supply Chain Data  Industry Contacts  Company Data  Supply Chain Taxonomies  Product Data  Social Media  Company Taxonomies  International Trade Data  News  Custom Research  Market Data  Proprietary Datasets IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 4 of 12
  • 5. The following diagram provides an overview of the data management process as it relates to extracting research insights from raw data sources. Data Source Management Data Ingestion & Integration Master Data Management (MDM) Market Data Industry Contacts Supply Chain Company Data Taxonomies Product Data Company Supply Chain Data Taxonomies News Social Media International Trade Data Custom Research Proprietary Datasets
  • 6. Data Warehousing (DW) Custom Data Processing Analytics & Business Intelligence (BI) Research Management System (RMS) Business Process Automation (BPA) Actionable Research Insights Big Data Hadoop and MapReduce technology was used to convert multi-terabyte supply- chain-related datasets into actionable research data and Business Intelligence (BI) tools used by client to help facilitate investment research and decisions. The solution analyzes over a billion supply chain data points on ongoing basis to provide insights into historical supply chain changes. The Hadoop infrastructure breaks up large datasets into "chunks" and coordinates the processing of the data out into the distributed, clustered environment. The Hadoop infrastructure included the following core components: HDFS containers, MapReduce processing infrastructure, Hive for immutable table data store, and HBase for mutable table data store. IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 6 of 12
  • 7. The following diagram illustrates a simplified version of a data workflow that utilizes big data processing technology to provide the RMS with actionable data via an integrated BI platform. Data Sources Dozens of Proprietary and Commercial Supply Chain-related Data Sources (multi-terabyte) Data Ingestion Data Processing Hadoop Compute Cluster Master 1 Map Scheduler Classification Master 2 Task Repository Deduplication Slave 1 Admin UI Normalization Slave 2 Map … Slave n Map Reduce Research Dataset Current Custom Dataset Trends DW BI Platform Research Management System IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 7 of 12
  • 8. Key Technologies The following are some of the key technologies used for this solution.  PHP  MapReduce  Linux  Lucene  LAMP  HDFS  Apache  Solr  JAVA  HBase  MongoDB  ETL  jQuery  Pig  MSSQL  SSIS  Hadoop  Hive  MarkLogic  Pentaho User Interface (UI) The solution leveraged a combination of Rich Internet Application (RIA) technology and intuitive User Interface (UI) design to simplify the supply chain navigation process, seamlessly manage the underlying research processes, and to provide a way to easily review large quantities of data using data visualization and dashboards. IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 8 of 12
  • 9. The following are some of the UI features of the RMS.  Innovative Data Visualization interfaces  Intuitive upstream and downstream supply chain navigation with rollover information popups.  Analysis of companies within each node in the supply chain.  Supplier, vendor, manufacturer, customer, and distributor information.  Competitor information for companies within the supply chain.  Comparison matrix for any two entities in the supply chain.  Supply chain company filter with research-focused criteria.  Supply chain search using hundreds of criteria including historical trends.  Sharing of screens, graphs, charts, dashboards, and research feedback. Automated Trend Spotting & Monitoring IDT Partners developed a data warehouse and tools to keep track of historical supply chain changes to help identify supply chain trends. Automated trend spotting and monitoring automated and streamlined what had previously been a highly time consuming and error-prone manual process. This has proved to be invaluable for the client’s research team. Reporting The solution features dozens of highly customized reports leveraging a much larger dataset and the newly built data warehouse. IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 9 of 12
  • 10. Some of the features include Custom Dashboards, Trend and Portfolio Monitoring Alerts, dozens of Highly Customized Reports, and Data Export (Excel, PDF, Email). Definitions The following are definitions for some domain-specific terminology referenced in this white paper. [1] Supply Chain A supply chain is a system of organizations, people, technology, activities, information and resources involved in moving a product or service from supplier to customer. Supply chain activities transform natural resources, raw materials and components into a finished product that is delivered to the end customer. The following is an overview of an electronics manufacturing supply chain. Manufacturing Marketing R&D Design Assembly Distribution Sales The following simplified supply chain diagram is an example of how NVIDIA, a global technology company and GPU manufacturer, supplies IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 10 of 12
  • 11. directly to the customer, to PC vendors, and retailers. Each point in the supply chain may include hundreds of companies across many tiers. Direct PC Semiconductor Vendors Materials Component Customers Makers / Semiconduct or Foundries Indirect PC Resellers / Semiconductor Vendors Retailers Manufacturing Equipment Distributors [2] Channel Check In financial analysis, a channel check is third-party research on a company's business based on collecting information from the distribution channels of the company. It may be conducted in order to value the company or to perform due diligence in various contexts. Industries where channel checks are more often conducted include retail, technology, commodities, etc. Channel checks can give insights complementary to balance sheet analysis, such as distributor and retailer attitudes towards a product and its competitors, seasonal and geographic variation, and inventory levels (notably channel stuffing). IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 11 of 12
  • 12. About IDT Partners IDT Partners is a New York City based technology solutions and web application development firm specializing in enterprise-level web application development, custom product development, and technology solutions that help customers accelerate growth, capitalize on new market opportunities, and optimize operational efficiency by leveraging the latest technology. Contact IDT Partners at info@idtpartners.com IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 12 of 12