SlideShare una empresa de Scribd logo
1 de 6
Descargar para leer sin conexión
Building an Analytical Platform
Sybase IQ ile Analitik Platform
I was recently asked to build an analyti-   3. Provide a way to automate the                 An incidental but equally useful
cal platform for a project. But what is an        running of the statistical data models,     consequence of using a column-store
analytical platform? The client, a retailer,      once developed, so that they can be         database such as SAP Sybase IQ is that
described it as a database where it could         run without engaging the statistical        there is no advantage in creating a star
store data and as a front end where it            development resources.                      schema as a data model. Instead, hold-
could do statistical work. This work             Of course, time was of the essence           ing all the data in one large wide table is
would range from simple means and              and costs had to be as low as possible –                                                   -
standard deviations through to more            but we’ve come to expect that with             ing each column with a key means that
complex predictive analytics that could                                                       the underlying storage of data is a star
be used, for example, to analyze past          Step 1: The database                           schema. Creating a star schema in a
performance of a customer to assess the          Our chosen solution for the database         column-store database rather than a
likelihood that the customer will exhibit a    was an SAP® Sybase® IQ database, a             large single table would mean incurring
future behavior. Or it might involve using     technology our client was already famil-       unnecessary additional join and process-
models to classify customers into groups       iar with. SAP Sybase IQ is a column-store      ing overhead.
and ultimately to bring the two processes      database. This means that instead of             As a result of choosing SAP
together into an area known as decision        storing all the data in its rows, as many      Sybase IQ’s column-store database
models. The customer had also come up          other databases do, the data is organized      we are able to have a data model that
with an innovative way to resource the         on disk by the columns. For example if a       consists of a number of simple single
                                                                                              table data sets (one table for each
work placements to master’s degree             have the text of each country (for exam-
students studying statistics at the local      ple, “United Kingdom”) stored many             that is quick to load and to query.
university and arranged for them to work       times. In a column-store database the            It should be noted that this type of
with the customer insight team to              text is stored only once and given a
describe and develop the advanced              unique ID. This is repeated for each           online transaction processing (OLTP)
models. All the customer needed was a          column and therefore the “row” of data         applications because of the cost of doing
platform to work with.                         consists of a list of IDs linked to the data   small inserts and updates. However, this
   From a systems architecture and             held for each column.                          is not relevant for this particular
development perspective, we could
describe the requirements in three rela-       reporting and analytical databases.              The solution can be deployed only on
tively simple statements:                                                                     a Linux platform. We use Linux for three
1. Build a database with a very simple                                                        reasons. First, RStudio Server Edition is
    data model that could be easily            used. In our example, “United Kingdom”         not yet available for Microsoft Windows.
    loaded, that was capable of support-       would occupy 14 bytes, while the ID            Second, precompiled packages for all
    ing high-performance queries, and          might occupy only 1 byte – reducing the        elements of the solution on Linux reduce
    that did not consume a massive             storage for that one value in that one
    amount of disk space. It would also        column by a ratio of 14:1 – and this           environments are normally cheaper than
    ideally be capable of being placed in                                                     Windows environments due to the cost
    the cloud.                                 the data. Furthermore, because there is        of the operating system license. We
2. Create a Web-based interface that           less data on the disk, the time taken to       chose CentOS because it is a Red Hat
    would allow users to securely log on,      read the data from disk and to process         derivative that is free.
    to write statistical programs that                                                           One additional advantage of this solu-
    could use the database as a source of      which massively speeds up the queries          tion for some organizations is the ability
    data, and to output reports and graph-     too. Finally, each column is already           to deploy it in the cloud. Since the solu-
    ics and well as to populate other          indexed, which again helps the overall                                                 -
    tables (for example, target lists) as a    query speed.                                   ered, and since all querying is done via a
    result of statistical models.                                                             Web interface, it is possible to use any




SAP White Paper – Building an Analytical Platform                                                                                         3
colocation or cloud-based hosting            your environment, but these are well                          At this point data has to be loaded and
provider. Colocation or cloud deploy-        documented on the source Web sites                          the statisticians can get to work.
                                             and in general automatically download if                    Obviously this is more time consuming
systems management overhead, and             you are using a tool such as yum.                           than the build, and over the days and
access for both data delivery and data         The next step was to get access to the                    weeks the analysts created their models
access. The system requires SSH access       data held in our SAP Sybase IQ server.                      and produced the results.
for management; FTP, SFTP, or SCP for        This proved to also be very straightfor-                      For this exercise we used our in-house
                                             ward. There is a SAP Sybase white paper                     extract, transform, and load (ETL) tool to
port open. The RStudio server uses the                                                                   create a repeatable data extraction and
server login accounts for security but                                                                   load process, but it would have been
can also be tied to existing LDAP            describes the process that can be simply                    possible to use any of a wide range of
infrastructure.                              stated as:                                                  tools that are available for this process
Step 2: Statistical tools and Web              Install the R JDBC package                                Step 3: Automatically running the
interface                                      Set up a JDBC connection                                  statistical models
   There are a number of statistical tools     Establish your connection                                   Eventually a number of models for
in the market. Most are very expensive,        Query the table                                           analyzing the data had been created and
prohibitively so in this case, and the         We now have an R object that contains                     we were ready to move into a production
associated skills are hard to come by        data sourced from SAP Sybase IQ that                        environment. We automated the load of
and expensive. However, since 1993 an        we can work with. And what is amazing is                    the data into the agreed single-table
open-source programming language             that it took me less than half a day to                     structure and wanted to also run the
called R (www.r-project.org) for statisti-   build the platform from scratch.                            data models.
cal computing and graphics has been
under development. It is now widely used
among statisticians for developing statis-
tical software and data analysis, is used
by many universities, and is predicted to                         Analytical Platform Server
become the most widely used statistical
package by 2015. The R project provides
                                                                                                  R Studio
a command line and graphical interface                                     R
                                                                                                Server Edition
as well as a large open-source library of
useful routines (http:/ /cran.r-project.                         R/JDBC
org) and it is available as packaged soft-                    Connection
ware for most platforms including Linux.
   In addition, a second open-source proj-                                     SAP
ect called RStudio (http:/  /rstudio.org)                                     Sybase
provides a single integrated development                              R/JDBC Connection
                                                                                IQ
environment for R and can be deployed        (S) FTP/SCP
on a local machine or as a Web-based          File Delivery                                                   Write to
service using the server’s security model.                                                                    Database                  Any Network
In this case, we implemented the server                                                                                             Connected Computer
                                                                               Read File               ETL                        with a Browser Accessing
edition in order to make the entire envi-                                                                                        the R Studio Server Edition
                                                                                                      Engine
ronment Web based.
   So in two simple steps (download and
install R, followed by download and
install RStudio) we can install a full                                              CentOS
Web-based statistical environment. Note
                                                                                           ©2012 Data Management & Warehousing


packages may be required depending on
SAP Sybase IQ has the functionality          ConCluSionS                                 ABout the Author

                                                                                           David Walker has been involved with business
These C++ programs “talk” to a process          Business intelligence requirements are
                                                                                           intelligence and data warehousing for over
known as Rserve, which in turn executes       changing and business users are moving
the R program and returns the results         more and more from historical reporting
to SAP Sybase IQ. This allows R func-         into predictive analytics in an attempt to
tions to be embedded directly into SAP        get both a better and deeper under-          Data Management & Warehousing (http://
                                                                                           datamgmt.com) in 1995.
Sybase IQ SQL commands. While setting         standing of their data.
this up requires a little more program-         Traditionally, building an analytical      David and his team have worked around
ming experience, it does mean that all        platform has required an expensive infra-    the world on projects designed to deliver
processing can be done within SAP             structure and a considerable amount of
Sybase IQ.                                    time for setup and deployment.               converting data into information and by
   Conversely, it is possible to run R from     By combining the high performance,
                                                                                           exploit that information.
the command line and call the program         low footprint of SAP Sybase IQ with the
that in turn uses the RJDBC connection        open-source R and RStudio statistical        David’s project work has given him experi-
to read and write data to the database.       packages, it is possible to quickly deploy   ence in a wide variety of industries including
   Having a choice of methods is very         an analytical platform in the cloud for                                                  -
                                                                                           facturing, transportation, and public sector
helpful as it means that it can be inte-      which there are readily available skills.
                                                                                           as well as a broad and deep knowledge of
grated with the ETL environment in the          This infrastructure can be used both       business intelligence and data warehousing
most appropriate way. If the ETL tool         for rapid prototyping on analytical          technologies.
                                              models and for running completed
function (UDF) route is the most attrac-      models on new data sets to deliver
tive. However, if the ETL tool supports       greater insight into the data.
host callouts (as ours does) then running
R programs from a command line callout
is quicker than developing the UDF.




SAP White Paper – Building an Analytical Platform                                                                                           5
www.sap.com/contactsap




12/08 ©2012 SAP AG. All rights reserved.
SAP, R/3, SAP NetWeaver, Duet, PartnerEdge, ByDesign,
SAP BusinessObjects Explorer, StreamWork, SAP HANA, and
other SAP products and services mentioned herein as well as
their respective logos are trademarks or registered trademarks
of SAP AG in Germany and other countries.

Business Objects and the Business Objects logo, BusinessObjects,
Crystal Reports, Crystal Decisions, Web Intelligence, Xcelsius, and other
Business Objects products and services mentioned herein as well as their
respective logos are trademarks or registered trademarks of Business
Objects Software Ltd. Business Objects is an SAP company.

Sybase and Adaptive Server, iAnywhere, Sybase 365, SQL Anywhere, and
other Sybase products and services mentioned herein as well as their
respective logos are trademarks or registered trademarks of Sybase Inc.
Sybase is an SAP company.

Crossgate, m@gic EDDY, B2B 360°, and B2B 360° Services are registered
trademarks of Crossgate AG in Germany and other countries. Crossgate
is an SAP company.

All other product and service names mentioned are the trademarks of
their respective companies. Data contained in this document serves


These materials are subject to change without notice. These materials

for informational purposes only, without representation or warranty of
any kind, and SAP Group shall not be liable for errors or omissions with
respect to the materials. The only warranties for SAP Group products and
services are those that are set forth in the express warranty statements
accompanying such products and services, if any. Nothing herein should
be construed as constituting an additional warranty.

Más contenido relacionado

La actualidad más candente

Hadoop MapReduce Framework
Hadoop MapReduce FrameworkHadoop MapReduce Framework
Hadoop MapReduce FrameworkEdureka!
 
Intro to Hybrid Data Warehouse
Intro to Hybrid Data WarehouseIntro to Hybrid Data Warehouse
Intro to Hybrid Data WarehouseJonathan Bloom
 
Auto­matic Para­meter Tun­ing for Data­bases and Big Data Sys­tems
Auto­matic Para­meter Tun­ing for Data­bases and Big Data Sys­tems Auto­matic Para­meter Tun­ing for Data­bases and Big Data Sys­tems
Auto­matic Para­meter Tun­ing for Data­bases and Big Data Sys­tems Jiaheng Lu
 
Hadoop Technologies
Hadoop TechnologiesHadoop Technologies
Hadoop Technologieszahid-mian
 
Big data processing with apache spark part1
Big data processing with apache spark   part1Big data processing with apache spark   part1
Big data processing with apache spark part1Abbas Maazallahi
 
XML Parsing with Map Reduce
XML Parsing with Map ReduceXML Parsing with Map Reduce
XML Parsing with Map ReduceEdureka!
 
Hadoop Ecosystem Architecture Overview
Hadoop Ecosystem Architecture Overview Hadoop Ecosystem Architecture Overview
Hadoop Ecosystem Architecture Overview Senthil Kumar
 
Learning How to Learn Hadoop
Learning How to Learn HadoopLearning How to Learn Hadoop
Learning How to Learn HadoopSilicon Halton
 
Which NoSQL Database to Combine with Spark for Real Time Big Data Analytics?
Which NoSQL Database to Combine with Spark for Real Time Big Data Analytics?Which NoSQL Database to Combine with Spark for Real Time Big Data Analytics?
Which NoSQL Database to Combine with Spark for Real Time Big Data Analytics?IJCSIS Research Publications
 
Hadoop Summit 2012 - Hadoop and Vertica: The Data Analytics Platform at Twitter
Hadoop Summit 2012 - Hadoop and Vertica: The Data Analytics Platform at TwitterHadoop Summit 2012 - Hadoop and Vertica: The Data Analytics Platform at Twitter
Hadoop Summit 2012 - Hadoop and Vertica: The Data Analytics Platform at TwitterBill Graham
 
Shared slides-edbt-keynote-03-19-13
Shared slides-edbt-keynote-03-19-13Shared slides-edbt-keynote-03-19-13
Shared slides-edbt-keynote-03-19-13Daniel Abadi
 
Hadoop Interview Questions and Answers by rohit kapa
Hadoop Interview Questions and Answers by rohit kapaHadoop Interview Questions and Answers by rohit kapa
Hadoop Interview Questions and Answers by rohit kapakapa rohit
 
Architecting Big Data Ingest & Manipulation
Architecting Big Data Ingest & ManipulationArchitecting Big Data Ingest & Manipulation
Architecting Big Data Ingest & ManipulationGeorge Long
 
Case Study Real Time Olap Cubes
Case Study Real Time Olap CubesCase Study Real Time Olap Cubes
Case Study Real Time Olap Cubesmister_zed
 
Hadoop Seminar Report
Hadoop Seminar ReportHadoop Seminar Report
Hadoop Seminar ReportAtul Kushwaha
 
Introduction to Hadoop and MapReduce
Introduction to Hadoop and MapReduceIntroduction to Hadoop and MapReduce
Introduction to Hadoop and MapReduceeakasit_dpu
 

La actualidad más candente (20)

Understanding hdfs
Understanding hdfsUnderstanding hdfs
Understanding hdfs
 
Hadoop MapReduce Framework
Hadoop MapReduce FrameworkHadoop MapReduce Framework
Hadoop MapReduce Framework
 
Intro to Hybrid Data Warehouse
Intro to Hybrid Data WarehouseIntro to Hybrid Data Warehouse
Intro to Hybrid Data Warehouse
 
Auto­matic Para­meter Tun­ing for Data­bases and Big Data Sys­tems
Auto­matic Para­meter Tun­ing for Data­bases and Big Data Sys­tems Auto­matic Para­meter Tun­ing for Data­bases and Big Data Sys­tems
Auto­matic Para­meter Tun­ing for Data­bases and Big Data Sys­tems
 
Hadoop Technologies
Hadoop TechnologiesHadoop Technologies
Hadoop Technologies
 
Big data processing with apache spark part1
Big data processing with apache spark   part1Big data processing with apache spark   part1
Big data processing with apache spark part1
 
Hadoop technology doc
Hadoop technology docHadoop technology doc
Hadoop technology doc
 
XML Parsing with Map Reduce
XML Parsing with Map ReduceXML Parsing with Map Reduce
XML Parsing with Map Reduce
 
Hadoop Ecosystem Architecture Overview
Hadoop Ecosystem Architecture Overview Hadoop Ecosystem Architecture Overview
Hadoop Ecosystem Architecture Overview
 
Learning How to Learn Hadoop
Learning How to Learn HadoopLearning How to Learn Hadoop
Learning How to Learn Hadoop
 
Which NoSQL Database to Combine with Spark for Real Time Big Data Analytics?
Which NoSQL Database to Combine with Spark for Real Time Big Data Analytics?Which NoSQL Database to Combine with Spark for Real Time Big Data Analytics?
Which NoSQL Database to Combine with Spark for Real Time Big Data Analytics?
 
Prashanth Kumar_Hadoop_NEW
Prashanth Kumar_Hadoop_NEWPrashanth Kumar_Hadoop_NEW
Prashanth Kumar_Hadoop_NEW
 
Hadoop Summit 2012 - Hadoop and Vertica: The Data Analytics Platform at Twitter
Hadoop Summit 2012 - Hadoop and Vertica: The Data Analytics Platform at TwitterHadoop Summit 2012 - Hadoop and Vertica: The Data Analytics Platform at Twitter
Hadoop Summit 2012 - Hadoop and Vertica: The Data Analytics Platform at Twitter
 
002 Introduction to hadoop v3
002   Introduction to hadoop v3002   Introduction to hadoop v3
002 Introduction to hadoop v3
 
Shared slides-edbt-keynote-03-19-13
Shared slides-edbt-keynote-03-19-13Shared slides-edbt-keynote-03-19-13
Shared slides-edbt-keynote-03-19-13
 
Hadoop Interview Questions and Answers by rohit kapa
Hadoop Interview Questions and Answers by rohit kapaHadoop Interview Questions and Answers by rohit kapa
Hadoop Interview Questions and Answers by rohit kapa
 
Architecting Big Data Ingest & Manipulation
Architecting Big Data Ingest & ManipulationArchitecting Big Data Ingest & Manipulation
Architecting Big Data Ingest & Manipulation
 
Case Study Real Time Olap Cubes
Case Study Real Time Olap CubesCase Study Real Time Olap Cubes
Case Study Real Time Olap Cubes
 
Hadoop Seminar Report
Hadoop Seminar ReportHadoop Seminar Report
Hadoop Seminar Report
 
Introduction to Hadoop and MapReduce
Introduction to Hadoop and MapReduceIntroduction to Hadoop and MapReduce
Introduction to Hadoop and MapReduce
 

Destacado

Franz bartels actividad1_mapa_c
Franz bartels actividad1_mapa_cFranz bartels actividad1_mapa_c
Franz bartels actividad1_mapa_cFranz Manare
 
Frases celebres del fracaso
Frases celebres del fracasoFrases celebres del fracaso
Frases celebres del fracasomvazquezfranco
 
Ktdigieco201211
Ktdigieco201211Ktdigieco201211
Ktdigieco201211deepersnet
 
UTE_La estrategia en los emprendimientos sociales YESENIA CASTILLO
UTE_La estrategia en los emprendimientos sociales YESENIA CASTILLOUTE_La estrategia en los emprendimientos sociales YESENIA CASTILLO
UTE_La estrategia en los emprendimientos sociales YESENIA CASTILLOjosucalis
 
Ay, cómo duele crecer. roberto jorge saller
Ay, cómo duele crecer. roberto jorge sallerAy, cómo duele crecer. roberto jorge saller
Ay, cómo duele crecer. roberto jorge sallerRobertoSaller
 
Westport Innovations Presentation
Westport Innovations PresentationWestport Innovations Presentation
Westport Innovations PresentationCALSTART
 
Curs per a deixar de fumar - Sessió 5
Curs per a deixar de fumar - Sessió 5Curs per a deixar de fumar - Sessió 5
Curs per a deixar de fumar - Sessió 5psqvuoc
 
Luis calderon byron melara!
Luis calderon byron melara!Luis calderon byron melara!
Luis calderon byron melara!birolooo
 
Gtc200 guia-para-la-implementacion-de-iso-9001-en-inst-educativas
Gtc200 guia-para-la-implementacion-de-iso-9001-en-inst-educativasGtc200 guia-para-la-implementacion-de-iso-9001-en-inst-educativas
Gtc200 guia-para-la-implementacion-de-iso-9001-en-inst-educativasJohn Triana
 
Utgifter i jul julklappar och övriga julutgifter
Utgifter i jul   julklappar och övriga julutgifterUtgifter i jul   julklappar och övriga julutgifter
Utgifter i jul julklappar och övriga julutgifterNordea Bank
 
Curso búsqueda de empleo
Curso búsqueda de empleoCurso búsqueda de empleo
Curso búsqueda de empleoiLabora
 
Luis diaz politicas ambientales
Luis diaz politicas ambientalesLuis diaz politicas ambientales
Luis diaz politicas ambientalesLuis Daniel
 

Destacado (15)

Franz bartels actividad1_mapa_c
Franz bartels actividad1_mapa_cFranz bartels actividad1_mapa_c
Franz bartels actividad1_mapa_c
 
Frases celebres del fracaso
Frases celebres del fracasoFrases celebres del fracaso
Frases celebres del fracaso
 
Ktdigieco201211
Ktdigieco201211Ktdigieco201211
Ktdigieco201211
 
UTE_La estrategia en los emprendimientos sociales YESENIA CASTILLO
UTE_La estrategia en los emprendimientos sociales YESENIA CASTILLOUTE_La estrategia en los emprendimientos sociales YESENIA CASTILLO
UTE_La estrategia en los emprendimientos sociales YESENIA CASTILLO
 
Np viu incorporada en la junta qualificadora coneixements de valencià
Np viu incorporada en la junta qualificadora coneixements de valenciàNp viu incorporada en la junta qualificadora coneixements de valencià
Np viu incorporada en la junta qualificadora coneixements de valencià
 
Ay, cómo duele crecer. roberto jorge saller
Ay, cómo duele crecer. roberto jorge sallerAy, cómo duele crecer. roberto jorge saller
Ay, cómo duele crecer. roberto jorge saller
 
Westport Innovations Presentation
Westport Innovations PresentationWestport Innovations Presentation
Westport Innovations Presentation
 
Curs per a deixar de fumar - Sessió 5
Curs per a deixar de fumar - Sessió 5Curs per a deixar de fumar - Sessió 5
Curs per a deixar de fumar - Sessió 5
 
Luis calderon byron melara!
Luis calderon byron melara!Luis calderon byron melara!
Luis calderon byron melara!
 
Gtc200 guia-para-la-implementacion-de-iso-9001-en-inst-educativas
Gtc200 guia-para-la-implementacion-de-iso-9001-en-inst-educativasGtc200 guia-para-la-implementacion-de-iso-9001-en-inst-educativas
Gtc200 guia-para-la-implementacion-de-iso-9001-en-inst-educativas
 
Utgifter i jul julklappar och övriga julutgifter
Utgifter i jul   julklappar och övriga julutgifterUtgifter i jul   julklappar och övriga julutgifter
Utgifter i jul julklappar och övriga julutgifter
 
Curriculo de Educacion Sexual UNESCO Primera Parte
Curriculo de Educacion Sexual UNESCO Primera ParteCurriculo de Educacion Sexual UNESCO Primera Parte
Curriculo de Educacion Sexual UNESCO Primera Parte
 
Curso búsqueda de empleo
Curso búsqueda de empleoCurso búsqueda de empleo
Curso búsqueda de empleo
 
KSUY 5328 Hafta 11
KSUY 5328 Hafta 11KSUY 5328 Hafta 11
KSUY 5328 Hafta 11
 
Luis diaz politicas ambientales
Luis diaz politicas ambientalesLuis diaz politicas ambientales
Luis diaz politicas ambientales
 

Similar a Sybase IQ ile Analitik Platform

A Survey on Data Mapping Strategy for data stored in the storage cloud 111
A Survey on Data Mapping Strategy for data stored in the storage cloud  111A Survey on Data Mapping Strategy for data stored in the storage cloud  111
A Survey on Data Mapping Strategy for data stored in the storage cloud 111NavNeet KuMar
 
Unstructured Datasets Analysis: Thesaurus Model
Unstructured Datasets Analysis: Thesaurus ModelUnstructured Datasets Analysis: Thesaurus Model
Unstructured Datasets Analysis: Thesaurus ModelEditor IJCATR
 
Sap Interview Questions - Part 1
Sap Interview Questions - Part 1Sap Interview Questions - Part 1
Sap Interview Questions - Part 1ReKruiTIn.com
 
Testistanbul 2016 - Keynote: "Performance Testing of Big Data" by Roland Leusden
Testistanbul 2016 - Keynote: "Performance Testing of Big Data" by Roland LeusdenTestistanbul 2016 - Keynote: "Performance Testing of Big Data" by Roland Leusden
Testistanbul 2016 - Keynote: "Performance Testing of Big Data" by Roland LeusdenTurkish Testing Board
 
NETWORK TRAFFIC ANALYSIS: HADOOP PIG VS TYPICAL MAPREDUCE
NETWORK TRAFFIC ANALYSIS: HADOOP PIG VS TYPICAL MAPREDUCENETWORK TRAFFIC ANALYSIS: HADOOP PIG VS TYPICAL MAPREDUCE
NETWORK TRAFFIC ANALYSIS: HADOOP PIG VS TYPICAL MAPREDUCEcscpconf
 
A Comparative Survey Based on Processing Network Traffic Data Using Hadoop Pi...
A Comparative Survey Based on Processing Network Traffic Data Using Hadoop Pi...A Comparative Survey Based on Processing Network Traffic Data Using Hadoop Pi...
A Comparative Survey Based on Processing Network Traffic Data Using Hadoop Pi...IJCSES Journal
 
NETWORK TRAFFIC ANALYSIS: HADOOP PIG VS TYPICAL MAPREDUCE
NETWORK TRAFFIC ANALYSIS: HADOOP PIG VS TYPICAL MAPREDUCENETWORK TRAFFIC ANALYSIS: HADOOP PIG VS TYPICAL MAPREDUCE
NETWORK TRAFFIC ANALYSIS: HADOOP PIG VS TYPICAL MAPREDUCEcsandit
 
Cloud-Native Patterns for Data-Intensive Applications
Cloud-Native Patterns for Data-Intensive ApplicationsCloud-Native Patterns for Data-Intensive Applications
Cloud-Native Patterns for Data-Intensive ApplicationsVMware Tanzu
 
Database Integrated Analytics using R InitialExperiences wi
Database Integrated Analytics using R InitialExperiences wiDatabase Integrated Analytics using R InitialExperiences wi
Database Integrated Analytics using R InitialExperiences wiOllieShoresna
 
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...Cloudera, Inc.
 
Aucfanlab Datalake - Big Data Management Platform -
Aucfanlab Datalake - Big Data Management Platform -Aucfanlab Datalake - Big Data Management Platform -
Aucfanlab Datalake - Big Data Management Platform -Aucfan
 
Otimizações de Projetos de Big Data, Dw e AI no Microsoft Azure
Otimizações de Projetos de Big Data, Dw e AI no Microsoft AzureOtimizações de Projetos de Big Data, Dw e AI no Microsoft Azure
Otimizações de Projetos de Big Data, Dw e AI no Microsoft AzureLuan Moreno Medeiros Maciel
 
Agile data warehousing
Agile data warehousingAgile data warehousing
Agile data warehousingSneha Challa
 
Performance evaluation and estimation model using regression method for hadoo...
Performance evaluation and estimation model using regression method for hadoo...Performance evaluation and estimation model using regression method for hadoo...
Performance evaluation and estimation model using regression method for hadoo...redpel dot com
 
Maharshi_Amin_416
Maharshi_Amin_416Maharshi_Amin_416
Maharshi_Amin_416mamin1411
 
IRJET- Big Data Processes and Analysis using Hadoop Framework
IRJET- Big Data Processes and Analysis using Hadoop FrameworkIRJET- Big Data Processes and Analysis using Hadoop Framework
IRJET- Big Data Processes and Analysis using Hadoop FrameworkIRJET Journal
 
Agile data lake? An oxymoron?
Agile data lake? An oxymoron?Agile data lake? An oxymoron?
Agile data lake? An oxymoron?samthemonad
 
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010Bhupesh Bansal
 

Similar a Sybase IQ ile Analitik Platform (20)

A Survey on Data Mapping Strategy for data stored in the storage cloud 111
A Survey on Data Mapping Strategy for data stored in the storage cloud  111A Survey on Data Mapping Strategy for data stored in the storage cloud  111
A Survey on Data Mapping Strategy for data stored in the storage cloud 111
 
Unstructured Datasets Analysis: Thesaurus Model
Unstructured Datasets Analysis: Thesaurus ModelUnstructured Datasets Analysis: Thesaurus Model
Unstructured Datasets Analysis: Thesaurus Model
 
Facade
FacadeFacade
Facade
 
Sap Interview Questions - Part 1
Sap Interview Questions - Part 1Sap Interview Questions - Part 1
Sap Interview Questions - Part 1
 
Testistanbul 2016 - Keynote: "Performance Testing of Big Data" by Roland Leusden
Testistanbul 2016 - Keynote: "Performance Testing of Big Data" by Roland LeusdenTestistanbul 2016 - Keynote: "Performance Testing of Big Data" by Roland Leusden
Testistanbul 2016 - Keynote: "Performance Testing of Big Data" by Roland Leusden
 
NETWORK TRAFFIC ANALYSIS: HADOOP PIG VS TYPICAL MAPREDUCE
NETWORK TRAFFIC ANALYSIS: HADOOP PIG VS TYPICAL MAPREDUCENETWORK TRAFFIC ANALYSIS: HADOOP PIG VS TYPICAL MAPREDUCE
NETWORK TRAFFIC ANALYSIS: HADOOP PIG VS TYPICAL MAPREDUCE
 
A Comparative Survey Based on Processing Network Traffic Data Using Hadoop Pi...
A Comparative Survey Based on Processing Network Traffic Data Using Hadoop Pi...A Comparative Survey Based on Processing Network Traffic Data Using Hadoop Pi...
A Comparative Survey Based on Processing Network Traffic Data Using Hadoop Pi...
 
NETWORK TRAFFIC ANALYSIS: HADOOP PIG VS TYPICAL MAPREDUCE
NETWORK TRAFFIC ANALYSIS: HADOOP PIG VS TYPICAL MAPREDUCENETWORK TRAFFIC ANALYSIS: HADOOP PIG VS TYPICAL MAPREDUCE
NETWORK TRAFFIC ANALYSIS: HADOOP PIG VS TYPICAL MAPREDUCE
 
Cloud-Native Patterns for Data-Intensive Applications
Cloud-Native Patterns for Data-Intensive ApplicationsCloud-Native Patterns for Data-Intensive Applications
Cloud-Native Patterns for Data-Intensive Applications
 
Database Integrated Analytics using R InitialExperiences wi
Database Integrated Analytics using R InitialExperiences wiDatabase Integrated Analytics using R InitialExperiences wi
Database Integrated Analytics using R InitialExperiences wi
 
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
 
Aucfanlab Datalake - Big Data Management Platform -
Aucfanlab Datalake - Big Data Management Platform -Aucfanlab Datalake - Big Data Management Platform -
Aucfanlab Datalake - Big Data Management Platform -
 
Otimizações de Projetos de Big Data, Dw e AI no Microsoft Azure
Otimizações de Projetos de Big Data, Dw e AI no Microsoft AzureOtimizações de Projetos de Big Data, Dw e AI no Microsoft Azure
Otimizações de Projetos de Big Data, Dw e AI no Microsoft Azure
 
Agile data warehousing
Agile data warehousingAgile data warehousing
Agile data warehousing
 
Performance evaluation and estimation model using regression method for hadoo...
Performance evaluation and estimation model using regression method for hadoo...Performance evaluation and estimation model using regression method for hadoo...
Performance evaluation and estimation model using regression method for hadoo...
 
Maharshi_Amin_416
Maharshi_Amin_416Maharshi_Amin_416
Maharshi_Amin_416
 
IRJET- Big Data Processes and Analysis using Hadoop Framework
IRJET- Big Data Processes and Analysis using Hadoop FrameworkIRJET- Big Data Processes and Analysis using Hadoop Framework
IRJET- Big Data Processes and Analysis using Hadoop Framework
 
Agile data lake? An oxymoron?
Agile data lake? An oxymoron?Agile data lake? An oxymoron?
Agile data lake? An oxymoron?
 
Event Driven Architecture
Event Driven ArchitectureEvent Driven Architecture
Event Driven Architecture
 
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
 

Más de Sybase Türkiye

Italya Posta Teskilatı Sybase Afaria Kullaniyot
Italya Posta Teskilatı Sybase Afaria KullaniyotItalya Posta Teskilatı Sybase Afaria Kullaniyot
Italya Posta Teskilatı Sybase Afaria KullaniyotSybase Türkiye
 
SAP REAL TIME DATA PLATFORM WITH SYBASE SUPPORT
SAP REAL TIME DATA PLATFORM WITH SYBASE SUPPORTSAP REAL TIME DATA PLATFORM WITH SYBASE SUPPORT
SAP REAL TIME DATA PLATFORM WITH SYBASE SUPPORTSybase Türkiye
 
SAP Sybase Event Streaming Processing
SAP Sybase Event Streaming ProcessingSAP Sybase Event Streaming Processing
SAP Sybase Event Streaming ProcessingSybase Türkiye
 
Sybase IQ ile Muhteşem Performans
Sybase IQ ile Muhteşem PerformansSybase IQ ile Muhteşem Performans
Sybase IQ ile Muhteşem PerformansSybase Türkiye
 
Mobil Uygulama Geliştirme Klavuzu
Mobil Uygulama Geliştirme KlavuzuMobil Uygulama Geliştirme Klavuzu
Mobil Uygulama Geliştirme KlavuzuSybase Türkiye
 
Mobile Device Management for Dummies
Mobile Device Management for DummiesMobile Device Management for Dummies
Mobile Device Management for DummiesSybase Türkiye
 
SAP Sybase Data Management
SAP Sybase Data Management SAP Sybase Data Management
SAP Sybase Data Management Sybase Türkiye
 
Appcelerator report-q2-2012
Appcelerator report-q2-2012Appcelerator report-q2-2012
Appcelerator report-q2-2012Sybase Türkiye
 
Sybase PowerDesigner Vs Erwin
Sybase PowerDesigner Vs ErwinSybase PowerDesigner Vs Erwin
Sybase PowerDesigner Vs ErwinSybase Türkiye
 
Elastic Platform for Business Analytics
Elastic Platform for Business AnalyticsElastic Platform for Business Analytics
Elastic Platform for Business AnalyticsSybase Türkiye
 
Information Architech and DWH with PowerDesigner
Information Architech and DWH with PowerDesignerInformation Architech and DWH with PowerDesigner
Information Architech and DWH with PowerDesignerSybase Türkiye
 
Real-Time Loading to Sybase IQ
Real-Time Loading to Sybase IQReal-Time Loading to Sybase IQ
Real-Time Loading to Sybase IQSybase Türkiye
 
Mobile Application Strategy
Mobile Application StrategyMobile Application Strategy
Mobile Application StrategySybase Türkiye
 
Mobile is the new face of business
Mobile is the new face of businessMobile is the new face of business
Mobile is the new face of businessSybase Türkiye
 
Sybase SUP Mobil Uygulama Geliştirme Genel Bilgilendirme
Sybase SUP Mobil Uygulama Geliştirme Genel BilgilendirmeSybase SUP Mobil Uygulama Geliştirme Genel Bilgilendirme
Sybase SUP Mobil Uygulama Geliştirme Genel BilgilendirmeSybase Türkiye
 

Más de Sybase Türkiye (20)

Italya Posta Teskilatı Sybase Afaria Kullaniyot
Italya Posta Teskilatı Sybase Afaria KullaniyotItalya Posta Teskilatı Sybase Afaria Kullaniyot
Italya Posta Teskilatı Sybase Afaria Kullaniyot
 
SAP REAL TIME DATA PLATFORM WITH SYBASE SUPPORT
SAP REAL TIME DATA PLATFORM WITH SYBASE SUPPORTSAP REAL TIME DATA PLATFORM WITH SYBASE SUPPORT
SAP REAL TIME DATA PLATFORM WITH SYBASE SUPPORT
 
SAP Sybase Event Streaming Processing
SAP Sybase Event Streaming ProcessingSAP Sybase Event Streaming Processing
SAP Sybase Event Streaming Processing
 
Sybase IQ ile Muhteşem Performans
Sybase IQ ile Muhteşem PerformansSybase IQ ile Muhteşem Performans
Sybase IQ ile Muhteşem Performans
 
Mobil Uygulama Geliştirme Klavuzu
Mobil Uygulama Geliştirme KlavuzuMobil Uygulama Geliştirme Klavuzu
Mobil Uygulama Geliştirme Klavuzu
 
Mobile Device Management for Dummies
Mobile Device Management for DummiesMobile Device Management for Dummies
Mobile Device Management for Dummies
 
SAP Sybase Data Management
SAP Sybase Data Management SAP Sybase Data Management
SAP Sybase Data Management
 
Sybase IQ ve Big Data
Sybase IQ ve Big DataSybase IQ ve Big Data
Sybase IQ ve Big Data
 
SAP EIM
SAP EIM SAP EIM
SAP EIM
 
Appcelerator report-q2-2012
Appcelerator report-q2-2012Appcelerator report-q2-2012
Appcelerator report-q2-2012
 
Sybase PowerDesigner Vs Erwin
Sybase PowerDesigner Vs ErwinSybase PowerDesigner Vs Erwin
Sybase PowerDesigner Vs Erwin
 
Elastic Platform for Business Analytics
Elastic Platform for Business AnalyticsElastic Platform for Business Analytics
Elastic Platform for Business Analytics
 
Actionable Architecture
Actionable Architecture Actionable Architecture
Actionable Architecture
 
Information Architech and DWH with PowerDesigner
Information Architech and DWH with PowerDesignerInformation Architech and DWH with PowerDesigner
Information Architech and DWH with PowerDesigner
 
Why modeling matters ?
Why modeling matters ?Why modeling matters ?
Why modeling matters ?
 
Welcome introduction
Welcome introductionWelcome introduction
Welcome introduction
 
Real-Time Loading to Sybase IQ
Real-Time Loading to Sybase IQReal-Time Loading to Sybase IQ
Real-Time Loading to Sybase IQ
 
Mobile Application Strategy
Mobile Application StrategyMobile Application Strategy
Mobile Application Strategy
 
Mobile is the new face of business
Mobile is the new face of businessMobile is the new face of business
Mobile is the new face of business
 
Sybase SUP Mobil Uygulama Geliştirme Genel Bilgilendirme
Sybase SUP Mobil Uygulama Geliştirme Genel BilgilendirmeSybase SUP Mobil Uygulama Geliştirme Genel Bilgilendirme
Sybase SUP Mobil Uygulama Geliştirme Genel Bilgilendirme
 

Último

How Accurate are Carbon Emissions Projections?
How Accurate are Carbon Emissions Projections?How Accurate are Carbon Emissions Projections?
How Accurate are Carbon Emissions Projections?IES VE
 
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...DianaGray10
 
UiPath Studio Web workshop series - Day 8
UiPath Studio Web workshop series - Day 8UiPath Studio Web workshop series - Day 8
UiPath Studio Web workshop series - Day 8DianaGray10
 
Salesforce Miami User Group Event - 1st Quarter 2024
Salesforce Miami User Group Event - 1st Quarter 2024Salesforce Miami User Group Event - 1st Quarter 2024
Salesforce Miami User Group Event - 1st Quarter 2024SkyPlanner
 
UiPath Studio Web workshop series - Day 7
UiPath Studio Web workshop series - Day 7UiPath Studio Web workshop series - Day 7
UiPath Studio Web workshop series - Day 7DianaGray10
 
Artificial Intelligence & SEO Trends for 2024
Artificial Intelligence & SEO Trends for 2024Artificial Intelligence & SEO Trends for 2024
Artificial Intelligence & SEO Trends for 2024D Cloud Solutions
 
Building Your Own AI Instance (TBLC AI )
Building Your Own AI Instance (TBLC AI )Building Your Own AI Instance (TBLC AI )
Building Your Own AI Instance (TBLC AI )Brian Pichman
 
OpenShift Commons Paris - Choose Your Own Observability Adventure
OpenShift Commons Paris - Choose Your Own Observability AdventureOpenShift Commons Paris - Choose Your Own Observability Adventure
OpenShift Commons Paris - Choose Your Own Observability AdventureEric D. Schabell
 
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve DecarbonizationUsing IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve DecarbonizationIES VE
 
Cybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptxCybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptxGDSC PJATK
 
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCostKubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCostMatt Ray
 
9 Steps For Building Winning Founding Team
9 Steps For Building Winning Founding Team9 Steps For Building Winning Founding Team
9 Steps For Building Winning Founding TeamAdam Moalla
 
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...Will Schroeder
 
Machine Learning Model Validation (Aijun Zhang 2024).pdf
Machine Learning Model Validation (Aijun Zhang 2024).pdfMachine Learning Model Validation (Aijun Zhang 2024).pdf
Machine Learning Model Validation (Aijun Zhang 2024).pdfAijun Zhang
 
UiPath Studio Web workshop series - Day 6
UiPath Studio Web workshop series - Day 6UiPath Studio Web workshop series - Day 6
UiPath Studio Web workshop series - Day 6DianaGray10
 
Designing A Time bound resource download URL
Designing A Time bound resource download URLDesigning A Time bound resource download URL
Designing A Time bound resource download URLRuncy Oommen
 
VoIP Service and Marketing using Odoo and Asterisk PBX
VoIP Service and Marketing using Odoo and Asterisk PBXVoIP Service and Marketing using Odoo and Asterisk PBX
VoIP Service and Marketing using Odoo and Asterisk PBXTarek Kalaji
 
UiPath Solutions Management Preview - Northern CA Chapter - March 22.pdf
UiPath Solutions Management Preview - Northern CA Chapter - March 22.pdfUiPath Solutions Management Preview - Northern CA Chapter - March 22.pdf
UiPath Solutions Management Preview - Northern CA Chapter - March 22.pdfDianaGray10
 
Crea il tuo assistente AI con lo Stregatto (open source python framework)
Crea il tuo assistente AI con lo Stregatto (open source python framework)Crea il tuo assistente AI con lo Stregatto (open source python framework)
Crea il tuo assistente AI con lo Stregatto (open source python framework)Commit University
 

Último (20)

How Accurate are Carbon Emissions Projections?
How Accurate are Carbon Emissions Projections?How Accurate are Carbon Emissions Projections?
How Accurate are Carbon Emissions Projections?
 
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
 
UiPath Studio Web workshop series - Day 8
UiPath Studio Web workshop series - Day 8UiPath Studio Web workshop series - Day 8
UiPath Studio Web workshop series - Day 8
 
Salesforce Miami User Group Event - 1st Quarter 2024
Salesforce Miami User Group Event - 1st Quarter 2024Salesforce Miami User Group Event - 1st Quarter 2024
Salesforce Miami User Group Event - 1st Quarter 2024
 
UiPath Studio Web workshop series - Day 7
UiPath Studio Web workshop series - Day 7UiPath Studio Web workshop series - Day 7
UiPath Studio Web workshop series - Day 7
 
Artificial Intelligence & SEO Trends for 2024
Artificial Intelligence & SEO Trends for 2024Artificial Intelligence & SEO Trends for 2024
Artificial Intelligence & SEO Trends for 2024
 
Building Your Own AI Instance (TBLC AI )
Building Your Own AI Instance (TBLC AI )Building Your Own AI Instance (TBLC AI )
Building Your Own AI Instance (TBLC AI )
 
OpenShift Commons Paris - Choose Your Own Observability Adventure
OpenShift Commons Paris - Choose Your Own Observability AdventureOpenShift Commons Paris - Choose Your Own Observability Adventure
OpenShift Commons Paris - Choose Your Own Observability Adventure
 
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve DecarbonizationUsing IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
 
Cybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptxCybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptx
 
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCostKubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
 
20150722 - AGV
20150722 - AGV20150722 - AGV
20150722 - AGV
 
9 Steps For Building Winning Founding Team
9 Steps For Building Winning Founding Team9 Steps For Building Winning Founding Team
9 Steps For Building Winning Founding Team
 
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
 
Machine Learning Model Validation (Aijun Zhang 2024).pdf
Machine Learning Model Validation (Aijun Zhang 2024).pdfMachine Learning Model Validation (Aijun Zhang 2024).pdf
Machine Learning Model Validation (Aijun Zhang 2024).pdf
 
UiPath Studio Web workshop series - Day 6
UiPath Studio Web workshop series - Day 6UiPath Studio Web workshop series - Day 6
UiPath Studio Web workshop series - Day 6
 
Designing A Time bound resource download URL
Designing A Time bound resource download URLDesigning A Time bound resource download URL
Designing A Time bound resource download URL
 
VoIP Service and Marketing using Odoo and Asterisk PBX
VoIP Service and Marketing using Odoo and Asterisk PBXVoIP Service and Marketing using Odoo and Asterisk PBX
VoIP Service and Marketing using Odoo and Asterisk PBX
 
UiPath Solutions Management Preview - Northern CA Chapter - March 22.pdf
UiPath Solutions Management Preview - Northern CA Chapter - March 22.pdfUiPath Solutions Management Preview - Northern CA Chapter - March 22.pdf
UiPath Solutions Management Preview - Northern CA Chapter - March 22.pdf
 
Crea il tuo assistente AI con lo Stregatto (open source python framework)
Crea il tuo assistente AI con lo Stregatto (open source python framework)Crea il tuo assistente AI con lo Stregatto (open source python framework)
Crea il tuo assistente AI con lo Stregatto (open source python framework)
 

Sybase IQ ile Analitik Platform

  • 3. I was recently asked to build an analyti- 3. Provide a way to automate the An incidental but equally useful cal platform for a project. But what is an running of the statistical data models, consequence of using a column-store analytical platform? The client, a retailer, once developed, so that they can be database such as SAP Sybase IQ is that described it as a database where it could run without engaging the statistical there is no advantage in creating a star store data and as a front end where it development resources. schema as a data model. Instead, hold- could do statistical work. This work Of course, time was of the essence ing all the data in one large wide table is would range from simple means and and costs had to be as low as possible – - standard deviations through to more but we’ve come to expect that with ing each column with a key means that complex predictive analytics that could the underlying storage of data is a star be used, for example, to analyze past Step 1: The database schema. Creating a star schema in a performance of a customer to assess the Our chosen solution for the database column-store database rather than a likelihood that the customer will exhibit a was an SAP® Sybase® IQ database, a large single table would mean incurring future behavior. Or it might involve using technology our client was already famil- unnecessary additional join and process- models to classify customers into groups iar with. SAP Sybase IQ is a column-store ing overhead. and ultimately to bring the two processes database. This means that instead of As a result of choosing SAP together into an area known as decision storing all the data in its rows, as many Sybase IQ’s column-store database models. The customer had also come up other databases do, the data is organized we are able to have a data model that with an innovative way to resource the on disk by the columns. For example if a consists of a number of simple single table data sets (one table for each work placements to master’s degree have the text of each country (for exam- students studying statistics at the local ple, “United Kingdom”) stored many that is quick to load and to query. university and arranged for them to work times. In a column-store database the It should be noted that this type of with the customer insight team to text is stored only once and given a describe and develop the advanced unique ID. This is repeated for each online transaction processing (OLTP) models. All the customer needed was a column and therefore the “row” of data applications because of the cost of doing platform to work with. consists of a list of IDs linked to the data small inserts and updates. However, this From a systems architecture and held for each column. is not relevant for this particular development perspective, we could describe the requirements in three rela- reporting and analytical databases. The solution can be deployed only on tively simple statements: a Linux platform. We use Linux for three 1. Build a database with a very simple reasons. First, RStudio Server Edition is data model that could be easily used. In our example, “United Kingdom” not yet available for Microsoft Windows. loaded, that was capable of support- would occupy 14 bytes, while the ID Second, precompiled packages for all ing high-performance queries, and might occupy only 1 byte – reducing the elements of the solution on Linux reduce that did not consume a massive storage for that one value in that one amount of disk space. It would also column by a ratio of 14:1 – and this environments are normally cheaper than ideally be capable of being placed in Windows environments due to the cost the cloud. the data. Furthermore, because there is of the operating system license. We 2. Create a Web-based interface that less data on the disk, the time taken to chose CentOS because it is a Red Hat would allow users to securely log on, read the data from disk and to process derivative that is free. to write statistical programs that One additional advantage of this solu- could use the database as a source of which massively speeds up the queries tion for some organizations is the ability data, and to output reports and graph- too. Finally, each column is already to deploy it in the cloud. Since the solu- ics and well as to populate other indexed, which again helps the overall - tables (for example, target lists) as a query speed. ered, and since all querying is done via a result of statistical models. Web interface, it is possible to use any SAP White Paper – Building an Analytical Platform 3
  • 4. colocation or cloud-based hosting your environment, but these are well At this point data has to be loaded and provider. Colocation or cloud deploy- documented on the source Web sites the statisticians can get to work. and in general automatically download if Obviously this is more time consuming systems management overhead, and you are using a tool such as yum. than the build, and over the days and access for both data delivery and data The next step was to get access to the weeks the analysts created their models access. The system requires SSH access data held in our SAP Sybase IQ server. and produced the results. for management; FTP, SFTP, or SCP for This proved to also be very straightfor- For this exercise we used our in-house ward. There is a SAP Sybase white paper extract, transform, and load (ETL) tool to port open. The RStudio server uses the create a repeatable data extraction and server login accounts for security but load process, but it would have been can also be tied to existing LDAP describes the process that can be simply possible to use any of a wide range of infrastructure. stated as: tools that are available for this process Step 2: Statistical tools and Web Install the R JDBC package Step 3: Automatically running the interface Set up a JDBC connection statistical models There are a number of statistical tools Establish your connection Eventually a number of models for in the market. Most are very expensive, Query the table analyzing the data had been created and prohibitively so in this case, and the We now have an R object that contains we were ready to move into a production associated skills are hard to come by data sourced from SAP Sybase IQ that environment. We automated the load of and expensive. However, since 1993 an we can work with. And what is amazing is the data into the agreed single-table open-source programming language that it took me less than half a day to structure and wanted to also run the called R (www.r-project.org) for statisti- build the platform from scratch. data models. cal computing and graphics has been under development. It is now widely used among statisticians for developing statis- tical software and data analysis, is used by many universities, and is predicted to Analytical Platform Server become the most widely used statistical package by 2015. The R project provides R Studio a command line and graphical interface R Server Edition as well as a large open-source library of useful routines (http:/ /cran.r-project. R/JDBC org) and it is available as packaged soft- Connection ware for most platforms including Linux. In addition, a second open-source proj- SAP ect called RStudio (http:/ /rstudio.org) Sybase provides a single integrated development R/JDBC Connection IQ environment for R and can be deployed (S) FTP/SCP on a local machine or as a Web-based File Delivery Write to service using the server’s security model. Database Any Network In this case, we implemented the server Connected Computer Read File ETL with a Browser Accessing edition in order to make the entire envi- the R Studio Server Edition Engine ronment Web based. So in two simple steps (download and install R, followed by download and install RStudio) we can install a full CentOS Web-based statistical environment. Note ©2012 Data Management & Warehousing packages may be required depending on
  • 5. SAP Sybase IQ has the functionality ConCluSionS ABout the Author David Walker has been involved with business These C++ programs “talk” to a process Business intelligence requirements are intelligence and data warehousing for over known as Rserve, which in turn executes changing and business users are moving the R program and returns the results more and more from historical reporting to SAP Sybase IQ. This allows R func- into predictive analytics in an attempt to tions to be embedded directly into SAP get both a better and deeper under- Data Management & Warehousing (http:// datamgmt.com) in 1995. Sybase IQ SQL commands. While setting standing of their data. this up requires a little more program- Traditionally, building an analytical David and his team have worked around ming experience, it does mean that all platform has required an expensive infra- the world on projects designed to deliver processing can be done within SAP structure and a considerable amount of Sybase IQ. time for setup and deployment. converting data into information and by Conversely, it is possible to run R from By combining the high performance, exploit that information. the command line and call the program low footprint of SAP Sybase IQ with the that in turn uses the RJDBC connection open-source R and RStudio statistical David’s project work has given him experi- to read and write data to the database. packages, it is possible to quickly deploy ence in a wide variety of industries including Having a choice of methods is very an analytical platform in the cloud for - facturing, transportation, and public sector helpful as it means that it can be inte- which there are readily available skills. as well as a broad and deep knowledge of grated with the ETL environment in the This infrastructure can be used both business intelligence and data warehousing most appropriate way. If the ETL tool for rapid prototyping on analytical technologies. models and for running completed function (UDF) route is the most attrac- models on new data sets to deliver tive. However, if the ETL tool supports greater insight into the data. host callouts (as ours does) then running R programs from a command line callout is quicker than developing the UDF. SAP White Paper – Building an Analytical Platform 5
  • 6. www.sap.com/contactsap 12/08 ©2012 SAP AG. All rights reserved. SAP, R/3, SAP NetWeaver, Duet, PartnerEdge, ByDesign, SAP BusinessObjects Explorer, StreamWork, SAP HANA, and other SAP products and services mentioned herein as well as their respective logos are trademarks or registered trademarks of SAP AG in Germany and other countries. Business Objects and the Business Objects logo, BusinessObjects, Crystal Reports, Crystal Decisions, Web Intelligence, Xcelsius, and other Business Objects products and services mentioned herein as well as their respective logos are trademarks or registered trademarks of Business Objects Software Ltd. Business Objects is an SAP company. Sybase and Adaptive Server, iAnywhere, Sybase 365, SQL Anywhere, and other Sybase products and services mentioned herein as well as their respective logos are trademarks or registered trademarks of Sybase Inc. Sybase is an SAP company. Crossgate, m@gic EDDY, B2B 360°, and B2B 360° Services are registered trademarks of Crossgate AG in Germany and other countries. Crossgate is an SAP company. All other product and service names mentioned are the trademarks of their respective companies. Data contained in this document serves These materials are subject to change without notice. These materials for informational purposes only, without representation or warranty of any kind, and SAP Group shall not be liable for errors or omissions with respect to the materials. The only warranties for SAP Group products and services are those that are set forth in the express warranty statements accompanying such products and services, if any. Nothing herein should be construed as constituting an additional warranty.