BigDoor Business Intelligence in 15 Minutes

•Descargar como PPTX, PDF•

0 recomendaciones•573 vistas

BigDoor is an online marketing platform that partners with brands to offer loyalty programs where users earn virtual currency for actions and can exchange the currency for rewards. BigDoor aims to increase registration, engagement, and loyalty for its partners. It faces challenges in aggregating and analyzing user data from different sources like transactional databases and log files to evaluate its goals and meet partner metrics.

Tecnología

What is BigDoor?

 Online marketing through loyalty programs
 Partner: Enterprise brands with online presence
 Goals: Registration, engagement, loyalty

 Product:
 Users earn virtual currency for actions
 Users exchange virtual currency for rewards

BigDoor Data Goal

 Prove that we are meeting Partner goals
 Registration: Are people registering?
 Registration rate of control and exposed groups

 Engagement: Are participants more engaged?
 Actions per user in control and exposed groups
 Loyalty: Do participants return?
 Daily unique users v. monthly unique users

Data Challenges
 Peak: ~800 requests per second
 Business data ->Transactional SQL DB

 Optimized for write speed and flexibility
 Unregistered user requests -> Apache logs
 Flat text files
 Need all data in one place
 Fast queries
 Easy to slice and dice

BigDoor Architecture

Aggregation

Data
Warehouse

App Host

SQL DB
Load
Balancer

App Host
ETL
App Host

Log
Processing

Drop us a line any time!
Contact: eva@bigdoor.com

Más contenido relacionado

La actualidad más candente

This presentation shares real-world customer examples that illustrate how Master Data Management can make every customer interaction personal, including: - Collect and reconcile customer data about identities, profiles, purchase history, preferences, and transactions - Transform and augment this data into a 360° view of the customer with context, intentions, relationships, and interactions - Turn data into insights with segments, scores, forecasts and recommendations - Connect in real time to customer touch-points and turn those insights into increased conversion rates and customer loyalty

Creating the golden record that makes every click personal

Jean-Michel Franco

Distributed storage provides users with greater resiliency for their data, but guaranteeing that the storage providers are living up to their obligations remains a challenging problem. In this talk, we review various storage proofs, and how the blockchain can guarantee that storage providers are doing their jobs. We review 0Chain's storage protocol in particular, and discuss how we can verify that storage is provided by carefully aligning the storage provider's incentives.

0chain Blockhain and off-chain storage integrity

Vishwas Manral

Sensitivity for Groups, Teams, and SharePoint

Drew Madelung

Office 365 is so much more than just email, instant messaging and file storage. When your data is combined with cloud-driven intelligence and analytics you can discover new, relevant information and people based on who you work with and the content you work on. With personalized insights into how you interact with your data and those around you day-to-day, Office 365 can deliver the right information to you automatically. Join us as we explore the power of the Office Graph and Microsoft Delve for end users. We’ll discuss how the features of Delve can not only surface data for your users, but also act as a gateway to other Office 365 services such as search, Yammer, and Office 365 Video.

JAXSPUG April 2016 - Staying in the Know with Office 365

Scott Hoag

Review of the new Managed Metadata experience in SharePoint Online

Drew Madelung

Dynamics 365 Portals

CloudFronts Technologies LLP.

Agcweb, I Bridge

abiyala

Fastest and Most Comprehensive Assortment Planning

Arun Joshi

Corporate presentation

Ramya Dhamodharan

Developing enterprise ecommerce solutions using hybris by Drazen Nikolic - Be...

youngculture

Database for cloud

Sriram Natarajan

Architecture of Dynamics CRM with Office 365 and Azure

Pedro Azevedo

Part 2 -Deep Dive into the new features of Sharepoint Online and OneDrive for...

Vignesh Ganesan I Microsoft MVP

La actualidad más candente (13)

Creating the golden record that makes every click personal

0chain Blockhain and off-chain storage integrity

Sensitivity for Groups, Teams, and SharePoint

JAXSPUG April 2016 - Staying in the Know with Office 365

Review of the new Managed Metadata experience in SharePoint Online

Dynamics 365 Portals

Agcweb, I Bridge

Fastest and Most Comprehensive Assortment Planning

Corporate presentation

Developing enterprise ecommerce solutions using hybris by Drazen Nikolic - Be...

Database for cloud

Architecture of Dynamics CRM with Office 365 and Azure

Part 2 -Deep Dive into the new features of Sharepoint Online and OneDrive for...

Similar a BigDoor Business Intelligence in 15 Minutes

As digital channels continue to grow, they drive greater diversity in our data landscape. At Yorkshire Building Society, our purpose is to provide real help with real life and this relies on data from a myriad of sources. This diversity creates a need for points of intersection, where data can unite to feed customer and business insights. How do we create these hubs of intersection and what can modern technology offer? Speaker: Mark Walters Lead Enterprise Data Architect for Data & Information Yorkshire Building Society

Building Your Data Hub to Support Digital

Denodo

Birst

Joseph A Murphy

Synergies across APIs and IAM

Sagara Gunathunga

With consumer and business buyer expectations growing exponentially, more businesses are competing on the basis of customer experience. But executing preferred customer experiences requires data about who your customers are today and what will they likely need in the future. Every business can benefit from an AI-powered master data management platform to supply this information to line-of-business owners so they can execute great experiences at scale. This same need is true from an internal business process perspective as well. For example, many businesses require better data management practices to deliver preferred employee experiences. Informatica provides an MDM platform to solve for these examples and more.

Customer-Centric Data Management for Better Customer Experiences

Informatica

Customer-Centric Data Management for Better Customer Experiences

Informatica

Power BI: Business intelligence like never before! Power BI is a tool that allows accounting and finance managers and professionals to have relevant information at the right time to make strategic decisions in a flexible, centralized, and secure way. Transform your business with predictive analytics, data visualization, and information in real-time, thanks to Power BI. Presentation Agenda: 1. Introduction to Power BI 2. Why would your business need Power BI? 3. Power BI Architecture 4. Data Sanitization & Cleansing Capabilities 5. Analytical Insights 6. Generating Department Wise Reports & Visualizations 7. Latest & Upcoming Power BI features 8. Live Q&A

Introduction to Power BI to make smart decisions

VIVEK GURURANI

Race IT Review

cschmidtva

Thuis is ppt for ecommerce website where

AwadheshGupta22

Keepin Pitch Deck for TMTI Conference

Assaf Luxembourg

FinTechLabs Company Profile

Vipul Rawal

CIS 2015 Modernize IAM with UnboundID and Ping Identity - Terry Sigle & B. Al...

CloudIDSummit

IT leaders face ever-increasing challenges to fund and deliver innovative business solutions for better customer and stakeholder engagement. While much of the infrastructure stack has been commoditized, the DBMS remains an expensive and growing drain on IT resources that otherwise could drive innovation. This presentation outlines how money can be freed up in IT from expensive database spend by transforming applications and the DBMS to a subscription-based, cloud ready EnterpriseDB Postgres offering. Further, learn how the same DBMS used for these traditional workloads with familiar tools and a large skill base can also be used for new applications of engagement, which rely on a wider variety of data including NoSQL, semi-structured along with relational data. Visit EnterpriseDB > Resources > Webcasts to listen to the presentation recording. This presentation is intended for strategic IT and business decision-makers involved in data infrastructure decisions and cost savings.

Transform DBMS to Drive Apps of Engagement Innovation

EDB

EDB Executive Presentation 101515

Pierre Fricke

Many enterprises faced with silo’ed, batch-oriented, legacy systems struggle to compete in this new digital-first world. Adhering to the ‘If it’s not broken don’t fix it’ mentality leaves the door wide open for native digital challengers to grow and succeed. To stay competitive, your organization must respond in real time to every customer experience transaction, sale, and market movement. But how do you get there? First, you must change your mindset. As streaming platforms become central to data strategies, companies both small and large are re-thinking their enterprise architecture with real-time context at the forefront. Monoliths are evolving into microservices. Datacenters are moving to the cloud. What was once a ‘batch’ mindset is quickly being replaced with stream processing as the demands of the business impose real-time requirements on technology leaders. Join Argyle, in partnership with Confluent, in our 2018 CIO Virtual Event: The Digital Transformation Mindset – More Than Just Technology. During the webinar we’ll learn how leading companies across industries rely on a streaming platform to make event-driven architectures central to: • How data strategies and IT initiatives are improving the digital customer experiences • How executives are reducing risk with real time monitoring and anomaly detection • Increasing operational agility with microservices and IoT architectures within organizations

Digital Transformation Mindset - More Than Just Technology

confluent

BI and Predictive analytics 2011 shyam desigan presentation

Shyam Desigan

Intro to Neo4j

Neo4j

How Retail Banks Use MongoDB

MongoDB

Enterprise Digital Assistants: How they can support you in your Credit, Colle...

emagia

Enterprise Digital Assistants: How they can support you in your Credit, Colle...

emagia

How you can become a complete cloud solution provider and future proof your business. Here we cover: - Updates on the latest SMB and channel market trends - An in-depth understanding of how to make Microsoft’s Cloud Solution Provider Programme (CSP) work for you - Insights into how to future proof your business – transitioning to solution-led propositions and managing customers’ lifecycles - How to build more compelling solutions and increase your profit on cloud services - In-depth insights into Office 365 and Microsoft’s Cloud Platform, Azure – learn how you can generate opportunities in 2017

15th December 2016 - Microsoft Paddington Vuzion Awareness Event

Vuzion

Similar a BigDoor Business Intelligence in 15 Minutes (20)

Building Your Data Hub to Support Digital

Birst

Synergies across APIs and IAM

Customer-Centric Data Management for Better Customer Experiences

Introduction to Power BI to make smart decisions

Race IT Review

Thuis is ppt for ecommerce website where

Keepin Pitch Deck for TMTI Conference

FinTechLabs Company Profile

CIS 2015 Modernize IAM with UnboundID and Ping Identity - Terry Sigle & B. Al...

Transform DBMS to Drive Apps of Engagement Innovation

EDB Executive Presentation 101515

Digital Transformation Mindset - More Than Just Technology

BI and Predictive analytics 2011 shyam desigan presentation

Intro to Neo4j

How Retail Banks Use MongoDB

Enterprise Digital Assistants: How they can support you in your Credit, Colle...

15th December 2016 - Microsoft Paddington Vuzion Awareness Event

Último

Following the popularity of "Cloud Revolution: Exploring the New Wave of Serverless Spatial Data," we're thrilled to announce this much-anticipated encore webinar. In this sequel, we'll dive deeper into the Cloud-Native realm by uncovering practical applications and FME support for these new formats, including COGs, COPC, FlatGeoBuf, GeoParquet, STAC, and ZARR. Building on the foundation laid by industry leaders Michelle Roby of Radiant Earth and Chris Holmes of Planet in the first webinar, this second part offers an in-depth look at the real-world application and behind-the-scenes dynamics of these cutting-edge formats. We will spotlight specific use-cases and workflows, showcasing their efficiency and relevance in practical scenarios. Discover the vast possibilities each format holds, highlighted through detailed discussions and demonstrations. Our expert speakers will dissect the key aspects and provide critical takeaways for effective use, ensuring attendees leave with a thorough understanding of how to apply these formats in their own projects. Elevate your understanding of how FME supports these cutting-edge technologies, enhancing your ability to manage, share, and analyze spatial data. Whether you're building on knowledge from our initial session or are new to the serverless spatial data landscape, this webinar is your gateway to mastering cloud-native formats in your workflows.

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Safe Software

ICT role in 21st century education and its challenges

rafiqahmad00786416

💉💊+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHABI}}+971581248768 +971581248768 Mtp-Kit (500MG) Prices » Dubai [(+971581248768**)] Abortion Pills For Sale In Dubai, UAE, Mifepristone and Misoprostol Tablets Available In Dubai, UAE CONTACT DR.Maya Whatsapp +971581248768 We Have Abortion Pills / Cytotec Tablets /Mifegest Kit Available in Dubai, Sharjah, Abudhabi, Ajman, Alain, Fujairah, Ras Al Khaimah, Umm Al Quwain, UAE, Buy cytotec in Dubai +971581248768''''Abortion Pills near me DUBAI | ABU DHABI|UAE. Price of Misoprostol, Cytotec” +971581248768' Dr.DEEM ''BUY ABORTION PILLS MIFEGEST KIT, MISOPROTONE, CYTOTEC PILLS IN DUBAI, ABU DHABI,UAE'' Contact me now via What's App…… abortion Pills Cytotec also available Oman Qatar Doha Saudi Arabia Bahrain Above all, Cytotec Abortion Pills are Available In Dubai / UAE, you will be very happy to do abortion in Dubai we are providing cytotec 200mg abortion pill in Dubai, UAE. Medication abortion offers an alternative to Surgical Abortion for women in the early weeks of pregnancy. We only offer abortion pills from 1 week-6 Months. We then advise you to use surgery if its beyond 6 months. Our Abu Dhabi, Ajman, Al Ain, Dubai, Fujairah, Ras Al Khaimah (RAK), Sharjah, Umm Al Quwain (UAQ) United Arab Emirates Abortion Clinic provides the safest and most advanced techniques for providing non-surgical, medical and surgical abortion methods for early through late second trimester, including the Abortion By Pill Procedure (RU 486, Mifeprex, Mifepristone, early options French Abortion Pill), Tamoxifen, Methotrexate and Cytotec (Misoprostol). The Abu Dhabi, United Arab Emirates Abortion Clinic performs Same Day Abortion Procedure using medications that are taken on the first day of the office visit and will cause the abortion to occur generally within 4 to 6 hours (as early as 30 minutes) for patients who are 3 to 12 weeks pregnant. When Mifepristone and Misoprostol are used, 50% of patients complete in 4 to 6 hours; 75% to 80% in 12 hours; and 90% in 24 hours. We use a regimen that allows for completion without the need for surgery 99% of the time. All advanced second trimester and late term pregnancies at our Tampa clinic (17 to 24 weeks or greater) can be completed within 24 hours or less 99% of the time without the need surgery. The procedure is completed with minimal to no complications. Our Women's Health Center located in Abu Dhabi, United Arab Emirates, uses the latest medications for medical abortions (RU-486, Mifeprex, Mifegyne, Mifepristone, early options French abortion pill), Methotrexate and Cytotec (Misoprostol). The safety standards of our Abu Dhabi, United Arab Emirates Abortion Doctors remain unparalleled. They consistently maintain the lowest complication rates throughout the nation. Our Physicians and staff are always available to answer questions and care for women in one of the most difficult times in their lives. The decision to have an abortion at the Abortion Cl

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...

Zilliz

Keynote 2: APIs in 2030: The Risk of Technological Sleepwalk Paolo Malinverno, Growth Advisor - The Business of Technology Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...

apidays

Following the popularity of “Cloud Revolution: Exploring the New Wave of Serverless Spatial Data,” we’re thrilled to announce this much-anticipated encore webinar. In this sequel, we’ll dive deeper into the Cloud-Native realm by uncovering practical applications and FME support for these new formats, including COGs, COPC, FlatGeoBuf, GeoParquet, STAC, and ZARR. Building on the foundation laid by industry leaders Michelle Roby of Radiant Earth and Chris Holmes of Planet in the first webinar, this second part offers an in-depth look at the real-world application and behind-the-scenes dynamics of these cutting-edge formats. We will spotlight specific use-cases and workflows, showcasing their efficiency and relevance in practical scenarios. Discover the vast possibilities each format holds, highlighted through detailed discussions and demonstrations. Our expert speakers will dissect the key aspects and provide critical takeaways for effective use, ensuring attendees leave with a thorough understanding of how to apply these formats in their own projects. Elevate your understanding of how FME supports these cutting-edge technologies, enhancing your ability to manage, share, and analyze spatial data. Whether you’re building on knowledge from our initial session or are new to the serverless spatial data landscape, this webinar is your gateway to mastering cloud-native formats in your workflows.

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Safe Software

Boost Fertility New Invention Ups Success Rates.pdf

sudhanshuwaghmare1

Dubai, known for its towering skyscrapers, luxurious lifestyle, and relentless pursuit of innovation, often finds itself in the global spotlight. However, amidst the glitz and glamour, the emirate faces its own set of challenges, including the occasional threat of flooding. In recent years, Dubai has experienced sporadic but significant floods, disrupting normalcy and posing unique challenges to its infrastructure. Among the critical nodes in this bustling metropolis is the Dubai International Airport, a vital hub connecting the world. This article delves into the intersection of Dubai flood events and the resilience demonstrated by the Dubai International Airport in the face of such challenges.

Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf

Orbitshub

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving

Edi Saputra

Manulife - Insurer Transformation Award 2024

The Digital Insurer

Accelerating FinTech Innovation: Unleashing API Economy and GenAI Vasa Krishnan, Chief Technology Officer - FinResults Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...

apidays

Abhishek Deb(1), Mr Abdul Kalam(2) M. Des (UX) , School of Design, DIT University , Dehradun. This paper explores the future potential of AI-enabled smartphone processors, aiming to investigate the advancements, capabilities, and implications of integrating artificial intelligence (AI) into smartphone technology. The research study goals consist of evaluating the development of AI in mobile phone processors, analyzing the existing state as well as abilities of AI-enabled cpus determining future patterns as well as chances together with reviewing obstacles as well as factors to consider for more growth.

Exploring the Future Potential of AI-Enabled Smartphone Processors

debabhi2

The Good, the Bad and the Governed - Why is governance a dirty word? David O'Neill, Chief Operating Officer - APIContext Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...

apidays

Corporate and higher education. Two industries that, in the past, have had a clear divide with very little crossover. The difference in goals, learning styles and objectives paved the way for differing learning technologies platforms to evolve. Now, those stark lines are blurring as both sides are discovering they have content that’s relevant to the other. Join Tammy Rutherford as she walks through the pros and cons of corporate and higher ed collaborating. And the challenges of these different technology platforms working together for a brighter future.

Corporate and higher education May webinar.pptx

Rustici Software

CNIC Information System with Pakdata Cf In Pakistan

danishmna97

How to Troubleshoot Apps for the Modern Connected Worker

ThousandEyes

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood

Juan lago vázquez

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Martijn de Jong

MINDCTI Revenue Release Quarter One 2024

MIND CTI

Three things you will take away from the session: • How to run an effective tenant-to-tenant migration • Best practices for before, during, and after migration • Tips for using migration as a springboard to prepare for Copilot in Microsoft 365 Main ideas: Migration Overview: The presentation covers the current reality of cross-tenant migrations, the triggers, phases, best practices, and benefits of a successful tenant migration Considerations: When considering a migration, it is important to consider the migration scope, performance, customization, flexibility, user-friendly interface, automation, monitoring, support, training, scalability, data integrity, data security, cost, and licensing structure Next Wave: The next wave of change includes the launch of Copilot, which requires businesses to be prepared for upcoming changes related to Copilot and the cloud, and to consolidate data and tighten governance ShareGate: ShareGate can help with pre-migration analysis, configurable migration tool, and automated, end-user driven collaborative governance

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

sammart93

BigDoor Business Intelligence in 15 Minutes

2. What is BigDoor?  Online marketing through loyalty programs  Partner: Enterprise brands with online presence  Goals: Registration, engagement, loyalty  Product:  Users earn virtual currency for actions  Users exchange virtual currency for rewards

3. PacSun.com + BigDoor

4. PacSun.com + BigDoor

5. BigDoor Data Goal  Prove that we are meeting Partner goals  Registration: Are people registering?  Registration rate of control and exposed groups  Engagement: Are participants more engaged?  Actions per user in control and exposed groups  Loyalty: Do participants return?  Daily unique users v. monthly unique users

7. Data Challenges  Peak: ~800 requests per second  Business data ->Transactional SQL DB  Optimized for write speed and flexibility  Unregistered user requests -> Apache logs  Flat text files  Need all data in one place  Fast queries  Easy to slice and dice

8. BigDoor Architecture Aggregation Data Warehouse App Host SQL DB Load Balancer App Host ETL App Host Log Processing

9. Drop us a line any time! Contact: eva@bigdoor.com

Notas del editor

Thank you for having me! My name is Eva Monsen and I am a business intelligence developer at BigDoor. I am responsible for making sure BigDoor’s data can answer questions. I’ll be giving a brief real-world example of “big data”. I’ll give an introduction to BigDoor, its customers, and its product, and then I’ll talk about the BigDoor data pipeline, which is the technology path that data takes to get from its raw form to reports and visualizations.
What is BigDoor? BigDoor helps large companies do marketing through loyalty programs. These partner companies already have an online presence, and are looking to grow their online user base by acquiring new users, engaging and retaining users long term. BigDoor adds to the partner website a program where users earn virtual currency and exchange that currency for rewards.
Here’s an example. One of our partner companies is PacSun, a fairly large clothing retailer with both brick and mortar stores, and a website. PacSun wants to increase online sales and create relationships with its online customers.So, they have added BigDoor’s product to their website. BigDoor is a whitelabel product, which means it appears to be integrated with the rest of the website, but under the covers, these widgets are run by bigdoor: and make web API requests to BigDoor servers. Those web requests form the basis for the raw data I will be talking about.The highlighted areas are BigDoor Javascript widgets. On the top is a user profile picture, their currency balance (which PacSun has chosen to call “points”), and some links. On the bottom is what we call the “task bar” or “dock”, which shows some actions that the user can take to earn points.
Here is PacSun’s rewards page. Users can exchange the virtual currency they have earned for items appearing on this page. Rewards include sweepstakes entries, coupons, and physical merchandise. The rewards list is also served by BigDoor servers.
BigDoor receives web API requests whenever a user sees our widgets, registers, logs in, logs out, or takes an action that affects their currency balance such as completing a task or redeeming a reward. These are some example questions we ask of the data from those requests, and some metrics that we use to answer those questions.One way we can measure the answers is to show some users the BigDoor UI and not others. Those shown the BigDoor UI are the “exposed” group and those not shown the BigDoor UI are the “control” group. We can prove whether BigDoor is effective at registrations, for example, by looking at the difference in registration rates between control and exposed groups. If BigDoor is doing its job right, the registration rate should be higher in the exposed group. We also use control and exposed groups to measure user engagement, by looking at the number of actions a user takes while logged in.Whether users return is currently measured by comparing the number of unique visitors per day to the number of unique visitors for the trailing 30 days. Equal numbers would usually mean 100% of users are returning to the site daily.
We answer these questions in the form of reports built using Tableau Software. This is just one example of such a report that shows the number of unique users per hour, per day, and per trailing month.
We face many challenges with BigDoor data pipeline. One million requests per hour is actually a fairly small number in the big data world, but it is enough that we need to constantly load data so that our reporting can stay up-to-date.Most API requests by registered users result in updates or inserts to the transactional database, which is a MySQL database like you may have seen in your coursework. It keeps track of registered users’ profiles, currency balances, badges, reward redemptions, and so on. Requests by unregistered users only end up in our Apache logs, flat files with raw request data such as the query string.We want to combine all of the information in the Apache logs and the transactional database into one place, where it is easy and fast to query, and slice and dice by partner, date, user group and so on.
Finally, the guts of the system. This is the pipeline. It is how data is written to our system and ultimately read from the report.First, all web requests go through our load balancer, which dispatches those requests to a number of identical hosts. (I’ve labeled these “app hosts” because that is the term we use internally. ) I’ve shown three here but we usually have many more than that. The app hosts write data to the transactional database, and they also send their Apache logs – the flat files - to a log processing server every two minutes.The log processing server does some interesting work. Using multiple parallel processes, it parses every request in every Apache log and extracts some information of interest, such as the request timestamp, the partner id, user id, the type of action the user took. It produces output files to be consumed by the next step in the pipeline. This type of work is ideally suited to a distributed processing system like Hadoop, which is what Adam will be talking about next. Ours is a custom-built system, written in Python.ETL stands for “Extract, Transform, Load”, which is what this box does to the data. In this case, it extracts data from the transactional database, and from the log processing server, and transforms that data through a series of steps, and loads it into a data warehouse. You can look at the data warehouse as essentially a record of all of the partner configuration, user information, and every action taken by every user.There are many existing ETL products out there. Our ETL system is custom and written in Ruby. Finally, ETL summarizes all of that data into a series of tables in what I am calling the Aggregation database. These summary tables are very small in comparison to those in the data warehouse and are queried directly by Tableau to generate summary reports.
I know I’ve gone through a lot of information very quickly, but I hope that you now have some idea of what happens to data in the real world. I’ll take a few questions now, and I am always checking email and would love to go into depth about any of this, or general software engineering questions, with you later. Thanks!

BigDoor Business Intelligence in 15 Minutes

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (13)

Similar a BigDoor Business Intelligence in 15 Minutes

Similar a BigDoor Business Intelligence in 15 Minutes (20)

Último

Último (20)

BigDoor Business Intelligence in 15 Minutes

Notas del editor