SlideShare una empresa de Scribd logo
1 de 5
Descargar para leer sin conexión
Why serve an undercooked Big Data
Solution!! Go for the well-baked one
Introduction
Today Big Data is the buzz word in the information and technology domain. Thereis too much of
discussion and at the sametime confusion pertaining to this revolution calledBig Data. Inclient’smind
(most of the time) Big Data cando wonders on their data with its intelligent processing power and
analyticalfeatures. Most often, the client is in a cloud nine situation where they visualize a Utopian/
ideal environment whereeverything is getting resolved.
Through this article, I want to deconstruct that belief. I want to share and pass on my learning on how to
rightly approach Big Data solution in order to get the insight in line with the expectationsof the client.
The intension of this BLOG is not to provide any technicalsolution for Big-Data problems, but a process
to approach Big Data solution in a smart way.
Understand the Client Requirement
Often we come across a successfully implemented Big-Data solution with optimal processing time, jazzy
visualization & analyticalcapabilities!! But generally it doesn’t meet the client’s expectations!!
Therefore, we need to understand why a client wants to go for the Big Data solution. Without having
this insight, we may not be able to produce thedesired output in line with their anticipation.
Image 1: Client mind mapping
Following the initial step of understanding the reason behind the client going for Big Data solution, it’s
time to generateanextrapolationof the study. We have to come up with a blueprint which will
highlight our suggestions and recommendations to executethe Big Data solution.
Image 2: The inference from client’sexpectation& Outcome
A First Person Account
Let me explain toyou this concept from a realtime example and walk you through a smart way to
execute the solution and present theoutcomes.
The tablebelow gives a brief overview of a client’s problem and how they expect it to resolve it.
Client Situation A leading retailer in USA wants to enhance the existing Decision
Support System application. The DSS application presently serves a
huge user base (50+) on a data volume of about 250TB. It runs on a
traditionalRDBMSand is not able to scale to the said expectation.
Client Requirement According to the client’sIT Director, iftheir existing application is
moved to Big-Data (byhosting on Microsoft Azure Cloud) it should be
able to resolve their problem, both on cost and performance.
Image 3: Smart Way Inference
Points to ponder for implementing Big Data Solution
i) Baseline the Solution Coverage
Based on the client situation mentioned above, we need to come up with solution for the following
glitches:
 Reduce cost on data storage
 Provide stable code base
 Implement a scalable solution
 Deliver cost effective application
From the inference above, I amtrying to come up with two use cases (scenario):
UseCase 1
i. Ingest one unit of the data set (1 TB) in to the system. Repeat this procedure for 30 units.
ii. Run this use case on 10, 50 and 100 Node clusters.
UseCase 2
i. Readone unit of data through multiple requests. Consider them as supplier request and can
rangefrom 1 to 40K.
ii. Run this use case on 10, 50 & 100 Node clusters.
ii) Configuring theEnvironment
This is yet another crucialphase of implementation. Identify theoptimal infrastructureto validatethe
solution. Ones we have identified the vendor (For e.g., Microsoft)or an infrastructure provider (For e.g.,
Big Decisions or Client’s IT Dept.)toimplement Big Data solution, keep the following aspects for one’s
cognizance:
 Is the infrastructurestable? Validate thefollowing:
 ValidateCPU & Memory
 ValidateI/O
 Validatethe Storage(In our case it should hold 250TB)
 Adequate access rightsfor you to execute
 Edgenode access of a cluster
 Root folder access
iii) Identifying the Core Team
No solution canbe productive, until or unless we get the right stream of resource. We need to engage
the spot-on resource to run the solution.
In this sample POC (Proof of Concept), wewould need an expert or COE to support the coreteam.
There is certainly no time for training, therefore wehave to identify a coreteam who can reachthe
expectancyin no time.
iv) All Set to Go
Ones there is clarityon the objective of the exercise (here POC) with right resources and optimal
environment, you could expect the outcome alignwith the client’s expectation. It is now timeto draft a
plan for implementing thesolution.
For instance, you candetermine the storagetypes (BLOB, AzureData LakeStore) and tool stacks
(Spark, Hive, ADLF) for data processing and querying. Createa code base and execute it over the
environments.
NOTE: It’s mandatory to complete the above steps before moving to the next. All the
above activities can be executedin parallel
v) Present the Insights and Recommendations
Having clarity on the objectives, it is easy to matchor meet the client’sexpectation, be it implementing
the Big Data solution. Last phase of theexercise will capture thestatistics by running the code on the
configured environments. It will derive the metrics/insights from the execution outcome and present it
to the client with recommendations. Refer imagebelow for a sample outcome.
I would like to conclude with the saying, “Don’t just meet the expectations. Exceed them.”And with
this approach you are bound to exceedthe expectations of the client.

Más contenido relacionado

La actualidad más candente

Skytree Partner Program 2-15
Skytree Partner Program 2-15Skytree Partner Program 2-15
Skytree Partner Program 2-15
Dylan Steeg
 

La actualidad más candente (20)

Machine learning101 v1.2
Machine learning101 v1.2Machine learning101 v1.2
Machine learning101 v1.2
 
A strategy for security data analytics - SIRACon 2016
A strategy for security data analytics - SIRACon 2016A strategy for security data analytics - SIRACon 2016
A strategy for security data analytics - SIRACon 2016
 
Transforming IoT using Dynamical Machine Learning
Transforming IoT using Dynamical Machine LearningTransforming IoT using Dynamical Machine Learning
Transforming IoT using Dynamical Machine Learning
 
1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptop1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptop
 
Harness the power of data
Harness the power of dataHarness the power of data
Harness the power of data
 
Ml in a Day Workshop 5/1
Ml in a Day Workshop 5/1Ml in a Day Workshop 5/1
Ml in a Day Workshop 5/1
 
Skytree Partner Program 2-15
Skytree Partner Program 2-15Skytree Partner Program 2-15
Skytree Partner Program 2-15
 
H2O World - What you need before doing predictive analysis - Keen.io
H2O World - What you need before doing predictive analysis - Keen.ioH2O World - What you need before doing predictive analysis - Keen.io
H2O World - What you need before doing predictive analysis - Keen.io
 
2013 International Conference on Knowledge, Innovation and Enterprise Presen...
2013  International Conference on Knowledge, Innovation and Enterprise Presen...2013  International Conference on Knowledge, Innovation and Enterprise Presen...
2013 International Conference on Knowledge, Innovation and Enterprise Presen...
 
IBM Watson
IBM WatsonIBM Watson
IBM Watson
 
H2O World - NCS Continuous Media Optimization w/H2O - Satya Satyamoorthy
H2O World - NCS Continuous Media Optimization w/H2O - Satya SatyamoorthyH2O World - NCS Continuous Media Optimization w/H2O - Satya Satyamoorthy
H2O World - NCS Continuous Media Optimization w/H2O - Satya Satyamoorthy
 
Data Science Salon: Quit Wasting Time – Case Studies in Production Machine Le...
Data Science Salon: Quit Wasting Time – Case Studies in Production Machine Le...Data Science Salon: Quit Wasting Time – Case Studies in Production Machine Le...
Data Science Salon: Quit Wasting Time – Case Studies in Production Machine Le...
 
Data vault
Data vaultData vault
Data vault
 
CRISP-DM: a data science project methodology
CRISP-DM: a data science project methodologyCRISP-DM: a data science project methodology
CRISP-DM: a data science project methodology
 
Ml in a day v 1.1
Ml in a day v 1.1Ml in a day v 1.1
Ml in a day v 1.1
 
What Comes After The Star Schema? Dimensional Modeling For Enterprise Data Hubs
What Comes After The Star Schema? Dimensional Modeling For Enterprise Data HubsWhat Comes After The Star Schema? Dimensional Modeling For Enterprise Data Hubs
What Comes After The Star Schema? Dimensional Modeling For Enterprise Data Hubs
 
Machine Learning with Azure and Databricks Virtual Workshop
Machine Learning with Azure and Databricks Virtual WorkshopMachine Learning with Azure and Databricks Virtual Workshop
Machine Learning with Azure and Databricks Virtual Workshop
 
How can a quality engineering and assurance consultancy keep you ahead of others
How can a quality engineering and assurance consultancy keep you ahead of othersHow can a quality engineering and assurance consultancy keep you ahead of others
How can a quality engineering and assurance consultancy keep you ahead of others
 
Data Vault: Data Warehouse Design Goes Agile
Data Vault: Data Warehouse Design Goes AgileData Vault: Data Warehouse Design Goes Agile
Data Vault: Data Warehouse Design Goes Agile
 
Anatomy of a data science project
Anatomy of a data science projectAnatomy of a data science project
Anatomy of a data science project
 

Similar a A practice to perfect the big data solution

The Growth Of Data Centers
The Growth Of Data CentersThe Growth Of Data Centers
The Growth Of Data Centers
Gina Buck
 
TaskYou are required to prepare for this Assessment Item by1..docx
TaskYou are required to prepare for this Assessment Item by1..docxTaskYou are required to prepare for this Assessment Item by1..docx
TaskYou are required to prepare for this Assessment Item by1..docx
bradburgess22840
 
TaskYou are required to prepare for this Assessment Item by1..docx
TaskYou are required to prepare for this Assessment Item by1..docxTaskYou are required to prepare for this Assessment Item by1..docx
TaskYou are required to prepare for this Assessment Item by1..docx
deanmtaylor1545
 
CHAPTER 10 SystemArchitectureChapter 10 is the final chapter.docx
CHAPTER 10 SystemArchitectureChapter 10 is the final chapter.docxCHAPTER 10 SystemArchitectureChapter 10 is the final chapter.docx
CHAPTER 10 SystemArchitectureChapter 10 is the final chapter.docx
cravennichole326
 

Similar a A practice to perfect the big data solution (20)

Data Engineer's Lunch #85: Designing a Modern Data Stack
Data Engineer's Lunch #85: Designing a Modern Data StackData Engineer's Lunch #85: Designing a Modern Data Stack
Data Engineer's Lunch #85: Designing a Modern Data Stack
 
DX2000 from NEC lets you put big data to work
DX2000 from NEC lets you put big data to workDX2000 from NEC lets you put big data to work
DX2000 from NEC lets you put big data to work
 
Achieve New Heights with Modern Analytics
Achieve New Heights with Modern AnalyticsAchieve New Heights with Modern Analytics
Achieve New Heights with Modern Analytics
 
Data Warehouse Questions
Data Warehouse QuestionsData Warehouse Questions
Data Warehouse Questions
 
AI at Scale in Enterprises
AI at Scale in Enterprises AI at Scale in Enterprises
AI at Scale in Enterprises
 
Cloud Analytics Playbook
Cloud Analytics PlaybookCloud Analytics Playbook
Cloud Analytics Playbook
 
Building a Big Data Solution
Building a Big Data SolutionBuilding a Big Data Solution
Building a Big Data Solution
 
Traditional data word
Traditional data wordTraditional data word
Traditional data word
 
Building a Big Data Analytics Platform- Impetus White Paper
Building a Big Data Analytics Platform- Impetus White PaperBuilding a Big Data Analytics Platform- Impetus White Paper
Building a Big Data Analytics Platform- Impetus White Paper
 
Introduction to BigData
Introduction to BigData Introduction to BigData
Introduction to BigData
 
The Growth Of Data Centers
The Growth Of Data CentersThe Growth Of Data Centers
The Growth Of Data Centers
 
Doing Analytics Right - Building the Analytics Environment
Doing Analytics Right - Building the Analytics EnvironmentDoing Analytics Right - Building the Analytics Environment
Doing Analytics Right - Building the Analytics Environment
 
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
 
Building a SaaS Style Application
Building a SaaS Style ApplicationBuilding a SaaS Style Application
Building a SaaS Style Application
 
TaskYou are required to prepare for this Assessment Item by1..docx
TaskYou are required to prepare for this Assessment Item by1..docxTaskYou are required to prepare for this Assessment Item by1..docx
TaskYou are required to prepare for this Assessment Item by1..docx
 
TaskYou are required to prepare for this Assessment Item by1..docx
TaskYou are required to prepare for this Assessment Item by1..docxTaskYou are required to prepare for this Assessment Item by1..docx
TaskYou are required to prepare for this Assessment Item by1..docx
 
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data Architecture
 
CHAPTER 10 SystemArchitectureChapter 10 is the final chapter.docx
CHAPTER 10 SystemArchitectureChapter 10 is the final chapter.docxCHAPTER 10 SystemArchitectureChapter 10 is the final chapter.docx
CHAPTER 10 SystemArchitectureChapter 10 is the final chapter.docx
 
Bridging the Last Mile: Getting Data to the People Who Need It
Bridging the Last Mile: Getting Data to the People Who Need ItBridging the Last Mile: Getting Data to the People Who Need It
Bridging the Last Mile: Getting Data to the People Who Need It
 

Último

Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
nirzagarg
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
wsppdmt
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
Health
 
PLE-statistics document for primary schs
PLE-statistics document for primary schsPLE-statistics document for primary schs
PLE-statistics document for primary schs
cnajjemba
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
nirzagarg
 
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
wsppdmt
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
ranjankumarbehera14
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
gajnagarg
 
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling ManjurJual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
ptikerjasaptiker
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
nirzagarg
 
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
vexqp
 

Último (20)

Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham Ware
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 
7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
 
Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...
 
PLE-statistics document for primary schs
PLE-statistics document for primary schsPLE-statistics document for primary schs
PLE-statistics document for primary schs
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
 
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
 
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling ManjurJual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 
Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
 
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
 

A practice to perfect the big data solution

  • 1. Why serve an undercooked Big Data Solution!! Go for the well-baked one Introduction Today Big Data is the buzz word in the information and technology domain. Thereis too much of discussion and at the sametime confusion pertaining to this revolution calledBig Data. Inclient’smind (most of the time) Big Data cando wonders on their data with its intelligent processing power and analyticalfeatures. Most often, the client is in a cloud nine situation where they visualize a Utopian/ ideal environment whereeverything is getting resolved. Through this article, I want to deconstruct that belief. I want to share and pass on my learning on how to rightly approach Big Data solution in order to get the insight in line with the expectationsof the client. The intension of this BLOG is not to provide any technicalsolution for Big-Data problems, but a process to approach Big Data solution in a smart way. Understand the Client Requirement Often we come across a successfully implemented Big-Data solution with optimal processing time, jazzy visualization & analyticalcapabilities!! But generally it doesn’t meet the client’s expectations!! Therefore, we need to understand why a client wants to go for the Big Data solution. Without having this insight, we may not be able to produce thedesired output in line with their anticipation. Image 1: Client mind mapping
  • 2. Following the initial step of understanding the reason behind the client going for Big Data solution, it’s time to generateanextrapolationof the study. We have to come up with a blueprint which will highlight our suggestions and recommendations to executethe Big Data solution. Image 2: The inference from client’sexpectation& Outcome A First Person Account Let me explain toyou this concept from a realtime example and walk you through a smart way to execute the solution and present theoutcomes. The tablebelow gives a brief overview of a client’s problem and how they expect it to resolve it. Client Situation A leading retailer in USA wants to enhance the existing Decision Support System application. The DSS application presently serves a huge user base (50+) on a data volume of about 250TB. It runs on a traditionalRDBMSand is not able to scale to the said expectation. Client Requirement According to the client’sIT Director, iftheir existing application is moved to Big-Data (byhosting on Microsoft Azure Cloud) it should be able to resolve their problem, both on cost and performance.
  • 3. Image 3: Smart Way Inference Points to ponder for implementing Big Data Solution i) Baseline the Solution Coverage Based on the client situation mentioned above, we need to come up with solution for the following glitches:  Reduce cost on data storage  Provide stable code base  Implement a scalable solution  Deliver cost effective application From the inference above, I amtrying to come up with two use cases (scenario): UseCase 1 i. Ingest one unit of the data set (1 TB) in to the system. Repeat this procedure for 30 units. ii. Run this use case on 10, 50 and 100 Node clusters. UseCase 2 i. Readone unit of data through multiple requests. Consider them as supplier request and can rangefrom 1 to 40K. ii. Run this use case on 10, 50 & 100 Node clusters.
  • 4. ii) Configuring theEnvironment This is yet another crucialphase of implementation. Identify theoptimal infrastructureto validatethe solution. Ones we have identified the vendor (For e.g., Microsoft)or an infrastructure provider (For e.g., Big Decisions or Client’s IT Dept.)toimplement Big Data solution, keep the following aspects for one’s cognizance:  Is the infrastructurestable? Validate thefollowing:  ValidateCPU & Memory  ValidateI/O  Validatethe Storage(In our case it should hold 250TB)  Adequate access rightsfor you to execute  Edgenode access of a cluster  Root folder access iii) Identifying the Core Team No solution canbe productive, until or unless we get the right stream of resource. We need to engage the spot-on resource to run the solution. In this sample POC (Proof of Concept), wewould need an expert or COE to support the coreteam. There is certainly no time for training, therefore wehave to identify a coreteam who can reachthe expectancyin no time. iv) All Set to Go Ones there is clarityon the objective of the exercise (here POC) with right resources and optimal environment, you could expect the outcome alignwith the client’s expectation. It is now timeto draft a plan for implementing thesolution. For instance, you candetermine the storagetypes (BLOB, AzureData LakeStore) and tool stacks (Spark, Hive, ADLF) for data processing and querying. Createa code base and execute it over the environments. NOTE: It’s mandatory to complete the above steps before moving to the next. All the above activities can be executedin parallel
  • 5. v) Present the Insights and Recommendations Having clarity on the objectives, it is easy to matchor meet the client’sexpectation, be it implementing the Big Data solution. Last phase of theexercise will capture thestatistics by running the code on the configured environments. It will derive the metrics/insights from the execution outcome and present it to the client with recommendations. Refer imagebelow for a sample outcome. I would like to conclude with the saying, “Don’t just meet the expectations. Exceed them.”And with this approach you are bound to exceedthe expectations of the client.