SlideShare una empresa de Scribd logo
1 de 2
SevenBestPracticesForRevolutionizingYourData
A complete revisitingof the corporate dataarchitecture anditsrespective bestpracticesisinorder
because of cloudcomputingandbigchangesin computingtechnologyandsoftware development.In
some cases,a complete inversionhasoccurred(inthe bestway) tosolve aparticularproblem.Tobe
competitive,organizationsneedtotake advantage of these new waysof doingthings.Massive data
and informationisoutthere if we canjustgrasp it. Below are some principlesandpracticesonhow
we can betterdeal withdatagoingforward.
Store First,Analyze Later:Diskischeap. We can’t alwayspredictwhatdatawill be importantlater.
Store firstand ask questionslater.Withscalable infrastructure andtoday’shardware economics,it’s
okayif a piece of data turnsout to neverbe used.The schemaflexibilityof NoSQLtechnology
facilitatesthis. Forexample,withacustomerdocument,addingadditional fieldsof informationata
laterdate is easyeven if theywere notenvisionedinitially.
DefaulttoReal-time:Historically,dataprocessingandanalysishasbeendone viabatchprocessing.
We defaultedtobatchprocessingbecause it’scomputationallyefficienthowever,givenMoore’slaw
and the passage of time we nowhave much more powerat our disposal.We canaffordto do more
workto get real-time answersinsteadof answerstomorrow. NoSQLandfast storage technologies
(suchas solidstate disk) make real-time possible. Yourorganizationshoulddeliver
recommendations,personalizationandbusinessmetricsimmediately.Defaulttoreal-timeandgoto
batch onlywhennecessary.
Structure Shouldn’tHoldYouBack: It’seasyto store basic stockinformation –for example (ticker,
high,low,close) –inany database. What abouta complete derivative security?How dowe store
that inthe database,especiallygiventhatnew securitiesare inventedall the time? A legal
contract’s terms? Howdo we store polymorphicinformationordatawe weren’taware of a priori?
Historicallyafewmethodshave beenmostcommon:the relational database fordatawithvery
precise structuring;completelyunstructureddata(“BLOBs”);andthingsinthe middle,suchas
spreadsheets. The lattertwoformatsare mostlyuselessfor integrationintoyourapplications,yet
the volume of suchdata is massive. Withthe rise of dynamicdocument-orienteddatamodels(using
JSON),semi-structured,complex structured,andpolymorphicdatacan be stored,accessedand
organizedjustasefficientlyasthe more rigidlystructureddatathathasbeenindatabases
traditionally.
AgilityIsKey:The software developmentworldhasmovedfromclassic“waterfall”software
developmentlifecyclestomore agile,oriterative,methodologies(forexample,Scrum). These
methods’rapiditerationallowsorganizationstodeliverfeaturesandenhancementstoendusers
quicklyandeffectively.Toworkthisway,we neednew toolsthatare agile-compatible — version
control,continuousintegration,programminglanguageshave adaptedalready. We needsimilar
adaptionbythe database if we want to make software developmentnimbleandproductive. NoSQL
technologiesfacilitate iterationinthe datamodel muchthe same way as youiterate withyourcode.
One Size Doesn’tFitAll:One-size-fits-all isover. Use multiple database technologiesaspartof your
standardenterprise technologyplatform. Youwon’twantdozens – that wouldbe far toocomplex –
but more than one isoptimal. A goodmodel forthe future isto have three primarytools:an DBMS,
a relational datawarehouse andaNoSQLdatabase. For each projector sub-problem, use whichever
tool is best. Augmentwithniche tools(e.g.,atime seriesdatabase) forspecial cases.The above
approach ishighlycompatiblewithservice-orientedarchitectures,whichyoushouldbe using.
Monolithichub-and-spoke architecturesleadtolate projectsandunchangeable systems. Instead,
buildwebserviceswitheachone potentiallyhavingitsowndatabase ordatamart behindit.
Go Commodity:The rise of commodityhardware asa viable productionplatformhasmade it
possible todeploymulti-node systemsquickly.Newerdatabase technologiesare designedwith
commodityserversinmind.Companiesare movingawayfrom“bigiron”serversandembrace this
approach.By adoptinga commodityserverdeploymentmodel,thereislessof adependencyon
proprietarymechanismsandvendorlock-inisoftenavoided.Findthe sweetspotonthe price-
performance curve andbuyserversof that size. Don’tbuy$1k servers,you’ll have toomanyto
manage (or evenplugin!) Butdon’tgo toobig either.Manyorganizationsare standardizingon$10k
commodityXeon(orAMD) basedserverswithgigabitEthernet.
Use SolidState Drives –a Lot: Traditional spinningdiskshave increasedincapacityanddata transfer
ratesby a factor of one thousand,yetthe randomi/otimeshave barelybudgedoveradecade. If
youare doingany randomI/Oat all,youshoulduse SSDsinstead.CommoditySATA-style SSDscan
worksurprisinglywell. Be sure to mirrorthem – theystill fail eventhoughthereare nomovingparts
(exceptelectrons!) Reserve20%+of the disk’sspace as un-partitionedtogive the drive roomto
optimize randomwritesandavoidexcess“write amplification”.
For sequentialI/O,sticktospinningdisks. Thus,use spinningdisksforHadoopbatchprocessingand
for backups. Some have predictedeventually99% of data maybe storedonspinningdisksyet99%
of accesseswill be happeningonSSDs.Withspinning disksbeingthe mainplace forbackups,thatis
conceivable.
Source : forbes.com
Recommendedby:
JonCohn ,CTO, VP IT Architecture
https://www.linkedin.com/in/jonacohn
joncohn@comcast.net
"JonCohn ExtonPA""JonCohn Exton""JonCohnEvolution"

Más contenido relacionado

Más de Jon Cohn

Jon Cohn Exton PA - Technology Trends – 2016 and beyond
Jon Cohn Exton PA - Technology Trends – 2016 and beyondJon Cohn Exton PA - Technology Trends – 2016 and beyond
Jon Cohn Exton PA - Technology Trends – 2016 and beyondJon Cohn
 
Jon Cohn Exton PA - Rationalizing Application Portfolios
Jon Cohn Exton PA - Rationalizing Application PortfoliosJon Cohn Exton PA - Rationalizing Application Portfolios
Jon Cohn Exton PA - Rationalizing Application PortfoliosJon Cohn
 
Jon Cohn Exton PA - Next Gen Enterprise Information Technology
Jon Cohn Exton PA - Next Gen Enterprise Information TechnologyJon Cohn Exton PA - Next Gen Enterprise Information Technology
Jon Cohn Exton PA - Next Gen Enterprise Information TechnologyJon Cohn
 
Jon Cohn Exton PA - Healthcare - Enterprise Architecture
Jon Cohn Exton PA - Healthcare - Enterprise Architecture Jon Cohn Exton PA - Healthcare - Enterprise Architecture
Jon Cohn Exton PA - Healthcare - Enterprise Architecture Jon Cohn
 
Jon Cohn Exton PA - ERP Predictions
Jon Cohn Exton PA - ERP PredictionsJon Cohn Exton PA - ERP Predictions
Jon Cohn Exton PA - ERP PredictionsJon Cohn
 
Jon Cohn Exton PA - Enterprise Architecture - Best Practices
Jon Cohn Exton PA - Enterprise Architecture - Best PracticesJon Cohn Exton PA - Enterprise Architecture - Best Practices
Jon Cohn Exton PA - Enterprise Architecture - Best PracticesJon Cohn
 
Jon Cohn Exton PA - EA and Innovation
Jon Cohn Exton PA - EA and InnovationJon Cohn Exton PA - EA and Innovation
Jon Cohn Exton PA - EA and InnovationJon Cohn
 
Jon Cohn Exton PA - Data Governance – Best Practices
Jon Cohn Exton PA - Data Governance – Best PracticesJon Cohn Exton PA - Data Governance – Best Practices
Jon Cohn Exton PA - Data Governance – Best PracticesJon Cohn
 
Jon cohn exton pa corporate data architecture
Jon cohn exton pa   corporate data architectureJon cohn exton pa   corporate data architecture
Jon cohn exton pa corporate data architectureJon Cohn
 
Big Data Architecture
Big Data ArchitectureBig Data Architecture
Big Data ArchitectureJon Cohn
 
Jon Cohn Exton PA - Microservices anti
Jon Cohn Exton PA - Microservices antiJon Cohn Exton PA - Microservices anti
Jon Cohn Exton PA - Microservices antiJon Cohn
 
Jon Cohn Exton PA - Knowledge management in software architecture
Jon Cohn Exton PA - Knowledge management in software architectureJon Cohn Exton PA - Knowledge management in software architecture
Jon Cohn Exton PA - Knowledge management in software architectureJon Cohn
 
Jon Cohn Exton PA - Resume
Jon Cohn Exton PA - ResumeJon Cohn Exton PA - Resume
Jon Cohn Exton PA - ResumeJon Cohn
 
Jon A Cohn - CTO / VP / Sr Director - joncohn@comcast.net
Jon A Cohn - CTO / VP / Sr Director - joncohn@comcast.netJon A Cohn - CTO / VP / Sr Director - joncohn@comcast.net
Jon A Cohn - CTO / VP / Sr Director - joncohn@comcast.netJon Cohn
 
Where is enterprise architecture in healthcare
Where is enterprise architecture in healthcareWhere is enterprise architecture in healthcare
Where is enterprise architecture in healthcareJon Cohn
 
The big data architecture dilemma for ci os
The big data architecture dilemma for ci osThe big data architecture dilemma for ci os
The big data architecture dilemma for ci osJon Cohn
 
Enterprise architecture
Enterprise architectureEnterprise architecture
Enterprise architectureJon Cohn
 
Best practices
Best practicesBest practices
Best practicesJon Cohn
 
9 enterprise tech trends for 2016 and beyond
9 enterprise tech trends for 2016 and beyond9 enterprise tech trends for 2016 and beyond
9 enterprise tech trends for 2016 and beyondJon Cohn
 
8 enterprise software predictions
8 enterprise software predictions8 enterprise software predictions
8 enterprise software predictionsJon Cohn
 

Más de Jon Cohn (20)

Jon Cohn Exton PA - Technology Trends – 2016 and beyond
Jon Cohn Exton PA - Technology Trends – 2016 and beyondJon Cohn Exton PA - Technology Trends – 2016 and beyond
Jon Cohn Exton PA - Technology Trends – 2016 and beyond
 
Jon Cohn Exton PA - Rationalizing Application Portfolios
Jon Cohn Exton PA - Rationalizing Application PortfoliosJon Cohn Exton PA - Rationalizing Application Portfolios
Jon Cohn Exton PA - Rationalizing Application Portfolios
 
Jon Cohn Exton PA - Next Gen Enterprise Information Technology
Jon Cohn Exton PA - Next Gen Enterprise Information TechnologyJon Cohn Exton PA - Next Gen Enterprise Information Technology
Jon Cohn Exton PA - Next Gen Enterprise Information Technology
 
Jon Cohn Exton PA - Healthcare - Enterprise Architecture
Jon Cohn Exton PA - Healthcare - Enterprise Architecture Jon Cohn Exton PA - Healthcare - Enterprise Architecture
Jon Cohn Exton PA - Healthcare - Enterprise Architecture
 
Jon Cohn Exton PA - ERP Predictions
Jon Cohn Exton PA - ERP PredictionsJon Cohn Exton PA - ERP Predictions
Jon Cohn Exton PA - ERP Predictions
 
Jon Cohn Exton PA - Enterprise Architecture - Best Practices
Jon Cohn Exton PA - Enterprise Architecture - Best PracticesJon Cohn Exton PA - Enterprise Architecture - Best Practices
Jon Cohn Exton PA - Enterprise Architecture - Best Practices
 
Jon Cohn Exton PA - EA and Innovation
Jon Cohn Exton PA - EA and InnovationJon Cohn Exton PA - EA and Innovation
Jon Cohn Exton PA - EA and Innovation
 
Jon Cohn Exton PA - Data Governance – Best Practices
Jon Cohn Exton PA - Data Governance – Best PracticesJon Cohn Exton PA - Data Governance – Best Practices
Jon Cohn Exton PA - Data Governance – Best Practices
 
Jon cohn exton pa corporate data architecture
Jon cohn exton pa   corporate data architectureJon cohn exton pa   corporate data architecture
Jon cohn exton pa corporate data architecture
 
Big Data Architecture
Big Data ArchitectureBig Data Architecture
Big Data Architecture
 
Jon Cohn Exton PA - Microservices anti
Jon Cohn Exton PA - Microservices antiJon Cohn Exton PA - Microservices anti
Jon Cohn Exton PA - Microservices anti
 
Jon Cohn Exton PA - Knowledge management in software architecture
Jon Cohn Exton PA - Knowledge management in software architectureJon Cohn Exton PA - Knowledge management in software architecture
Jon Cohn Exton PA - Knowledge management in software architecture
 
Jon Cohn Exton PA - Resume
Jon Cohn Exton PA - ResumeJon Cohn Exton PA - Resume
Jon Cohn Exton PA - Resume
 
Jon A Cohn - CTO / VP / Sr Director - joncohn@comcast.net
Jon A Cohn - CTO / VP / Sr Director - joncohn@comcast.netJon A Cohn - CTO / VP / Sr Director - joncohn@comcast.net
Jon A Cohn - CTO / VP / Sr Director - joncohn@comcast.net
 
Where is enterprise architecture in healthcare
Where is enterprise architecture in healthcareWhere is enterprise architecture in healthcare
Where is enterprise architecture in healthcare
 
The big data architecture dilemma for ci os
The big data architecture dilemma for ci osThe big data architecture dilemma for ci os
The big data architecture dilemma for ci os
 
Enterprise architecture
Enterprise architectureEnterprise architecture
Enterprise architecture
 
Best practices
Best practicesBest practices
Best practices
 
9 enterprise tech trends for 2016 and beyond
9 enterprise tech trends for 2016 and beyond9 enterprise tech trends for 2016 and beyond
9 enterprise tech trends for 2016 and beyond
 
8 enterprise software predictions
8 enterprise software predictions8 enterprise software predictions
8 enterprise software predictions
 

Último

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Principled Technologies
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024SynarionITSolutions
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024The Digital Insurer
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 

Último (20)

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 

Seven best practices for revolutionizing your data

  • 1. SevenBestPracticesForRevolutionizingYourData A complete revisitingof the corporate dataarchitecture anditsrespective bestpracticesisinorder because of cloudcomputingandbigchangesin computingtechnologyandsoftware development.In some cases,a complete inversionhasoccurred(inthe bestway) tosolve aparticularproblem.Tobe competitive,organizationsneedtotake advantage of these new waysof doingthings.Massive data and informationisoutthere if we canjustgrasp it. Below are some principlesandpracticesonhow we can betterdeal withdatagoingforward. Store First,Analyze Later:Diskischeap. We can’t alwayspredictwhatdatawill be importantlater. Store firstand ask questionslater.Withscalable infrastructure andtoday’shardware economics,it’s okayif a piece of data turnsout to neverbe used.The schemaflexibilityof NoSQLtechnology facilitatesthis. Forexample,withacustomerdocument,addingadditional fieldsof informationata laterdate is easyeven if theywere notenvisionedinitially. DefaulttoReal-time:Historically,dataprocessingandanalysishasbeendone viabatchprocessing. We defaultedtobatchprocessingbecause it’scomputationallyefficienthowever,givenMoore’slaw and the passage of time we nowhave much more powerat our disposal.We canaffordto do more workto get real-time answersinsteadof answerstomorrow. NoSQLandfast storage technologies (suchas solidstate disk) make real-time possible. Yourorganizationshoulddeliver recommendations,personalizationandbusinessmetricsimmediately.Defaulttoreal-timeandgoto batch onlywhennecessary. Structure Shouldn’tHoldYouBack: It’seasyto store basic stockinformation –for example (ticker, high,low,close) –inany database. What abouta complete derivative security?How dowe store that inthe database,especiallygiventhatnew securitiesare inventedall the time? A legal contract’s terms? Howdo we store polymorphicinformationordatawe weren’taware of a priori? Historicallyafewmethodshave beenmostcommon:the relational database fordatawithvery precise structuring;completelyunstructureddata(“BLOBs”);andthingsinthe middle,suchas spreadsheets. The lattertwoformatsare mostlyuselessfor integrationintoyourapplications,yet the volume of suchdata is massive. Withthe rise of dynamicdocument-orienteddatamodels(using JSON),semi-structured,complex structured,andpolymorphicdatacan be stored,accessedand organizedjustasefficientlyasthe more rigidlystructureddatathathasbeenindatabases traditionally. AgilityIsKey:The software developmentworldhasmovedfromclassic“waterfall”software developmentlifecyclestomore agile,oriterative,methodologies(forexample,Scrum). These methods’rapiditerationallowsorganizationstodeliverfeaturesandenhancementstoendusers quicklyandeffectively.Toworkthisway,we neednew toolsthatare agile-compatible — version control,continuousintegration,programminglanguageshave adaptedalready. We needsimilar adaptionbythe database if we want to make software developmentnimbleandproductive. NoSQL technologiesfacilitate iterationinthe datamodel muchthe same way as youiterate withyourcode. One Size Doesn’tFitAll:One-size-fits-all isover. Use multiple database technologiesaspartof your standardenterprise technologyplatform. Youwon’twantdozens – that wouldbe far toocomplex – but more than one isoptimal. A goodmodel forthe future isto have three primarytools:an DBMS,
  • 2. a relational datawarehouse andaNoSQLdatabase. For each projector sub-problem, use whichever tool is best. Augmentwithniche tools(e.g.,atime seriesdatabase) forspecial cases.The above approach ishighlycompatiblewithservice-orientedarchitectures,whichyoushouldbe using. Monolithichub-and-spoke architecturesleadtolate projectsandunchangeable systems. Instead, buildwebserviceswitheachone potentiallyhavingitsowndatabase ordatamart behindit. Go Commodity:The rise of commodityhardware asa viable productionplatformhasmade it possible todeploymulti-node systemsquickly.Newerdatabase technologiesare designedwith commodityserversinmind.Companiesare movingawayfrom“bigiron”serversandembrace this approach.By adoptinga commodityserverdeploymentmodel,thereislessof adependencyon proprietarymechanismsandvendorlock-inisoftenavoided.Findthe sweetspotonthe price- performance curve andbuyserversof that size. Don’tbuy$1k servers,you’ll have toomanyto manage (or evenplugin!) Butdon’tgo toobig either.Manyorganizationsare standardizingon$10k commodityXeon(orAMD) basedserverswithgigabitEthernet. Use SolidState Drives –a Lot: Traditional spinningdiskshave increasedincapacityanddata transfer ratesby a factor of one thousand,yetthe randomi/otimeshave barelybudgedoveradecade. If youare doingany randomI/Oat all,youshoulduse SSDsinstead.CommoditySATA-style SSDscan worksurprisinglywell. Be sure to mirrorthem – theystill fail eventhoughthereare nomovingparts (exceptelectrons!) Reserve20%+of the disk’sspace as un-partitionedtogive the drive roomto optimize randomwritesandavoidexcess“write amplification”. For sequentialI/O,sticktospinningdisks. Thus,use spinningdisksforHadoopbatchprocessingand for backups. Some have predictedeventually99% of data maybe storedonspinningdisksyet99% of accesseswill be happeningonSSDs.Withspinning disksbeingthe mainplace forbackups,thatis conceivable. Source : forbes.com Recommendedby: JonCohn ,CTO, VP IT Architecture https://www.linkedin.com/in/jonacohn joncohn@comcast.net "JonCohn ExtonPA""JonCohn Exton""JonCohnEvolution"