SlideShare una empresa de Scribd logo
1 de 8
how stuff works - technology @ fisheye analytics -ashwinreddygayam Part 1 : How a day looks like http://engineering.fisheyeanalytics.com
3 layers of technology In the spirit of sharing knowledge, I will discuss how we at Fisheye Analytics engineer large scale software systems solving complex problems in a series of presentations. Fisheye runs it’s technology on 30 servers running programs 24x7 giving insightful media intelligence for it’s clients. The technology can be put into three layers 1. Crawling & Search Engines 2. Analytics Processing 3. Client Applications
How a day looks like In this presentation, I’d like to shed some light on how a day in our server farm looks like. In AWS US East Coast, A handful of proprietary web crawlers download tens of gigabytes of data a day scouring through millions of web pages, Twitter & Facebook APIs running on a cluster of machines Another bunch of indexers, index the data fetched above using SOLR and the data is ready to be searched for.
How a day looks like Message Queues (using ActiveMQ) get flooded with millions of messages and act as a backbone with which all machines on the cluster exchange information. Peeks of up to a thousand messages a second is not uncommon. Meanwhile in the Singapore farm, Database servers running MySQL (partitioned) see peeks of 600 transactions per second (read+write) Also in the Singapore farm, 7 different kinds of analytic programs (called DPPs internally), all highly multi threaded, feast on over 40 cores of CPU in a separate cluster.
How a day looks like Shell scripts make real time backups of articles stored on our NAS into the cloud (AWS EBS) encrypted with AES 128 bit encryption Copies of MySQL binary logs are continuously transferred over for incremental backup at two different places. Health monitoring programs run all day long measuring message queue sizes, server uptimes etc. and shoot emails and texts alerts to our mobile phones as they sense anything abnormal.  Programs/servers stopping erroneously are automatically restarted too by the health monitoring programs.
How a day looks like A central log monitoring server (internally called FishMon) pulls the logs of various programs, servers at regular intervals, and stores them centrally allowing the developers to glance through them to catch and fix bugs in a rich interface. For reporting and analytic purposes for our clients, client data in specific formats is indexed through Sphinx search engine and is queried by Media Lens (one of our products) and our client report generators. Replicated Databases, Search Engine Indexes, Message Queues work as hot spares when their masters go down.
More to it! There is definitely more to it and this is just a starter. After all, we boast state-of-the art technology in different areas. We build such large complex systems with only a handful of us – sounds like a startup? If this kind of stuff gives you a kick, you should wait no longer and try to build your career with us. We are always looking for very passionate and extremely talented engineers who can help us build technology which makes a difference. More information about our software architectures coming in future presentations.
Thank you! Thank you everyone and feel free to write to me at ashwin.reddy@fisheyeanalytics.com You can always drop by our Singapore office and say Hi to our great engineers.

Más contenido relacionado

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 

Último (20)

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 

Destacado

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Destacado (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Technology @ Fisheye Analytics : How a day looks like in our server farm

  • 1. how stuff works - technology @ fisheye analytics -ashwinreddygayam Part 1 : How a day looks like http://engineering.fisheyeanalytics.com
  • 2. 3 layers of technology In the spirit of sharing knowledge, I will discuss how we at Fisheye Analytics engineer large scale software systems solving complex problems in a series of presentations. Fisheye runs it’s technology on 30 servers running programs 24x7 giving insightful media intelligence for it’s clients. The technology can be put into three layers 1. Crawling & Search Engines 2. Analytics Processing 3. Client Applications
  • 3. How a day looks like In this presentation, I’d like to shed some light on how a day in our server farm looks like. In AWS US East Coast, A handful of proprietary web crawlers download tens of gigabytes of data a day scouring through millions of web pages, Twitter & Facebook APIs running on a cluster of machines Another bunch of indexers, index the data fetched above using SOLR and the data is ready to be searched for.
  • 4. How a day looks like Message Queues (using ActiveMQ) get flooded with millions of messages and act as a backbone with which all machines on the cluster exchange information. Peeks of up to a thousand messages a second is not uncommon. Meanwhile in the Singapore farm, Database servers running MySQL (partitioned) see peeks of 600 transactions per second (read+write) Also in the Singapore farm, 7 different kinds of analytic programs (called DPPs internally), all highly multi threaded, feast on over 40 cores of CPU in a separate cluster.
  • 5. How a day looks like Shell scripts make real time backups of articles stored on our NAS into the cloud (AWS EBS) encrypted with AES 128 bit encryption Copies of MySQL binary logs are continuously transferred over for incremental backup at two different places. Health monitoring programs run all day long measuring message queue sizes, server uptimes etc. and shoot emails and texts alerts to our mobile phones as they sense anything abnormal. Programs/servers stopping erroneously are automatically restarted too by the health monitoring programs.
  • 6. How a day looks like A central log monitoring server (internally called FishMon) pulls the logs of various programs, servers at regular intervals, and stores them centrally allowing the developers to glance through them to catch and fix bugs in a rich interface. For reporting and analytic purposes for our clients, client data in specific formats is indexed through Sphinx search engine and is queried by Media Lens (one of our products) and our client report generators. Replicated Databases, Search Engine Indexes, Message Queues work as hot spares when their masters go down.
  • 7. More to it! There is definitely more to it and this is just a starter. After all, we boast state-of-the art technology in different areas. We build such large complex systems with only a handful of us – sounds like a startup? If this kind of stuff gives you a kick, you should wait no longer and try to build your career with us. We are always looking for very passionate and extremely talented engineers who can help us build technology which makes a difference. More information about our software architectures coming in future presentations.
  • 8. Thank you! Thank you everyone and feel free to write to me at ashwin.reddy@fisheyeanalytics.com You can always drop by our Singapore office and say Hi to our great engineers.