1. Page1
Diyotta, Inc. All Rights Reserved
Analyze Telecom Fraud at Hadoop Scale
29th June 2016
Sanjay Vyas
Co-founder & COO, Diyotta
2. Page2
Diyotta, Inc. All Rights Reserved
Telecom-Relevant Glossary
• CDR – Call Detail Record
• Any phone call generates a CDR
• IPDR – IP Detail Record
• Any internet browsing activity generates an IPDR
• IVR – Interactive Voice Response
• Automated telephone response system usually for typical queries
3. Page3
Diyotta, Inc. All Rights Reserved
Fraud in Telecom
Global Mobile Industry
$2.2T
Revenue Losses due to
Fraud
$46.3B
Reference: http://www.argyledata.com/files/Telecom-Fraud-101-eBook.pdf
4. Page4
Diyotta, Inc. All Rights Reserved
A Day In Life of Telecom Data (Fraud Use-Case)
Source Systems
Ingestion Pipelines
Target Data Sets
Fraud Analysis
Minataur
5. Page5
Diyotta, Inc. All Rights Reserved
Legacy State for Fraud Analytics
Monolithic
Script Based
file Ingestion
Minataur
Fraud Application
• Limited capacity for processing
• Cannot Scale for Volume/Velocity
• Cannot do on-demand real-time Analytics
6. Page6
Diyotta, Inc. All Rights Reserved
Business Challenges
• Business not able to
• analyze IPDR Data due to the sheer
volume
• ingest streaming data from IVR
systems for fraud analysis
• Perform on-demand real-time fraud
analytics for deeper insights
7. Page7
Diyotta, Inc. All Rights Reserved
IT Challenges with Hadoop Adoption
• Skill Gap
• Limited in-house expertise on evolving technologies and keep up the pace
• Enterprise Standards
• Manual coding suffers from quality/maintenance issues and is inconsistent
• Scalability across data and technology
• Real-time, social media, multi-processing engines
• Data Lineage