SlideShare una empresa de Scribd logo
1 de 44
Company 
LOGO 
An Introduction of JStorm 
LongdaFeng(zhongyan.feng@alibaba-inc.com)
Longda Feng 
Alibaba 
Agenda 
Background 
Basic Concept & Scenarios 
Why start JStorm? 
JStorm vs Storm 
Question and Answer.
Who are we? 
JStorm Team was among one of the 
earliest that uses Storm in China. 
 Storm 0.5.1/0.5.4/0.6.0/0.6.2/0.7.0/0.7.1 
 JStorm 0.7.1/0.9.0/0.9.1/0.9.2/0.9.3/… 
Our Duties 
 Application Development 
 JStorm System Development 
 JStorm System Operation 
Longda Feng 
Alibaba
Who are Using JStorm 
Many small Chinese companies are using 
JStorm 
Longda Feng 
Alibaba
How Big? 
More than 3000 servers 
More than 3 trillion messages per day 
Longda Feng 
Alibaba
What is JStorm? 
JStorm is a distributed programming 
framework 
 Similar to Hadoop MapReduce but designed 
for real-time/in-memory scenarios 
 Users can build powerful distributed 
applications from very simple APIs 
Longda Feng 
Alibaba
What is JStorm? 
Redesigned Storm in Java. 
 Proved stable running in huge clusters. 
 Much faster 
 Much more powerful 
Longda Feng 
Alibaba
Basic Conception 
Pipe-lined data processing 
Longda Feng 
Alibaba
Advantage 1 
Easy learning: 
 Simple Building Blocks: Topology/Spout/Bolt 
APIs 
 Out of Box RPC/Fault-tolerance/Real-time 
Data Grouping & Combining 
Longda Feng 
Alibaba
Advantage 2 
Excellent Scalability 
 Horizontally Scalable 
 DAG-based 
 Adjustable parallelism of each component 
Longda Feng 
Alibaba
Stable 
Guarantees Fault-Tolerance 
 No Single Point of Failure 
• Nimbus HA 
• Any Supervisor can be shutdown 
 New worker will be spawned and replace the 
failed one automatically 
Longda Feng 
Alibaba
Accuracy 
Acking framework guarantees no lost of 
data 
Transaction framework guarantees data 
accuracy. 
Longda Feng 
Alibaba
Scenarios 
Stateless Computation 
 All data come from Tuple 
Use Cases: 
 Log Analysis 
 Pipe-lined System 
 Message converter 
 Statistical Analysis 
 Real-time Recommendation Algorithm 
Longda Feng 
Alibaba
Longda Feng 
Alibaba 
Why start JStorm 
Storm community is not as active as we’ve 
expected 
 Tailored for enterprise environment 
 Fixed critical bugs in Storm 
 Provided professional technical support, 
improved app development pace. 
 Reduced operational cost. 

How Many Versions? 
https://github.com/alibaba/JStorm/releases 
 0.9.6(2014/9/22) 
 0.9.5.1(2014/9/14) 
 0.9.5 (2014/8/27) 
 0.9.4.1 (2014/8/15) 
 0.9.4(2014/7/18) 
 0.9.3.1 (2014/5/31) 
 0.9.3 (2014/5/10) 
 0.9.2 (2014/4/8) 
 0.9.1(2014/1/24) 
 0.9.0(2013/12/30) 
 0.7.1(2013/4/28) 
Longda Feng 
Alibaba
JStorm is a superset of Storm 
The program run in Storm can run in 
JStorm without changing code 
Longda Feng 
Alibaba
More stable (1) -- nimbus HA 
Nimbus HA 
 Dual-Nimbus HA 
Longda Feng 
Alibaba
More stable (2) -- RPC 
Netty supports 2 RPC modes 
 Async 
 Sync 
• Sending speed keeps up with the receiving speed, 
therefore the data flow is more stable. 
Longda Feng 
Alibaba
More stable(3) – resource isolation 
Malicious Worker won’t mess up with 
others 
 Supported CPU Isolation with cgroups 
 Supported Memory Isolation 
Resources quota can be enforced on each 
group (before 0.9.5) 
Longda Feng 
Alibaba
More stable(4) -- Monitor 
Monitor every component in your 
Topology 
 Many more metrics(70+) than storm 
Supported user-defined metrics 
Supported user-defined alerts 
Longda Feng 
Alibaba
More stable (5) – CPU usage 
Better utilizing CPU resource 
 Improved disruptor implementation 
• Drop CPU usage from 300% to 10% when 
processing queue is full 
 Avoid CPU spin-waiting 
• Relocating nextTuple/ack/fail work to a different 
thread 
Longda Feng 
Alibaba
More stable(6) -- more catch 
Add try-catch in any place. 
 Nimbus/supervisor main thread 
 Spout/bolt initialization/cleanup 
 All IO operation, serialization/deserialization 
 All ZK operation 
Longda Feng 
Alibaba
More stable(7) -- ZK 
Reduced unnecessary ZK usage: 
 Removed useless watcher 
 Increased ZK heartbeat frequency 
 Detect failed worker without a full scan of the 
entire ZK directory 
Longda Feng 
Alibaba
More stable(8) -- other 
Improved GC Tuning. 
Guaranteed that all workers killed after kill 
command is issued 
Guaranteed single supervisor/nimbus per 
instance 
Avoid excessive use of local ports by 
Netty client 
。。。 
Longda Feng 
Alibaba
More powerful scheduler 
Balancing Tasks with regard of : 
 CPU 
 Memory 
 Net 
Longda Feng 
Alibaba
CPU assignment 
By default assign each worker a single 
CPU slot 
Application can be configured to utilize 
more slots 
Why: 
 Some task creates extra threads to do other 
things in Alimama, one CPU slot doesn’t meet 
requirement 
Longda Feng 
Alibaba
Memory Usage 
Default worker memory is 2G 
Application can be configured to utilize 
more memory slots 
Why: 
 In Alipay Mdrill application, Solr bolt will apply 
much more memory 
Longda Feng 
Alibaba
Smarter Balancing 
With JStorm Scheduler: 
 Tasks that exchange data heavily tend to be 
assigned to the same worker to avoid 
networking cost. 
Longda Feng 
Alibaba
User Defined Scheduler 
User define task run one designated 
worker 
 User can setting how many CPU slot /memory 
slot will be used 
Why: 
 In Taobao TAE project, some bolts want to 
run in user defined-nodes 
Longda Feng 
Alibaba
Task on Different Node 
Task of one component can be scheduled 
to run on different nodes 
Why: 
 In ALIPAY Mdrill, Solr bolt must run different 
node 
Longda Feng 
Alibaba
Task on Single Node 
All tasks can be scheduled to run on a 
single node. 
Why: 
 In Taobao TLog, there are many small jobs, in 
order to reduce network cost, all task of one 
job must run on single node. 
Longda Feng 
Alibaba
Old Assignment 
“Last Assignment Policy” 
 By default , a task will run on the machine it 
runs previous time 
Why: 
 In Alibaba CDO, When restart one application, 
user wanted to reuse old workers 
Longda Feng 
Alibaba
Pluginable 
Be able to run on: 
Hadoop yarn(more stable than storm) 
Alibaba Apsara Clould System 
Alibaba Elastic Resource Pool 
Longda Feng 
Alibaba
Classloader 
Resolved application jar-confliction with 
JStorm 
Longda Feng 
Alibaba
More convenient UI 
More useful stats collected and displayed. 
Browse Worker Log in UI 
Longda Feng 
Alibaba
Support libjar 
Don’t need assembly all dependency jars 
into one jar 
 Submit libjar with libjar parameter 
 Support worker.classpath 
Longda Feng 
Alibaba
Faster 
6 Servers (24core/98G) 
18 Spout/18 Bolt/18 Acker 
Longda Feng 
Alibaba 
9280598 
10818815 
Throughput vs workers 
9065965 
6819139 
5610201 
6243680 
6830500 
5595900 5474180 
3379800 
12000000 
10000000 
8000000 
6000000 
4000000 
2000000 
0 
0 10 20 30 40 50 60 
polltuples/10s 
workers 
jstorm 
storm
JStorm 41W/S Sending Speed 
Longda Feng 
Alibaba
Storm 41W/S Sending Speed 
Longda Feng 
Alibaba
Why Faster 
Reduce memory-copying by zeroMq 
Dedicated Deserializing Thread 
Better Tuned Sampling Logic 
Better Tuned Acking Framework 
Better Tuned GC 
Longda Feng 
Alibaba
Other Improvement 
 More than 100 improvements 
 https://github.com/alibaba/JStorm/blob/master/history.md 
 Fixed assign topology competition 
 Reset rebalance/reassigned worker timeout as 4 minutes 
 Graceful worker shutdown 
 Improvement on thrift server 
 Avoid mistakenly killing of worker while rebalancing jobs. 
 。。。。 
Longda Feng 
Alibaba
More document 
https://github.com/alibaba/JStorm/wiki 
Google-group:JStorm-user@googlegroups.com 
Wangwang:JStorm 
QQ:228374502 
Laiwang: JStorm 
Longda Feng 
Alibaba
Join us 
Welcome to Join us 
 zhongyan.feng@alibaba-inc.com 
Longda Feng 
Alibaba
Company 
LOGO 
纪君祥(Longda Feng)

Más contenido relacionado

La actualidad más candente

Hadoop 2.0 Architecture | HDFS Federation | NameNode High Availability |
Hadoop 2.0 Architecture | HDFS Federation | NameNode High Availability | Hadoop 2.0 Architecture | HDFS Federation | NameNode High Availability |
Hadoop 2.0 Architecture | HDFS Federation | NameNode High Availability |
Edureka!
 
Introduction to Data Analyst Training
Introduction to Data Analyst TrainingIntroduction to Data Analyst Training
Introduction to Data Analyst Training
Cloudera, Inc.
 

La actualidad más candente (20)

Introduction to Hadoop - The Essentials
Introduction to Hadoop - The EssentialsIntroduction to Hadoop - The Essentials
Introduction to Hadoop - The Essentials
 
Hadoop 2.0 Architecture | HDFS Federation | NameNode High Availability |
Hadoop 2.0 Architecture | HDFS Federation | NameNode High Availability | Hadoop 2.0 Architecture | HDFS Federation | NameNode High Availability |
Hadoop 2.0 Architecture | HDFS Federation | NameNode High Availability |
 
Hadoop
Hadoop Hadoop
Hadoop
 
Hadoop Backup and Disaster Recovery
Hadoop Backup and Disaster RecoveryHadoop Backup and Disaster Recovery
Hadoop Backup and Disaster Recovery
 
Introduction to Data Analyst Training
Introduction to Data Analyst TrainingIntroduction to Data Analyst Training
Introduction to Data Analyst Training
 
Hadoop Interview Questions and Answers by rohit kapa
Hadoop Interview Questions and Answers by rohit kapaHadoop Interview Questions and Answers by rohit kapa
Hadoop Interview Questions and Answers by rohit kapa
 
Boston Hadoop Meetup, April 26 2012
Boston Hadoop Meetup, April 26 2012Boston Hadoop Meetup, April 26 2012
Boston Hadoop Meetup, April 26 2012
 
Design, Scale and Performance of MapR's Distribution for Hadoop
Design, Scale and Performance of MapR's Distribution for HadoopDesign, Scale and Performance of MapR's Distribution for Hadoop
Design, Scale and Performance of MapR's Distribution for Hadoop
 
Hadoop
HadoopHadoop
Hadoop
 
SQL-on-Hadoop Tutorial
SQL-on-Hadoop TutorialSQL-on-Hadoop Tutorial
SQL-on-Hadoop Tutorial
 
Introduction to Hadoop and Cloudera, Louisville BI & Big Data Analytics Meetup
Introduction to Hadoop and Cloudera, Louisville BI & Big Data Analytics MeetupIntroduction to Hadoop and Cloudera, Louisville BI & Big Data Analytics Meetup
Introduction to Hadoop and Cloudera, Louisville BI & Big Data Analytics Meetup
 
HUG August 2010: Best practices
HUG August 2010: Best practicesHUG August 2010: Best practices
HUG August 2010: Best practices
 
Introduction to Hadoop
Introduction to HadoopIntroduction to Hadoop
Introduction to Hadoop
 
An Introduction to Hadoop and Cloudera: Nashville Cloudera User Group, 10/23/14
An Introduction to Hadoop and Cloudera: Nashville Cloudera User Group, 10/23/14An Introduction to Hadoop and Cloudera: Nashville Cloudera User Group, 10/23/14
An Introduction to Hadoop and Cloudera: Nashville Cloudera User Group, 10/23/14
 
Hadoop_Its_Not_Just_Internal_Storage_V14
Hadoop_Its_Not_Just_Internal_Storage_V14Hadoop_Its_Not_Just_Internal_Storage_V14
Hadoop_Its_Not_Just_Internal_Storage_V14
 
Hadoop World 2011: Hadoop and RDBMS with Sqoop and Other Tools - Guy Harrison...
Hadoop World 2011: Hadoop and RDBMS with Sqoop and Other Tools - Guy Harrison...Hadoop World 2011: Hadoop and RDBMS with Sqoop and Other Tools - Guy Harrison...
Hadoop World 2011: Hadoop and RDBMS with Sqoop and Other Tools - Guy Harrison...
 
10c introduction
10c introduction10c introduction
10c introduction
 
Shared slides-edbt-keynote-03-19-13
Shared slides-edbt-keynote-03-19-13Shared slides-edbt-keynote-03-19-13
Shared slides-edbt-keynote-03-19-13
 
Apache hadoop technology : Beginners
Apache hadoop technology : BeginnersApache hadoop technology : Beginners
Apache hadoop technology : Beginners
 
Apache kudu
Apache kuduApache kudu
Apache kudu
 

Destacado

Semiconductor industry in china20151126R1.2
Semiconductor industry in china20151126R1.2Semiconductor industry in china20151126R1.2
Semiconductor industry in china20151126R1.2
xiaolong zheng
 
Monitoring Electronic Trading Environments using Spark by Fergal Toomey and P...
Monitoring Electronic Trading Environments using Spark by Fergal Toomey and P...Monitoring Electronic Trading Environments using Spark by Fergal Toomey and P...
Monitoring Electronic Trading Environments using Spark by Fergal Toomey and P...
Spark Summit
 
Online Trading
Online TradingOnline Trading
Online Trading
koolzub
 
Competitive analysis of it service firms
Competitive analysis of it service firmsCompetitive analysis of it service firms
Competitive analysis of it service firms
Sayan Maiti
 
Wipro presentation
Wipro presentationWipro presentation
Wipro presentation
manishkr90
 
Online trading ppt
Online trading ppt Online trading ppt
Online trading ppt
petkarshwt
 

Destacado (19)

Quant trading theory series: electronic markets
Quant trading theory series: electronic marketsQuant trading theory series: electronic markets
Quant trading theory series: electronic markets
 
Tabobao Reinventing (E-)Commerce
Tabobao Reinventing (E-)CommerceTabobao Reinventing (E-)Commerce
Tabobao Reinventing (E-)Commerce
 
Semiconductor industry in china20151126R1.2
Semiconductor industry in china20151126R1.2Semiconductor industry in china20151126R1.2
Semiconductor industry in china20151126R1.2
 
Monitoring Electronic Trading Environments using Spark by Fergal Toomey and P...
Monitoring Electronic Trading Environments using Spark by Fergal Toomey and P...Monitoring Electronic Trading Environments using Spark by Fergal Toomey and P...
Monitoring Electronic Trading Environments using Spark by Fergal Toomey and P...
 
Alibaba & e-Commerce in China
Alibaba & e-Commerce in ChinaAlibaba & e-Commerce in China
Alibaba & e-Commerce in China
 
Online Trading
Online TradingOnline Trading
Online Trading
 
TradeZilla - Trading system Design
TradeZilla - Trading system DesignTradeZilla - Trading system Design
TradeZilla - Trading system Design
 
Electronic Trading: A Primer
Electronic Trading: A PrimerElectronic Trading: A Primer
Electronic Trading: A Primer
 
Competitive analysis of it service firms
Competitive analysis of it service firmsCompetitive analysis of it service firms
Competitive analysis of it service firms
 
The Alibaba Group Explained
The Alibaba Group ExplainedThe Alibaba Group Explained
The Alibaba Group Explained
 
Wipro presentation
Wipro presentationWipro presentation
Wipro presentation
 
WIPRO PROJECT FILE
WIPRO PROJECT FILEWIPRO PROJECT FILE
WIPRO PROJECT FILE
 
Tcs ppt
Tcs pptTcs ppt
Tcs ppt
 
Online trading ppt
Online trading ppt Online trading ppt
Online trading ppt
 
Fixed Income Trading System Architecture
Fixed Income Trading System ArchitectureFixed Income Trading System Architecture
Fixed Income Trading System Architecture
 
7 steps to cloud onboarding
7 steps to cloud onboarding7 steps to cloud onboarding
7 steps to cloud onboarding
 
Indian IT Industry overview
Indian IT Industry overviewIndian IT Industry overview
Indian IT Industry overview
 
LinkedIn Data Infrastructure (QCon London 2012)
LinkedIn Data Infrastructure (QCon London 2012)LinkedIn Data Infrastructure (QCon London 2012)
LinkedIn Data Infrastructure (QCon London 2012)
 
E commerce
E commerceE commerce
E commerce
 

Similar a Jstorm introduction-0.9.6

Azul yandexjune010
Azul yandexjune010Azul yandexjune010
Azul yandexjune010
yaevents
 
Java Tuning White Paper
Java Tuning White PaperJava Tuning White Paper
Java Tuning White Paper
white paper
 

Similar a Jstorm introduction-0.9.6 (20)

JStorm Introduction
JStorm IntroductionJStorm Introduction
JStorm Introduction
 
J storm
J stormJ storm
J storm
 
Low latency in java 8 v5
Low latency in java 8 v5Low latency in java 8 v5
Low latency in java 8 v5
 
Comparison of various streaming technologies
Comparison of various streaming technologiesComparison of various streaming technologies
Comparison of various streaming technologies
 
Five cool ways the JVM can run Apache Spark faster
Five cool ways the JVM can run Apache Spark fasterFive cool ways the JVM can run Apache Spark faster
Five cool ways the JVM can run Apache Spark faster
 
2019 StartIT - Boosting your performance with Blackfire
2019 StartIT - Boosting your performance with Blackfire2019 StartIT - Boosting your performance with Blackfire
2019 StartIT - Boosting your performance with Blackfire
 
Things You MUST Know Before Deploying OpenStack: Bruno Lago, Catalyst IT
Things You MUST Know Before Deploying OpenStack: Bruno Lago, Catalyst ITThings You MUST Know Before Deploying OpenStack: Bruno Lago, Catalyst IT
Things You MUST Know Before Deploying OpenStack: Bruno Lago, Catalyst IT
 
Low latency in java 8 by Peter Lawrey
Low latency in java 8 by Peter Lawrey Low latency in java 8 by Peter Lawrey
Low latency in java 8 by Peter Lawrey
 
Java Performance and Profiling
Java Performance and ProfilingJava Performance and Profiling
Java Performance and Profiling
 
Azul yandexjune010
Azul yandexjune010Azul yandexjune010
Azul yandexjune010
 
Basic architecuture and operation concept of Backlog and Cacoo
Basic architecuture and operation concept of Backlog and CacooBasic architecuture and operation concept of Backlog and Cacoo
Basic architecuture and operation concept of Backlog and Cacoo
 
Azure + DataStax Enterprise (DSE) Powers Office365 Per User Store
Azure + DataStax Enterprise (DSE) Powers Office365 Per User StoreAzure + DataStax Enterprise (DSE) Powers Office365 Per User Store
Azure + DataStax Enterprise (DSE) Powers Office365 Per User Store
 
Google App Engine for Java v0.0.2
Google App Engine for Java v0.0.2Google App Engine for Java v0.0.2
Google App Engine for Java v0.0.2
 
Google App Engine for Java
Google App Engine for JavaGoogle App Engine for Java
Google App Engine for Java
 
MySQL Scalability and Reliability for Replicated Environment
MySQL Scalability and Reliability for Replicated EnvironmentMySQL Scalability and Reliability for Replicated Environment
MySQL Scalability and Reliability for Replicated Environment
 
On The Building Of A PostgreSQL Cluster
On The Building Of A PostgreSQL ClusterOn The Building Of A PostgreSQL Cluster
On The Building Of A PostgreSQL Cluster
 
murakumo Cloud Controller
murakumo Cloud Controllermurakumo Cloud Controller
murakumo Cloud Controller
 
Java Performance and Using Java Flight Recorder
Java Performance and Using Java Flight RecorderJava Performance and Using Java Flight Recorder
Java Performance and Using Java Flight Recorder
 
Java Tuning White Paper
Java Tuning White PaperJava Tuning White Paper
Java Tuning White Paper
 
[RakutenTechConf2013] [C-1] Rakuten new infrastructure
[RakutenTechConf2013] [C-1] Rakuten new infrastructure[RakutenTechConf2013] [C-1] Rakuten new infrastructure
[RakutenTechConf2013] [C-1] Rakuten new infrastructure
 

Último

💚😋 Salem Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋
💚😋 Salem Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋💚😋 Salem Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋
💚😋 Salem Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋
nirzagarg
 
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
ydyuyu
 
valsad Escorts Service ☎️ 6378878445 ( Sakshi Sinha ) High Profile Call Girls...
valsad Escorts Service ☎️ 6378878445 ( Sakshi Sinha ) High Profile Call Girls...valsad Escorts Service ☎️ 6378878445 ( Sakshi Sinha ) High Profile Call Girls...
valsad Escorts Service ☎️ 6378878445 ( Sakshi Sinha ) High Profile Call Girls...
Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure
 
📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱
📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱
📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱
@Chandigarh #call #Girls 9053900678 @Call #Girls in @Punjab 9053900678
 
➥🔝 7737669865 🔝▻ mehsana Call-girls in Women Seeking Men 🔝mehsana🔝 Escorts...
➥🔝 7737669865 🔝▻ mehsana Call-girls in Women Seeking Men  🔝mehsana🔝   Escorts...➥🔝 7737669865 🔝▻ mehsana Call-girls in Women Seeking Men  🔝mehsana🔝   Escorts...
➥🔝 7737669865 🔝▻ mehsana Call-girls in Women Seeking Men 🔝mehsana🔝 Escorts...
nirzagarg
 

Último (20)

Hire↠Young Call Girls in Tilak nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esc...
Hire↠Young Call Girls in Tilak nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esc...Hire↠Young Call Girls in Tilak nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esc...
Hire↠Young Call Girls in Tilak nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esc...
 
VVIP Pune Call Girls Sinhagad WhatSapp Number 8005736733 With Elite Staff And...
VVIP Pune Call Girls Sinhagad WhatSapp Number 8005736733 With Elite Staff And...VVIP Pune Call Girls Sinhagad WhatSapp Number 8005736733 With Elite Staff And...
VVIP Pune Call Girls Sinhagad WhatSapp Number 8005736733 With Elite Staff And...
 
Busty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort Service
Busty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort ServiceBusty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort Service
Busty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort Service
 
VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...
VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...
VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...
 
💚😋 Salem Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋
💚😋 Salem Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋💚😋 Salem Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋
💚😋 Salem Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋
 
"Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency""Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency"
 
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
 
Shikrapur - Call Girls in Pune Neha 8005736733 | 100% Gennuine High Class Ind...
Shikrapur - Call Girls in Pune Neha 8005736733 | 100% Gennuine High Class Ind...Shikrapur - Call Girls in Pune Neha 8005736733 | 100% Gennuine High Class Ind...
Shikrapur - Call Girls in Pune Neha 8005736733 | 100% Gennuine High Class Ind...
 
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
 
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
 
APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53
 
Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...
Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...
Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...
 
Story Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr
Story Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrStory Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr
Story Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr
 
Nanded City ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready ...
Nanded City ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready ...Nanded City ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready ...
Nanded City ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready ...
 
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
 
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...
 
valsad Escorts Service ☎️ 6378878445 ( Sakshi Sinha ) High Profile Call Girls...
valsad Escorts Service ☎️ 6378878445 ( Sakshi Sinha ) High Profile Call Girls...valsad Escorts Service ☎️ 6378878445 ( Sakshi Sinha ) High Profile Call Girls...
valsad Escorts Service ☎️ 6378878445 ( Sakshi Sinha ) High Profile Call Girls...
 
📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱
📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱
📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱
 
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
 
➥🔝 7737669865 🔝▻ mehsana Call-girls in Women Seeking Men 🔝mehsana🔝 Escorts...
➥🔝 7737669865 🔝▻ mehsana Call-girls in Women Seeking Men  🔝mehsana🔝   Escorts...➥🔝 7737669865 🔝▻ mehsana Call-girls in Women Seeking Men  🔝mehsana🔝   Escorts...
➥🔝 7737669865 🔝▻ mehsana Call-girls in Women Seeking Men 🔝mehsana🔝 Escorts...
 

Jstorm introduction-0.9.6

  • 1. Company LOGO An Introduction of JStorm LongdaFeng(zhongyan.feng@alibaba-inc.com)
  • 2. Longda Feng Alibaba Agenda Background Basic Concept & Scenarios Why start JStorm? JStorm vs Storm Question and Answer.
  • 3. Who are we? JStorm Team was among one of the earliest that uses Storm in China.  Storm 0.5.1/0.5.4/0.6.0/0.6.2/0.7.0/0.7.1  JStorm 0.7.1/0.9.0/0.9.1/0.9.2/0.9.3/… Our Duties  Application Development  JStorm System Development  JStorm System Operation Longda Feng Alibaba
  • 4. Who are Using JStorm Many small Chinese companies are using JStorm Longda Feng Alibaba
  • 5. How Big? More than 3000 servers More than 3 trillion messages per day Longda Feng Alibaba
  • 6. What is JStorm? JStorm is a distributed programming framework  Similar to Hadoop MapReduce but designed for real-time/in-memory scenarios  Users can build powerful distributed applications from very simple APIs Longda Feng Alibaba
  • 7. What is JStorm? Redesigned Storm in Java.  Proved stable running in huge clusters.  Much faster  Much more powerful Longda Feng Alibaba
  • 8. Basic Conception Pipe-lined data processing Longda Feng Alibaba
  • 9. Advantage 1 Easy learning:  Simple Building Blocks: Topology/Spout/Bolt APIs  Out of Box RPC/Fault-tolerance/Real-time Data Grouping & Combining Longda Feng Alibaba
  • 10. Advantage 2 Excellent Scalability  Horizontally Scalable  DAG-based  Adjustable parallelism of each component Longda Feng Alibaba
  • 11. Stable Guarantees Fault-Tolerance  No Single Point of Failure • Nimbus HA • Any Supervisor can be shutdown  New worker will be spawned and replace the failed one automatically Longda Feng Alibaba
  • 12. Accuracy Acking framework guarantees no lost of data Transaction framework guarantees data accuracy. Longda Feng Alibaba
  • 13. Scenarios Stateless Computation  All data come from Tuple Use Cases:  Log Analysis  Pipe-lined System  Message converter  Statistical Analysis  Real-time Recommendation Algorithm Longda Feng Alibaba
  • 14. Longda Feng Alibaba Why start JStorm Storm community is not as active as we’ve expected  Tailored for enterprise environment  Fixed critical bugs in Storm  Provided professional technical support, improved app development pace.  Reduced operational cost. 
  • 15. How Many Versions? https://github.com/alibaba/JStorm/releases  0.9.6(2014/9/22)  0.9.5.1(2014/9/14)  0.9.5 (2014/8/27)  0.9.4.1 (2014/8/15)  0.9.4(2014/7/18)  0.9.3.1 (2014/5/31)  0.9.3 (2014/5/10)  0.9.2 (2014/4/8)  0.9.1(2014/1/24)  0.9.0(2013/12/30)  0.7.1(2013/4/28) Longda Feng Alibaba
  • 16. JStorm is a superset of Storm The program run in Storm can run in JStorm without changing code Longda Feng Alibaba
  • 17. More stable (1) -- nimbus HA Nimbus HA  Dual-Nimbus HA Longda Feng Alibaba
  • 18. More stable (2) -- RPC Netty supports 2 RPC modes  Async  Sync • Sending speed keeps up with the receiving speed, therefore the data flow is more stable. Longda Feng Alibaba
  • 19. More stable(3) – resource isolation Malicious Worker won’t mess up with others  Supported CPU Isolation with cgroups  Supported Memory Isolation Resources quota can be enforced on each group (before 0.9.5) Longda Feng Alibaba
  • 20. More stable(4) -- Monitor Monitor every component in your Topology  Many more metrics(70+) than storm Supported user-defined metrics Supported user-defined alerts Longda Feng Alibaba
  • 21. More stable (5) – CPU usage Better utilizing CPU resource  Improved disruptor implementation • Drop CPU usage from 300% to 10% when processing queue is full  Avoid CPU spin-waiting • Relocating nextTuple/ack/fail work to a different thread Longda Feng Alibaba
  • 22. More stable(6) -- more catch Add try-catch in any place.  Nimbus/supervisor main thread  Spout/bolt initialization/cleanup  All IO operation, serialization/deserialization  All ZK operation Longda Feng Alibaba
  • 23. More stable(7) -- ZK Reduced unnecessary ZK usage:  Removed useless watcher  Increased ZK heartbeat frequency  Detect failed worker without a full scan of the entire ZK directory Longda Feng Alibaba
  • 24. More stable(8) -- other Improved GC Tuning. Guaranteed that all workers killed after kill command is issued Guaranteed single supervisor/nimbus per instance Avoid excessive use of local ports by Netty client 。。。 Longda Feng Alibaba
  • 25. More powerful scheduler Balancing Tasks with regard of :  CPU  Memory  Net Longda Feng Alibaba
  • 26. CPU assignment By default assign each worker a single CPU slot Application can be configured to utilize more slots Why:  Some task creates extra threads to do other things in Alimama, one CPU slot doesn’t meet requirement Longda Feng Alibaba
  • 27. Memory Usage Default worker memory is 2G Application can be configured to utilize more memory slots Why:  In Alipay Mdrill application, Solr bolt will apply much more memory Longda Feng Alibaba
  • 28. Smarter Balancing With JStorm Scheduler:  Tasks that exchange data heavily tend to be assigned to the same worker to avoid networking cost. Longda Feng Alibaba
  • 29. User Defined Scheduler User define task run one designated worker  User can setting how many CPU slot /memory slot will be used Why:  In Taobao TAE project, some bolts want to run in user defined-nodes Longda Feng Alibaba
  • 30. Task on Different Node Task of one component can be scheduled to run on different nodes Why:  In ALIPAY Mdrill, Solr bolt must run different node Longda Feng Alibaba
  • 31. Task on Single Node All tasks can be scheduled to run on a single node. Why:  In Taobao TLog, there are many small jobs, in order to reduce network cost, all task of one job must run on single node. Longda Feng Alibaba
  • 32. Old Assignment “Last Assignment Policy”  By default , a task will run on the machine it runs previous time Why:  In Alibaba CDO, When restart one application, user wanted to reuse old workers Longda Feng Alibaba
  • 33. Pluginable Be able to run on: Hadoop yarn(more stable than storm) Alibaba Apsara Clould System Alibaba Elastic Resource Pool Longda Feng Alibaba
  • 34. Classloader Resolved application jar-confliction with JStorm Longda Feng Alibaba
  • 35. More convenient UI More useful stats collected and displayed. Browse Worker Log in UI Longda Feng Alibaba
  • 36. Support libjar Don’t need assembly all dependency jars into one jar  Submit libjar with libjar parameter  Support worker.classpath Longda Feng Alibaba
  • 37. Faster 6 Servers (24core/98G) 18 Spout/18 Bolt/18 Acker Longda Feng Alibaba 9280598 10818815 Throughput vs workers 9065965 6819139 5610201 6243680 6830500 5595900 5474180 3379800 12000000 10000000 8000000 6000000 4000000 2000000 0 0 10 20 30 40 50 60 polltuples/10s workers jstorm storm
  • 38. JStorm 41W/S Sending Speed Longda Feng Alibaba
  • 39. Storm 41W/S Sending Speed Longda Feng Alibaba
  • 40. Why Faster Reduce memory-copying by zeroMq Dedicated Deserializing Thread Better Tuned Sampling Logic Better Tuned Acking Framework Better Tuned GC Longda Feng Alibaba
  • 41. Other Improvement  More than 100 improvements  https://github.com/alibaba/JStorm/blob/master/history.md  Fixed assign topology competition  Reset rebalance/reassigned worker timeout as 4 minutes  Graceful worker shutdown  Improvement on thrift server  Avoid mistakenly killing of worker while rebalancing jobs.  。。。。 Longda Feng Alibaba
  • 42. More document https://github.com/alibaba/JStorm/wiki Google-group:JStorm-user@googlegroups.com Wangwang:JStorm QQ:228374502 Laiwang: JStorm Longda Feng Alibaba
  • 43. Join us Welcome to Join us  zhongyan.feng@alibaba-inc.com Longda Feng Alibaba