13. We crawls:
1,932,823 tweets/day (Japanese)
607,749 tweets/day (English)
(We’re now focusing on Japanese users
due to API limits)
We have:
1,336,444 PVs / Month
25. System Diagram:
Rough Sketch
Web Notification
Frontend module
crawler Fulltext Analyzer
module Search module
BuzzDAS: Buzz Data
Analysis System
26. System Diagram: Crawler
Typhoeus
Twitter crawler EventMachine
RabbitMQ MQ
crawl crawl scheduling
controller
libtextcat
Users DB langugage
language,
guesser
PostgreSQL post frequency
27. notifier pagecache
crawler bot varnish
Twitter4R Web Frontend
net/irc
Rails
MQ Web Service API
Sinatra
memcached
groonga
fulltext
importer search analyzer
engine
BuzzDAS: Buzz Data Analysis System
28. analyzer
tokenizer phrase extractor
detects change of phrase occurrence
Reference Recent
Index Index
keeps post in 24 hours keeps recent 1 hour
29. Try http://buzztter.com and give
me your feedback please!
If you Interested in these
keywords:
groonga, AMQP, RabbitMQ, Typhoeus,
EventMachine, PostgreSQL, libtextcat,
Sinatra, Rails, Twitter4R, net/irc,
memcached, PrefixSpan ,MeCab, TF-IDF, ...
or our BuzzDAS Engine,
please contact me!
My name is Yoji Shidara.