Se ha denunciado esta presentación.
Utilizamos tu perfil de LinkedIn y tus datos de actividad para personalizar los anuncios y mostrarte publicidad más relevante. Puedes cambiar tus preferencias de publicidad en cualquier momento.
1 
Elasticsearch 
Amir Sedighi 
Twitter: @amirsedighi 
Blog: http://hexican.com 
Email: sedighi@gmail.com 
Oct 2014
2 
References 
● http://elasticsearch.org/ 
● https://www.found.no/foundation/elasticsearch-in-production/ 
● https://www....
3 
● Thanks to Alex Brasetvik (@alexbrasetvik) 
from @foundsays, for the slides. 
● Thanks to Leslie Hawthorn (@lhawthorn)...
Powered by Lucene, Search Stuffs 
● 1999 Doug Cutting 
● 2003 Doug Cutting 
● 2004 Yonik Seeley 
● 2010 Shay Banon
5 
● Full-Text Search Library. 
● Free & Open-Source 
● Features: 
– Indexes & Analyzes Data 
– Tokenizing 
– Filtering 
–...
6 
● Free and Open-Source 
● Java (Cross-platform) 
● Real-Time Analytical Search Engine 
● Distributed 
● Highly Availabl...
7
8
Shard
Inverted Index
One Index Per a Day
A Partial Query
The filtered Query Graph
50 
Question 
● Can ES be used as a "NoSQL"-database?
51 
Production and Deployment 
● Keeping End-users Happy. 
● Tracking Quality of Service and Healthy.
52 
Agenda 
● Memory (Performance and Reliability) 
● Security 
● Networking (Reliability)
53 
Memory 
● Search engines have a great appetite for 
memory! 
– Caches, caches, caches 
● Field and filter caches 
● In...
54 
Comparison 
● RDBMSs are built to store. They Put good 
things in memory, and will flush to disk when 
there is no mem...
55 
Question 
● What if you don't provide them enough 
memory?
Question 
● What if you don't provide them enough 
memory?
57 
Out Of Memory 
● In the best case: 
– Your Indexing or Search Request simply failed. 
● More: 
– Cluster state corrupt...
58 
Warning Signs 
● ES provides lots of end-points to give you 
insights into it. 
– Resource Usage 
● Cache Sizes 
● Hea...
59 
Marvel
Try it on the Cloud by http://found.no 
60
61 
BigDesk
62 
Paramedic
63 
Memory Constraints 
● Large heaps are expensive to garbage collect. 
– JVM can no longer user pointer compression if 
...
64 
Security 
● Everyone is most welcome. 
● Auth(z) things aren't ES business. 
– You are the gatekeeper 
● Upon the role...
65 
Security Shield is coming soon
66 
Networking 
● ES works great, on a single node. 
● ES is impressively easy to use for being a 
distributed system. 
● ...
67 
Networking
68 
Networking
69 
Networking in a Log Manager
70 
Suggestions 
● Have enough memory to keep your nodes 
reliable. 
● Have majority of nodes. 
● Favor filters over match...
71 
Questions?
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
Próxima SlideShare
Cargando en…5
×

An Introduction to Elasticsearch for Beginners

This is an introduction to Elasticsearch, based on Alex Brazetvik presentations, Elasticsearch from the bottom up and Elasticsearch in production.

  • Sé el primero en comentar

An Introduction to Elasticsearch for Beginners

  1. 1. 1 Elasticsearch Amir Sedighi Twitter: @amirsedighi Blog: http://hexican.com Email: sedighi@gmail.com Oct 2014
  2. 2. 2 References ● http://elasticsearch.org/ ● https://www.found.no/foundation/elasticsearch-in-production/ ● https://www.found.no/foundation/sizing-elasticsearch/ ● https://www.found.no/foundation/elasticsearch-as-nosql/ ● https://www.found.no/foundation/elasticsearch-from-the-bottom-up/
  3. 3. 3 ● Thanks to Alex Brasetvik (@alexbrasetvik) from @foundsays, for the slides. ● Thanks to Leslie Hawthorn (@lhawthorn) from @elasticsearch, for the stickers.
  4. 4. Powered by Lucene, Search Stuffs ● 1999 Doug Cutting ● 2003 Doug Cutting ● 2004 Yonik Seeley ● 2010 Shay Banon
  5. 5. 5 ● Full-Text Search Library. ● Free & Open-Source ● Features: – Indexes & Analyzes Data – Tokenizing – Filtering – Wildcards – Aggregation – Sorting
  6. 6. 6 ● Free and Open-Source ● Java (Cross-platform) ● Real-Time Analytical Search Engine ● Distributed ● Highly Available ● RESTful
  7. 7. 7
  8. 8. 8
  9. 9. Shard
  10. 10. Inverted Index
  11. 11. One Index Per a Day
  12. 12. A Partial Query
  13. 13. The filtered Query Graph
  14. 14. 50 Question ● Can ES be used as a "NoSQL"-database?
  15. 15. 51 Production and Deployment ● Keeping End-users Happy. ● Tracking Quality of Service and Healthy.
  16. 16. 52 Agenda ● Memory (Performance and Reliability) ● Security ● Networking (Reliability)
  17. 17. 53 Memory ● Search engines have a great appetite for memory! – Caches, caches, caches ● Field and filter caches ● Index building
  18. 18. 54 Comparison ● RDBMSs are built to store. They Put good things in memory, and will flush to disk when there is no memory. – Slower but working. – Timeout is a client matter. ● Search-Engines are built for speed. – Fast running or not running. – Assumption: You've provided enough memory.
  19. 19. 55 Question ● What if you don't provide them enough memory?
  20. 20. Question ● What if you don't provide them enough memory?
  21. 21. 57 Out Of Memory ● In the best case: – Your Indexing or Search Request simply failed. ● More: – Cluster state corrupted. – Crashed Netty. ● Just don't end up there in your production cluster.
  22. 22. 58 Warning Signs ● ES provides lots of end-points to give you insights into it. – Resource Usage ● Cache Sizes ● Heap Space ● There are Monitoring Tools. – Profile your queries and optimize them.
  23. 23. 59 Marvel
  24. 24. Try it on the Cloud by http://found.no 60
  25. 25. 61 BigDesk
  26. 26. 62 Paramedic
  27. 27. 63 Memory Constraints ● Large heaps are expensive to garbage collect. – JVM can no longer user pointer compression if heap goes beyond 32GB. – Keep heap < 32GB ● Single Machine with Huge amount of Memory/SSD. – Multiple nodes on super-fast machine with SSD and big amount of RAM. (Note: Replicas, SPF) ● Scale-Out
  28. 28. 64 Security ● Everyone is most welcome. ● Auth(z) things aren't ES business. – You are the gatekeeper ● Upon the role, limit the user requests applying filters. – Out of memory is a critical issue. (Attacks) – Unfiltered or unnecessary queries are pretty memory consuming.
  29. 29. 65 Security Shield is coming soon
  30. 30. 66 Networking ● ES works great, on a single node. ● ES is impressively easy to use for being a distributed system. ● ES Supports lots of different network topologies.
  31. 31. 67 Networking
  32. 32. 68 Networking
  33. 33. 69 Networking in a Log Manager
  34. 34. 70 Suggestions ● Have enough memory to keep your nodes reliable. ● Have majority of nodes. ● Favor filters over matching queries. ● Have an eye on the cluster (Health). ● Don't let user to run faceted queries or reduce the frequency.
  35. 35. 71 Questions?

×