Se ha denunciado esta presentación.
Utilizamos tu perfil de LinkedIn y tus datos de actividad para personalizar los anuncios y mostrarte publicidad más relevante. Puedes cambiar tus preferencias de publicidad en cualquier momento.

Apache SOLR in AEM 6

8.731 visualizaciones

Publicado el

Introduction to Apache SOLR and configuring Apache SOLR with AEM 6

Publicado en: Tecnología

Apache SOLR in AEM 6

  1. 1. Introduction to Apache SOLR in Adobe AEM 6 Dr.Yash Mody, PhD CTO | Tekno Point Consulting
  2. 2. About Me Adobe AEM,Apache Hadoop Instructor & Consultant Application Architecture and Design Consultant Need I say more?  
  3. 3.  
  4. 4. Information Retrieval Document Term Inverted Index Term Frequency (tf) Skip Pointers Positional Index Collection Frequency Document Frequency (df) Inverse Frequency Idf = Log10(N/df) Term Frequency Inverse Document Frequency tf-idf = tf * Idf  
  5. 5. More??? PHEW! No Way  
  6. 6. Apache SOLR Fire Powered Lucene Distributed Replicated Remote And just for the record its… SEARCH On LUCENE w/REPLICATION (TBHPHB)  
  7. 7. Installation Unpack SOLR distribution Add solr.war to webapps Add –Dsolr.solr.home = … OR  
  8. 8. Getting solr ready Starting SOLR cd /usr/local/Cellar/solr/4.7.2/libexec/example/ - jetty java -jar start.jar http://localhost:8983/solr/#/ Adding content using  
  9. 9. Index and search Indexing Data java -jar post.jar solr.xml Searching http://localhost:8983/solr/select?q=solr&wt=json  
  10. 10. Configurations Configurations are done in 2 xml files schema.xml – SOLR index configurations solrconfig.xml – SOLR configurations  
  11. 11. Indexing Indexing is using HTTP POST. So indexed can be posted to SOLR via a web request Data can be pulled using Data Import Handler (uses HTTP GET or DB) SOLR can index binary content (textual + metadata) from docs, video, mp3, images and other binary content  
  12. 12. Search Search features: Paging, Filtering, Sorting, Faceting Results: xml (Default), json, php, ruby, python etc. Query Parser: used to interpret queries. 2 types of query parsers Lucene Query Syntax Parser DisMax Parser (Disjunction Max)  
  13. 13. Solr integration approaches Crawl using an external crawler like Nutch or Heritrix CQ servlets to serialize content into a Solr (JSON/XML) JCR Observer for page modifications to trigger indexing to Solr.  
  14. 14. AEM 6 2 Types In Built Remote (For distributed) Zookeeper (for setting up a cluster) Shard – horizontal Partition Replication – no of copies of the index files  
  15. 15. SOLR things we didn’t see  
  16. 16. Thanks @yash_mody