This document provides an overview of Apache Solr and how to integrate it with Drupal 7. Apache Solr is an open source enterprise search platform built using Lucene. It provides advanced text analysis, faceted search, and geospatial search capabilities. The document outlines how to install Solr, configure the required Solr files, copy the Drupal configuration files, and set up search pages. It also discusses useful Drupal modules for integrating Solr and some powerful hook functions for customizing the integration.
1. Drupal7 & Apache Solr
By Nikolay Ignatov
OpenSource Department
Senior PHP Developer
2. About me
• Nikolay Ignatov
• Using PHP since 2006
• Working with Drupal since 2009
• Love to learn new technologies
( PHP Libevent:
http://php.net/manual/en/book.libevent.php
)
4. What is Apache Solr?
• Open source enterprise search platform
• Written in Java
• Built using Lucene Core (Apache Lucene
Project)
• SOLR = Searching on Lucene Replication
• Standalone server using Tomcat or Jetty
• REST-like API
5. Some Solr features
• Advanced, Configurable Text Analysis
• Faceted Search and Filtering
• Geospatial Search
• Rich Document Parsing and Indexing
(PDF, Word, HTML, etc) using Apache
Tika
• More on http://lucene.apache.org/solr/
6. Installing Apache Solr
• Java 1.5 or greater installed.
• A servlet container such as Tomcat, Jetty
or Resin
• Download Apache Solr from
http://apache.skknet.net/lucene/solr/
• Follow the installation instructions from
http://wiki.apache.org/solr/SolrInstall
• Apache Solr contains Jetty in it
Thank you very much for joining me today.My name is Nikolay Ignatov and I will speak about Drupal7 and Apache Solr integration. If you have any questions that popus during the session please write them down and we can discuss them at the end of it.Before begin, how many of you talk English? How many of you doesn’t understand english? Make a decision which language I will talk about.
As I said, my name is Nikolay Ignatov and I’m working atProPeople as Senior PHP developer and team lead. I’m coding with PHP since two thousand and six. I’m working with Drupal since two thousand and nine (the year I’ve join to the ProPeople team). I love to learn new technologies and experiment with them. I’ve recently reviewed this cool lib for PHP. I’m almost sure that it will break down the Node.js because it does the same but will all power of the PHP (We can discuss this after the session because I know that there are people in the room that do not agree with me).
Does anybody know what is Apache Solr (I know that a lot of ProPeople guys knows about it )?
Apache Solr is an open source enterprise search platform. It is a fast growing search engine written in Java. It is build using Lucene Core which provides Java-based indexing and search technologies. All this makes Solr a powerful standalone full-text search server within a servlet container such as Tomcat or Jetty. It has a REST-like API. You put documents in it (called "indexing") via XML, JSON or binary over HTTP. You query it via HTTP GET and receive XML, JSON, or binary results.
Some features that Apache solr support: Advanced, Configurable Text AnalysisFaceted Search and Filtering – Faceted search is a technique for accessing information organized. I will show youexamples on thistwofeatures in a while. GeospatialSearch–Whatisthis? For example if you have a location content type that stores geo locations youcan index this locations in solr and afterthat to fetch for example all near by locations withoutanyspecialcalculation. All of itis made in Solr.Rich Document Parsing and indexingusing Apache Tika. You can see more nice features at Solr website.
Before we can start Apache Solr we should have the following: A PC with Java 1.5 or greater. A Servlet container such as Tomcat, Jetty or Resin.If we have this on our computer we can Download Apache Solr from web page. Follow the installation instructions from Solr wiki page.We can check if the Apache Solr is setup correctly.
Explain the solr special files
Install the apache solr module and facetapi module.Copy the predefined Solr configuration from module directory. Configure the enviorment and index the content.
Show how we can configure weights in administrationEnable and setup facets.