1. Search Engine
Optimization & Drupal!
Keywords!
Page Optimization!
Site Optimization!
Ninja Toolkit!
Robert Carr rob@cognimatic.com!
2. Keywords!
The words or phrases typed into search
engine:!
!
!
!
Keywords define market and organizes
audience!
3. Keyword Research!
Market: who is your intended audience, and
what is the competition?!
!
Purpose: what does your site do?!
Brand – product, organization, pedigree!
Conversion – transactions, leads!
4. Finding Keywords!
Look at own site search statistics!
Look at competitorsʼ sites and keywords!
Monitor trends (Google Trends and Zeitgeist)!
Try out keywords in Google, Yahoo, Bing,
Wikipedia and real people!
!
…and paid tools and services!
!
!
6. Page Structure! And
where
to
sprinkle
keywords
My
Great
Page
|
UCLA
Page <title>! /tle
module
Page
h"p://ucla.edu/page-‐/tle
SemanticPath
auto
&
Clean
URLs
URL! Home
|
About
|
Calendar
|
Maps
Menu system! Title!
H1-H6 for titles and sections!
Keywords!!
Important text first!
But donʼt overdo it!
Meaningful filenames!
Link to stuff (more juice)!
W3C/508 compliance!
Interesting, fresh content!
8. Advanced Page Metadata!
Dublin Core – describe digital media and more thorough tagging
system.!
!
RDF (Resource Description Framework) – W3C spec
for relating information and adding semantic value to text.!
!
PICS labels – Platform for Internet Content Selection (Parental/Student
filters)… but use dwindling.!
!
Short URLs <link rel="shorturl" href="http://ucla.in/1234" />
drupal.org/project/nodewords
9. Site Optimization!
robots.txt - www.robotstxt.org!
sitemap.xml - www.sitemaps.org!
Speed [of page load] - ranking and humans!
Opcode/page caching!
Yslow for other tips (CDN, sprites, compression…)!
Redirects – HTTP 301 for page moves!
HTTP 403/404 Error pages – for humans!
Google Analytics!
11. Drupal Tools – Whole Site!
XML Sitemap (2.0+)
drupal.org/project/xmlsitemap!
Text sitemap drupal.org/project/site_map!
Boost drupal.org/project/boost!
Thrifty 404 module drupal.org/project/thrifty404!
Google Analytics
drupal.org/project/google_analytics!
Site Verification drupal.org/project/site_verify!
!
!
12. Drupal SEO Tools (condensed)!
In simple terms…
drupal.org/project/
seo_checklist!
13. Ninjaʼs Tools of Choice!
Google account and toolbar http://toolbar.google.com!
Yahoo https://siteexplorer.search.yahoo.com/!
Bing www.bing.com/webmaster !
Google Analytics www.google.com/analytics!
Google Webmaster Tools www.google.com/webmasters/tools!
Google Adwords Keyword Tool https://adwords.google.com/select/KeywordToolExternal!
SEO Plugin for Firefox http://tools.seobook.com/firefox/seo-for-firefox.html!
W3C et al validators: !
http://validator.w3.org!
WAVE toolbar (http://wave.webaim.org) !
Firefox Web Developer toolbar!
14. Great Reference!
• Drupal 6 SEO (Ben Finklea, Packt, Sep 2009)!
• SEO Warrior (John Jerkovic, OʼReilly, Nov 2009)!
• Yahoo Style Guide (Yahoo, July 2010) !
But see also:
groups.drupal.org/search-engine-optimization!
15. The End!
Keywords!
Page Optimization!
Site Optimization!
Ninja Toolkit!
Notas del editor
Not a Ninja… but the areas I’m going to cover are brief intro to the world of SEO“Why am I not #1 when I type my name into Google?”
Defining a market – people searching for a specific productKeywords aggregate searchers into specific audiences – need to channel audience to the right content
Brand – company/product brand awareness, or building credibilityConversion – (conversion rates), selling products, email subscription, white paper downloads…
Own site – server records – referrals; Google Tools suite – popular search termsOther sites – Google Adwords Keyword ToolRight terms: terms where your site ranks highest, but need to trade off amount of competition with level of traffic (subjective analysis)Paid tools: Google Adwords, SEO Consultants (he’s called Shimon BTW) etc
Could stop presentation! Good content will always be found, regardless of SEO
URLs – avoid parameterised ?id=12345 etc; use descriptive terms (keywords); hypens to join words (no spaces or underscores); use dates structures when appropriateMenu – puts current page into context, and allows search engines and users find related materialH1 for page title/heading only – ONE occurrence; use H2-H6 for section headings (do not apply CSS to body text to simulate headings). Turn off CSS for insight.If inserting PDFs or images, use a meaningful URI, eg, man-relaxing-beach-sun.jpgIt’s a webpage, not a document, so link stuff: PAGE RANK – Google Toolbar. Ranks revised 3-4 times/year. Link to pages with high RANK, then you get a bit of their juice… Even more juice if high ranking pages link to you.Tag all content correctly (ie, ALT, TITLE tags etc). Good for accessibility; great for SEOOh – might say this a few times – WRITE INTERESTING CONTENT… lay it out in visually appealing manner to attract users in the BLINK moment (Blink – Malcolm Gladwell). Google caches every 5-6 days - keep it up to date as your ranking will slip over time if not updated.
KEYWORDS – largely ignored by search engines due to abuse in 90sDESCRIPTION – used as text summary in SE listingCOPYRIGHT – should be on every pageGEO – add positional info (now appears on Google listings). ICBM = sad geek humour (drop bomb here)ROBOTS – tells search engines whether to index page or not, or tell them to ignore embedded URLs (useful if page includes public comments)CANONICAL URL – if content can be accessed by duplicated URLs (esp on parameterised URLs, or sites that use REWRITE rules), this lets the Search Engine now definitive reference and stops you being penalised for duplicate contentTITLE tag – place in HEAD use keywords – can be same as first H1… but most important/unique keywords first (ie, priority LEFT to RIGHT)REDIRECT – if a page’s URL moves and you cannot configure a server-side redirect, there are various methods to auto forward users to the new page. At very least put a link to new page.
Dublin Core: 15 elements Title Creator SubjectDescriptionPublisherContributorDateTypeFormatIdentifierSourceLanguageRelation CoverageRightsRDF: Food for Robots. Essentially embedding XML type descriptors around information to define relationships and meaning. Types of RDF = FOAF and RSS. Really useful is linked to external taxonomies , eg, Thompson-Reuters Open Calais for tagging with named entities (people, organizations, geographies, books, albums etc), facts and events. Web 3.0 stuff.(Triples: subject-predicate-object expressions: “The sky has the color blue”: a subject denoting "the sky", a predicate denoting "has the color", and an object denoting "blue".)PICS: AOL usersShort URLs (W3C approved?) – TinyURL was banned! Bit.lyGna.rl Get with Twitter generation. May die a death soon
Robots.txt – food for robots – tells search engine crawlers what content to look at, what to ignore and how often to crawl. Google will drop you site ranking (or even ignore) without such a file in the ROOT directorySitemaps – XML file useful for search engines to work out the information architecture of your site; text sitemaps can be useful to your human visitors tooPage Load – not only frustrating for human viewers, Google (as of March 2010) now factors page load time into Rank calculation. Lots of techniques for improving HTTP requests – Yslow is the only real tool for this.Redirects – assume old URLs never die. Server side tweaks – HTTP 301 redirect for permanent changes; 307 for temporary redirects. Helps users, and informs search engines so you don’t get penalized for duplicate contentError pages – make a custom error page for 404 errors (page no longer exists) with something helpful – will keep users on your site longer; same for (403) access deniedGoogle Analytics – Uses Javascript on each page to track all access into, within and out of your site. Great tool for site analysis – but Google now knows exactly how popular your site really is(… page ranking again!). Can analyze site server log (doesn’t need JS) but data not as rich
Google Account – passport top all their various tools. Toolbar gives page rankingGoogle Analytics – track your users – you’ll need to embed code in each page of your site. Wait at least a few days to get some useful info (see other CWP Presentation)Google Webmaster Tools: List top search queries, keywords, crawler stats, 404 errors, broken links, malware check, site performance… Need to upload verification file to site root.Google Adwords: Type in keywords and see stats across whole web for related terms (competition, monthly traffic country/global)SEO Plugin has some useful features, but for ucla.edusubdomains (MOST OF US), most analysis is for the overall domain of ucla.eduW3C/508 validators… Firefox Web Developer Toolbar probably the most useful toolAlso… Google Website Optimizer (A/B testing) – which type of layout/functionality is more effective (version A, or version B?)