Getting Started with Amazon CloudSearch

Introduction to Amazon
CloudSearch

© 2012 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified or distributed in whole or in part without the express consent of Amazon.com, Inc.

What You Can Expect To Learn In This Webinar

Amazon CloudSearch details
How search works
How to set up and configure your search domain
CloudSearch pricing
Where to find additional resources


Introduction To Amazon CloudSearch
Fully-managed, full-featured search service
Automatically scales for data & traffic
Handles both structured and unstructured data
Near real-time indexing
Up and running in less than 1 hour


How Search Works


Introduction to Search


Inverted Index

US President


Search On The Web
Relevance/Ranking
Faceting
Range Searching
Fielded Searching
Boolean Queries
Complex Relevance


Search On The Web
Relevance/Ranking
Faceting
Range Searching
Fielded Searching
Boolean Queries
Complex relevance


Search On The Web
Relevance/Ranking
Faceting
Range-Searching
Fielded Searching
Boolean Queries
Complex Relevance


Features Overview
Full Featured Search Easy to Set-up and Use
Free text, structured data, and HTTP endpoints
Boolean search • Configuration
Faceting • Document upload
Customizable relevance ranking • Search
Fielded and Range search Web console
Result sorting Command Line Tools
Text Processing Options APIs
Near real-time indexing


Amazon CloudSearch Architecture
SEARCH CLIENT SEARCH DEVELOPER
www.example.com

Send
Search
Requests

Use the Search Send Create and
Search Tester Documents Manage Domains
Results

SEARCH ENDPOINT DOCUMENT SERVICE ENDPOINT CONFIGURATION SERVICE ENDPOINT

Document Command Configuration Command Console
Search API Console Console
Service API Line Tools API Line Tools

ACCESS CONTROL ACCESS CONTROL ACCESS CONTROL

SEARCH SERVICE DOCUMENT SERVICE CONFIGURATION SERVICE
Search Documents Add Documents Create Domains

Update Documents Configure Domains

Delete Documents Delete Domains


Creating an Amazon CloudSearch Domain

1. Create a search domain
2. Upload documents
3. Configure search fields and text processing options
4. Integrate CloudSearch into your application


Create Search Domain
Amazon CloudSearch
Console
• 3 - easy steps
• Hides complexity
• Management dashboard


Create Search Domain


Upload Documents
Console (good for small data sets)

Command Line
• cs-post-sdf --source <file> [other options]
• curl -d @<file> [other options]
Direct-to-API
• http://<endpoint>/2011-02-01/documents/batch


Upload Documents
Search Data Format (SDF)
[ {"type":"add",
"id": "sombzze12a8c134960",
"version":5,
"lang":"en",
"fields": {
"title":"The History Buff’s Guide to the Presidents",
"author":"Thomas R. Flagel",
"year":"2007",
"book_id":"sombzze12a8c134960",
"popularity":449425,
"genre":["biographies", "politics", "social science"]
} },
...]


Upload Documents
Automatic Scaling: Data
Amazon CloudSearch adds capacity
• Automatically
• Seamlessly

DATA
Document Quantity and Size
SEARCH INSTANCE
Index Partition 1
Copy 1

SEARCH INSTANCE SEARCH INSTANCE SEARCH INSTANCE
Index Partition 1 Index Partition 2 Index Partition n
Copy 1 Copy 1 Copy 1


Configure Search Domain
Automatic configuration detection
Easy to update
Fully customizable


Configuration
Field types: text, literal, uint
Options: search, facet, result
Defaults and sources


Custom Ranking
Simple syntax
Use integer fields


Search Integration
Easy to integrate HTTP endpoint
• http(s)://<endpoint>/2011-02-01/search

Full-featured query language

Queries are specified as URL parameters


Search Integration
Console
Full text search
Text relevance
Facets


Search Integration
APIs
Full-text search
• http://<endpoint>/2011-02-01/search?q=us+presidents
Complex and fielded search
• bq=(and title:’us presidents' genre:’history’)
Retrieving facet counts
• q=us+presidents&facet=genre
Custom Ranking
• q=us+presidents&rank=custom,text_relevance


Search Integration
APIs
Retrieving data
• q=us+presidents&return-fields=title,actor,director

Pagination
• q=us+presidents&size=20&start=200

Integer range search
• bq=year:1970..1980


Automatic Scaling: Data & Traffic

SEARCH INSTANCE
Index Partition 1
Copy 1


TRAFFIC
Search
Request
Volume and SEARCH INSTANCE SEARCH INSTANCE SEARCH INSTANCE
Complexity Index Partition 1 Index Partition 2 Index Partition n

Copy n Copy n Copy n


Automatic Scaling: Data & Traffic

DATA
Document Quantity and Size
SEARCH INSTANCE
Index Partition 1
Copy 1


TRAFFIC
Search
Request
Volume and SEARCH INSTANCE SEARCH INSTANCE SEARCH INSTANCE
Complexity Index Partition 1 Index Partition 2 Index Partition n

Copy n Copy n Copy n


Pricing


Pricing Model Parameters
Search instances
Document uploads
Index documents requests


Pricing Model

1. Search Instance Types*
Search Instance Type Cost per hour
Small $0.12
Large $0.48
XLarge $0.68

2. Document Upload Charge
$0.10 per 1,000 batch uploads
1 batch has 5 MB limit

3. Index Documents Requests Charge
$0.98 per GB of data in Search Domain


Pricing Example
1 Million documents
Average document size 1K
80K updates per day
1 million queries per day
1 index documents request call per month

Cost: $97/month
1 Small Search Instance


CloudSearch Users


Top Requested Features
Multi-region
Multi-AZ
Languages
Auto-complete
Highlights


Resources
Amazon CloudSearch Overview Page
http://aws.amazon.com/cloudsearch/
• FAQs
• Community Forum
• Documentation & Getting Started Tutorial (IMDb)
Demos and Tutorials
• What Is Amazon CloudSearch
• Introducing Amazon CloudSearch (Features)
• Building a Search Application Using Amazon CloudSearch


Getting Started with Amazon CloudSearch

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Destacado

Destacado (20)

Similar a Getting Started with Amazon CloudSearch

Similar a Getting Started with Amazon CloudSearch (20)

Más de Amazon Web Services

Más de Amazon Web Services (20)

Último

Último (20)

Getting Started with Amazon CloudSearch

Notas del editor