24. XPath Parser
Scrapy, is a web crawling framework,
used to crawl websites and extract
structured data from their pages.
# Regular Expression
Every characters are
treated as the same
# Alternatives: XPath
html doc can be a
strudtured data
25. XPath is like “address”
# C://Python27
# html/body/div[@class="wrapper"]/
div[class="header.clearfix"]/h1[class="
logo"]/a
33. #3 每日期權strike, price戰力分佈圖
Find
the “url pattern”
crawl pages and
scrap data
store into DB
Visualization:
histogram
Google Developer Tools
Python
Scrapy
XPath Helper
Django -> MSSQL
Excel
Windows
Platform