The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
Web Information Systems Introduction and Origin of World Wide Web
1. Web Information Systems
Tanvi Banerjee, Amit Sheth, et al.
1Kno.e.sis Center / Dept. of Computer Science & Engineering
Wright State University
One of the courses under
Big and Smart Data Science Certificate.
2. Flipped Classroom
Most classes will have three components:
Before the class: Prerecorded online video and
other reading material
During the class: discussions/Q&A of what was
reviewed before the class; exercises with
consultation if needed
Exercises – some of which will be graded
Web Information Systems Course
3. 3
Class 1: Module 1:
Preliminaries and Objectives
Image source: CISCO
4. “More data has been created in the last
three years than in all the past 40,000 years”
- Teradata
Course Context: How to manage, use
and benefit from Data (on the Web)
This statistics is for all data, but almost all of the data flows on the Internet and is accessible via the Web
5. Course Objectives
"The World Wide Web is the universe of network-
accessible information, an embodiment of human
knowledge.” [TBL]. The Web has become the
largest repository of documents and data, and the
largest source of information and knowledge.
How do you effectively access and utilize Web
data?
How do you build data centric Web applications?
How do you convert data into information? How
do you build, share and use knowledge?
Web Information Systems Course
7. Course Plan
Learn and exercise variety of techniques,
technologies and tools that focus on collecting
and using data for building rich data and
information centric Web applications
Do exercises to become comfortable with the
techniques and technologies
Do a substantial project
Some additional assignments and more
indepth/demanding projects for graduate
students
Web Information Systems Course
8. 8
Class 1: Module 2
Internet and Web:
Brief History and Review
Web Information Systems Course
9. Internet (the Net) vs WWW (the Web)
The Net: “a world-wide system of computer
networks, … users at any computer can get
information (if they have permissions) get
information from any other computer” [Review:
ARPANet,TCP/IP,… ]
The Web: “all the resources and users on the
internet that are using HTTP, a system of
internetworked hypertext documents, the
universe of network-accessible information”
[Review: http, html, URL, …; also Web History,
Original Proposal]
10. The World Wide Web functions
as a layer on top of the Internet
Source: Wikipedia
Web Information Systems Course
11. Web standards
Web Design & Applications: HTML, CSS, SVG,
Ajax
Web Architecture: URIs, HTTP, …
XML Technology: XML, XML Namespaces,
XML Schema, XSLT, ...
Web of Services: WSDL, choreography, policy
specifications,
Semantic Web: RDF, SPARQL, OWL, and
SKOS
Browser & Authoring, Web of Devices
Web Information Systems Course
http://www.w3.org/standards/
12. Let us focus on Content/Data
Markup language- html; URIs/URLs; linking <A
HREF>
Browser: Mosaic: 1993
Dynamic updates on Web pages: Javascript:
1995
Deepweb – structured data on the web,
dynamically rendered web pages, often nonpublic
data
Directory/Cataloging: Yahoo, Looksmart, DMOZ
Web pages: 2B in 2002, 11+B in 2005, 22+B in 2009
Search: Lycos (1994), Altavista/Excite (1995), Google (1998)
Web Information Systems Course
13. Evolution of the Web
Web 1.0 (pre 2002)
Retronym. Few content creators, many consumer;
Personal web sites, Static Pages, XML (1996)
Web 2.0 (1999/2002/2003)
Dynamic pages, Web as a platform, RW Web - UGC,
mass use participation- blogging, social web, Web
APIs, mashups, growth of media
Web 3.0 (1999/2009)
Semantic Web, broader content type- esp IoT
Web Information Systems Course
14. A review of first 5 years of WWW
WWW implementation started: end of 1990;
usable browser – Mosaic: 1993. Here is my
1995 review [Slides, Video first 36 mins] of the first 3-5
years of the Web and some prognosis. The
video made in 2013 also makes connections
between the thoughts presented in 1995 and
have it has turned out in the present.
Web Information Systems Course
15. Web 1.0 vs Web 2.0
Web 1.0 Web 2.0
Static/personal web pages,
served from server file
system.
Dynamic HTML, often created from RDBMS
HTML 3.2, custom
extensions, GIFs
More standardization, better Web authoring
tools, CSS etc, more client side code, rich user
experience
Limited/controlled authoring Extensive user participation/user generated
content; anyone can author, broader forms of
content
CGI, JSP, ASP Software as a service, APIs, mashups; Client:
AJAX/XML/JSON/jQuery/DOM, Flash/Flex;
Server: PHP, Python, RUBY, PERL
Directory, simple search Decline of directory; Sophisticated search;
PageRank,
Social Web/Networking. Blogging.
Web Information Systems Course
16. Review & Discussion Questions
URI vs URL
Is www prefix in URL necessary?
What document specification language was the basis of HTML (& married w/
the concept of hypertext)?
What is DNS? Are these two same or different? http://74.125.224.72/ and
http://www.google.com
Is the Web a proper subset of Internet? What is the most used Internet
application? What is the most used Web application?
What was the first inflexion point for the Web?
What format/ data representation (hint: it is a Web standard) made data
exchange and syndication across the We possible?
Web Information Systems Course
17. Review & Discussion Questions
Background for the following questions:
See what survives of Yahoo! director today and DMOZ. Looksmart
history which will give you one case study about a directory that was
successful in the past. Then read this excellent review of past and
current search engines with some information on directories.
Reflect on the the challenges of maintaining a directory for the Web.?
What was the key advance of Google over initial search engines?
Why did search engines replace directories (Yahoo!, Looksmart,..)?
Web Information Systems Course