SlideShare una empresa de Scribd logo
1 de 42
Descargar para leer sin conexión
Our World is Socio-technical
Markus Luczak-Roesch | @mluczak
Senior Lecturer in Information Systems
School of Information Management
Victoria University of Wellington
Humans in the information age
• IT project management
• Business process design
• Application development
• User experience design
• Business and systems analysis
Data insights
Information Management
Today’s road ahead
• From Dickens to data science
http://www.morguefile.com
• Citizen scientists in the classroom
• What is the World Wide Web and why should we preserve it?
The World Wide Web
The World Wide WebThe Web - a decentralized hypermedia system
Basic client-server architecture
Hypertext gateway to represent
databases as hypertext
The REST is historyThe REST is historyThe REST is history
The REST is history
The REST is history
The REST is history
The REST is history
What is REST?
• Representational State Transfer
• generic architectural style for network-based systems
• architectural style of the WorldWide Web
• literature: Roy Thomas Fielding. 2000. Architectural Styles and the Design of Network-Based Software
Architectures. Ph.D. Dissertation. University of California, Irvine. AAI9980887.
REST principles
RepresentationalState Transfer
1. client-server
2. stateless
3. caching
4. uniform interface
1. resource identification
2. manipulation of representations
3. self-descriptive messaging
4. Hypermedia
5. layered system
6. code-on-demand
HTTP
• Hypertext Transfer Protocol
• transfer data between Web servers and clients
• transport protocol is TCP
• textbased
• literature: R. Fielding, J. Gettys, J. Mogul, H. Frystyk, L. Masinter, P. Leach und T. Berners-Lee. Hypertext
Transfer Protocol - HTTP/1.1. RFC 2616, http://www.ietf.org/rfc/rfc2616.txt
HTTP
• stateless
• request-response
t
Client Server
Request
Response
TCP	lifecycle
Establish connection
Terminate connection
HTTP
GET / HTTP/1.1
User-Agent: Mozilla/5.0 … Firefox/10.0.3
Host: markus-luczak.de:80
Accept: */*
HTTP/1.1 200 OK
Server: Apache/2.0.49
Content-Language: en
Content-Type: text/html
Content-length: 2990
<!DOCTYPE html>
<html xml:lang="en"
…
Client
Server
Markup languages
• most prominent example: HTML (HyperText Markup Language)
• to structure Web documents
What’s the problem with HTML (& XML & databases & …)?
Apple
<Apple>
<Pear>
Data silos
The Semantic Web vision
“The Semantic Web is an
extension of the current web
in which information is given
well-defined meaning, better
enabling computers and
people to work in
cooperation.“
Berners-Lee, Hendler, and Lassila, 2001.
Meatadata
• Data about data
• describe content
• in the best case
metadata is machine processable
§Author
§Title
§Date
§…
§Species:	Android
§Height…
Content
Metadata
Web of Linked Data
RDF
RDF
RDF
RDF	Links
Technical example
M. Luczak-Rösch,R. Heese. Linked Data Authoring forNon-Experts.In proceedings of LDOW
2009, co-located with World Wide Web Conference, Madrid, Spain.
of the loomp OCA interface elements
ubset
he ap-
esign:
e per-
d the
s. We
f text
n MS
n dif-
high-
strat-
n the
extual
s, we
each
(e.g.,
l con-
e pro-
p be-
A
BC
Fig. 8. loomp screenshot – authoring with annotation elements
4.2. Effectiveness of OCA design: Usability
As the core motivation and purpose for loomp was
Fig. 3. Core elements of the loomp Domain Ontology
mashup
fragment
loomp content data
data:fragment4
Frankfurt
data: fragment3
data:mashup1
data:fragment5
data:fragemet6
data:mashup2
loomp annotation data
rdf:type
dbp:uri3
dbp:uri1
Frankfurtrdf:label
rdf:type
M. Luczak-
Roesch
foaf:name
M. Luczak-
Roesch
rdf:_3
rdf:_1
loomp:contains
Ext. Linked Data (e.g. DBpedia)
dbp:uri1 Frankfurt/Main
dbp:uri2 Frankfurt (Oder)
dbp:uri3 city
loomp:
annotation9
Frankfurt
M. Luczak-
Roesch
loomp vocabulary
Country
loomp:
annotation10
City
dbp:uri3
foaf:Person
rdf:_2
rdf:_2
rdf:_1
1
2
3
4
external ontology
foaf:Person foaf:name
Symbolic links (expressing identity)
RDF relationships
data:resource7
loomp:hasRDFa full-text as
XHTML+
RDFa snippet
5
6
7
Fig. 4. Selected elements of the loomp content model with example encoding for content and annotations
typed as loomp:Fragment and loomp:Mashup,
respectively (cf. Figure 3).
ple, the shared fragment4 is linked via rdf:_1 to
mashup1 and via rdf:_3 to mashup2. Fragments
Markus Luczak-Rösch,Ralf Heese, Adrian Paschke,"FutureContent Authoring",In
Nodilities – The Magazineof the Semantic Web, Issue11, pp.17-18, 2010.
Populating the Semantic Web
“Dresden University”
correct
meaningful
54
1416
“Light	and	heating	are	turned	off”	
Hinze, A.,Heese, R., Luczak-Rösch, M., & Paschke, A. (2012).
Semanticenrichmentbynon-experts: usabilityof manual
annotation tools. In The SemanticWeb–ISWC 2012 (pp. 165-
181). Springer Berlin Heidelberg.
Semantics for the rest of us?
Citizen scientists in the classroom
What is online citizen science?
From Galaxy Zoo to the Zooniverse
The online citizen science workflow
n words
n words
k slices of n words
(e.g. n=1,000)
...
1 2 3 k
Abel Magwitch
Joe Gargery,
Mrs Joe Gargery, Pip
Joe Gargery,
Mrs Joe Gargery, Pip
+
Lexicon of characters(can alsobe
a list of other entitiessuch as
placesor phrases)
- net
- stat
Directed network
Dynamicnetworkvisualization
source texts from
Project Gutenberg
Matched information
• Abel Magwitch
• Joe Gargary
• Mrs Joe Gargary
• Pip
Matched information
• Abel Magwitch
automatic/semi-automatic
methodsto detectmatch entities
in text, topicdetection, sentiment
analysis, part-of-speech tagging)
configure
n words
n words
k slices of n words
(e.g. n=1,000)
...
1 2 3 k
Abel Magwitch
Joe Gargery,
Mrs Joe Gargery, Pip
Joe Gargery,
Mrs Joe Gargery, Pip
+
Lexicon of characters(can alsobe
a list of other entitiessuch as
placesor phrases)
-
-
Directed network
Dynamicnetworkvisualization
source texts from
Project Gutenberg
Matched information
• Abel Magwitch
• Joe Gargary
• Mrs Joe Gargary
• Pip
Matched information
• Abel Magwitch
automatic/semi-automatic
methodsto detectmatch entities
in text, topicdetection, sentiment
analysis, part-of-speech tagging)
configure
n words
n words
k slices of n words
(e.g. n=1,000)
...
1 2 3 k
Abel Magwitch
Joe Gargery,
Mrs Joe Gargery, Pip
Joe Gargery,
Mrs Joe Gargery, Pip
+
Lexicon of characters(can alsobe
a list of other entitiessuch as
placesor phrases)
- network
- statistic
Directed network
Dynamicnetworkvisualization
source texts from
Project Gutenberg
Matched information
• Abel Magwitch
• Joe Gargary
• Mrs Joe Gargary
• Pip
Matched information
• Abel Magwitch
automatic/semi-automatic
methodsto detectmatch entities
in text, topicdetection, sentiment
analysis, part-of-speech tagging)
configure
n words
n words
k slices of n words
(e.g.n=1,000)
...
1 2 3 k
Abel Magwitch
Joe Gargery,
Mrs Joe Gargery, Pip
Joe Gargery,
Mrs Joe Gargery, Pip
+
Lexicon of characters(can alsobe
a list of other entitiessuch as
placesor phrases)
- network visuali
- statistics and ot
Directed network
Dynamicnetworkvisualization
source texts from
Project Gutenberg
Matched information
• Abel Magwitch
• Joe Gargary
• Mrs Joe Gargary
• Pip
Matched information
• Abel Magwitch
automatic/semi-automatic
methodsto detectmatch entities
in text, topicdetection, sentiment
analysis,part-of-speech tagging)
configure
n words
n words
k slices of n words
(e.g. n=1,000)
...
1 2 3 k
Abel Magwitch
Joe Gargery,
Mrs Joe Gargery, Pip
Joe Gargery,
Mrs Joe Gargery, Pip
+
Lexicon of characters(can alsobe
a list of other entitiessuch as
placesor phrases)
- network vis
- statistics an
Directed network
Dynamicnetworkvisualization
source texts from
Project Gutenberg
Matched information
• Abel Magwitch
• Joe Gargary
• Mrs Joe Gargary
• Pip
Matched information
• Abel Magwitch
automatic/semi-automatic
methodsto detectmatch entities
in text, topicdetection, sentiment
analysis, part-of-speech tagging)
configure
Online	
Citizen	
Science	
Platform
Machine	
Learning
Scientific	
Results
Citizen scientists
Our project: Detecting NZ native birds using AI
• Recorded sound around Zealandia
• Segment the recordings and upload them to Zooniverse for people to decide if they have bird calls
or not (and if they have, what bird is it)
• Use the data from Zooniverse to train and test our AI
Hanny’s Voorwerp
Galaxy Zoo [2007]
Green Pea Galaxies
Galaxy Zoo [2007]
Yellow Balls
Milky Way [2009]
Circumbinary Planet Ph1b
Planet Hunters [2012]
Convict Worm
Seafloor Explorer [2012]
Spanish Flu
Operation War Diaries [2014]
Serendipitous discoveries through talk
Upcoming project: Understanding the impact of
online citizen science participation on the
development of science capabilities of primary
age children.
From Dickens to Data Science
Dickens and the Serial Novel Form
All fourteen of Dickens’scompleted novels
were published serially in weekly or monthly
installments.
From “Sketches” to Novels
1836-37
1864-65
“I have endeavoured in the progress of this Tale, to resist the
temptation of the current Monthly Number, and to keep a
steadier eye upon the general purpose and design.”
Preface to Martin Chuzzlewit (1844)
Managing Characters
Dickens’ Working Notes for His Novels. Edited by Harry Stone, U of Chicago Press, 1987.
TIC approach applied to Victorian novels
n words
n words
k slices of n words
(e.g. n=1,000)
...
1 2 3 k
Abel Magwitch
Joe Gargery,
Mrs Joe Gargery, Pip
Joe Gargery,
Mrs Joe Gargery, Pip
+
Lexicon of characters(can alsobe
a list of other entitiessuch as
placesor phrases)
- network visualisations
- statistics and other measures
Directed network
Dynamicnetworkvisualization
source texts from
Project Gutenberg
Matched information
• Abel Magwitch
• Joe Gargary
• Mrs Joe Gargary
• Pip
Matched information
• Abel Magwitch
automatic/semi-automatic
methodsto detectmatch entities
in text, topicdetection, sentiment
analysis, part-of-speech tagging)
configure
analyze
Understanding character management
Information-theoretic properties of literature
Pickwick Our Mutual Friend
Not-so-distant Reading
Distant Reading “tackles literary problems by
scientific means: hypothesis-testing,
computational modeling, quantitative
analysis....understanding literature not by
studying particular texts, but by aggregating and
analyzing massive amounts of data.”[1]
[1]Shulz, Kathryn. “What is Distant Reading?” New York Times, 24 June2011.
An new interpretative practice?
Demo
https://stia.shinyapps.io/tlit/
● At the School of Information Management we focus on
○ humans in the information age
○ data insights
● Topics at the intersection of
○ computer science (e.g. data management, data analytics,
software development)
○ behavioral science (e.g. organizational decision making,
social networks)
○ social science (organizational processes, inequalities)
Thanks	to	my	collaborators
• Dr Adam	Grener (Dickens	project)
• Emma	Fenton	(Dickens	project)
• Tom	Goldfinch	(Dickens	project)
• Isabel	Parker	(Dickens	project)
• Victor	Anton	(NZ	bird	identification)
• Jacob	Woods	 (NZ	bird	identification)
• Dr Dayle Anderson	(Citizen	science	in	educ.)
• Dr Cathal Doyle	(Citizen	science	in	educ.)
• …

Más contenido relacionado

La actualidad más candente

Question answering in linked data
Question answering in linked dataQuestion answering in linked data
Question answering in linked dataReza Ramezani
 
The Art of Social Media Analysis with Twitter & Python-OSCON 2012
The Art of Social Media Analysis with Twitter & Python-OSCON 2012The Art of Social Media Analysis with Twitter & Python-OSCON 2012
The Art of Social Media Analysis with Twitter & Python-OSCON 2012OSCON Byrum
 
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...Marko Rodriguez
 
Prateek Jain dissertation defense, Kno.e.sis, Wright State University
Prateek Jain dissertation defense, Kno.e.sis, Wright State UniversityPrateek Jain dissertation defense, Kno.e.sis, Wright State University
Prateek Jain dissertation defense, Kno.e.sis, Wright State UniversityPrateek Jain
 
Question Answering over Linked Data: Challenges, Approaches & Trends (Tutoria...
Question Answering over Linked Data: Challenges, Approaches & Trends (Tutoria...Question Answering over Linked Data: Challenges, Approaches & Trends (Tutoria...
Question Answering over Linked Data: Challenges, Approaches & Trends (Tutoria...Andre Freitas
 
The Network Data Structure in Computing
The Network Data Structure in ComputingThe Network Data Structure in Computing
The Network Data Structure in ComputingMarko Rodriguez
 
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...DeVonne Parks, CEM
 
Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Jon Voss
 
[DSC x TAAI 2016] 林守德 / 人工智慧與機器學習在推薦系統上的應用
[DSC x TAAI 2016] 林守德 / 人工智慧與機器學習在推薦系統上的應用[DSC x TAAI 2016] 林守德 / 人工智慧與機器學習在推薦系統上的應用
[DSC x TAAI 2016] 林守德 / 人工智慧與機器學習在推薦系統上的應用台灣資料科學年會
 
The Hitchhiker's Guide to Machine Learning with Python & Apache Spark
The Hitchhiker's Guide to Machine Learning with Python & Apache SparkThe Hitchhiker's Guide to Machine Learning with Python & Apache Spark
The Hitchhiker's Guide to Machine Learning with Python & Apache SparkKrishna Sankar
 
Databases and Ontologies: Where do we go from here?
Databases and Ontologies:  Where do we go from here?Databases and Ontologies:  Where do we go from here?
Databases and Ontologies: Where do we go from here?Maryann Martone
 
Towards Context-Aware Search and Analysis on Social Media Data
Towards Context-Aware Search and Analysis on Social Media DataTowards Context-Aware Search and Analysis on Social Media Data
Towards Context-Aware Search and Analysis on Social Media DataLeon Derczynski
 
Books and Webs: Pulling the Down Rows
Books and Webs: Pulling the Down RowsBooks and Webs: Pulling the Down Rows
Books and Webs: Pulling the Down RowsPeter Brantley
 
Why language technology can’t handle Game of Thrones (yet)
Why language technology can’t handle Game of Thrones (yet)Why language technology can’t handle Game of Thrones (yet)
Why language technology can’t handle Game of Thrones (yet)Marieke van Erp
 
User-Generated Content on Social Media
User-Generated Content on Social MediaUser-Generated Content on Social Media
User-Generated Content on Social MediaMeena Nagarajan
 
A Deep Survey of the Digital Resource Landscape: Perspectives from the Neuros...
A Deep Survey of the Digital Resource Landscape:Perspectives from the Neuros...A Deep Survey of the Digital Resource Landscape:Perspectives from the Neuros...
A Deep Survey of the Digital Resource Landscape: Perspectives from the Neuros...Maryann Martone
 

La actualidad más candente (20)

Question answering in linked data
Question answering in linked dataQuestion answering in linked data
Question answering in linked data
 
The Art of Social Media Analysis with Twitter & Python-OSCON 2012
The Art of Social Media Analysis with Twitter & Python-OSCON 2012The Art of Social Media Analysis with Twitter & Python-OSCON 2012
The Art of Social Media Analysis with Twitter & Python-OSCON 2012
 
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
 
Prateek Jain dissertation defense, Kno.e.sis, Wright State University
Prateek Jain dissertation defense, Kno.e.sis, Wright State UniversityPrateek Jain dissertation defense, Kno.e.sis, Wright State University
Prateek Jain dissertation defense, Kno.e.sis, Wright State University
 
Question Answering over Linked Data: Challenges, Approaches & Trends (Tutoria...
Question Answering over Linked Data: Challenges, Approaches & Trends (Tutoria...Question Answering over Linked Data: Challenges, Approaches & Trends (Tutoria...
Question Answering over Linked Data: Challenges, Approaches & Trends (Tutoria...
 
PhD thesis defense of Christopher Thomas
PhD thesis defense of Christopher ThomasPhD thesis defense of Christopher Thomas
PhD thesis defense of Christopher Thomas
 
The Network Data Structure in Computing
The Network Data Structure in ComputingThe Network Data Structure in Computing
The Network Data Structure in Computing
 
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
 
Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.
 
[DSC x TAAI 2016] 林守德 / 人工智慧與機器學習在推薦系統上的應用
[DSC x TAAI 2016] 林守德 / 人工智慧與機器學習在推薦系統上的應用[DSC x TAAI 2016] 林守德 / 人工智慧與機器學習在推薦系統上的應用
[DSC x TAAI 2016] 林守德 / 人工智慧與機器學習在推薦系統上的應用
 
The Hitchhiker's Guide to Machine Learning with Python & Apache Spark
The Hitchhiker's Guide to Machine Learning with Python & Apache SparkThe Hitchhiker's Guide to Machine Learning with Python & Apache Spark
The Hitchhiker's Guide to Machine Learning with Python & Apache Spark
 
Sanderson Shout It Out: LOUD
Sanderson Shout It Out: LOUDSanderson Shout It Out: LOUD
Sanderson Shout It Out: LOUD
 
Semantic web Santhosh N Basavarajappa
Semantic web   Santhosh N BasavarajappaSemantic web   Santhosh N Basavarajappa
Semantic web Santhosh N Basavarajappa
 
Databases and Ontologies: Where do we go from here?
Databases and Ontologies:  Where do we go from here?Databases and Ontologies:  Where do we go from here?
Databases and Ontologies: Where do we go from here?
 
Towards Context-Aware Search and Analysis on Social Media Data
Towards Context-Aware Search and Analysis on Social Media DataTowards Context-Aware Search and Analysis on Social Media Data
Towards Context-Aware Search and Analysis on Social Media Data
 
Books and Webs: Pulling the Down Rows
Books and Webs: Pulling the Down RowsBooks and Webs: Pulling the Down Rows
Books and Webs: Pulling the Down Rows
 
Why language technology can’t handle Game of Thrones (yet)
Why language technology can’t handle Game of Thrones (yet)Why language technology can’t handle Game of Thrones (yet)
Why language technology can’t handle Game of Thrones (yet)
 
Semantics based Summarization of Entities in Knowledge Graphs
Semantics based Summarization of Entities in Knowledge GraphsSemantics based Summarization of Entities in Knowledge Graphs
Semantics based Summarization of Entities in Knowledge Graphs
 
User-Generated Content on Social Media
User-Generated Content on Social MediaUser-Generated Content on Social Media
User-Generated Content on Social Media
 
A Deep Survey of the Digital Resource Landscape: Perspectives from the Neuros...
A Deep Survey of the Digital Resource Landscape:Perspectives from the Neuros...A Deep Survey of the Digital Resource Landscape:Perspectives from the Neuros...
A Deep Survey of the Digital Resource Landscape: Perspectives from the Neuros...
 

Similar a Our World is Socio-technical

An Introduction to NOSQL, Graph Databases and Neo4j
An Introduction to NOSQL, Graph Databases and Neo4jAn Introduction to NOSQL, Graph Databases and Neo4j
An Introduction to NOSQL, Graph Databases and Neo4jDebanjan Mahata
 
Big Data e tecnologie semantiche - Utilizzare i Linked data come driver d'int...
Big Data e tecnologie semantiche - Utilizzare i Linked data come driver d'int...Big Data e tecnologie semantiche - Utilizzare i Linked data come driver d'int...
Big Data e tecnologie semantiche - Utilizzare i Linked data come driver d'int...giuseppe_futia
 
Machines are people too
Machines are people tooMachines are people too
Machines are people tooPaul Groth
 
Development of Semantic Web based Disaster Management System
Development of Semantic Web based Disaster Management SystemDevelopment of Semantic Web based Disaster Management System
Development of Semantic Web based Disaster Management SystemNIT Durgapur
 
myExperiment and the Rise of Social Machines
myExperiment and the Rise of Social MachinesmyExperiment and the Rise of Social Machines
myExperiment and the Rise of Social MachinesDavid De Roure
 
Open Data - Principles and Techniques
Open Data - Principles and TechniquesOpen Data - Principles and Techniques
Open Data - Principles and TechniquesBernhard Haslhofer
 
The web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedThe web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedSören Auer
 
Overview AG AKSW
Overview AG AKSWOverview AG AKSW
Overview AG AKSWSören Auer
 
What do we want computers to do for us?
What do we want computers to do for us? What do we want computers to do for us?
What do we want computers to do for us? Andrea Volpini
 
The Real-time Web in the Age of Agents
The Real-time Web in the Age of AgentsThe Real-time Web in the Age of Agents
The Real-time Web in the Age of AgentsJoshua Shinavier
 
Ontologies for multimedia: the Semantic Culture Web
Ontologies for multimedia: the Semantic Culture WebOntologies for multimedia: the Semantic Culture Web
Ontologies for multimedia: the Semantic Culture WebGuus Schreiber
 
Question Answering - Application and Challenges
Question Answering - Application and ChallengesQuestion Answering - Application and Challenges
Question Answering - Application and ChallengesJens Lehmann
 
Effective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP SystemsEffective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP SystemsAndre Freitas
 
Neno/Fhat: Semantic Network Programming Language and Virtual Machine Specific...
Neno/Fhat: Semantic Network Programming Language and Virtual Machine Specific...Neno/Fhat: Semantic Network Programming Language and Virtual Machine Specific...
Neno/Fhat: Semantic Network Programming Language and Virtual Machine Specific...Marko Rodriguez
 
Social Semantic (Sensor) Web
Social Semantic (Sensor) WebSocial Semantic (Sensor) Web
Social Semantic (Sensor) WebDavid Crowley
 

Similar a Our World is Socio-technical (20)

An Introduction to NOSQL, Graph Databases and Neo4j
An Introduction to NOSQL, Graph Databases and Neo4jAn Introduction to NOSQL, Graph Databases and Neo4j
An Introduction to NOSQL, Graph Databases and Neo4j
 
Irish Digital Libraries Summit
Irish Digital Libraries SummitIrish Digital Libraries Summit
Irish Digital Libraries Summit
 
Big Data e tecnologie semantiche - Utilizzare i Linked data come driver d'int...
Big Data e tecnologie semantiche - Utilizzare i Linked data come driver d'int...Big Data e tecnologie semantiche - Utilizzare i Linked data come driver d'int...
Big Data e tecnologie semantiche - Utilizzare i Linked data come driver d'int...
 
Machines are people too
Machines are people tooMachines are people too
Machines are people too
 
Development of Semantic Web based Disaster Management System
Development of Semantic Web based Disaster Management SystemDevelopment of Semantic Web based Disaster Management System
Development of Semantic Web based Disaster Management System
 
Ir1
Ir1Ir1
Ir1
 
myExperiment and the Rise of Social Machines
myExperiment and the Rise of Social MachinesmyExperiment and the Rise of Social Machines
myExperiment and the Rise of Social Machines
 
Semantic Technologies for Big Sciences including Astrophysics
Semantic Technologies for Big Sciences including AstrophysicsSemantic Technologies for Big Sciences including Astrophysics
Semantic Technologies for Big Sciences including Astrophysics
 
Open Data - Principles and Techniques
Open Data - Principles and TechniquesOpen Data - Principles and Techniques
Open Data - Principles and Techniques
 
The web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedThe web of interlinked data and knowledge stripped
The web of interlinked data and knowledge stripped
 
Overview AG AKSW
Overview AG AKSWOverview AG AKSW
Overview AG AKSW
 
What do we want computers to do for us?
What do we want computers to do for us? What do we want computers to do for us?
What do we want computers to do for us?
 
The Real-time Web in the Age of Agents
The Real-time Web in the Age of AgentsThe Real-time Web in the Age of Agents
The Real-time Web in the Age of Agents
 
2014_WWW_BTOR
2014_WWW_BTOR2014_WWW_BTOR
2014_WWW_BTOR
 
Ontologies for multimedia: the Semantic Culture Web
Ontologies for multimedia: the Semantic Culture WebOntologies for multimedia: the Semantic Culture Web
Ontologies for multimedia: the Semantic Culture Web
 
Question Answering - Application and Challenges
Question Answering - Application and ChallengesQuestion Answering - Application and Challenges
Question Answering - Application and Challenges
 
Effective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP SystemsEffective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP Systems
 
Neno/Fhat: Semantic Network Programming Language and Virtual Machine Specific...
Neno/Fhat: Semantic Network Programming Language and Virtual Machine Specific...Neno/Fhat: Semantic Network Programming Language and Virtual Machine Specific...
Neno/Fhat: Semantic Network Programming Language and Virtual Machine Specific...
 
Semtech2006
Semtech2006Semtech2006
Semtech2006
 
Social Semantic (Sensor) Web
Social Semantic (Sensor) WebSocial Semantic (Sensor) Web
Social Semantic (Sensor) Web
 

Más de Markus Luczak-Rösch

Not re-decentralizing the Web is not only a missed opportunity, it is irrespo...
Not re-decentralizing the Web is not only a missed opportunity, it is irrespo...Not re-decentralizing the Web is not only a missed opportunity, it is irrespo...
Not re-decentralizing the Web is not only a missed opportunity, it is irrespo...Markus Luczak-Rösch
 
Analysing literature through the lens of information theory and network science
Analysing literature through the lens of information theory and network scienceAnalysing literature through the lens of information theory and network science
Analysing literature through the lens of information theory and network scienceMarkus Luczak-Rösch
 
Transcending our views to sequential data
Transcending our views to sequential data Transcending our views to sequential data
Transcending our views to sequential data Markus Luczak-Rösch
 
The Web Science MacroScope: Mixed-methods Approach for Understanding Web Acti...
The Web Science MacroScope: Mixed-methods Approach for Understanding Web Acti...The Web Science MacroScope: Mixed-methods Approach for Understanding Web Acti...
The Web Science MacroScope: Mixed-methods Approach for Understanding Web Acti...Markus Luczak-Rösch
 
Context-free data analysis with Transcendental Information Cascades.
Context-free data analysis with Transcendental Information Cascades.Context-free data analysis with Transcendental Information Cascades.
Context-free data analysis with Transcendental Information Cascades.Markus Luczak-Rösch
 
From coincidence to purposeful flow? Properties of transcendental information...
From coincidence to purposeful flow? Properties of transcendental information...From coincidence to purposeful flow? Properties of transcendental information...
From coincidence to purposeful flow? Properties of transcendental information...Markus Luczak-Rösch
 
When resources collide: Towards a theory of coincidence in information spaces...
When resources collide: Towards a theory of coincidence in information spaces...When resources collide: Towards a theory of coincidence in information spaces...
When resources collide: Towards a theory of coincidence in information spaces...Markus Luczak-Rösch
 
Observation and Analysis of Social Machines
Observation and Analysis of Social MachinesObservation and Analysis of Social Machines
Observation and Analysis of Social MachinesMarkus Luczak-Rösch
 
Zooniverse - Through the Observatory
Zooniverse - Through the ObservatoryZooniverse - Through the Observatory
Zooniverse - Through the ObservatoryMarkus Luczak-Rösch
 
loomp - semantic content authoring
loomp - semantic content authoringloomp - semantic content authoring
loomp - semantic content authoringMarkus Luczak-Rösch
 
Statistical Analysis of Web of Data Usage
Statistical Analysis of Web of Data UsageStatistical Analysis of Web of Data Usage
Statistical Analysis of Web of Data UsageMarkus Luczak-Rösch
 

Más de Markus Luczak-Rösch (12)

Not re-decentralizing the Web is not only a missed opportunity, it is irrespo...
Not re-decentralizing the Web is not only a missed opportunity, it is irrespo...Not re-decentralizing the Web is not only a missed opportunity, it is irrespo...
Not re-decentralizing the Web is not only a missed opportunity, it is irrespo...
 
Analysing literature through the lens of information theory and network science
Analysing literature through the lens of information theory and network scienceAnalysing literature through the lens of information theory and network science
Analysing literature through the lens of information theory and network science
 
Web of Data Usage Mining
Web of Data Usage MiningWeb of Data Usage Mining
Web of Data Usage Mining
 
Transcending our views to sequential data
Transcending our views to sequential data Transcending our views to sequential data
Transcending our views to sequential data
 
The Web Science MacroScope: Mixed-methods Approach for Understanding Web Acti...
The Web Science MacroScope: Mixed-methods Approach for Understanding Web Acti...The Web Science MacroScope: Mixed-methods Approach for Understanding Web Acti...
The Web Science MacroScope: Mixed-methods Approach for Understanding Web Acti...
 
Context-free data analysis with Transcendental Information Cascades.
Context-free data analysis with Transcendental Information Cascades.Context-free data analysis with Transcendental Information Cascades.
Context-free data analysis with Transcendental Information Cascades.
 
From coincidence to purposeful flow? Properties of transcendental information...
From coincidence to purposeful flow? Properties of transcendental information...From coincidence to purposeful flow? Properties of transcendental information...
From coincidence to purposeful flow? Properties of transcendental information...
 
When resources collide: Towards a theory of coincidence in information spaces...
When resources collide: Towards a theory of coincidence in information spaces...When resources collide: Towards a theory of coincidence in information spaces...
When resources collide: Towards a theory of coincidence in information spaces...
 
Observation and Analysis of Social Machines
Observation and Analysis of Social MachinesObservation and Analysis of Social Machines
Observation and Analysis of Social Machines
 
Zooniverse - Through the Observatory
Zooniverse - Through the ObservatoryZooniverse - Through the Observatory
Zooniverse - Through the Observatory
 
loomp - semantic content authoring
loomp - semantic content authoringloomp - semantic content authoring
loomp - semantic content authoring
 
Statistical Analysis of Web of Data Usage
Statistical Analysis of Web of Data UsageStatistical Analysis of Web of Data Usage
Statistical Analysis of Web of Data Usage
 

Último

Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhousejana861314
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 

Último (20)

Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhouse
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 

Our World is Socio-technical

  • 1. Our World is Socio-technical Markus Luczak-Roesch | @mluczak Senior Lecturer in Information Systems School of Information Management Victoria University of Wellington
  • 2. Humans in the information age • IT project management • Business process design • Application development • User experience design • Business and systems analysis Data insights Information Management
  • 3. Today’s road ahead • From Dickens to data science http://www.morguefile.com • Citizen scientists in the classroom • What is the World Wide Web and why should we preserve it?
  • 5.
  • 6.
  • 7.
  • 8. The World Wide WebThe Web - a decentralized hypermedia system Basic client-server architecture Hypertext gateway to represent databases as hypertext
  • 9. The REST is historyThe REST is historyThe REST is history
  • 10. The REST is history The REST is history The REST is history The REST is history
  • 11. What is REST? • Representational State Transfer • generic architectural style for network-based systems • architectural style of the WorldWide Web • literature: Roy Thomas Fielding. 2000. Architectural Styles and the Design of Network-Based Software Architectures. Ph.D. Dissertation. University of California, Irvine. AAI9980887.
  • 12. REST principles RepresentationalState Transfer 1. client-server 2. stateless 3. caching 4. uniform interface 1. resource identification 2. manipulation of representations 3. self-descriptive messaging 4. Hypermedia 5. layered system 6. code-on-demand
  • 13. HTTP • Hypertext Transfer Protocol • transfer data between Web servers and clients • transport protocol is TCP • textbased • literature: R. Fielding, J. Gettys, J. Mogul, H. Frystyk, L. Masinter, P. Leach und T. Berners-Lee. Hypertext Transfer Protocol - HTTP/1.1. RFC 2616, http://www.ietf.org/rfc/rfc2616.txt
  • 14. HTTP • stateless • request-response t Client Server Request Response TCP lifecycle Establish connection Terminate connection
  • 15. HTTP GET / HTTP/1.1 User-Agent: Mozilla/5.0 … Firefox/10.0.3 Host: markus-luczak.de:80 Accept: */* HTTP/1.1 200 OK Server: Apache/2.0.49 Content-Language: en Content-Type: text/html Content-length: 2990 <!DOCTYPE html> <html xml:lang="en" … Client Server
  • 16. Markup languages • most prominent example: HTML (HyperText Markup Language) • to structure Web documents
  • 17. What’s the problem with HTML (& XML & databases & …)? Apple <Apple> <Pear>
  • 19. The Semantic Web vision “The Semantic Web is an extension of the current web in which information is given well-defined meaning, better enabling computers and people to work in cooperation.“ Berners-Lee, Hendler, and Lassila, 2001.
  • 20. Meatadata • Data about data • describe content • in the best case metadata is machine processable §Author §Title §Date §… §Species: Android §Height… Content Metadata
  • 21. Web of Linked Data RDF RDF RDF RDF Links
  • 23. M. Luczak-Rösch,R. Heese. Linked Data Authoring forNon-Experts.In proceedings of LDOW 2009, co-located with World Wide Web Conference, Madrid, Spain. of the loomp OCA interface elements ubset he ap- esign: e per- d the s. We f text n MS n dif- high- strat- n the extual s, we each (e.g., l con- e pro- p be- A BC Fig. 8. loomp screenshot – authoring with annotation elements 4.2. Effectiveness of OCA design: Usability As the core motivation and purpose for loomp was Fig. 3. Core elements of the loomp Domain Ontology mashup fragment loomp content data data:fragment4 Frankfurt data: fragment3 data:mashup1 data:fragment5 data:fragemet6 data:mashup2 loomp annotation data rdf:type dbp:uri3 dbp:uri1 Frankfurtrdf:label rdf:type M. Luczak- Roesch foaf:name M. Luczak- Roesch rdf:_3 rdf:_1 loomp:contains Ext. Linked Data (e.g. DBpedia) dbp:uri1 Frankfurt/Main dbp:uri2 Frankfurt (Oder) dbp:uri3 city loomp: annotation9 Frankfurt M. Luczak- Roesch loomp vocabulary Country loomp: annotation10 City dbp:uri3 foaf:Person rdf:_2 rdf:_2 rdf:_1 1 2 3 4 external ontology foaf:Person foaf:name Symbolic links (expressing identity) RDF relationships data:resource7 loomp:hasRDFa full-text as XHTML+ RDFa snippet 5 6 7 Fig. 4. Selected elements of the loomp content model with example encoding for content and annotations typed as loomp:Fragment and loomp:Mashup, respectively (cf. Figure 3). ple, the shared fragment4 is linked via rdf:_1 to mashup1 and via rdf:_3 to mashup2. Fragments Markus Luczak-Rösch,Ralf Heese, Adrian Paschke,"FutureContent Authoring",In Nodilities – The Magazineof the Semantic Web, Issue11, pp.17-18, 2010. Populating the Semantic Web
  • 24. “Dresden University” correct meaningful 54 1416 “Light and heating are turned off” Hinze, A.,Heese, R., Luczak-Rösch, M., & Paschke, A. (2012). Semanticenrichmentbynon-experts: usabilityof manual annotation tools. In The SemanticWeb–ISWC 2012 (pp. 165- 181). Springer Berlin Heidelberg. Semantics for the rest of us?
  • 25. Citizen scientists in the classroom
  • 26. What is online citizen science?
  • 27. From Galaxy Zoo to the Zooniverse
  • 28. The online citizen science workflow n words n words k slices of n words (e.g. n=1,000) ... 1 2 3 k Abel Magwitch Joe Gargery, Mrs Joe Gargery, Pip Joe Gargery, Mrs Joe Gargery, Pip + Lexicon of characters(can alsobe a list of other entitiessuch as placesor phrases) - net - stat Directed network Dynamicnetworkvisualization source texts from Project Gutenberg Matched information • Abel Magwitch • Joe Gargary • Mrs Joe Gargary • Pip Matched information • Abel Magwitch automatic/semi-automatic methodsto detectmatch entities in text, topicdetection, sentiment analysis, part-of-speech tagging) configure n words n words k slices of n words (e.g. n=1,000) ... 1 2 3 k Abel Magwitch Joe Gargery, Mrs Joe Gargery, Pip Joe Gargery, Mrs Joe Gargery, Pip + Lexicon of characters(can alsobe a list of other entitiessuch as placesor phrases) - - Directed network Dynamicnetworkvisualization source texts from Project Gutenberg Matched information • Abel Magwitch • Joe Gargary • Mrs Joe Gargary • Pip Matched information • Abel Magwitch automatic/semi-automatic methodsto detectmatch entities in text, topicdetection, sentiment analysis, part-of-speech tagging) configure n words n words k slices of n words (e.g. n=1,000) ... 1 2 3 k Abel Magwitch Joe Gargery, Mrs Joe Gargery, Pip Joe Gargery, Mrs Joe Gargery, Pip + Lexicon of characters(can alsobe a list of other entitiessuch as placesor phrases) - network - statistic Directed network Dynamicnetworkvisualization source texts from Project Gutenberg Matched information • Abel Magwitch • Joe Gargary • Mrs Joe Gargary • Pip Matched information • Abel Magwitch automatic/semi-automatic methodsto detectmatch entities in text, topicdetection, sentiment analysis, part-of-speech tagging) configure n words n words k slices of n words (e.g.n=1,000) ... 1 2 3 k Abel Magwitch Joe Gargery, Mrs Joe Gargery, Pip Joe Gargery, Mrs Joe Gargery, Pip + Lexicon of characters(can alsobe a list of other entitiessuch as placesor phrases) - network visuali - statistics and ot Directed network Dynamicnetworkvisualization source texts from Project Gutenberg Matched information • Abel Magwitch • Joe Gargary • Mrs Joe Gargary • Pip Matched information • Abel Magwitch automatic/semi-automatic methodsto detectmatch entities in text, topicdetection, sentiment analysis,part-of-speech tagging) configure n words n words k slices of n words (e.g. n=1,000) ... 1 2 3 k Abel Magwitch Joe Gargery, Mrs Joe Gargery, Pip Joe Gargery, Mrs Joe Gargery, Pip + Lexicon of characters(can alsobe a list of other entitiessuch as placesor phrases) - network vis - statistics an Directed network Dynamicnetworkvisualization source texts from Project Gutenberg Matched information • Abel Magwitch • Joe Gargary • Mrs Joe Gargary • Pip Matched information • Abel Magwitch automatic/semi-automatic methodsto detectmatch entities in text, topicdetection, sentiment analysis, part-of-speech tagging) configure Online Citizen Science Platform Machine Learning Scientific Results Citizen scientists
  • 29. Our project: Detecting NZ native birds using AI • Recorded sound around Zealandia • Segment the recordings and upload them to Zooniverse for people to decide if they have bird calls or not (and if they have, what bird is it) • Use the data from Zooniverse to train and test our AI
  • 30. Hanny’s Voorwerp Galaxy Zoo [2007] Green Pea Galaxies Galaxy Zoo [2007] Yellow Balls Milky Way [2009] Circumbinary Planet Ph1b Planet Hunters [2012] Convict Worm Seafloor Explorer [2012] Spanish Flu Operation War Diaries [2014] Serendipitous discoveries through talk
  • 31. Upcoming project: Understanding the impact of online citizen science participation on the development of science capabilities of primary age children.
  • 32. From Dickens to Data Science
  • 33. Dickens and the Serial Novel Form All fourteen of Dickens’scompleted novels were published serially in weekly or monthly installments.
  • 34. From “Sketches” to Novels 1836-37 1864-65 “I have endeavoured in the progress of this Tale, to resist the temptation of the current Monthly Number, and to keep a steadier eye upon the general purpose and design.” Preface to Martin Chuzzlewit (1844)
  • 35. Managing Characters Dickens’ Working Notes for His Novels. Edited by Harry Stone, U of Chicago Press, 1987.
  • 36. TIC approach applied to Victorian novels n words n words k slices of n words (e.g. n=1,000) ... 1 2 3 k Abel Magwitch Joe Gargery, Mrs Joe Gargery, Pip Joe Gargery, Mrs Joe Gargery, Pip + Lexicon of characters(can alsobe a list of other entitiessuch as placesor phrases) - network visualisations - statistics and other measures Directed network Dynamicnetworkvisualization source texts from Project Gutenberg Matched information • Abel Magwitch • Joe Gargary • Mrs Joe Gargary • Pip Matched information • Abel Magwitch automatic/semi-automatic methodsto detectmatch entities in text, topicdetection, sentiment analysis, part-of-speech tagging) configure analyze
  • 38. Information-theoretic properties of literature Pickwick Our Mutual Friend
  • 39. Not-so-distant Reading Distant Reading “tackles literary problems by scientific means: hypothesis-testing, computational modeling, quantitative analysis....understanding literature not by studying particular texts, but by aggregating and analyzing massive amounts of data.”[1] [1]Shulz, Kathryn. “What is Distant Reading?” New York Times, 24 June2011.
  • 42. ● At the School of Information Management we focus on ○ humans in the information age ○ data insights ● Topics at the intersection of ○ computer science (e.g. data management, data analytics, software development) ○ behavioral science (e.g. organizational decision making, social networks) ○ social science (organizational processes, inequalities) Thanks to my collaborators • Dr Adam Grener (Dickens project) • Emma Fenton (Dickens project) • Tom Goldfinch (Dickens project) • Isabel Parker (Dickens project) • Victor Anton (NZ bird identification) • Jacob Woods (NZ bird identification) • Dr Dayle Anderson (Citizen science in educ.) • Dr Cathal Doyle (Citizen science in educ.) • …