A presentation to the Owen Graduate School of Management (Vanderbilt University) about social media and some of the technology behind the future uses of social media that are likely to shape the future of the Web as we know it.
Privacy, Ethics, and Future Uses of the Social Web
1. 1!
Privacy, Ethics, and Future Uses of the Social Web"
Prepared for Owen Graduate School of Management (Vanderbilt University)!
April 3, 2014!
Matthew A. Russell (Chief Technology Officer @ Digital Reasoning)!
Twitter: @ptwobrussell & @dreasoning!
4. Hello, My Name Is ... Matthew!
• Background in Computer Science
• Data mining, AI, machine learning, etc.
• CTO @ Digital Reasoning Systems
• Moving toward cognitive computing
• Author
• 5 published books on technology (just for fun)
• CrossFit, triathlon, Bikram hot yoga
• Stress management
4!
5. The only easy day was yesterday.
-- Motto of the U.S. Navy SEALs
5!
6. It pays to be a winner.
-- Motto of the U.S. Navy SEALs
6!
8. Data Exhaust => Digital Fingerprints!
• World population: ~7B people
• Facebook: 1.15B users
• Twitter: 500M users
• Google+ 343M users
• LinkedIn: 238M users
• ~200M+ blogs (conservative estimate)
8!
9. • An open source software (OSS) project
• http://bit.ly/MiningTheSocialWeb2E
• A (rewritten) book
• http://bit.ly/135dHfs
• Accessible to (virtually) everyone
• Virtual machine with turn-key coding templates for
data science experiments
• Think of the book as "premium" support for the OSS
project
Transforming Curiosity Into Insight!
9!
10. Table of Contents (1/2)!
• Chapter 1 - Mining Twitter: Exploring Trending Topics,
Discovering What People Are Talking About, and More
• Chapter 2 - Mining Facebook: Analyzing Fan Pages,
Examining Friendships, and More
• Chapter 3 - Mining LinkedIn: Faceting Job Titles,
Clustering Colleagues, and More
• Chapter 4 - Mining Google+: Computing Document
Similarity, Extracting Collocations, and More
• Chapter 5 - Mining Web Pages: Using Natural Language
Processing to Understand Human Language,
Summarize Blog Posts, and More
• Chapter 6 - Mining Mailboxes: Analyzing Who's Talking
to Whom About What, How Often, and More
10!
11. Table of Contents (2/2)!
• Chapter 7 - Mining GitHub: Inspecting Software
Collaboration Habits, Building Interest Graphs, and More
• Chapter 8 - Mining the Semantically Marked-Up Web:
Extracting Microformats, Inferencing over RDF, and More
• Chapter 9 - Twitter Cookbook
• Appendix A - Information About This Machine's Virtual
Machine Experience
• Appendix B - OAuth Primer
• Appendix C - Python and IPython Notebook Tips & Tricks
11!
12. Anatomy of Each Chapter!
• Brief Intro
• Objectives
• API Primer
• Analysis Technique(s)
• Data Visualization
• Recap
• Suggested Exercises
• Recommended Resources
12!
13. Why You Should Use IPython
Notebook!
• Because it's great for hacking
• And hacking is usually the first step
• Because it's great for collaboration
• Sharing/publishing results is trivial
• Because the UX is as easy as working in a
notepad
• Think of it as "executable paper"
• In short, it's a terrific learning platform for
novices and experts alike
13!
22. Machine Learning!
• A program that learns (improves)
from experience (data)
according to some objective
• Supervised learning
• Unsupervised learning
• Reinforcement learning
• How to do it
• Program mathematical
models and hope for the
best...
• How to do it well
• Program state-of-the-art
mathematical models with
sufficient representative data
22!
23. Knowledge is a process of piling up facts;
wisdom lies in their simplification.
--Martin Fischer
23!
26. Is Privacy Already an Illusion?!
• Digital happenings circa 2014
• The Cloud
• Social Media
• Deep Learning
• The Internet of Things
• Internet.org
26!
28. If you have something that you
don’t want anyone to know,
maybe you shouldn’t be doing it
in the first place.
-- Eric Schmidt, (former) CEO of Google
28!
29. Influences on Ethics!
• Capitalism, economics, & marketing
• A for-profit corporation's fiduciary duty: To
maximize the common stock's value
• How to do it? By transacting commerce
• How do it well? By advertising more effectively
than competitors
• How to do it really well? With highly relevant
personalized ads (recommenders)
• Terms of Service (ToS) - The legal extent of
ethical obligations?
29!
30. If you're not paying for the product, you
are the product.
-- Savvy consumers everywhere one day (?)
30!
31. For the wisdom of this world is
foolishness...
-- Saint Paul
31!
32. The Future of the Web...!
• The Blue Pill: All of your precious data housed remotely
and controlled by a few of the world's most powerful
international corporations
• The Red Pill: A distributed cloud controlled by no one
with decentralized data and anonymity online as the
status quo
• The Purple Pill: Meet somewhere in the middle (?)
• Significant legislative reforms concerning consumer
data (?)
• Consumer education with more transparency (?)
• Resurgence of local/offline storage and anonymity
online (?)
32!
33. The real danger is the gradual erosion of
individual liberties through automation,
integration, and interconnection of many
small, separate record-keeping systems,
each of which alone may seem innocuous,
even benevolent, and wholly justifiable.
-- Anonymous (U. S. Privacy Study Commission, 1977)
33!
34. .
There are two primary choices in life:
to accept conditions as they exist,
or accept the responsibility for
changing them.
-- Dennis Waitley
34!