SlideShare una empresa de Scribd logo
1 de 9
Descargar para leer sin conexión
A Survey on Web Use: How People Access, Consume, Keep, and Organize Web
  Content
  Seungyon Claire Lee, Eamonn O'Brien-Strain, Jerry Liu, Qian Lin

  HP Laboratories
  HPL-2011-203

 Keyword(s):
  Content repository; web content; current practice; user experience


 Abstract:
  We present the results from a preliminary survey concerning how people access, consume, keep, and
  organize seven types of web content. From the results of the survey, we highlight design suggestion for
  Tangible Web (TW), our cloud content repository system that enables users to clip, save, format, and
  organize web content through active discovery and passive delivery.




External Posting Date: October 22, 2011 [Fulltext]   Approved for External Publication
Internal Posting Date: October 22, 2011 [Fulltext]


 Copyright 2011 Hewlett-Packard Development Company, L.P.
A Survey on Web Use: How People Access,
           Consume, Keep, and Organize Web Content

Seungyon Claire Lee                          Abstract
Hewlett Packard Laboratories                 We present the results from a preliminary survey
1501 Page Mill Road                          concerning how people access, consume, keep, and
Palo Alto, CA 94304 USA                      organize seven types of web content. From the results
claire.sylee@hp.com                          of the survey, we highlight design suggestion for
                                             Tangible Web (TW), our cloud content repository
Eamonn O’Brien-Strain
                                             system that enables users to clip, save, format, and
Hewlett Packard Laboratories
1501 Page Mill Road
                                             organize web content through active discovery and
Palo Alto, CA 94304 USA                      passive delivery.
eamonn.obrien-strain@hp.com
                                             Keywords
Jerry Liu                                    Content repository, web content, current practice
Hewlett Packard Laboratories
1501 Page Mill Road                          ACM Classification Keywords
Palo Alto, CA 94304 USA
                                             K8.3. Personal computing: Management/maintenance
jerry.liu@hp.com

Qian Lin
                                             General Terms
Hewlett Packard Laboratories                 Design.
1501 Page Mill Road
Palo Alto, CA 94304 USA                      Introduction
qian.lin@hp.com                              The history of graphical user interface (GUI) is tightly
                                             coupled with the continuous evolution of relevant user
                                             experience (UX). Even though they may look obvious
Copyright is held by the author/owner(s).    and straightforward, interactions with many familiar
CHI 2012, May 5–10, 2012, Austin, TX, USA.   metaphors within the GUI are the results from careful
ACM xxx-x-xxxx-xxxx-x/xx/xx.                 design iteration over the decades. In that sense, UX in
                                             current practice is efficient resource to reflect things
2




                                       that can be accepted naturally when designing new          users contact the content of interest (Figure 1): active
                                       systems. This paper presents the results from a            discovery and passive delivery. Active discovery refers
                                       preliminary survey that concerns lessons learned from      to the situation where users are manually moving from
                                       current practice (how people access, consume, keep,        page to page to search information to clip. In this case,
                                       and organize web content) to support the design of         TW is installed as an add-on of a web browser and is
                                       Tangible Web, a cloud content repository. Note that        accessible any time while the user browses the web.
                                       since cloud computing blurs the boundary of personal       EverNote ( evernote.com) and Diigo (diigo.com)
                                       computing between local machine and web                    employ this approach.
                                       environment, the survey focuses on the web content
                                       while including some desktop metaphor as well.             On the other hand, passive delivery generates
                                                                                                  automatic content feed from preregistered resources
                                       Tangible Web: Cloud Content Repository                     such as users’ existing social network service (SNS) or
                                       Tangible Web (TW) is a web content repository service      favorite websites. Examples of such approach include
                                       that enables users to clip, save, format, and organize     Google Reader (google.com/reader), FlipBoard
                                       web content (image, text, video, URL) in a private         (flipboard.com) and many personalized home pages
                                       account. The notion of clipping is inspired by the         (google.com/ig, my.yahoo.com). In this approach, TW
                                       ephemeral nature of web content, which is frequently       highlights how to integrate one-stop reading experience
                                       updated and may disappear for a given URL. Thus,           of daily updates with appropriate clipping capabilities.
                                       similar to Zoetrope that concerns temporal change of
                                       web content by time [1], the clipped content in TW can     Assuming that the nature of the content type (active
Figure 1. Two approaches to clip
                                       be saved either as a live view that applies the updates    discovery or passive delivery) may affect the relevant
content: active discovery by which
                                       of the pointer or a frozen view that captures the          user behavior, we encountered several design
the user moves from page to page to
                                       snapshot at the moment that the content is clipped. By     questions while implementing TW (e.g., Will the content
clip resources (top) and passive
                                       using TW, users select multiple discrete areas in the      type and access method affect user behavior on
delivery by which the user stays in
                                       web page as a single entity to clip so that any            clipping and organizing?). In search of answers for such
TW home to read daily feed delivered
                                       unwanted content on the web page can be disregarded        questions, we perform a survey to learn about current
from preregistered resources
                                       (advertisement, page navigation object). The clipped       practice on how people use web content on a daily
(bottom).
                                       content can be reformatted with other clipped entity       basis. We expect that the lessons learned from such
                                       contained in the TW repository. When a clipped entity is   general behavior would assist us to predict requests
                                       presented, either independently or organized with other    from potential TW users and to find design strategy to
                                       clipped entities, the combined text streams reflow to      fulfill these requests.
                                       generate appropriate page composition with the print-
                                       friendly formatting.                                       Study overview
                                                                                                  The study specifies seven content types and four
                                       With respect to the method for accessing the content to    interaction types in the survey design.
                                       clip, TW has two approaches depending on how the
3




                                      Passive delivery                                            created as a result of organizing a URL that is
                                      <1> Up-to-date news                                         kept/saved on a browser during or after consuming the
                                      <2> Entertainment (video, music)                            web content.). In the survey, behaviors in this four step
                                      <3> Personal content posted by others (blog, SNS)           cycle are integrated with relevant metaphors and
                                      <4> Personal content posted by me (blog, SNS)               systems such as bookmark and folder, metaphor in e-
                                      Active discovery
                                                                                                  mail, file browser, and web browser. Researchers have
                                                                                                  studied matured UX in such systems with respect to the
                                      <5> Reference (knowledge, education)
                                                                                                  consumption of web media [7] and strategic
                                      <6> Work related document (work, school)
                                                                                                  management of emails by searching, tagging, and
                                      <7> Supporting document to assist real world activities
                                                                                                  using folders [2, 4, 5, 8]. Unlike these studies that
                                            (receipt, bill, coupon, banking info)
                                                                                                  highlight a single behavior, our study focuses on the
                                      Table 1. Content type characteristics                       flow of the behavior cycle to understand overall trends
                                                                                                  in the interaction cycle.
                                      Content type: Passive delivery vs. active discovery
                                      The survey explores user behavior for interacting with      Survey and participants
                                      seven content types that can rely on either passive         The survey includes four steps of behavior cycle (Figure
Figure 2. Interaction cycle in four   delivery or active discovery (Table 1). Passive delivery    2) that are incorporated with seven content types
steps                                 content (<1> News, <2> Entertainment, <3> Content           (Table 1) and relevant metaphors. We recruited 435
                                      posted by others, <4> Content posted by me) may fit         volunteers to participant in the online survey (age 18-
                                      with the content that users want to know in a time-         75, mean age = 35.54) who are college students or
                                      sensitive manner on a daily basis, whereas active           working professionals.
                                      discovery content (<5> References, <6> Work related
                                      document,-<7> Supporting document for real world            Result: Access
                                      activities) reflect content that might be less casual and   Behaviors and methods for access to the web indicate
                                      would reflect users’ moment-by-moment interests or          the moment that the user initiates the interaction with
                                      urgent needs. Here, and throughout the paper, the           web content. Methods to access web content could be
                                      content type will be described as a number bracket          the results of organizing (e.g., Bookmark), which is
                                      such as “<1>” for news content.                             another node in the interaction cycle. We integrate
                                                                                                  seven content types with four access methods (Type
                                      Interaction type: Four steps of behavior cycle              URL, search keyword, click bookmark, click link) to
                                      We cluster users’ general behavior on the web as four       learn the relationship between content type and access
                                      steps within a cycle: access, consume, keep, and            method.
                                      organize (Figure 2). In this cycle, behaviors on the web
                                      are somewhat reciprocal such that the result of one         Only two methods were selected with content-specific
                                      behavior often affects consecutive behavior (e.g., A        behavior (bookmark and keyword search). The
                                      bookmark that is used as a method for access is             selection of access method is partially affected by the
4




                                      content type. Keyword search is the most preferred          quantities), would help us to find the dominance or
                                      method to access entertainment and reference content        absence of the leading websites by content type.
                                      (<2, 5>), whereas bookmark is the most preferred
                                      method to access to SNS feeds (<3,4>), websites for         Q: Excluding e-mail and search engines, what is the
                                      work or school (<6>), and websites that provide             website that you access every day? (Open-ended)
                                      supporting document to assist real world activity such
                                      as banking (<7>). For these content types, the
                                      preference on keyword search and bookmark is
                                      inversely proportional. However, no dominant method
                                      was observed for news access (<1>).

                                      Question: Through what methods do you access the
                                      following types of content? (Select all applied each
                                      content type): Graph shows two main methods only




                                                                                                  Figure 4. Daily visit of web content by popularity: (A) In SNS,
                                                                                                  two leaders attract most visitors (91.16%).; (B) In
                                                                                                  entertainment category, 39 non-popular websites attract
                                                                                                  visitors (72.88%) more than two leading websites.


                                      Figure 3. Access method by content type                     Among 173 websites from 435 entries in eight
                                                                                                  categories (Figure 4), the ten most popular websites
                                      Result: Consume                                             (Figure 5) and the rest 163 websites obtained 55.86%
Figure 5. Ten most popular            Questions in this interaction cycle explore content types   and 44.14% of votes, respectively. The top ten
websites (small number of unique      by popularity to learn what people want to check from       websites are included in top four dominant categories:
websites that have large number of    the web on a daily basis. We focus on integrating the       two websites in SNS, four websites in news, two
visitors for each) and the rest 163   popularity of website with the effect of popular website    websites in shopping, and two websites in
websites in a long tail (large        such as dominant leadership or long tail [2]. The notion    entertainment. Among these categories, the most and
number of unique websites that        of the long tail, which has been widely accepted to         least significant effects of leading websites were found
have small number of visitors for     describe retailing strategy of selling (selling a large     in SNS and entertainment categories, respectively. In
each)                                 number of unique items each of which is sold with small     SNS category, two leading websites attract large
                                                                                                  number of visitors (91.16%, Figure 4-A). On the other
5




hand, in entertainment category, large number of non-
dominant websites are found (39 out of 41, 95.12%).
Even though the number of visitors is small in each
website, the sum is larger (72.88%, Figure 4-B) than
the visitors of two leading websites.

People’s daily visit on the ten most popular websites
(55.86%) is slightly higher than the preference to visit
the rest unique, yet non-popular websites (44.14%).        Figure 6. Three personae in keeping web content.
The frequent visit on such diverse websites, which is
especially observed in entertainment content, indicates    Since different personae were found, analyzing
the preference to consume content by personal interest     behavior to keep web content is performed with Group
rather than by common interest of others.                  B and C. Group A result is not included in the analysis.
                                                           In Figure 7 and 8, the result from Groups B and C are
Result: Keep                                               illustrated as line and bar graph, respectively, such that
With respect to the behavior on keeping web contents       line graph is found only in content <5>, <6>, and <7>.
on a daily basis, which implies the moment that the        Stars on each graph present the dominant method of
public content becomes personal content, we are            which the preference is statistically significant after
mostly concerned with how and why people select            CHI-square (p<.05). Empty and filled stars stand for
between saving and printing, what the relevant             the result from Group B and C, respectively.
methods are, and how content type affects decision
making.                                                    Q: Which method do you use to keep web content as
                                                           yours? (Select one method for each content type.)
Three personae were found in the question that asks
about the preferred method among four choices (print,
save, others, and not applicable) to keep content in
seven content types (Figure 6): keeping web content
very rarely (Group A), conditionally (Group B), and
actively (Group C). Group A (14.05%) does not keep
web content in any content type. Group B (38.10%)
conditionally keeps web content. They keep at least one
content type that relies on active discovery (<5, 6, 7>)
while keep none of the content in passive delivery
category (<1>-<4>). Group C, which shows the most
active keeping behavior, keeps all types of given
                                                           Figure 7. Keeping web content by content type and persona
content.
6




Group B (Figure 7) prefers saving to printing for          Result: Organize
keeping web content, with statistical significance         The behavior for organization is related with the
(p<.05) in work related document (<6>) and                 preferences and strategies to manage personal
supporting document for daily activity such as shopping    information by using three methods: bookmark, email,
or banking (<7>). Group C prefers saving to printing       and file browser. The preference for using bookmarks,
(p<.05) to keep all content type.                          email, and file browser are 45.84%, 23.91%, and
                                                           30.24%, respectively (Figure 9).
To learn about preferred method to save web content,
we provided five methods in another question:              Q: How do you organize content saved from the web?
bookmark, downloading to the local machine, uploading      (Select all applied)
to cloud storage, post to SNS, copy and paste to the
local machine text editor and save. Among these
methods, bookmark and download are mostly preferred.
The preference for downloading work related document
(<6>) to the local machine is discovered from Group B
(Figure 8, p<.05). For all content types in passive
delivery category (<1, 2, 3, 4>), Group C prefers
bookmark (p<.05). However, for <6> and <7> in
active discovery category, Group C prefers to download
the content into the local machine (p<.05).
                                                           Figure 9. Organizing content saved from the web
Q: Which method do you use to save web content?
                                                           Q: How many bookmarks do you have in bookmark
                                                           menu and toolbar in your default web browser?




Figure 8. Saving web content by content type and persona

                                                           Figure 10. Managing bookmark by using toolbar and menu
7




Q: How do you further organize/categorize to manage                 Design implication
bookmark, emails, and files? (Select all applied)                   Lessons learned from this survey (Table 2) contribute
                                                                    to building content-specific design strategies of TW,
                                                                    which aims to include contents by both passive delivery
                                                                    and active discovery (Figure 1).

                                                                    Access: Bookmark vs. Keyword search
                                                                    Bookmark is the most dominant method to access SNS,
                                                                    work related document, and supporting document (<3,
                                                                    4, 6, 7>). Thus, TW can actively utilize bookmarks as a
                                                                    source of automatic or passive delivery. However, since
Figure 11. Methods to organize content saved from the web           not only those contents but also bookmarks could be
                                                                    highly private by the nature, TW should consider
The use of bookmark toolbar significantly drops when                privacy issue carefully when augmenting daily
the number of bookmarks exceeds 25 (Figure 10). This                interaction. On the other hand, keyword search is
tendency is stronger when the number of bookmarks in                preferred to accessing entertainment and reference
the menu is less than 50 (solid lines in Figure 10).                content (<2, 5>). Such content could be highly
When the number of bookmarks in the toolbar is less                 situational, hard to predict, and require manual
than 25 (Toolbar:1-25 in Figure 10), people tend to                 operation. Thus, automating such activity through user
have more items in the bookmark menu. However, this                 analytics would be helpful in predicting user interest
difference is not observed if the number of bookmarks               and profile. As Liu et al. developed a system to
on the toolbar exceeds 25. Among various metaphors                  recommend personalized news content by analyzing
that assist organizing, folder (51.27%), which probably             click behavior [6], entertainment and reference content
has the longest history of use and is widely applied, is            (<2>, <5>) can be recommended in TW by analyzing
the most preferred metaphor (Figure 11).                            history of relevant activities: content clip; keyword
                                                                    search; and link click.
       Access                    Consume                    Keep
<1> Various     2nd popular content. Has decent leaders. Bookmark   Consume: Find the niche for entertainment portal
<2> Search      4th popular content. Has weak leaders.   Bookmark   Unlike SNS (<3, 4>), entertainment content (<2>)
<3> Bookmark 1st popular content. Has strong leaders. Bookmark      does not have a significant leader. Among 41 websites
<4> Bookmark 1st popular content. Has strong leaders. Bookmark
                                                                    that constitute entertainment content, two top websites
                                                                    draw only 27.12% of traffic, which implies relatively
<5> Search      Less popular content. No leaders.        Various
                                                                    weak hold on user consumption and long tail of the
<6> Bookmark Less popular content. No leaders            Download
                                                                    market share. Such a long tail (72.88% of traffic from
<7> Bookmark 3rd popular content. Has decent leaders. Download
                                                                    39 websites) refers to the need of entertainment portal
Table 2. Summary                                                    that aggregates various, yet user-specific, requests.
8




Keep: Passive delivery vs. active discovery                   design. Although not explored in this preliminary
Passive delivery content is saved by bookmarks.               survey, examining the effects of content type with
However, active discovery content, except references,         respect to the relationship between browsing history
is downloaded to a local machine, which is a more             and organizing strategy in emails, bookmarks, and files
private space. Reference content (<5>) is saved by            would be a viable direction for further investigation.
various methods with similar preference (bookmark,
post to SNS, or download). From this result, we assume        Acknowledgement
that TW should carefully detect such user preference.         We thank to staff members of Communispace for their
The fact that the news, entertainment, and SNS feeds          help on recruiting participants and collecting data.
(<1>-<4>) are mostly saved as bookmark reflects the
characteristics of the content that is consumed,              References
updated, and disposed frequently. Thus, TW may                [1] Adar, E., Dontcheva, M., Fogarty, J. and Weld, D.
highlight the timely delivery and appropriate visibility of   S. 2008. Zoetrope: interacting with the ephemeral web.
                                                              In Proc. UIST 2008. ACM Press (2008), 239-248.
such content while concerning information overload. On
the other hand, work related documents and supporting         [2] Ames, M. and Naaman, M. Why we tag:
                                                              motivations for annotation in mobile and online media.
documents for the real world activities (<6>, <7>)
                                                              In Proc. CHI 2007. ACM Press (2007), 971-980.
may require advanced features saved in TW
repository: ensuring secure management and clustering         [3] Anderson, C. The long tail: why the future of
                                                              business is selling less of more. New York: Hyperion,
saved content with other clipped content of other
                                                              2006.
application for further modification.
                                                              [4] Fisher, D., Brush, A. J., Gleave, E. and Smith, . A.
                                                              Revisiting Whittaker & Sidner's "email overload" ten
Organize: Integrating diverse concept with simplicity         years later. In Proc. CSCW 2006. ACM Press (2006).
In this step, even the content clipped from the public        309-312.
web is considered as a personal resource. We observe
                                                              [5] Jones, W., Phuwanartnurak, A. J., Gill, R. and
that the dominant preferences are for bookmark and            Bruce, H. Don't take my folders away!: organizing
folder metaphors. In TW, which will eventually blur the       personal information to get things done. In Proc. CHI
boundary among relevant applications, current practice        EA 2005. ACM Press, 1505-1508.
to organize bookmarks on the web browser, files in the        [6] Liu, J., Dolan, P. and Pedersen, E.R. Personalized
file browser on a local machine, and folders or other         news recommendation based on click behavior. In
metaphors on the email client will be seamlessly              Proc.IUI 2010. ACM Press (2010), 31-40.
aggregated. Thus, the challenges in designing an              [7] Savolainen, R. Filtering and withdrawing: strategies
organizing assistant in TW would be the integration of        for coping with information overload in everyday
diverse concepts with simplicity: how to optimize             contexts. J. Inf. Sci. 33, 5 (October 2007), 611-621.
diverse organizing method to fulfill various need and         [8] Whittaker, S. and Sidner, C. Email overload:
context and how to provide concrete structure in a            exploring personal information management of email.
universal manner to convey general theme in the UX            In Proc. CHI 1996, ACM Press (1996). 276-283.

Más contenido relacionado

Similar a Hpl 2011-203

Babouk: Focused Web Crawling for Corpus Compilation and Automatic Terminology...
Babouk: Focused Web Crawling for Corpus Compilation and Automatic Terminology...Babouk: Focused Web Crawling for Corpus Compilation and Automatic Terminology...
Babouk: Focused Web Crawling for Corpus Compilation and Automatic Terminology...Christophe Tricot
 
opencdn_iecco18.pptx
opencdn_iecco18.pptxopencdn_iecco18.pptx
opencdn_iecco18.pptxssuser3855be
 
Nuxeo Fact Sheet
Nuxeo Fact SheetNuxeo Fact Sheet
Nuxeo Fact SheetNuxeo
 
Linked Data Generation for the University Data From Legacy Database
Linked Data Generation for the University Data From Legacy Database  Linked Data Generation for the University Data From Legacy Database
Linked Data Generation for the University Data From Legacy Database dannyijwest
 
Sitkoski Metadata Proposal - Final
Sitkoski Metadata Proposal - FinalSitkoski Metadata Proposal - Final
Sitkoski Metadata Proposal - FinalKeith Sitkoski
 
Web 2.0: Implications for Library Services
Web 2.0: Implications for Library ServicesWeb 2.0: Implications for Library Services
Web 2.0: Implications for Library ServicesADINET Ahmedabad
 
Interactive Video Search and Browsing Systems
Interactive Video Search and Browsing SystemsInteractive Video Search and Browsing Systems
Interactive Video Search and Browsing SystemsAndrea Ferracani
 
OER Rapid Innovation
OER Rapid InnovationOER Rapid Innovation
OER Rapid InnovationJisc
 
Why I teach Content Strategy in Information Architecture
Why I teach Content Strategy in Information ArchitectureWhy I teach Content Strategy in Information Architecture
Why I teach Content Strategy in Information ArchitectureMisty Weaver
 
Web engineering notes unit 2
Web engineering notes unit 2Web engineering notes unit 2
Web engineering notes unit 2inshu1890
 
An Improved Annotation Based Summary Generation For Unstructured Data
An Improved Annotation Based Summary Generation For Unstructured DataAn Improved Annotation Based Summary Generation For Unstructured Data
An Improved Annotation Based Summary Generation For Unstructured DataMelinda Watson
 
Maximum Spanning Tree Model on Personalized Web Based Collaborative Learning ...
Maximum Spanning Tree Model on Personalized Web Based Collaborative Learning ...Maximum Spanning Tree Model on Personalized Web Based Collaborative Learning ...
Maximum Spanning Tree Model on Personalized Web Based Collaborative Learning ...ijcseit
 
Re-usable metadata, re-usable content
Re-usable metadata, re-usable contentRe-usable metadata, re-usable content
Re-usable metadata, re-usable contentPaul Walk
 
Topic Modeling : Clustering of Deep Webpages
Topic Modeling : Clustering of Deep WebpagesTopic Modeling : Clustering of Deep Webpages
Topic Modeling : Clustering of Deep Webpagescsandit
 
Topic Modeling : Clustering of Deep Webpages
Topic Modeling : Clustering of Deep WebpagesTopic Modeling : Clustering of Deep Webpages
Topic Modeling : Clustering of Deep Webpagescsandit
 
Inverted textindexing
Inverted textindexingInverted textindexing
Inverted textindexingKhwaja Aamer
 

Similar a Hpl 2011-203 (20)

Babouk: Focused Web Crawling for Corpus Compilation and Automatic Terminology...
Babouk: Focused Web Crawling for Corpus Compilation and Automatic Terminology...Babouk: Focused Web Crawling for Corpus Compilation and Automatic Terminology...
Babouk: Focused Web Crawling for Corpus Compilation and Automatic Terminology...
 
opencdn_iecco18.pptx
opencdn_iecco18.pptxopencdn_iecco18.pptx
opencdn_iecco18.pptx
 
Nuxeo Fact Sheet
Nuxeo Fact SheetNuxeo Fact Sheet
Nuxeo Fact Sheet
 
Linked Data Generation for the University Data From Legacy Database
Linked Data Generation for the University Data From Legacy Database  Linked Data Generation for the University Data From Legacy Database
Linked Data Generation for the University Data From Legacy Database
 
Sitkoski Metadata Proposal - Final
Sitkoski Metadata Proposal - FinalSitkoski Metadata Proposal - Final
Sitkoski Metadata Proposal - Final
 
Web 2.0: Implications for Library Services
Web 2.0: Implications for Library ServicesWeb 2.0: Implications for Library Services
Web 2.0: Implications for Library Services
 
Interactive Video Search and Browsing Systems
Interactive Video Search and Browsing SystemsInteractive Video Search and Browsing Systems
Interactive Video Search and Browsing Systems
 
Interactive Video Search and Browsing Systems
Interactive Video Search and Browsing SystemsInteractive Video Search and Browsing Systems
Interactive Video Search and Browsing Systems
 
OER Rapid Innovation
OER Rapid InnovationOER Rapid Innovation
OER Rapid Innovation
 
Metadata and Scotland’s information environment: potential benefits of Web 2.0
Metadata and Scotland’s information environment: potential benefits of Web 2.0Metadata and Scotland’s information environment: potential benefits of Web 2.0
Metadata and Scotland’s information environment: potential benefits of Web 2.0
 
Why I teach Content Strategy in Information Architecture
Why I teach Content Strategy in Information ArchitectureWhy I teach Content Strategy in Information Architecture
Why I teach Content Strategy in Information Architecture
 
Web engineering notes unit 2
Web engineering notes unit 2Web engineering notes unit 2
Web engineering notes unit 2
 
duepuntozero
duepuntozeroduepuntozero
duepuntozero
 
An Improved Annotation Based Summary Generation For Unstructured Data
An Improved Annotation Based Summary Generation For Unstructured DataAn Improved Annotation Based Summary Generation For Unstructured Data
An Improved Annotation Based Summary Generation For Unstructured Data
 
Maximum Spanning Tree Model on Personalized Web Based Collaborative Learning ...
Maximum Spanning Tree Model on Personalized Web Based Collaborative Learning ...Maximum Spanning Tree Model on Personalized Web Based Collaborative Learning ...
Maximum Spanning Tree Model on Personalized Web Based Collaborative Learning ...
 
Re-usable metadata, re-usable content
Re-usable metadata, re-usable contentRe-usable metadata, re-usable content
Re-usable metadata, re-usable content
 
Website Migration Planning
Website Migration PlanningWebsite Migration Planning
Website Migration Planning
 
Topic Modeling : Clustering of Deep Webpages
Topic Modeling : Clustering of Deep WebpagesTopic Modeling : Clustering of Deep Webpages
Topic Modeling : Clustering of Deep Webpages
 
Topic Modeling : Clustering of Deep Webpages
Topic Modeling : Clustering of Deep WebpagesTopic Modeling : Clustering of Deep Webpages
Topic Modeling : Clustering of Deep Webpages
 
Inverted textindexing
Inverted textindexingInverted textindexing
Inverted textindexing
 

Más de StephanieLeBadezet (20)

Data mining
Data miningData mining
Data mining
 
Data mining
Data miningData mining
Data mining
 
7
77
7
 
6
66
6
 
5
55
5
 
4
44
4
 
3
33
3
 
2
22
2
 
1
11
1
 
Were all connected the power of the social media ecosystem
Were all connected the power of the social media ecosystemWere all connected the power of the social media ecosystem
Were all connected the power of the social media ecosystem
 
Swws
SwwsSwws
Swws
 
Were all connected the power of the social media ecosystem
Were all connected the power of the social media ecosystemWere all connected the power of the social media ecosystem
Were all connected the power of the social media ecosystem
 
Swws
SwwsSwws
Swws
 
Lis651n12a.a4
Lis651n12a.a4Lis651n12a.a4
Lis651n12a.a4
 
Lis651n12a.a4
Lis651n12a.a4Lis651n12a.a4
Lis651n12a.a4
 
Lis651n12a.a4
Lis651n12a.a4Lis651n12a.a4
Lis651n12a.a4
 
Imc131 ager
Imc131 agerImc131 ager
Imc131 ager
 
Imc102 tiago
Imc102 tiagoImc102 tiago
Imc102 tiago
 
First monday
First mondayFirst monday
First monday
 
First monday
First mondayFirst monday
First monday
 

Último

Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 

Último (20)

Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 

Hpl 2011-203

  • 1. A Survey on Web Use: How People Access, Consume, Keep, and Organize Web Content Seungyon Claire Lee, Eamonn O'Brien-Strain, Jerry Liu, Qian Lin HP Laboratories HPL-2011-203 Keyword(s): Content repository; web content; current practice; user experience Abstract: We present the results from a preliminary survey concerning how people access, consume, keep, and organize seven types of web content. From the results of the survey, we highlight design suggestion for Tangible Web (TW), our cloud content repository system that enables users to clip, save, format, and organize web content through active discovery and passive delivery. External Posting Date: October 22, 2011 [Fulltext] Approved for External Publication Internal Posting Date: October 22, 2011 [Fulltext]  Copyright 2011 Hewlett-Packard Development Company, L.P.
  • 2. A Survey on Web Use: How People Access, Consume, Keep, and Organize Web Content Seungyon Claire Lee Abstract Hewlett Packard Laboratories We present the results from a preliminary survey 1501 Page Mill Road concerning how people access, consume, keep, and Palo Alto, CA 94304 USA organize seven types of web content. From the results claire.sylee@hp.com of the survey, we highlight design suggestion for Tangible Web (TW), our cloud content repository Eamonn O’Brien-Strain system that enables users to clip, save, format, and Hewlett Packard Laboratories 1501 Page Mill Road organize web content through active discovery and Palo Alto, CA 94304 USA passive delivery. eamonn.obrien-strain@hp.com Keywords Jerry Liu Content repository, web content, current practice Hewlett Packard Laboratories 1501 Page Mill Road ACM Classification Keywords Palo Alto, CA 94304 USA K8.3. Personal computing: Management/maintenance jerry.liu@hp.com Qian Lin General Terms Hewlett Packard Laboratories Design. 1501 Page Mill Road Palo Alto, CA 94304 USA Introduction qian.lin@hp.com The history of graphical user interface (GUI) is tightly coupled with the continuous evolution of relevant user experience (UX). Even though they may look obvious Copyright is held by the author/owner(s). and straightforward, interactions with many familiar CHI 2012, May 5–10, 2012, Austin, TX, USA. metaphors within the GUI are the results from careful ACM xxx-x-xxxx-xxxx-x/xx/xx. design iteration over the decades. In that sense, UX in current practice is efficient resource to reflect things
  • 3. 2 that can be accepted naturally when designing new users contact the content of interest (Figure 1): active systems. This paper presents the results from a discovery and passive delivery. Active discovery refers preliminary survey that concerns lessons learned from to the situation where users are manually moving from current practice (how people access, consume, keep, page to page to search information to clip. In this case, and organize web content) to support the design of TW is installed as an add-on of a web browser and is Tangible Web, a cloud content repository. Note that accessible any time while the user browses the web. since cloud computing blurs the boundary of personal EverNote ( evernote.com) and Diigo (diigo.com) computing between local machine and web employ this approach. environment, the survey focuses on the web content while including some desktop metaphor as well. On the other hand, passive delivery generates automatic content feed from preregistered resources Tangible Web: Cloud Content Repository such as users’ existing social network service (SNS) or Tangible Web (TW) is a web content repository service favorite websites. Examples of such approach include that enables users to clip, save, format, and organize Google Reader (google.com/reader), FlipBoard web content (image, text, video, URL) in a private (flipboard.com) and many personalized home pages account. The notion of clipping is inspired by the (google.com/ig, my.yahoo.com). In this approach, TW ephemeral nature of web content, which is frequently highlights how to integrate one-stop reading experience updated and may disappear for a given URL. Thus, of daily updates with appropriate clipping capabilities. similar to Zoetrope that concerns temporal change of web content by time [1], the clipped content in TW can Assuming that the nature of the content type (active Figure 1. Two approaches to clip be saved either as a live view that applies the updates discovery or passive delivery) may affect the relevant content: active discovery by which of the pointer or a frozen view that captures the user behavior, we encountered several design the user moves from page to page to snapshot at the moment that the content is clipped. By questions while implementing TW (e.g., Will the content clip resources (top) and passive using TW, users select multiple discrete areas in the type and access method affect user behavior on delivery by which the user stays in web page as a single entity to clip so that any clipping and organizing?). In search of answers for such TW home to read daily feed delivered unwanted content on the web page can be disregarded questions, we perform a survey to learn about current from preregistered resources (advertisement, page navigation object). The clipped practice on how people use web content on a daily (bottom). content can be reformatted with other clipped entity basis. We expect that the lessons learned from such contained in the TW repository. When a clipped entity is general behavior would assist us to predict requests presented, either independently or organized with other from potential TW users and to find design strategy to clipped entities, the combined text streams reflow to fulfill these requests. generate appropriate page composition with the print- friendly formatting. Study overview The study specifies seven content types and four With respect to the method for accessing the content to interaction types in the survey design. clip, TW has two approaches depending on how the
  • 4. 3 Passive delivery created as a result of organizing a URL that is <1> Up-to-date news kept/saved on a browser during or after consuming the <2> Entertainment (video, music) web content.). In the survey, behaviors in this four step <3> Personal content posted by others (blog, SNS) cycle are integrated with relevant metaphors and <4> Personal content posted by me (blog, SNS) systems such as bookmark and folder, metaphor in e- Active discovery mail, file browser, and web browser. Researchers have studied matured UX in such systems with respect to the <5> Reference (knowledge, education) consumption of web media [7] and strategic <6> Work related document (work, school) management of emails by searching, tagging, and <7> Supporting document to assist real world activities using folders [2, 4, 5, 8]. Unlike these studies that (receipt, bill, coupon, banking info) highlight a single behavior, our study focuses on the Table 1. Content type characteristics flow of the behavior cycle to understand overall trends in the interaction cycle. Content type: Passive delivery vs. active discovery The survey explores user behavior for interacting with Survey and participants seven content types that can rely on either passive The survey includes four steps of behavior cycle (Figure Figure 2. Interaction cycle in four delivery or active discovery (Table 1). Passive delivery 2) that are incorporated with seven content types steps content (<1> News, <2> Entertainment, <3> Content (Table 1) and relevant metaphors. We recruited 435 posted by others, <4> Content posted by me) may fit volunteers to participant in the online survey (age 18- with the content that users want to know in a time- 75, mean age = 35.54) who are college students or sensitive manner on a daily basis, whereas active working professionals. discovery content (<5> References, <6> Work related document,-<7> Supporting document for real world Result: Access activities) reflect content that might be less casual and Behaviors and methods for access to the web indicate would reflect users’ moment-by-moment interests or the moment that the user initiates the interaction with urgent needs. Here, and throughout the paper, the web content. Methods to access web content could be content type will be described as a number bracket the results of organizing (e.g., Bookmark), which is such as “<1>” for news content. another node in the interaction cycle. We integrate seven content types with four access methods (Type Interaction type: Four steps of behavior cycle URL, search keyword, click bookmark, click link) to We cluster users’ general behavior on the web as four learn the relationship between content type and access steps within a cycle: access, consume, keep, and method. organize (Figure 2). In this cycle, behaviors on the web are somewhat reciprocal such that the result of one Only two methods were selected with content-specific behavior often affects consecutive behavior (e.g., A behavior (bookmark and keyword search). The bookmark that is used as a method for access is selection of access method is partially affected by the
  • 5. 4 content type. Keyword search is the most preferred quantities), would help us to find the dominance or method to access entertainment and reference content absence of the leading websites by content type. (<2, 5>), whereas bookmark is the most preferred method to access to SNS feeds (<3,4>), websites for Q: Excluding e-mail and search engines, what is the work or school (<6>), and websites that provide website that you access every day? (Open-ended) supporting document to assist real world activity such as banking (<7>). For these content types, the preference on keyword search and bookmark is inversely proportional. However, no dominant method was observed for news access (<1>). Question: Through what methods do you access the following types of content? (Select all applied each content type): Graph shows two main methods only Figure 4. Daily visit of web content by popularity: (A) In SNS, two leaders attract most visitors (91.16%).; (B) In entertainment category, 39 non-popular websites attract visitors (72.88%) more than two leading websites. Figure 3. Access method by content type Among 173 websites from 435 entries in eight categories (Figure 4), the ten most popular websites Result: Consume (Figure 5) and the rest 163 websites obtained 55.86% Figure 5. Ten most popular Questions in this interaction cycle explore content types and 44.14% of votes, respectively. The top ten websites (small number of unique by popularity to learn what people want to check from websites are included in top four dominant categories: websites that have large number of the web on a daily basis. We focus on integrating the two websites in SNS, four websites in news, two visitors for each) and the rest 163 popularity of website with the effect of popular website websites in shopping, and two websites in websites in a long tail (large such as dominant leadership or long tail [2]. The notion entertainment. Among these categories, the most and number of unique websites that of the long tail, which has been widely accepted to least significant effects of leading websites were found have small number of visitors for describe retailing strategy of selling (selling a large in SNS and entertainment categories, respectively. In each) number of unique items each of which is sold with small SNS category, two leading websites attract large number of visitors (91.16%, Figure 4-A). On the other
  • 6. 5 hand, in entertainment category, large number of non- dominant websites are found (39 out of 41, 95.12%). Even though the number of visitors is small in each website, the sum is larger (72.88%, Figure 4-B) than the visitors of two leading websites. People’s daily visit on the ten most popular websites (55.86%) is slightly higher than the preference to visit the rest unique, yet non-popular websites (44.14%). Figure 6. Three personae in keeping web content. The frequent visit on such diverse websites, which is especially observed in entertainment content, indicates Since different personae were found, analyzing the preference to consume content by personal interest behavior to keep web content is performed with Group rather than by common interest of others. B and C. Group A result is not included in the analysis. In Figure 7 and 8, the result from Groups B and C are Result: Keep illustrated as line and bar graph, respectively, such that With respect to the behavior on keeping web contents line graph is found only in content <5>, <6>, and <7>. on a daily basis, which implies the moment that the Stars on each graph present the dominant method of public content becomes personal content, we are which the preference is statistically significant after mostly concerned with how and why people select CHI-square (p<.05). Empty and filled stars stand for between saving and printing, what the relevant the result from Group B and C, respectively. methods are, and how content type affects decision making. Q: Which method do you use to keep web content as yours? (Select one method for each content type.) Three personae were found in the question that asks about the preferred method among four choices (print, save, others, and not applicable) to keep content in seven content types (Figure 6): keeping web content very rarely (Group A), conditionally (Group B), and actively (Group C). Group A (14.05%) does not keep web content in any content type. Group B (38.10%) conditionally keeps web content. They keep at least one content type that relies on active discovery (<5, 6, 7>) while keep none of the content in passive delivery category (<1>-<4>). Group C, which shows the most active keeping behavior, keeps all types of given Figure 7. Keeping web content by content type and persona content.
  • 7. 6 Group B (Figure 7) prefers saving to printing for Result: Organize keeping web content, with statistical significance The behavior for organization is related with the (p<.05) in work related document (<6>) and preferences and strategies to manage personal supporting document for daily activity such as shopping information by using three methods: bookmark, email, or banking (<7>). Group C prefers saving to printing and file browser. The preference for using bookmarks, (p<.05) to keep all content type. email, and file browser are 45.84%, 23.91%, and 30.24%, respectively (Figure 9). To learn about preferred method to save web content, we provided five methods in another question: Q: How do you organize content saved from the web? bookmark, downloading to the local machine, uploading (Select all applied) to cloud storage, post to SNS, copy and paste to the local machine text editor and save. Among these methods, bookmark and download are mostly preferred. The preference for downloading work related document (<6>) to the local machine is discovered from Group B (Figure 8, p<.05). For all content types in passive delivery category (<1, 2, 3, 4>), Group C prefers bookmark (p<.05). However, for <6> and <7> in active discovery category, Group C prefers to download the content into the local machine (p<.05). Figure 9. Organizing content saved from the web Q: Which method do you use to save web content? Q: How many bookmarks do you have in bookmark menu and toolbar in your default web browser? Figure 8. Saving web content by content type and persona Figure 10. Managing bookmark by using toolbar and menu
  • 8. 7 Q: How do you further organize/categorize to manage Design implication bookmark, emails, and files? (Select all applied) Lessons learned from this survey (Table 2) contribute to building content-specific design strategies of TW, which aims to include contents by both passive delivery and active discovery (Figure 1). Access: Bookmark vs. Keyword search Bookmark is the most dominant method to access SNS, work related document, and supporting document (<3, 4, 6, 7>). Thus, TW can actively utilize bookmarks as a source of automatic or passive delivery. However, since Figure 11. Methods to organize content saved from the web not only those contents but also bookmarks could be highly private by the nature, TW should consider The use of bookmark toolbar significantly drops when privacy issue carefully when augmenting daily the number of bookmarks exceeds 25 (Figure 10). This interaction. On the other hand, keyword search is tendency is stronger when the number of bookmarks in preferred to accessing entertainment and reference the menu is less than 50 (solid lines in Figure 10). content (<2, 5>). Such content could be highly When the number of bookmarks in the toolbar is less situational, hard to predict, and require manual than 25 (Toolbar:1-25 in Figure 10), people tend to operation. Thus, automating such activity through user have more items in the bookmark menu. However, this analytics would be helpful in predicting user interest difference is not observed if the number of bookmarks and profile. As Liu et al. developed a system to on the toolbar exceeds 25. Among various metaphors recommend personalized news content by analyzing that assist organizing, folder (51.27%), which probably click behavior [6], entertainment and reference content has the longest history of use and is widely applied, is (<2>, <5>) can be recommended in TW by analyzing the most preferred metaphor (Figure 11). history of relevant activities: content clip; keyword search; and link click. Access Consume Keep <1> Various 2nd popular content. Has decent leaders. Bookmark Consume: Find the niche for entertainment portal <2> Search 4th popular content. Has weak leaders. Bookmark Unlike SNS (<3, 4>), entertainment content (<2>) <3> Bookmark 1st popular content. Has strong leaders. Bookmark does not have a significant leader. Among 41 websites <4> Bookmark 1st popular content. Has strong leaders. Bookmark that constitute entertainment content, two top websites draw only 27.12% of traffic, which implies relatively <5> Search Less popular content. No leaders. Various weak hold on user consumption and long tail of the <6> Bookmark Less popular content. No leaders Download market share. Such a long tail (72.88% of traffic from <7> Bookmark 3rd popular content. Has decent leaders. Download 39 websites) refers to the need of entertainment portal Table 2. Summary that aggregates various, yet user-specific, requests.
  • 9. 8 Keep: Passive delivery vs. active discovery design. Although not explored in this preliminary Passive delivery content is saved by bookmarks. survey, examining the effects of content type with However, active discovery content, except references, respect to the relationship between browsing history is downloaded to a local machine, which is a more and organizing strategy in emails, bookmarks, and files private space. Reference content (<5>) is saved by would be a viable direction for further investigation. various methods with similar preference (bookmark, post to SNS, or download). From this result, we assume Acknowledgement that TW should carefully detect such user preference. We thank to staff members of Communispace for their The fact that the news, entertainment, and SNS feeds help on recruiting participants and collecting data. (<1>-<4>) are mostly saved as bookmark reflects the characteristics of the content that is consumed, References updated, and disposed frequently. Thus, TW may [1] Adar, E., Dontcheva, M., Fogarty, J. and Weld, D. highlight the timely delivery and appropriate visibility of S. 2008. Zoetrope: interacting with the ephemeral web. In Proc. UIST 2008. ACM Press (2008), 239-248. such content while concerning information overload. On the other hand, work related documents and supporting [2] Ames, M. and Naaman, M. Why we tag: motivations for annotation in mobile and online media. documents for the real world activities (<6>, <7>) In Proc. CHI 2007. ACM Press (2007), 971-980. may require advanced features saved in TW repository: ensuring secure management and clustering [3] Anderson, C. The long tail: why the future of business is selling less of more. New York: Hyperion, saved content with other clipped content of other 2006. application for further modification. [4] Fisher, D., Brush, A. J., Gleave, E. and Smith, . A. Revisiting Whittaker & Sidner's "email overload" ten Organize: Integrating diverse concept with simplicity years later. In Proc. CSCW 2006. ACM Press (2006). In this step, even the content clipped from the public 309-312. web is considered as a personal resource. We observe [5] Jones, W., Phuwanartnurak, A. J., Gill, R. and that the dominant preferences are for bookmark and Bruce, H. Don't take my folders away!: organizing folder metaphors. In TW, which will eventually blur the personal information to get things done. In Proc. CHI boundary among relevant applications, current practice EA 2005. ACM Press, 1505-1508. to organize bookmarks on the web browser, files in the [6] Liu, J., Dolan, P. and Pedersen, E.R. Personalized file browser on a local machine, and folders or other news recommendation based on click behavior. In metaphors on the email client will be seamlessly Proc.IUI 2010. ACM Press (2010), 31-40. aggregated. Thus, the challenges in designing an [7] Savolainen, R. Filtering and withdrawing: strategies organizing assistant in TW would be the integration of for coping with information overload in everyday diverse concepts with simplicity: how to optimize contexts. J. Inf. Sci. 33, 5 (October 2007), 611-621. diverse organizing method to fulfill various need and [8] Whittaker, S. and Sidner, C. Email overload: context and how to provide concrete structure in a exploring personal information management of email. universal manner to convey general theme in the UX In Proc. CHI 1996, ACM Press (1996). 276-283.