Novelle: A collaborative open source writing tool software

Introduction Structure Architecture Conclusions

Novelle
A collaborative open source writing tool software

Federico Gobbo & Michele Chinosi
{federico.gobbo,michele.chinosi}@uninsubria.it
Universit` dell’Insubria
a
Varese, Italy


What we learned from hypertexts

The digital revolution and the medium of writing

Since the Web Era (1991), new forms and techniques of writings
emerged, whose structural traits are still unclear, and the
role/distinction between authors and readers beging to collapse.

The optimists say: ‘hypertexts realize our postmodern and
decostructionist dream of an ‘opera aperta’ (open work)’.

On the contrary, the pessimists say: ‘authors have lost their power
in this openness’.



The main ideas behind Novelle

Our aim is to ﬁnd a way to build new texts which is fully satisfying
for authors/users/active readers and whose structure is clear, i.e.
suitable for linguistic computation. These are the main ideas of
Novelle.

We feel that the problems faced by hypertexts are much more the
sames of blogs and wikis. We didn’t want to reinvent the wheel, so
we gave a lot of attention of the known literature, in particular on
hypertexts.



The analysis of terminology on hypertexts

The technical (Cunningham, McCloud) and philosophical (Nelson,
Bolter, Landow) description of the known problems faced by
hypertexts had helped us to design Novelle. We started from
terminology.

Terms as ‘chapter’, ‘page’ or ‘footnote’ become senseless in the
new texts, or they highly change their meaning. What seems to be
lost is the relations, the texture underpinning the text itself –
etimologically, ‘texture’ and ‘text’ both derive from the late Latin
term textum, coined by the Roman Rhetorician Quintilianus.



Terms we adopt

Some keywords we found useful for our analysis:

web canvas instead of ‘web page’, much more clear and not
dependent from printing, from web comics (McCloud);

lexias, i.e. autonomous units of a hypertext, form hypertexts in
education (Bolter).

transclusion, i.e. a kind of quotation, but with the ability to follow
the evolution of the original document (see later).


Known problems and proposed solutions

Known problems as traced by Nelson, 1992

the framing problem, i.e. how to extract sub-collections
without loss of context information.




comparing complex alternatives, i.e. how to get parallel or
alternative versions of the same document.




typology of links, i.e. how to order links avoiding confusion.




typology of links, i.e. how to order links avoiding confusion.
version control, i.e. how to keep track of the history of every
document.



Which problems are already solved?

the framing problem is not solved.




the framing problem is not solved. We think the best
approximation is to let the users be able to extract
subcollection on-the-ﬂy, i.e. extractions are not permanent,
they are view of lexias.




comparing complex alternatives is not solved.




comparing complex alternatives is not solved. A cue is given
by the document history model by wikis, but it works only on
a chronological basis.




typology of links is not solved.




typology of links is not solved. The (X)HTML standard(s) of
the anchor tag is too generic to give a typology.




version control is solved




version control is solved by the wiki model of document
history, so we keep this solution.



The wiki model of document history

a very old an old the current the last
version version version version

the document
creation destruction
history timeline

an edit a restore

sandbox


Ownership and licencing

The problem of ownership

In order to address the remaining problems, we found that we had
to make a choice for the problem of ownership.





The Blog Way. Blogs follow the annotation model, where a single
lexia is central and the others are comments, sometimes organized
in threads (“write once, read many”). Advantage: suitable for a
lot of licences.





The Blog Way. Blogs follow the annotation model, where a single
lexia is central and the others are comments, sometimes organized
in threads (“write once, read many”). Advantage: suitable for a
lot of licences.

The Wiki Way. Wikis follow an free-to-edit model, where every
lexia is central: no authorship, no signature, no hierarchy (“write
many, read many”). Disadvantage: only GPL-like licences.



Solution: free licencing

We decided to let users/authors free to choose the licence of their
works, as in case or narrative or creative works ownership is treated
as authorship, even by fellows of free culture (Stallman, Lessig). In
other words, each user owns his own lexias (as blogs).



Solution: free licencing

We decided to let users/authors free to choose the licence of their
works, as in case or narrative or creative works ownership is treated
as authorship, even by fellows of free culture (Stallman, Lessig). In
other words, each user owns his own lexias (as blogs).

So, we decided to implement every standard Creative Commons
(cc) Licence version 2.0. Consequently, everybody is free to
comment everything, but freedom of everything may be denied
depending on the licence. This have to be chosen on lexia creation.



How to manage editing and create derivative works

an old the current
version version

the document
creation
history timeline

creation of a a new document
derivative work history timeline



For people who don’t like pictures

A user may let others edit his work, i.e. no No-Deriv option in the
CC licence. If so, he has the right to retain or refuse the attribution
after the edits – for comparison: wikis let only to restore
documents along history, you can’t “fork contents”; nor blogs.

In this case, a new history timeline start and a derivative link will
be put to mark the derivation from the original work.



Transclusion: beyond quotation

the current
version

the document
history timeline

transclusion

a freezed quotation

an other document
history timeline



Again, for people who don’t like pictures

Every user may comment and quote works by others – max. 10%
of the original document may be quoted, as in the Italian Law. We
call the quoted text transclusion, following Nelson. Unlike the
cut-and-paste text, a transclusion retains a link to recall the
original context (quotation link), so it never points to out-to-date
data.

Furthermore, a transclusion may let the author/user to be living,
i.e. to be kept up-to-date along the history timeline of the original
document.



An up-to-date transclusion example

an old the current
version version

the document
history timeline

transclusion

an up-to-date
quotation

an other document
history timeline



From transclusion to a typology of links

A quotation link is a special case of the deep links, i.e. every link
that let users change the context. A web page becomes a view of
lexias, where relations/diﬀerences/comparison between lexias are
rendered visually in a single canvas. The choice of the (CC) licence
may be a guide for the user.





So, we distinguish between shallow links, i.e. links in a single view
of lexias, and deep links, i.e. links that let users change view or
context.





So, we distinguish between shallow links, i.e. links in a single view
of lexias, and deep links, i.e. links that let users change view or
context.

Finally, links to web materials out of our system will be marked as
external links.


New solutions to classic hypertext’s problems

Our solutions of open problems of hypertexts

the framing problem should be solved by deep links and web
canvas as views of lexias.




comparing complex alternatives should be solved by
transclusions and the document history model by wikis.




typology of links, i.e. shallow vs. deep (quotation, derivative)
and external links should avoid chaos.




typology of links, i.e. shallow vs. deep (quotation, derivative)
and external links should avoid chaos.
version control is already solved by wikis.


A simple overview

The Architecture of Novelle
This is a basic scheme of Novelle multi-tier architecture

GUI

AJAX
Ruby on Rails
RDBMS

XML

DBMS / Filesystem


Why XML for data?

XML, eXtensible Markup Language

We choose XML as language and meta-language because we
want to be able to save messages with their meanings.


Why XML for data?


XML is a W3C standard.


Why XML for data?


XML lets us extend and connect Novelle with other
applications.


Why XML for data?


XML lets us extend and connect Novelle with other
applications.
Storing separately data from their representations lets a
system run more eﬃciently and quickly.


Why XML for data?

XML suites our needs for the Repository

We use XML trees to store together data, metadata,
messages and their meanings.


Why XML for data?


We tried to use other more classical solutions such
commercial databases (e.g. Oracle) or open-source software
(e.g. PostgreSQL, MySQL).


Why XML for data?


None of these solutions let us store eﬃciently our data
structure.


Why XML for data?


None of these solutions let us store eﬃciently our data
structure.
An Entity-Relationship schema can’t map exactly Novelle’s
data architecture.


Why XML for data?

Why RDBMS don’t ﬁt our needs

The commercial solutions we tested haven’t native support for
XML data except an emulation layer that maps XML trees
into E-R model to store data into tables, losing most of the
expressive power. These products also introduce a cost for
purchase and use them.


Why XML for data?

Why RDBMS don’t ﬁt our needs

The commercial solutions we tested haven’t native support for
XML data except an emulation layer that maps XML trees
into E-R model to store data into tables, losing most of the
expressive power. These products also introduce a cost for
purchase and use them.
Other free RDBMS, such as the most famous MySQL or
PostgreSQL, are compatible with XML data but they also
store XML trees in a relational schema or map the entire XML
tree as BLOB (Binary Large OBject) storing it in one large
table.


Why XML for data?

Native XML databases

We tried using Xindice (by Apache Group), eXist, Ozone as
native XML databases.


Why XML for data?


While Ozone, as an Object Oriented XML native database,
requires a large amount of memory for working on entire XML
trees, Xindice and eXist are two more interesting projects.


Why XML for data?


Xindice has not been developed since april 2004, so it is very
diﬃcult to adopt it for a new project.


Why XML for data?


Xindice has not been developed since april 2004, so it is very
diﬃcult to adopt it for a new project.
eXist is more usable and stable, but it doesn’t implement full
XML standard and its performances are not so good (yet).


Why XML for data?

XML Repository in the filesystem

Like most blogs and wikis, we choose to store Novelle XML
repository on time-based filesystem structure. Our representation is
a directory tree that reflects quite well our idea of history.


Why XML for data?

XML Repository in the filesystem

Like most blogs and wikis, we choose to store Novelle XML
repository on time-based filesystem structure. Our representation is
a directory tree that reflects quite well our idea of history.
For every message Novelle stores three XML documents:
The message itself
Its past history
The filesystem directory tree


Ruby on Rails and AJAX

Ruby on Rails: an open-source web framework

The Ruby on Rails (RoR) framework lets us a quick develop
cycle for web applications without the need to rewrite common
functions and classes (DRY - Don’t Repeat Yourself).




It provides XML builder and gdiﬀ/gpatch libraries. In
particular:




particular:
gdiﬀ/gpatch creates a patch from two ﬁles.




particular:
gdiff/gpatch creates a patch from two files.
XML Builder offers a set of classes to menage XML files.



Ruby on Rails for the document history model

Using gdiﬀ/gpatch we can implement our history model in an
easy way and saving space.




Moving across a history means to retrieve a ﬁxed number of
subsequent patches.




Moving across a history means to retrieve a ﬁxed number of
subsequent patches.
RoR doesn’t support XML native databases, so we temporarily
use a RDBMS only for RoR needs.



AJAX: Asyncronous Javascript And XML

AJAX is a web development tecnique for creating interactive
web applications. It uses a combination of XHTML,
Javascript, XML, CSS, DOM and the XMLHTTPRequest
object.




object.
XMLHTTPRequest lets clients ask servers to give some
particular data using asyncronous handshake, while users can
still continue using the web application.




object.
XMLHTTPRequest lets clients ask servers to give some
particular data using asyncronous handshake, while users can
still continue using the web application.
Ruby on Rails fully supports AJAX.


Access Points

Access Points as a ﬂexible model for navigation

In Novelle users can search through histories using a simple
search engine.


Access Points


search engine.
The engine returns a list of meaning and a set of links
between them.


Access Points


search engine.
between them.
These links are represented with clickable images. Every
image is itself a map that the user can surf and/or open to
increase details level.


Access Points


search engine.
between them.
Users can create new typed links between lexias or
comment/modify every existing lexia if these actions are
granted by the author.


Access Points


search engine.
between them.
Users can create new typed links between lexias or
comment/modify every existing lexia if these actions are
granted by the author.
These modiﬁcations are stored into the history of the
document.


What is done, what’s still to do

Novelle’s milestones

Make a concept map to organize goals and common
terminology for the team – done.




Test technologies in order to ﬁnd out what may ﬁt our needs
– done.




– done.
Develop our ﬁrst prototype with the basic data structure – in
progress.




– done.
progress.
Let users test our prototype with real data and real needs, in
order to have feedback about GUI and features – to do.




– done.
progress.
Adjust GUI and features. Publish Novelle as an open-source
project – to do.




– done.
progress.
Adjust GUI and features. Publish Novelle as an open-source
project – to do.
Dig and extract sensible information automatically, thanks to
the well-structured context datas – to do.



Thanks

We wish to acknowledge Massimiliano Pepe for his collaboration.

Attribuzione - Non Commerciale - Condividi allo stesso modo 2.0 Italia

http://creativecommons.org/licenses/by-nc-sa/2.0/it/

You may have this ﬁle here: http://purl.org/net/fgobbo

Questions?

Novelle: A collaborative open source writing tool software

Recommended

Recommended

More Related Content

Similar to Novelle: A collaborative open source writing tool software

Similar to Novelle: A collaborative open source writing tool software (20)

More from Michele Chinosi

More from Michele Chinosi (13)

Recently uploaded

Recently uploaded (20)

Novelle: A collaborative open source writing tool software