SlideShare a Scribd company logo
1 of 25
Who Decides? Reinterpreting archival processes for the management of digital research Gareth Knight Centre for e-Research, King’s College London
Presentation Themes ,[object Object],[object Object],[object Object]
What is a record? ,[object Object],[object Object],[object Object],[object Object]
What is a record? ,[object Object],[object Object],[object Object],[object Object]
What is a Record today? What is a Record tomorrow? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Re-evaluating Records management at King’s College London ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
New archiving challenges ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Preservation Exemplars at King’s (PEKin) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Audit framework PEKin audit framework combines sections of DAF, DRAMBORA, DIRKS & other audit work
Audit management practices ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Types of digital information ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Many types of lifecycle Record lifecycle (variants: Information, data lifecycle) Access lifecycle (e.g. digital lifecycle)
Analysis of lifecycle risks ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Recognised risks ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Risk management Strategy ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
KCL Archives Preservation Repository ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
OAIS Reference Model
Technical Infrastructure
Alfresco Actions ,[object Object],[object Object],[object Object],[object Object]
Ingest Actions ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Archiving actions ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Content Models ,[object Object],[object Object],[object Object],[object Object],Filing Cabinet Drawer Folder
Content Model examples Each item represents a Fedora Object Each FO holds user provided metadata Different MD required at each layer Filing Cabinet Drawer Folder
Findings (so far) ,[object Object],[object Object],[object Object],[object Object],[object Object]
Contact ,[object Object],[object Object],[object Object],[object Object],http://www.kcl.ac.uk/iss/cerch/projects/portfolio/pekin.html Centre for e-Research : www.kcl.ac.uk/iss/cerch Archives and Information Management (AIM) :  http://www.kcl.ac.uk/iss/explore/team/aim

More Related Content

What's hot

What's hot (7)

The Role of OAIS Representation Information in the Digital Curation of Crysta...
The Role of OAIS Representation Information in the Digital Curation of Crysta...The Role of OAIS Representation Information in the Digital Curation of Crysta...
The Role of OAIS Representation Information in the Digital Curation of Crysta...
 
Open access data
Open access dataOpen access data
Open access data
 
DEVELOPING A KNOWLEDGE MANAGEMENT SPIRAL FOR THE LONG-TERM PRESERVATION SYSTE...
DEVELOPING A KNOWLEDGE MANAGEMENT SPIRAL FOR THE LONG-TERM PRESERVATION SYSTE...DEVELOPING A KNOWLEDGE MANAGEMENT SPIRAL FOR THE LONG-TERM PRESERVATION SYSTE...
DEVELOPING A KNOWLEDGE MANAGEMENT SPIRAL FOR THE LONG-TERM PRESERVATION SYSTE...
 
Saa Session 502 Born Digital Archives in Collecting Repositories
Saa Session 502 Born Digital Archives in Collecting RepositoriesSaa Session 502 Born Digital Archives in Collecting Repositories
Saa Session 502 Born Digital Archives in Collecting Repositories
 
Integrated research data management in the Structural Sciences
Integrated research data management in the Structural SciencesIntegrated research data management in the Structural Sciences
Integrated research data management in the Structural Sciences
 
Record management bab 1
Record management bab 1Record management bab 1
Record management bab 1
 
Research Data Management: CSUC activities & services
Research Data Management: CSUC activities & services Research Data Management: CSUC activities & services
Research Data Management: CSUC activities & services
 

Viewers also liked

Viewers also liked (9)

Keep Calm and Curate
Keep Calm and CurateKeep Calm and Curate
Keep Calm and Curate
 
Beyond budgeting
Beyond budgetingBeyond budgeting
Beyond budgeting
 
Same as it ever was? Significant Properties and the preservation of meaning o...
Same as it ever was? Significant Properties and the preservation of meaning o...Same as it ever was? Significant Properties and the preservation of meaning o...
Same as it ever was? Significant Properties and the preservation of meaning o...
 
Establishing the significant properties of digital research
Establishing the significant properties of digital researchEstablishing the significant properties of digital research
Establishing the significant properties of digital research
 
Introduction to digital curation
Introduction to digital curationIntroduction to digital curation
Introduction to digital curation
 
Laying the Foundation: Establishing an institutional RDM Support Service for ...
Laying the Foundation: Establishing an institutional RDM Support Service for ...Laying the Foundation: Establishing an institutional RDM Support Service for ...
Laying the Foundation: Establishing an institutional RDM Support Service for ...
 
Preservation Planning: Choosing a suitable digital preservation strategy
Preservation Planning: Choosing a suitable digital preservation strategyPreservation Planning: Choosing a suitable digital preservation strategy
Preservation Planning: Choosing a suitable digital preservation strategy
 
Research Data Management: What is it and why is the Library & Archives Servic...
Research Data Management: What is it and why is the Library & Archives Servic...Research Data Management: What is it and why is the Library & Archives Servic...
Research Data Management: What is it and why is the Library & Archives Servic...
 
Digital Forensics in the Archive
Digital Forensics in the ArchiveDigital Forensics in the Archive
Digital Forensics in the Archive
 

Similar to Who Decides? Reinterpreting archival processes for the management of digital research

Similar to Who Decides? Reinterpreting archival processes for the management of digital research (20)

Trm Planets Training Pp Module
Trm Planets Training Pp ModuleTrm Planets Training Pp Module
Trm Planets Training Pp Module
 
Preserving repository content: practical steps for repository managers by Mig...
Preserving repository content: practical steps for repository managers by Mig...Preserving repository content: practical steps for repository managers by Mig...
Preserving repository content: practical steps for repository managers by Mig...
 
Trm Trusted Repositories
Trm Trusted RepositoriesTrm Trusted Repositories
Trm Trusted Repositories
 
Collaboration on appraisal and collection development for the long-term prese...
Collaboration on appraisal and collection development for the long-term prese...Collaboration on appraisal and collection development for the long-term prese...
Collaboration on appraisal and collection development for the long-term prese...
 
Data curation and preservation: the Digital Curation Centre
Data curation and preservation: the Digital Curation CentreData curation and preservation: the Digital Curation Centre
Data curation and preservation: the Digital Curation Centre
 
OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...
OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...
OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...
 
Introduction to Research Data Management
Introduction to Research Data ManagementIntroduction to Research Data Management
Introduction to Research Data Management
 
Co-operation for digital preservation and curation: collaboration for collect...
Co-operation for digital preservation and curation: collaboration for collect...Co-operation for digital preservation and curation: collaboration for collect...
Co-operation for digital preservation and curation: collaboration for collect...
 
Introduction to digital curation
Introduction to digital curationIntroduction to digital curation
Introduction to digital curation
 
de theory and practice of digital preservation
de theory and practice of digital preservationde theory and practice of digital preservation
de theory and practice of digital preservation
 
Digital Preservation
Digital PreservationDigital Preservation
Digital Preservation
 
Digital Preservation
Digital PreservationDigital Preservation
Digital Preservation
 
Brief Introduction to Digital Preservation
Brief Introduction to Digital PreservationBrief Introduction to Digital Preservation
Brief Introduction to Digital Preservation
 
DCC 101: Preservation
DCC 101: PreservationDCC 101: Preservation
DCC 101: Preservation
 
Dc101 oxford sj_16062010
Dc101 oxford sj_16062010Dc101 oxford sj_16062010
Dc101 oxford sj_16062010
 
LIFE3: Predicting Long Term Preservation Costs, by Brian Hole
LIFE3: Predicting Long Term Preservation Costs, by Brian HoleLIFE3: Predicting Long Term Preservation Costs, by Brian Hole
LIFE3: Predicting Long Term Preservation Costs, by Brian Hole
 
LIFE3: Predicting Long Term Preservation Costs, Brian Hole
LIFE3: Predicting Long Term Preservation Costs, Brian HoleLIFE3: Predicting Long Term Preservation Costs, Brian Hole
LIFE3: Predicting Long Term Preservation Costs, Brian Hole
 
Digital Curation 101: Preserve
Digital Curation 101: PreserveDigital Curation 101: Preserve
Digital Curation 101: Preserve
 
Digital Preservation for DAMs
Digital Preservation for DAMsDigital Preservation for DAMs
Digital Preservation for DAMs
 
Research Data Management, Challenges and Tools - Per Öster
Research Data Management, Challenges and Tools - Per Öster Research Data Management, Challenges and Tools - Per Öster
Research Data Management, Challenges and Tools - Per Öster
 

More from GarethKnight

More from GarethKnight (9)

Supporting Open Science in Research
Supporting Open Science in ResearchSupporting Open Science in Research
Supporting Open Science in Research
 
Making Sense of a Digital Collection
Making Sense of a Digital CollectionMaking Sense of a Digital Collection
Making Sense of a Digital Collection
 
Building Sustainability: Preserving research data without breaking the bank
Building Sustainability: Preserving research data without breaking the bankBuilding Sustainability: Preserving research data without breaking the bank
Building Sustainability: Preserving research data without breaking the bank
 
GIS: A project by project prospective
GIS: A project by project prospectiveGIS: A project by project prospective
GIS: A project by project prospective
 
Complying with EPSRC policy: An LSHTM case study
Complying with EPSRC policy: An LSHTM case studyComplying with EPSRC policy: An LSHTM case study
Complying with EPSRC policy: An LSHTM case study
 
Data Management for Librarians: An Introduction
Data Management for Librarians: An IntroductionData Management for Librarians: An Introduction
Data Management for Librarians: An Introduction
 
Challenges in setting up an RDM Support Service
Challenges in setting up an RDM Support ServiceChallenges in setting up an RDM Support Service
Challenges in setting up an RDM Support Service
 
Doing research better: The role of meta‐data
Doing research better: The role of meta‐dataDoing research better: The role of meta‐data
Doing research better: The role of meta‐data
 
Watching the Detectives: Using digital forensics techniques to investigate th...
Watching the Detectives: Using digital forensics techniques to investigate th...Watching the Detectives: Using digital forensics techniques to investigate th...
Watching the Detectives: Using digital forensics techniques to investigate th...
 

Recently uploaded

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 

Who Decides? Reinterpreting archival processes for the management of digital research

  • 1. Who Decides? Reinterpreting archival processes for the management of digital research Gareth Knight Centre for e-Research, King’s College London
  • 2.
  • 3.
  • 4.
  • 5.
  • 6.
  • 7.
  • 8.
  • 9. Audit framework PEKin audit framework combines sections of DAF, DRAMBORA, DIRKS & other audit work
  • 10.
  • 11.
  • 12. Many types of lifecycle Record lifecycle (variants: Information, data lifecycle) Access lifecycle (e.g. digital lifecycle)
  • 13.
  • 14.
  • 15.
  • 16.
  • 19.
  • 20.
  • 21.
  • 22.
  • 23. Content Model examples Each item represents a Fedora Object Each FO holds user provided metadata Different MD required at each layer Filing Cabinet Drawer Folder
  • 24.
  • 25.

Editor's Notes

  1. I present a case study of work being performed at King’s College London to revise its records management strategy. The presentation will cover 3 key areas: The importance of re-evaluating definitions and criteria for what constitutes a Record to an institution on a regular basis Examine challenges that may be posed when attempting to manage the lifecycle of digital record Tthe technical architecture and processes that KCL has adopted to manage digital records.
  2. This slide introduces a record, describing its purpose and role within an institution. Definitions provided by the ISO Records Management specification and ICA establish a distinction between information and records While an institution will produce or maintain many types of information, not all of this information will constitute a record
  3. In this slide, I highlight key terms in the definition that are useful for making a distinction between the two. Key is the relationship between the data, key stakeholders and the set of activities that are performed within the institution. The Records Management standard places an emphasis upon activities related to business transactions & legal requirements, whereas The ICA adopts a broader perspective, by considering the evidence base necessary to understand any type of activity. It is left to the institution to decide which business transactions or activities should be recorded.
  4. Criteria for determining types of resource to maintain must be continuously reviewed and updated to meet with contemporary aims and objectives of the institution The criteria for determining what is a Record may evolve over time as a result of many factors, including alterations to the legal framework and changing interpretations of the business of the institution. For an academic institution, records traditionally include its operational conduct and teaching activities. In recent years, institutions have begun to recognise that other types of activity play a key role in their business and have developed broad criteria to encapsulate research and other types of information that represent the memory of institution
  5. We are currently addressing many of these issues at King’s College. College data is managed a central archives service. The service currently collects business records, college heritage resources & private papers. For the most part, these resources are paper-based. Evidence of this can be seen in the preservation policy that emphasises maintaining records in their original form & storing them in appropriate environmental conditions.
  6. In recent years there has been a recognition there is an increasing amount of information being created & published of archival value that are not covered by the policy and that new practices are required to manage this content. Specifically, the growth of electronic creation & publication methods presents new challenges in the archival approach that is taken. Resources are increasingly complex, in terms of their composition, e.g. hybrid archives that are composed of paper & digital, as well as the type of content that is created (e.g. CAD drawings, interactive resources). Many of these resources do not have a paper equivalent. The frequency of content creation, revision and publication and technological dependencies presents new challenges for an archivist.
  7. To examine these issues in detail, we obtained JISC funding and are currently developing a management solution that can be applied to the broad range of business and research data that is being produced by the college. Our strategy combines the skills and expertise of archivists and digital curators to ensure that digital resources of archival value remain accessible and usable for the period of retention. This is achieved through a two-layered strategy: Work with academic units to understand their production workflow and identify the type of risks that may limit the long-term access. Work with central services in the college to identify gaps and areas for improvement in their management infrastructure. The outcome of the project will be the development of a digital repository for the preservation of research and business data of archival value and a over-archiving strategy for managing data within the college.
  8. To understand the type of information being produced and their management requirements, we developed a bespoke framework for auditing each academic unit. The framework draws upon and combines sections of several existing methodologies, including Data Audit Framework, DRAMBORA and others We have extended these, through modification of the lifecycle model and extension of the vocabulary that was used. Although certain aspects of these frameworks were designed for a digital repository environment, they have proved useful for understanding data management issues in the wider context
  9. A data audit was applied to selected departments within the college that were identified as having data of particular value or at particular risk. Wished to understand the functional activities performed, the type of information created, the reason that the data was used, who was responsible for these resources and the time period that they were stored for. The audit process was time-consuming to perform and required perserverance. However, by end of audit, we were able to produce a detailed case study for each department or service.
  10. The audit identified many types of digital information being stored in each dept. These were frequently produced in-house, but also came from external sources. Each department considered their data assets to be valuable, but used different criteria for understanding value & adopted different approaches for maintaining them.
  11. A second aspect of the audit work was to develop an understanding of the digital object lifecycle. Digital information may be subject to several types of lifecycle We have the record lifecycle (on left) which emphasises the creation & use of a resource, as well as an appraisal process that may result in its destruction or re-use for different purpose We also have the cycle of access, represented by the digital lifecycle, that focuses upon the access period of the storage format. An encoding format may be created by a business, who will publish it, revise it over time to perform new functionality, and at some stage, make a decision to withdraw the format from usage or make the specification available for use. The ability to access information stored in the format will often depend upon where the format is in its lifecycle. Content stored in a older format that has been subsequently revised, replaced or withdrawn may be more difficult to access and recreate.
  12. As a result of the recognition that digital information has a lifecycle, we must also recognise that events may occur that reduce the likelihood that long-term access can be retained. To understand these events, we applied a ‘light touch’ version of the DRAMBORA risk assessment methodology to each case study. DRAMBORA was originally developed to analyse risks that may occur in a digital repository. For the ‘light touch’ version, we focused upon risks during the lifecycle, from creation to dissemination, as well as aspects of the staff and technical infrastructure components that affected the object. Each risk was documented, describing when it may occur, the potential consequences and one or more mitigation strategies that could be implemented.
  13. As a result of the risk analysis, we identified a number of management issues. These may be classified into four categories: Storage issues were the most common. Many staff were reliant upon USB disks & various free & for-cost 3rd party systems to store their data. This presented the risk that data could be lost or corrupted at any stage. Authenticity was a concern for all staff. Several less technical staff were reliant upon printed copies because they did not trust the digital copy that had been produced. In addition, the archival value & retention requirements were unknown. Assessment of the value of digital information was intrinsic in many department’s operation, but implementation varied between staff members.
  14. To mitigate the risk of loss or corruption of data during its lifecycle, we identified four key issues that we would have to address The First and perhaps most obvious issue is that a technical infrastructure must be provided to each dept. to store their data. These system should provide services to maintain authenticity and integrity Second, education is required to help staff understand data management and archival principles through provision of training events and supporting documents Third, consistent policies are required at the institutional level that indicates the type of digital information that have archival value. Finally, procedures are required to acquire, curate and preserve data in an archival environment.
  15. To reduce the likelihood of many risks, we’re developing a preservation repository that will be used to manage college data of short & long-term value. For the last section of this presentation, I will examine various aspects of the repository. In abstract, it was identified that the repository should: comply with necessary standards, such as OAIS and Trusted Repositories comply with records management strategies, by providing bitstream and content preservation. It should also be able to interoperate with other institution systems and provide controlled access to resources.
  16. A widely known repository specification is the OAIS Reference model, which provides a consistent method of expressing the components of a digital repository – Ingest, Data Management, Archival storage, Administration, Preservation Planning and Access. These components are responsible for processing the digital object at each stage of the workflow, changing it from a Submission Information Package into an Archival Package suitable for preservation and a Dissemination Package suitable for access & use
  17. In practice, the preservation repository is composed of several technical components that enable the archive to fulfil the functions of an OAIS. The Alfresco CMS (on left of screen) is being used to provide submission interface and data processing functions. Alfresco was chosen based upon its extensive plugin support & ability to create complex content models. Data is stored temporarily in Alfresco, until a time when the collection is closed. At this stage the data is transferred into Fedora Commons for archiving. This stores all data and metadata in its various versions. To provide controlled access, we’re currently testing Muradora as a front-end and using add-ons such as Tika and Solr to provide metadata and full text searching
  18. The processing workflow is expressed as a Business Processing Model (BPM). During the workflow, key activities are performed on each digital object. Alfresco actions are used to configure the type of events that occur, when they are initiated and the type of resource they are applied to. These are configured by defining various parameters. These activities can be organised into sequential or concurrent stages as required
  19. The primary stage where actions are performed is on receipt of the data. Actions will include: Checking that the data structure conforms to the specified content model (mentioned in later slide) Creation of fixity values for each file Format identification, which will lead into use of format specific tools to perform technical metadata extraction. And conversion into appropriate preservation and dissemination formats Each activity will be logged & stored as PREMIS Event metadata
  20. At a later stage, the data and metadata will be archived. A collection is held in Alfreso until one of two parameters are met: The collection is designated as closed by an archivist, or A specified time period has elapsed, e.g. 2 months after deposit. The parameters that are set will vary, dependent upon the broad category of data. Actions will be performed at later periods during the lifecycle. E.g. fixity checks will be performed to validate that the on-disk data is unchanged. Additional actions may be set by the archivist or depositor. For example, re-appraise content every 5 years to determine if it should continue to be stored, or to change the access status of the resource after an embargo.
  21. Content models define the organisational structure of different types of collection, type of content that can be stored & behaviours of individual components. For the repository, we adopted a three layer structure for each collection type. These are organised hierarchically. A comparison may be made to a filing cabinet that contains a set of shelves, each of which may contain one or more folders
  22. The diagrams on screen show examples of the content model that have been developed for committee records and research projects. A committee records is sub-divided into meetings that occurred on specific dates. Each meeting will posses an agenda, minutes of previous meeting, various papers. At a later stage, the minutes of the current meeting will also be added. Research project also have a 3 layer structure broken into year of project, which has been separated into different types of content.
  23. To conclude, digital resource represents a challenge for archives to maintain, but also present opportunities for re-assessing implementation of archival practices. We have found that data audit & risk analysis provide useful frameworks for understanding the information lifecycle and justifying archiving of these resources Although digital resources created for business and research purposes perform different functions in institution, process of managing them remains broadly similar and can be automated to some degree in a shared system. Finally, I think the process has demonstrated the value of applying archival techniques to the management of digital resources. It has been invaluable in understanding the storage and management requirements of the college and identifying further areas for development.