<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Digital Preservation Blog</title>
	<atom:link href="http://digitalpreservation.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://digitalpreservation.wordpress.com</link>
	<description></description>
	<lastBuildDate>Wed, 20 Jul 2011 11:35:16 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='digitalpreservation.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>Digital Preservation Blog</title>
		<link>http://digitalpreservation.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://digitalpreservation.wordpress.com/osd.xml" title="Digital Preservation Blog" />
	<atom:link rel='hub' href='http://digitalpreservation.wordpress.com/?pushpress=hub'/>
		<item>
		<title>Certification &amp; Outsourcing</title>
		<link>http://digitalpreservation.wordpress.com/2006/03/24/certification-outsourcing/</link>
		<comments>http://digitalpreservation.wordpress.com/2006/03/24/certification-outsourcing/#comments</comments>
		<pubDate>Fri, 24 Mar 2006 15:51:54 +0000</pubDate>
		<dc:creator>Joss</dc:creator>
				<category><![CDATA[Costs]]></category>
		<category><![CDATA[Outsourcing]]></category>
		<category><![CDATA[Preservation]]></category>
		<category><![CDATA[Repositories]]></category>

		<guid isPermaLink="false">https://digitalpreservation.wordpress.com/2006/03/24/certification-outsourcing/</guid>
		<description><![CDATA[The last hour of the course was spent discussing the Trusted Digital Repositories (PDF) certification document and the related matter of outsourcing digital preservation to another institution. Two of the tutors were part of the group that have drafted the TDR document and another of the tutors was involved in the peer review process. They [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=digitalpreservation.wordpress.com&amp;blog=156049&amp;post=23&amp;subd=digitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>The last hour of the course was spent discussing the <a href="http://www.rlg.org/legacy/longterm/repositories.pdf">Trusted Digital Repositories</a> (PDF) certification document and the related matter of outsourcing digital preservation to another institution.</p>
<p>Two of the tutors were part of the group that have drafted the TDR document and another of the tutors was involved in the peer review process.  They gave some background to the document which grew out of a recommendation from a 1996 Task Force on Archiving of Digital Information.  They explained the basic difference between auditing a repository and certifying a a repository, essentially saying that an audit is an internal or external process evaluating you against ideals.  An audit is about continual improvement and does not necessarily have a pass/fail process.  Certification, on the other hand, does result in a pass or fail, usually of an audit-like process.  Certification may have elements of both processes such as <a href="http://www.iso.org/iso/en/iso9000-14000/understand/inbrief.html">ISO 9000</a> mandatory elements. </p>
<p>The authors of the TDR document were tasked to created a standard certification process or framework that can be implemented across domains or types of digital repositories.</p>
<p>The tutor who has been involved in the peer review process criticised the draft document on the following grounds: </p>
<ul>
<li>It sets the bar too high for many archives</li>
<li>The drafting of the document was not led by archivists</li>
<li>The authors claim that they are an international body, but in fact they are mostly N. American with some Europeans but no-one east of the Netherlands</li>
<li>The OAIS standard is flexible but the TDR certification which is based on OAIS is very specific about wht the archive should achieve</li>
</ul>
<p>However, before we throw the document out on the above criticisms, it is worth remembering that it&#8217;s only a draft and that it still offers a very useful tool for internal audit and self-improvement.  As one of the tutors listed all the reasons why we should care about certification, I realised that none of them specifically apply to AI but are really aimed at public institutions who have obligations to to funding bodies, regulatory bodies, service purchasers, and external depositors.  Of course, certification would still give a number of IRP staff a warm glow inside, smug with the satisfaction of international recognition of our work and the service we provide to our organisation, but I think the real value in the TDR document is that it provides practical guidance on how we can improve.  Also, having audited our digital preservation archives using the TDR document, we may come to the conclusion that we&#8217;re just not up to or interested in developing an archive for long-term preservation and that we might want to outsource some or all of the work to another institution.  Preferable one that <em>is </em>certified!</p>
<p>Outsourcing was touched upon at the very end of the hour and we were offered some step-by-step guidelines on approaching the possibility of outsourcing.  Here they are, straight from the PPT slides:</p>
<blockquote><p>
<strong>Rationale</strong><br />
We can&#8217;t all be experts in everything<br />
We need to carry out some tasks we are not well-equipped to do<br />
We may have resource reasons (money but no space)<br />
We may have policy reasons<br />
It may be cheaper</p>
<p><strong>Understand your needs</strong><br />
Define your problem before you look for a solution<br />
Otherwise you will buy the answer to someone else&#8217;s question<br />
Specify mechanisms to monitor and measure performance<br />
Look at the <a href="http://www.dpconline.org/graphics/guides/index.html#outsourcing">DPC document on outsourcing</a></p>
<p><strong>Outsourcing</strong><br />
Outsourcing digital preservation requirements may be an attractive option &#8211; particularly for smaller organisations<br />
But these are also the most vulnerable in terms of what they should expect to ask for and receive<br />
Having a system of certified repositories can help to provide assurance<br />
The checklist of requirements can help organisations find a good match between what they think they asked for and what they receive.</p></blockquote>
<p>That&#8217;s it really.  I should add that the <a href="http://www.ulcc.ac.uk">University of London Computing Centre</a> which is a ten minute walk from AI will almost certainly be certified as a TDR because one of the authors of the report, runs the preservation programme there.  And, yes, they take on consultancy work and are willing to discuss any outsourcing we might decide we want to do with them.</p>
<p>That&#8217;s the end of this blog. I hope you&#8217;ve enjoyed reading it and found at least some parts of it thought provoking.  As I said early in the week, I wanted to do it because Chris sent me on the course on the condition that I gave a presentation to interested staff when I returned, which I&#8217;m happy to do, although a presentation is probably not the way to go about it.  Hopefully this blog has provided some background reading (and light entertainment) for us to discuss in the near future. </p>
<p>See you on Monday.</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/digitalpreservation.wordpress.com/23/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/digitalpreservation.wordpress.com/23/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/digitalpreservation.wordpress.com/23/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/digitalpreservation.wordpress.com/23/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/digitalpreservation.wordpress.com/23/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/digitalpreservation.wordpress.com/23/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/digitalpreservation.wordpress.com/23/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/digitalpreservation.wordpress.com/23/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/digitalpreservation.wordpress.com/23/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/digitalpreservation.wordpress.com/23/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/digitalpreservation.wordpress.com/23/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/digitalpreservation.wordpress.com/23/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/digitalpreservation.wordpress.com/23/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/digitalpreservation.wordpress.com/23/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/digitalpreservation.wordpress.com/23/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/digitalpreservation.wordpress.com/23/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=digitalpreservation.wordpress.com&amp;blog=156049&amp;post=23&amp;subd=digitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://digitalpreservation.wordpress.com/2006/03/24/certification-outsourcing/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/91890bc5aa9197368aeaa4f2f0fbf011?s=96&#38;d=identicon" medium="image">
			<media:title type="html">digitalpreservation</media:title>
		</media:content>
	</item>
		<item>
		<title>The Internet Archive class project</title>
		<link>http://digitalpreservation.wordpress.com/2006/03/24/the-internet-archive-class-project/</link>
		<comments>http://digitalpreservation.wordpress.com/2006/03/24/the-internet-archive-class-project/#comments</comments>
		<pubDate>Fri, 24 Mar 2006 14:09:03 +0000</pubDate>
		<dc:creator>Joss</dc:creator>
				<category><![CDATA[Legal]]></category>
		<category><![CDATA[Metadata]]></category>
		<category><![CDATA[OAIS]]></category>
		<category><![CDATA[Preservation]]></category>
		<category><![CDATA[Repositories]]></category>
		<category><![CDATA[Risks]]></category>
		<category><![CDATA[Technologies]]></category>
		<category><![CDATA[Web]]></category>

		<guid isPermaLink="false">https://digitalpreservation.wordpress.com/2006/03/24/the-internet-archive-class-project/</guid>
		<description><![CDATA[We started the last morning of the training programme with presentations of our project. As I mentioned before, we&#8217;ve been looking at the Internet Archive (IA) as a way of &#8220;preserving an institution&#8217;s digital collections that are made available on the web.&#8221; The task then, was to examine whether the IA scored well with regards [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=digitalpreservation.wordpress.com&amp;blog=156049&amp;post=22&amp;subd=digitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>We started the last morning of the training programme with presentations of our project.  As I mentioned <a href="http://digitalpreservation.wordpress.com/2006/03/21/archiving-the-world-wide-web/">before</a>, we&#8217;ve been looking at the <a href="http://www.archive.org/about/about.php">Internet Archive</a> (IA) as a way of &#8220;preserving an institution&#8217;s digital collections that are made available on the web.&#8221;</p>
<p>The task then, was to examine whether the IA scored well with regards to the &#8216;three legs&#8217; and &#8216;five stages&#8217; I <a href="http://digitalpreservation.wordpress.com/2006/03/20/documents-legs-stages/">mentioned </a>at the beginning of the week. The assumption is that a &#8216;trusted digital repository&#8217; (basically, a &#8216;good archive&#8217;), should satisfy our requirements in the areas of resources, technology and organisational framework.  We were asked to score the IA against the five stages: Acknowledge, Act, Consolidate, Institutionalise and Externalise.  These basically represent five stages of maturity in a digital archive&#8217;s development.</p>
<p>If you haven&#8217;t spent much time on the IA&#8217;s website, it&#8217;s certainly worth doing so. It was originally  the initiative of one man, Brewster Kahle, who made millions in software development in the 1990s.   His foundation continues to fund around 50% of the IA&#8217;s expenses (we were given copies of the IA&#8217;s 990-PF Tax Returns for 2003 and excerpts from 1999, 2000, 2001 and 2002).  In 2003, the IA had expenses of just over $3m, so Brewster&#8217;s millions are essential for the sustainability of the IA.</p>
<p>This was a concern for all of us since we were asked to consider how sustainable the IA appears to be.  Now, admittedly, our figures are a couple of years out of date and we&#8217;ve no idea how Brewster&#8217;s foundation is set up and how sustainable that in itself is, but the donations from other companies over the years have been sporadic, project based and unpredicatable.</p>
<p>It&#8217;s an expensive operation to run when your mission is to collect every website in the world which is why in 2003, they spent $380,558 on disk and tape stock, $79,358 on misc. hardware and $27,002 on their phone/network charges.</p>
<p>The IA&#8217;s technological framework scored quite low with our class, too.  It&#8217;s web crawling is done by <a href="http://www.alexa.com/">Alexa </a>every two months, although if you look at the history of AI on the IA&#8217;s <a href="http://www.archive.org/web/web.php">Wayback Machine</a>, you&#8217;ll see that the results are more adhoc than that.  It&#8217;s worth pointing out that the IA provides researchers with <a href="http://www.archive.org/about/using.php">another </a>way of accessing the archived websites, although currently this is from a command line and they are developing tools to help make this easier.</p>
<p>We looked at how they approach long-term preservation, in particular the way the data is documented, stored, migrated, backed up and secured.  The IA stores the websites in ARC and DAT containers which are open formats and open source tools are available to work with these formats.  However, the original file formats harvested from the websites are not migrated to different formats for long-term preservation so proprietary file formats harvested, remain proprietary and therefore possibly inaccessible long-term.  The IA do also collect software and emulators in the hope that some kind of access will be possible in the future, even for obsolete formats.  It&#8217;s all a bit hit and miss, not least because their website has no formal policy or strategy for preservation.  It&#8217;s just a few paragraphs here and there which you have to draw your own conclusions from. The preservation metadata which they add to the harvested pages is minimal since it&#8217;s done by machine. You can see what they add by using the Wayback Machine and then looking at the source of one of the harvested web pages.</p>
<p>The lack of formal, detailed policy was also a concern to the class.  The IA have taken on such a huge task, but with the exception of some areas of their technological development, there&#8217;s very little to suggest that they have a strategy in place that will ensure the long-term preservation of their archive which is central to their mission.  The other major organisational concern was that technically their activities are illegal or at least waiting to be challenged.  Essentially, they are taking copyrighted material from owners without asking for their permission.  They do say on their website that they respect robot.txt commands so that you can tell their crawler not to archive your website and they also say that if you contact them and ask them to remove your website from their archive they will do so, but this places all of the responsibility on the rights holder rather than the IA, which is contrary to the usual methods of agreeing terms and conditions before copying/taking/using owned materials.</p>
<p>While on the subject of their web crawler, as I said earlier in the week, it has quite obvious problems in that is only works reliably with flat html files.  Javascript, database driven websites and in fact, anything interactive, including video, audio and animation, presents real problems to the IA.  So even if you were happy about the IA copying your material, the likelihood is that they will only be able to do part of the job.  This is increasingly apparent if you use their Wayback Machine as sites from a few years ago are much better archived than more modern sites which are increasingly using more interactive web technologies.  </p>
<p>They are trying to address some of these issues with a <a href="http://www.archive-it.org/">new subscription based service</a> which allows you to pay for tools to customise the way your website is archived, ensuring that all the right links are harvested on a schedule that suits your needs.  You can also add Dublin Core metadata. This has the potential to make the IA&#8217;s technology directly available to you at a price.  You can also request copies of their archive of your website (for a cost), and you can set up a private service so that your website archive is not available to anyone else, though this is clearly not what the IA would prefer you do. Unfortunately, this new service still suffers from the technological limitations I mention above and you are still advised that &#8220;as a general rule of thumb, simple html is the easiest to archive.&#8221;</p>
<p>So, as you can see, from our outsider view, we had major concerns about several key aspects (sustainabilty, technology, policy and strategy) of the IA.  However, all the above doesn&#8217;t directly address the class project&#8217;s central concern which was to examine how the IA might preserve a University&#8217;s digital collections that are being made available on the web.</p>
<p>What the IA do is archive web content, not the institutional digital collections.  Effectively, it&#8217;s like saying that they archive the ADAM web pages but not the actual ADAM digital archive which contains high resolution TIFF master images and a fuller set of metadata than is shown on the website.  The IA can archive a site so that you can see what that site looked (and to a lesser degree functioned) like at a certain period in time but they make no claims to be preserving the digital collections which the website is drawing from when it presents a user with the results of their search for images on women in Sudan.</p>
<p>That was key to the class project and, in a way, to the week&#8217;s course over all.  On the first day, we discussed how the OAIS model works, in that digital materials are ingested/acquired (the submission information package), then archived (the archive information package), and then made available in a different form (the dissemination package) according to the user or &#8216;designated community&#8217;.  At each stage, the information collected/presented is different.  The IA are only interested in the dissemination package, whereas as an institution, we&#8217;re interested in all three information packages, and as a preservation archive with resources invested in the creation and intellectual property rights of the digital objects, we&#8217;re especially interested in the archival information package.</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/digitalpreservation.wordpress.com/22/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/digitalpreservation.wordpress.com/22/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/digitalpreservation.wordpress.com/22/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/digitalpreservation.wordpress.com/22/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/digitalpreservation.wordpress.com/22/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/digitalpreservation.wordpress.com/22/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/digitalpreservation.wordpress.com/22/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/digitalpreservation.wordpress.com/22/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/digitalpreservation.wordpress.com/22/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/digitalpreservation.wordpress.com/22/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/digitalpreservation.wordpress.com/22/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/digitalpreservation.wordpress.com/22/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/digitalpreservation.wordpress.com/22/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/digitalpreservation.wordpress.com/22/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/digitalpreservation.wordpress.com/22/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/digitalpreservation.wordpress.com/22/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=digitalpreservation.wordpress.com&amp;blog=156049&amp;post=22&amp;subd=digitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://digitalpreservation.wordpress.com/2006/03/24/the-internet-archive-class-project/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/91890bc5aa9197368aeaa4f2f0fbf011?s=96&#38;d=identicon" medium="image">
			<media:title type="html">digitalpreservation</media:title>
		</media:content>
	</item>
		<item>
		<title>Access</title>
		<link>http://digitalpreservation.wordpress.com/2006/03/23/access/</link>
		<comments>http://digitalpreservation.wordpress.com/2006/03/23/access/#comments</comments>
		<pubDate>Thu, 23 Mar 2006 22:22:57 +0000</pubDate>
		<dc:creator>Joss</dc:creator>
				<category><![CDATA[Access]]></category>

		<guid isPermaLink="false">https://digitalpreservation.wordpress.com/2006/03/23/access/</guid>
		<description><![CDATA[The last class of the day before getting on with our Internet Archive project (which has been very instructive), was about providing access to archival collections. Good, common-sense advice was dished out which we all pretty much knew but were pleased to hear again: We preserve because we expect access. We must be able to [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=digitalpreservation.wordpress.com&amp;blog=156049&amp;post=21&amp;subd=digitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>The last class of the day before getting on with our Internet Archive project (which has been very instructive), was about providing access to archival collections.  Good, common-sense advice was dished out which we all pretty much knew but were pleased to hear again:</p>
<blockquote><p>We preserve because we expect access.<br />
We must be able to derive one from the other.<br />
We don&#8217;t have to let one dictate the other.<br />
Access needs may drive decisions at ingest e.g. on metadata.<br />
There are many ways to provide access.</p></blockquote>
<p>We were advised to &#8216;preserve enough to tell the story&#8217; and that good preservation refers to &#8216;preserving meaningful information through time&#8217;.  Quotes like that can come in handy sometimes.</p>
<p>Of course, depending on the archive&#8217;s remit, there are various reasons why we might have to provide access (<a href="http://www.opsi.gov.uk/ACTS/acts2000/20000036.htm">FOI</a>) or restrict access (<a href="http://www.opsi.gov.uk/ACTS/acts1998/19980029.htm">DPA</a>).  Fortunately only the latter applies to us right now.  There was also a brief discussion about redacting certain information before providing access, something that we do in a way with MAV&#8217;s products and transcripts for security reasons before they are put on the public database. I don&#8217;t knnow if we do it to documents, too.  Do we?</p>
<p>Finally, the OAIS standard clearly has a strong element dedicated to ensuring access.  It&#8217;s based around knowing your &#8216;designated community&#8217; which may be as broad as &#8216;people who can read english and use the internet&#8217; (The National Archives) or as narrow as &#8216;my friends and family&#8217; (imagine an online photo service like Flickr).  As usual some of the best advice was about planning ahead, being proactive abouut offering ways to access material, seperating the preservation infrastructure from the access infrastructure and collaborating with other repositories.</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/digitalpreservation.wordpress.com/21/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/digitalpreservation.wordpress.com/21/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/digitalpreservation.wordpress.com/21/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/digitalpreservation.wordpress.com/21/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/digitalpreservation.wordpress.com/21/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/digitalpreservation.wordpress.com/21/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/digitalpreservation.wordpress.com/21/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/digitalpreservation.wordpress.com/21/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/digitalpreservation.wordpress.com/21/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/digitalpreservation.wordpress.com/21/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/digitalpreservation.wordpress.com/21/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/digitalpreservation.wordpress.com/21/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/digitalpreservation.wordpress.com/21/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/digitalpreservation.wordpress.com/21/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/digitalpreservation.wordpress.com/21/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/digitalpreservation.wordpress.com/21/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=digitalpreservation.wordpress.com&amp;blog=156049&amp;post=21&amp;subd=digitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://digitalpreservation.wordpress.com/2006/03/23/access/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/f324c1265f6c2140e68f73e18453c7eb?s=96&#38;d=identicon" medium="image">
			<media:title type="html">josswinn</media:title>
		</media:content>
	</item>
		<item>
		<title>Legalities</title>
		<link>http://digitalpreservation.wordpress.com/2006/03/23/legalities/</link>
		<comments>http://digitalpreservation.wordpress.com/2006/03/23/legalities/#comments</comments>
		<pubDate>Thu, 23 Mar 2006 22:03:54 +0000</pubDate>
		<dc:creator>Joss</dc:creator>
				<category><![CDATA[Access]]></category>
		<category><![CDATA[Legal]]></category>
		<category><![CDATA[Risks]]></category>

		<guid isPermaLink="false">https://digitalpreservation.wordpress.com/2006/03/23/legalities/</guid>
		<description><![CDATA[This was a post-lunch crash course in Intellectual Property Rights, Copyright, Digital Rights Management, Freedom of Information Act, Data Protection Act and Legal Deposit. No depth but some useful highlights and another interesting case study from the ADS concering the mis-use of images and how they dealt with it. Basically, the ADS received a collection [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=digitalpreservation.wordpress.com&amp;blog=156049&amp;post=20&amp;subd=digitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>This was a post-lunch crash course in <a href="http://www.intellectual-property.gov.uk/">Intellectual Property Rights</a>, Copyright, Digital Rights Management, Freedom of Information Act, Data Protection Act and Legal Deposit.  No depth but some useful highlights and another interesting case study from the ADS concering the mis-use of images and how they dealt with it.  </p>
<p>Basically, the ADS received a collection of data, including images, from the excavation of <a href="http://ads.ahds.ac.uk/catalogue/projArch/spitalfields_var_2001/">Christ Church, Spitalfields</a>.  The <a href="http://ads.ahds.ac.uk/catalogue/projArch/spitalfields_var_2001/images.cfm">images </a>in this very interestining collection show the remains of bodies buried in the 18th century crypt. These images were found by a web site for necrophilia enthusiasts and some images were copied from the ADS site and republished on the sex-with-dead fan site.  They also provided a link through to the ADS for enthusiasts to grab more images for themselves.</p>
<p>This was their mistake, because the ADS noticed an unexpected spike in the use of its website and traced it back to the link from the other website.  It was the first time they&#8217;d had to deal with the mis-use of their digital collections and sought advice from the JISC legal team.  They were advised a six-point plan spread over 70 days.  The first was simply to contact the website, tell them they had broken the <a href="http://ads.ahds.ac.uk/catalogue/">licence </a>they agreed to on the ADS website and that they take the images down or else face further legal action.  And they did take them down. End of story.</p>
<p>This was a satisfactory result for the ADS because despite the mis-use of the images, it would have been a long and difficult legal process had the web site not taken them down.  </p>
<p>We&#8217;ve been thinking about such things for ADAM and intend to introduce a &#8216;handshake&#8217; agreement prior to the download of ADAM images.  Having seen how the ADS handle this &#8216;contract&#8217; with its users, I&#8217;m now inclined to just have users agree to a licence when they first enter an ADAM session rather than each time they click to download.  Legally it would appear to cover us.  Our present system is based on authentication into the AI Intranet and then trusting that the AI staff member will respect the terms and conditions that are displayed with each image, but we think we can do better than this with little inconvenience to users.  There will also be more changes to the way ADAM handles rights management and licence agreements.</p>
<p>The main piece of advice that the ADS gave from this example was that archives should not wait for the abuse of their content before forming a response but rather formulate a strategy for dealing with a potential incident so we can react quickly, methodically and legally.  Wayne, Claire and Tim will know more about whether we&#8217;ve had to deal with this already.  I&#8217;m not aware of such a strategy being in place though.  In late May, an IPR expert from the Open University will be giving a one-day workshop on IPR issues for AI staff, something we intend to run each year.  Having spent just an hour touching on such issues, I feel a day&#8217;s course would be well spent ensuring IS staff are informed of the risks and responsibilities involved in this area of our work.  Not least because the <a href="http://www.fipr.org/copyright/guide/">European Copyright Directive</a>, which applies to the UK, now makes breaking copyright protection a criminal offense rather than a civil offense, so theoretically someone could go to jail whereas it used to be that the individual/organisation would be fined based on the &#8216;loss&#8217; (financial, of reputation, of relationships, etc) to the rights owner.</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/digitalpreservation.wordpress.com/20/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/digitalpreservation.wordpress.com/20/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/digitalpreservation.wordpress.com/20/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/digitalpreservation.wordpress.com/20/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/digitalpreservation.wordpress.com/20/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/digitalpreservation.wordpress.com/20/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/digitalpreservation.wordpress.com/20/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/digitalpreservation.wordpress.com/20/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/digitalpreservation.wordpress.com/20/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/digitalpreservation.wordpress.com/20/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/digitalpreservation.wordpress.com/20/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/digitalpreservation.wordpress.com/20/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/digitalpreservation.wordpress.com/20/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/digitalpreservation.wordpress.com/20/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/digitalpreservation.wordpress.com/20/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/digitalpreservation.wordpress.com/20/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=digitalpreservation.wordpress.com&amp;blog=156049&amp;post=20&amp;subd=digitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://digitalpreservation.wordpress.com/2006/03/23/legalities/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/f324c1265f6c2140e68f73e18453c7eb?s=96&#38;d=identicon" medium="image">
			<media:title type="html">josswinn</media:title>
		</media:content>
	</item>
		<item>
		<title>Costs, risk management and business planning</title>
		<link>http://digitalpreservation.wordpress.com/2006/03/23/costs-risk-management-and-business-planning/</link>
		<comments>http://digitalpreservation.wordpress.com/2006/03/23/costs-risk-management-and-business-planning/#comments</comments>
		<pubDate>Thu, 23 Mar 2006 18:16:04 +0000</pubDate>
		<dc:creator>Joss</dc:creator>
				<category><![CDATA[Costs]]></category>
		<category><![CDATA[Risks]]></category>

		<guid isPermaLink="false">https://digitalpreservation.wordpress.com/2006/03/23/costs-risk-management-and-business-planning/</guid>
		<description><![CDATA[Not the way I would have chosen to start the day but it ended up being a useful morning discussing how to identify the organisational costs of running a digital archive and how to justify those cost and identify the benefits. We also discussed risk management, the implications of lifecycle management and how to cost [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=digitalpreservation.wordpress.com&amp;blog=156049&amp;post=19&amp;subd=digitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Not the way I would have chosen to start the day but it ended up being a useful morning discussing how to identify the organisational costs of running a digital archive and how to justify those cost and identify the benefits.  We also discussed risk management, the implications of lifecycle management and how to cost elements of an OAIS compliant archive.</p>
<p>We did an interesting exercise comparing the costs of running an <a href="http://arxiv.org/">e-prints archive at Cornell University</a> and The National Archive&#8217;s digital archive.  Not surprisingly, the two archive&#8217;s costs are radically different because their remit and services provided are radically different.  It costs TNA £18.76 to ingest/acquire a single file into their archive.  A huge sum compared to Cornell&#8217;s £0.56-£2.84.  This is not only because TNA&#8217;s remit is so much wider and therefore the ingest/acquisition process is much more complex, but because TNA operate in an environment where they catalogue the material themselves whereas Cornell have no catalogers but require the Professor submitting her document to provide and verify all the information/metadata.  Also, TNA have huge preservation costs because they are dealing with a legacy digital material which are 20-30 years old, when no preparation was made for long-term preservation of these materials.  Cornell on the other hand, are archiving simple, modern digital materials and their preservation activities are relatively easy and predicatable.</p>
<p>This raised a familiar and interesting question for me because we will be developing a facillity in ADAM for staff to upload images to a team catalogue and provide metadata for the image.  In an ideal world, the member of staff would provide full and accurate metadata which would require no validation and could be entered directly into ADAM and immediately available on the Intranet.  Of course, this is almost certainly impossible for AI.  It works for Cornell because the Professor has a vested and very personal interest to ensure that her article is made widely available and correctly cited through the submission of complete and accurate metadata.  Even then, an example was given where an academic catalogued their article with a single keyword representing their sole academic interest, disregarding the other subject areas which the article related to.  I asked people if they had any advice on how we could have AI staff more involved in the cataloguing process but no miracle answers were forthcoming.  Basically, while staff are essential providers of information about the digital object, supplying information only they might know, it&#8217;s an unacceptable organisational risk to then make those images directly available for other staff to reuse before AVR have checked and verified the metadata and, as is always the case, enriched it with further information.  And of course, staff might justifiably argue that they could be making better use of their time than extensively cataloguing images and checking copyright and license agreements.  There will be ways that we can ensure that the information provided to us is formed in a way that is easy to validate and enrich though and that&#8217;s the approach we&#8217;ll be taking with ADAM.</p>
<p>At one point while trying to breakdown the cost elements of a digital archive I realised that we were a room full of archivists trying to do the job that IT professionals have been doing for years.  The element costs involved in digital archiving such as hardware, software, licenses, support, development, fixtures and fittings, etc. are costs that we share with ITP.  Where IRP need to demonstrate costs is by detailing the work processes and therefore the staff time involved and the business reasons why archival preservation might require three or four times the storage requirements, a different approach to risk management, changes in data management, etc.  But, with the exception of staff time, a digital archive uses readily available IT solutions in a specific way.  I tried to make this point that we (archivists) are not the people best placed to cost IT systems but rather need to work with IT professionals and draw on their existing experience in planning, purchasing and maintaining systems.  I think that to an IT department, a digital archive is just another application of IT hardware, software and processes. Do you agree?</p>
<p>This wasn&#8217;t the first time I&#8217;ve found that archivists tend to look at a digital archive infrastructure as something new and perculiar to them and completely alien to IT professionals.  Sure, there might be different requirements that some IT staff might not be familiar with but it&#8217;s the archivist&#8217;s role to explain and justify these in business terms and in return, let the IT staff deliver the infrastructure requirements to meet the business case. It&#8217;s just data that needs to be treated a bit differently, that&#8217;s all.</p>
<p>Despite this frustration, this class had real practical value for me and was a morning well spent.</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/digitalpreservation.wordpress.com/19/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/digitalpreservation.wordpress.com/19/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/digitalpreservation.wordpress.com/19/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/digitalpreservation.wordpress.com/19/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/digitalpreservation.wordpress.com/19/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/digitalpreservation.wordpress.com/19/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/digitalpreservation.wordpress.com/19/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/digitalpreservation.wordpress.com/19/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/digitalpreservation.wordpress.com/19/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/digitalpreservation.wordpress.com/19/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/digitalpreservation.wordpress.com/19/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/digitalpreservation.wordpress.com/19/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/digitalpreservation.wordpress.com/19/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/digitalpreservation.wordpress.com/19/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/digitalpreservation.wordpress.com/19/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/digitalpreservation.wordpress.com/19/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=digitalpreservation.wordpress.com&amp;blog=156049&amp;post=19&amp;subd=digitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://digitalpreservation.wordpress.com/2006/03/23/costs-risk-management-and-business-planning/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/f324c1265f6c2140e68f73e18453c7eb?s=96&#38;d=identicon" medium="image">
			<media:title type="html">josswinn</media:title>
		</media:content>
	</item>
		<item>
		<title>NDAD &#8211; The National Digital Archive of Datasets.</title>
		<link>http://digitalpreservation.wordpress.com/2006/03/22/ndad-the-national-digital-archive-of-datasets/</link>
		<comments>http://digitalpreservation.wordpress.com/2006/03/22/ndad-the-national-digital-archive-of-datasets/#comments</comments>
		<pubDate>Wed, 22 Mar 2006 23:10:04 +0000</pubDate>
		<dc:creator>Joss</dc:creator>
				<category><![CDATA[OAIS]]></category>
		<category><![CDATA[Preservation]]></category>
		<category><![CDATA[Repositories]]></category>

		<guid isPermaLink="false">https://digitalpreservation.wordpress.com/2006/03/22/ndad-the-national-digital-archive-of-datasets/</guid>
		<description><![CDATA[The last session of the day before getting on with our class project about the Internet Archive was on NDAD. It&#8217;s a service that the ULCC perform for TNA by preserving UK government databases and records no longer in use. Sounds dull but of course these databases include the National Inventory of Woodland and Trees, [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=digitalpreservation.wordpress.com&amp;blog=156049&amp;post=18&amp;subd=digitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>The last session of the day before getting on with our class project about the Internet Archive was on <a href="http://www.ndad.nationalarchives.gov.uk/">NDAD</a>.  </p>
<p>It&#8217;s a service that the <a href="http://www.ulcc.ac.uk/">ULCC </a>perform for <a href="http://www.nationalarchives.gov.uk/">TNA  </a>by preserving UK government databases and records no longer in use.  Sounds dull but of course these <a href="http://www.ndad.nationalarchives.gov.uk/access/?sort=SERIES_TITLE">databases </a>include the <a href="http://www.ndad.nationalarchives.gov.uk/CRDA/3/quickref.html">National Inventory of Woodland and Trees</a>, a survey of <a href="http://www.ndad.nationalarchives.gov.uk/CRDA/17/quickref.html">British Bats</a>, statistics on how many <a href="http://www.ndad.nationalarchives.gov.uk/CRDA/58/quickref.html">accidents there are in the home</a>, <a href="http://www.ndad.nationalarchives.gov.uk/CRDA/1/quickref.html">crime </a>statistics and the names and assets of <a href="http://www.ndad.nationalarchives.gov.uk/CRDA/42/quickref.html">victims of Nazi persecution who were compensated by the UK government</a>.  These databases are all transferred to NDAD who migrate them into sustainable formats, document them with good metadata and make them available to search online. It sounds like a great place to work if you&#8217;re interested in the history of computing as some of these databases were the largest of their  kind at the time and represent significant historical moments in the history of computing.  They also have to deal with all the legacy software and hardware issues, data analysis, system design and digital conversion as well as the development of emulators, data recovery and so on.  Hacker heaven and it&#8217;s only a ten minute walk from AI.</p>
<p>We assessed NDAD against the OAIS standard and it does pretty well.  Since TNA essentially do the selection of the databases, NDAD have no negotiation in the Ingest stage, but together TNA and NDAD are a functioning OAIS archive. And it&#8217;s only a ten minute walk from AI! It feels a bit like saying you live only ten minutes from Buckingham Palace.</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/digitalpreservation.wordpress.com/18/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/digitalpreservation.wordpress.com/18/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/digitalpreservation.wordpress.com/18/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/digitalpreservation.wordpress.com/18/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/digitalpreservation.wordpress.com/18/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/digitalpreservation.wordpress.com/18/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/digitalpreservation.wordpress.com/18/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/digitalpreservation.wordpress.com/18/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/digitalpreservation.wordpress.com/18/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/digitalpreservation.wordpress.com/18/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/digitalpreservation.wordpress.com/18/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/digitalpreservation.wordpress.com/18/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/digitalpreservation.wordpress.com/18/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/digitalpreservation.wordpress.com/18/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/digitalpreservation.wordpress.com/18/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/digitalpreservation.wordpress.com/18/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=digitalpreservation.wordpress.com&amp;blog=156049&amp;post=18&amp;subd=digitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://digitalpreservation.wordpress.com/2006/03/22/ndad-the-national-digital-archive-of-datasets/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/91890bc5aa9197368aeaa4f2f0fbf011?s=96&#38;d=identicon" medium="image">
			<media:title type="html">digitalpreservation</media:title>
		</media:content>
	</item>
		<item>
		<title>Records Management and Digital Preservation</title>
		<link>http://digitalpreservation.wordpress.com/2006/03/22/records-management-and-digital-preservation/</link>
		<comments>http://digitalpreservation.wordpress.com/2006/03/22/records-management-and-digital-preservation/#comments</comments>
		<pubDate>Wed, 22 Mar 2006 23:02:18 +0000</pubDate>
		<dc:creator>Joss</dc:creator>
				<category><![CDATA[Preservation]]></category>
		<category><![CDATA[RM]]></category>

		<guid isPermaLink="false">https://digitalpreservation.wordpress.com/2006/03/23/records-management-and-digital-preservation/</guid>
		<description><![CDATA[I wondered whether the next two sessions would be a bit dry, but not so. They were both led by an Archivist from the University of London Computing Centre who, in his free time, has a Friday evening radio show on RessonanceFM. This was a brief intro to RM discussing how it fits in with [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=digitalpreservation.wordpress.com&amp;blog=156049&amp;post=17&amp;subd=digitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>I wondered whether the next two sessions would be a bit dry, but not so.  They were both led by an Archivist from the University of London Computing Centre who, in his free time, has a Friday evening <a href="http://www.thesoundprojector.com/">radio show</a> on RessonanceFM. </p>
<p>This was a brief intro to RM discussing how it fits in with the digital presentation process.  I&#8217;ve never been trained in RM so it was useful for me and these are the highlights of what I learned:</p>
<p>RM is the efficient control of the creation, reciept, maintenance, use, retention and disposition of records.  It&#8217;s an archival skill but overlaps with business analysis and it&#8217;s assisted by international standards such as <a href="http://online.northumbria.ac.uk/faculties/art/information_studies/Imri/rarea/rm/rmproj/rmcurr/Impl/Impl.htm">ISO 15489</a>.</p>
<p>Why do we need records? Well, they might be evidence, required for accountability, for decision-making and to record institutional &#8216;memory&#8217;.  Good records are authentic, accurate, accessible, complete and comprehensive.  They are compliant, effective and secure.  I was told that RM assists and supports an organisation&#8217;s business processes; it identifies and protects vital records, ensures legal and regulatory compliance, provides protection against litigation, and allows compliance with Freedom of Information legislation.</p>
<p>With the growth of digital records, there&#8217;s obviously been a massive quantative increase in information.  Digital records share the same issues as paper records such as acquisition, preservation, storage and retrieval but also present additional challenges.  Digital records are characterised by being easy to create, copy, share, modify and store in multiple locations. They can be complex, transient, vulnerable, software and hardware dependent.  </p>
<p>Good digital records management is an underlying framework to good digital curation.  </p>
<p>A sound migration plan is essential to good digital RM and inherent in the preservation planning recommended by OAIS.  A migration plan is an essential part of ensuring that formats are retained and readable throughout their lifecycle.  </p>
<p>Of course, digital records have metadata which also needs to be managed and assists with the authenication of a record. I was told that good electronic records management policy should cover the creation and capture of all corporate records within the RM system.  It should cover the design and management of indexing and naming schemes.  It should offer policy on the automated management of metadata, for retrieval and retention.  It should ensure that records are &#8216;locked down&#8217; to ensure their integrity and security.  It should also provide guidance on the retention, preservation and destruction of digital records.</p>
<p>We discussed which type of records are selected for management: vital records needed to sustain the organisation&#8217;s business; records essential for legal compliance; and records with mid to long-term administrative value. These kind of things should form the basis of a selection policy.<br />
It all sounds like archiving to me with the exception that there&#8217;s more destruction in RM. I do see how RM is more business focused though and the issues of preservation are not always so difficult when you might be retaining records for shorter periods of time.  Still, there&#8217;s no reason why RM shouldn&#8217;t fit into an OAIS environment.  The main characteristics of selection, validation, fixity, preservation planning, metadata standards and retrieval/access are clearly very simmilar.  Perhaps Fiona or Lynda can explain more to me when I get back.</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/digitalpreservation.wordpress.com/17/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/digitalpreservation.wordpress.com/17/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/digitalpreservation.wordpress.com/17/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/digitalpreservation.wordpress.com/17/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/digitalpreservation.wordpress.com/17/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/digitalpreservation.wordpress.com/17/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/digitalpreservation.wordpress.com/17/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/digitalpreservation.wordpress.com/17/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/digitalpreservation.wordpress.com/17/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/digitalpreservation.wordpress.com/17/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/digitalpreservation.wordpress.com/17/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/digitalpreservation.wordpress.com/17/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/digitalpreservation.wordpress.com/17/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/digitalpreservation.wordpress.com/17/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/digitalpreservation.wordpress.com/17/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/digitalpreservation.wordpress.com/17/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=digitalpreservation.wordpress.com&amp;blog=156049&amp;post=17&amp;subd=digitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://digitalpreservation.wordpress.com/2006/03/22/records-management-and-digital-preservation/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/91890bc5aa9197368aeaa4f2f0fbf011?s=96&#38;d=identicon" medium="image">
			<media:title type="html">digitalpreservation</media:title>
		</media:content>
	</item>
		<item>
		<title>A whole hour discussing file formats!</title>
		<link>http://digitalpreservation.wordpress.com/2006/03/22/a-whole-hour-discussing-file-formats/</link>
		<comments>http://digitalpreservation.wordpress.com/2006/03/22/a-whole-hour-discussing-file-formats/#comments</comments>
		<pubDate>Wed, 22 Mar 2006 21:46:48 +0000</pubDate>
		<dc:creator>Joss</dc:creator>
				<category><![CDATA[Access]]></category>
		<category><![CDATA[Madness]]></category>
		<category><![CDATA[Preservation]]></category>

		<guid isPermaLink="false">https://digitalpreservation.wordpress.com/2006/03/22/a-whole-hour-discussing-file-formats/</guid>
		<description><![CDATA[I departed from earth this afternoon. I&#8217;m not sure where I went but this session on file formats and then a further session on digital records management took me places I never thought I&#8217;d go. The title of this class was &#8216;File Formats: Matters to Consider&#8217;, and I found it fascinating. First, we were shown [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=digitalpreservation.wordpress.com&amp;blog=156049&amp;post=16&amp;subd=digitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>I departed from earth this afternoon. I&#8217;m not sure where I went but this session on file formats and then a further session on digital records management took me places I never thought I&#8217;d go.</p>
<p>The title of this class was &#8216;File Formats: Matters to Consider&#8217;, and I found it fascinating.</p>
<p>First, we were shown where file formats fit in the hierarchy of the IT system:</p>
<p>Semantic Layer<br />
Actions Layer<br />
Format Layer (Alright!)<br />
Filesystem Layer<br />
Media Layer</p>
<p>Then an anecdote about how some file formats and their creating applications are better used for some tasks and not for others.  The tutor knew someone who <strong>wrote a novel in Excel</strong> because he didn&#8217;t have any other software to hand and I guess curiosity didn&#8217;t get the better of him either.</p>
<p>We did a quick exercise in what features to look for in a file format for preservation purposes.  Not too difficult:</p>
<p>Open, documented, widely used and therefore supported, interoperable over different Operating Systems, lossless/no compression, metadata support, etc. etc.</p>
<p>Another anecdote was that ten years ago, two men wrote a <a href="http://www.amazon.com/gp/product/1565921615/102-1365634-0615318">book</a> detailing over 3000 graphic file formats.  As the number of formats grew, it was revised and issued on a CD-ROM.  Now it&#8217;s updated on the web.  I&#8217;m sure Tim would love it.</p>
<p>I&#8217;ll state this here:  ADAM handles two graphic file formats for a reason. They are both open, documented, widely used, well supported, interoperable and have metadata support.  The list of supported graphic file formats may double or triple over time, but 3000+ demonstrates what  an industry digital archives are having to deal with.</p>
<p>If you want guidance on file formats (and who doesn&#8217;t?), then look no further than these fine institutions:</p>
<p><a href="http://ahds.ac.uk/depositing/deposit-formats.htm">AHDS</a><br />
<a href="http://www.fcla.edu/digitalArchive/daInfo.htm">FCLA Digital Archive</a><br />
<a href="http://preserve.harvard.edu/resources/imageformats.html">Harvard University formats registry</a><br />
<a href="http://www.prestospace.org">PRESTOSPACE</a><br />
<a href="http://www.erpanet.org/events/2004/vienna/index.php">ERPAnet file formats<br />
</a><a href="http://www.digitalpreservation.gov/formats/intro/intro.shtml">Library of Congress</a> (my favourite).</p>
<p>We finished up by looking at the conversion of file formats, something which presents problems when you want to preserve the original integrity of the file&#8217;s content but in a more suitable or non-obsolete file format.</p>
<p>I could go on about file formats but let&#8217;s face it, we&#8217;ve both had enough for one day.  Let&#8217;s talk at length in the &#8216;breakout&#8217; area when I get back, OK?</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/digitalpreservation.wordpress.com/16/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/digitalpreservation.wordpress.com/16/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/digitalpreservation.wordpress.com/16/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/digitalpreservation.wordpress.com/16/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/digitalpreservation.wordpress.com/16/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/digitalpreservation.wordpress.com/16/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/digitalpreservation.wordpress.com/16/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/digitalpreservation.wordpress.com/16/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/digitalpreservation.wordpress.com/16/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/digitalpreservation.wordpress.com/16/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/digitalpreservation.wordpress.com/16/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/digitalpreservation.wordpress.com/16/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/digitalpreservation.wordpress.com/16/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/digitalpreservation.wordpress.com/16/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/digitalpreservation.wordpress.com/16/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/digitalpreservation.wordpress.com/16/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=digitalpreservation.wordpress.com&amp;blog=156049&amp;post=16&amp;subd=digitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://digitalpreservation.wordpress.com/2006/03/22/a-whole-hour-discussing-file-formats/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/91890bc5aa9197368aeaa4f2f0fbf011?s=96&#38;d=identicon" medium="image">
			<media:title type="html">digitalpreservation</media:title>
		</media:content>
	</item>
		<item>
		<title>Institutional Repositories</title>
		<link>http://digitalpreservation.wordpress.com/2006/03/22/institutional-repositories/</link>
		<comments>http://digitalpreservation.wordpress.com/2006/03/22/institutional-repositories/#comments</comments>
		<pubDate>Wed, 22 Mar 2006 21:16:02 +0000</pubDate>
		<dc:creator>Joss</dc:creator>
				<category><![CDATA[Access]]></category>
		<category><![CDATA[Preservation]]></category>
		<category><![CDATA[Repositories]]></category>
		<category><![CDATA[Tools]]></category>

		<guid isPermaLink="false">https://digitalpreservation.wordpress.com/2006/03/22/institutional-repositories/</guid>
		<description><![CDATA[Maybe I should think up snappier title headings to these blogs. Believe me, occasionally I&#8217;m sitting in the class wondering how the hell I got here. Though I should say that the quality of the training programme so far has been very high and I&#8217;m finding it very engaging. The tutors are decent, down-to-earth people [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=digitalpreservation.wordpress.com&amp;blog=156049&amp;post=15&amp;subd=digitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Maybe I should think up snappier title headings to these blogs.  Believe me, occasionally I&#8217;m sitting in the class wondering how the hell I got here. Though I should say that the quality of the training programme so far has been very high and I&#8217;m finding it very engaging.  The tutors are decent, down-to-earth people with practical advice. In other news, the stats for this blog suggest that most of ITP and IRP looked at it yesterday.  Tomorrow&#8217;s stats should be interesting&#8230; <img src='http://s1.wp.com/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /> </p>
<p>Basically, this was a discussion on <a href="http://www.dspace.org/">DSpace </a>and the <a href="http://www.oclc.org/">OCLC</a>.  <a href="http://fedora.info/">Fedora </a>was mentioned but only briefly. That&#8217;s OK, because Fiona, Damon and I attended a conference on Fedora last year.  Damon&#8217;s an expert so ask him all the questions about Fedora&#8230; The implementation of a &#8216;trusted repository&#8217; is central to digital archiving and the two main course documents are the OAIS standard and the follow up document, <a href="http://www.rlg.org/legacy/longterm/repositories.pdf">Trusted Digital Repositories</a>. The TDR document basically goes through all the attributes and responsibilities that an OAIS compliant have. The report defines a TDR as:</p>
<blockquote><p>
A trusted digital repository is one whose mission is to provide reliable, long-term access to managed digital resources to its designated community, now and in the future. </p></blockquote>
<p>It&#8217;s a useful document for testing how well your institution is doing.</p>
<p>DSpace is a repository system that&#8217;s been developed at MIT.  It&#8217;s very popular (OK, so that&#8217;s a relative term&#8230;) in the USA and some UK institutions use it too.  From what I could see, it provides a customisable &#8216;repository out of the box&#8217; and shares some functionality with a Content Management System.</p>
<p>DSpace has three preservation service levels, providing functional preservation through &#8216;supported&#8217; (1) and &#8216;recognised&#8217; (2) file formats and bit-level (3) preservation.  I don&#8217;t think it is &#8216;OAIS compliant&#8217; but clearly it follows the basic OAIS functional model of Ingest of Submission Information Packages, creation of Archival Information Packages and the creation of Dissemination Information Packages.  The example we were shown worked very well for the submission and archiving of a document by an academic writer.  From the Fedora conference we attended, I&#8217;d got the impression DSpace was a bit crap, but there&#8217;s some competition between the two systems so that shouldn&#8217;t be surprising.  Fedora is a different animal really as it provides a suite of repository services which developers are expected to work with while DSpace is useable out of the box by people without programming skills.</p>
<p>OCLC, The Online Computer Library Centre provides a repository service for other institutions so is really an out-sourcing solution. Accessible over the web with OAIS-like functionality but, of course, still requires that you prepare your collection for Ingest.  </p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/digitalpreservation.wordpress.com/15/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/digitalpreservation.wordpress.com/15/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/digitalpreservation.wordpress.com/15/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/digitalpreservation.wordpress.com/15/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/digitalpreservation.wordpress.com/15/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/digitalpreservation.wordpress.com/15/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/digitalpreservation.wordpress.com/15/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/digitalpreservation.wordpress.com/15/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/digitalpreservation.wordpress.com/15/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/digitalpreservation.wordpress.com/15/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/digitalpreservation.wordpress.com/15/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/digitalpreservation.wordpress.com/15/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/digitalpreservation.wordpress.com/15/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/digitalpreservation.wordpress.com/15/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/digitalpreservation.wordpress.com/15/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/digitalpreservation.wordpress.com/15/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=digitalpreservation.wordpress.com&amp;blog=156049&amp;post=15&amp;subd=digitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://digitalpreservation.wordpress.com/2006/03/22/institutional-repositories/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/91890bc5aa9197368aeaa4f2f0fbf011?s=96&#38;d=identicon" medium="image">
			<media:title type="html">digitalpreservation</media:title>
		</media:content>
	</item>
		<item>
		<title>Initiatives &amp; Tools</title>
		<link>http://digitalpreservation.wordpress.com/2006/03/22/initiatives-tools/</link>
		<comments>http://digitalpreservation.wordpress.com/2006/03/22/initiatives-tools/#comments</comments>
		<pubDate>Wed, 22 Mar 2006 20:40:31 +0000</pubDate>
		<dc:creator>Joss</dc:creator>
				<category><![CDATA[Tools]]></category>

		<guid isPermaLink="false">https://digitalpreservation.wordpress.com/2006/03/22/initiatives-tools/</guid>
		<description><![CDATA[Before lunch, we discussed a number of initiatives and tools that are emerging in digital preservation. I&#8217;m just going to list a few right now because there are so many. UK Initiatives JISC - Funding body AHDS - Arts and Humanities Data Service DPC - Digital Preservation Coalition (course organisers) DCC - Digital Curation Centre. [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=digitalpreservation.wordpress.com&amp;blog=156049&amp;post=14&amp;subd=digitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Before lunch, we discussed a number of initiatives and tools that are emerging in digital preservation.  I&#8217;m just going to list a few right now because there are so many. </p>
<p><strong>UK Initiatives</strong></p>
<p><a href="http://www.jisc.ac.uk/">JISC </a>- Funding body<br />
<a href="http://ahds.ac.uk/">AHDS </a>- Arts and Humanities Data Service<br />
<a href="http://www.dpconline.org/graphics/">DPC </a>- Digital Preservation Coalition (course organisers)<br />
<a href="http://www.dcc.ac.uk/">DCC </a>- Digital Curation Centre. Courses, conferences, online forum, specifically interested in curatorial issues.<br />
<a href="http://www.ukoln.ac.uk/">UKOLN </a>- Advisory service<br />
<a href="http://www.data-archive.ac.uk/">UKDA </a>- UK Data Archive<br />
<a href="http://www.ndad.nationalarchives.gov.uk/">NDAD </a>- The National Digital Archive of Datasets<br />
<a href="http://www.webarchive.org.uk/">UKWAC </a>- UK Web Archving Consortium<br />
<a href="http://www.nationalarchives.gov.uk/">TNA </a>- The National Archives</p>
<p><strong>Non-UK Initiatives</strong></p>
<p><a href="http://www.archives.gov/era/">NARA </a>- Electronic Records Archive (USA) Project with $300m funding<br />
<a href="http://www.naa.gov.au/recordkeeping/preservation/digital/summary.html">National Archives of Australia Digital Preservation Service</a><br />
<a href="http://www.nla.gov.au/padi/">PADI</a> &#8211; National Library of Australia&#8217;s subject gateway to international digital preservation resources<br />
<a href="http://www.knaw.nl/ecpa/">ECPA</a> &#8211; The European Commission on Preservation and Access<br />
<a href="http://www.rlg.org/">RLG</a> &#8211; Research Libraries Group<br />
<a href="http://digitalpreservation.gov/">Library of Congress Digital Preservation</a><br />
<a href="http://www.erpanet.org/">ERPANET</a> &#8211; Electronic Resource Preservation and Access Network</p>
<p><strong>Tools</strong> (these were all demonstrated and very interesting &#8211; I think we could use one or two of them.  And they are all open source and written in Java which Merlin&#8217;s Team have expertise in).</p>
<p>PRONOM/DROID &#8211; <a href="http://www.nationalarchives.gov.uk/pronom/">PRONOM </a>is a file format registry at The National Archives.  <a href="http://www.nationalarchives.gov.uk/aboutapps/pronom/droid.htm">DROID </a>is a tool which works in conjunction with PRONOM.<br />
<a href="http://hul.harvard.edu/jhove/">JHOVE </a>- Validates digital files and extracts preservation metadata. Very cool. Really!<br />
<a href="http://www.natlib.govt.nz/en/whatsnew/4initiatives.html#extraction">National Library of New Zealand Metadata Extraction Tool</a><br />
<a href="http://xena.sourceforge.net/">XENA</a> &#8211; XML Electronic Normalisinig of Archives</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/digitalpreservation.wordpress.com/14/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/digitalpreservation.wordpress.com/14/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/digitalpreservation.wordpress.com/14/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/digitalpreservation.wordpress.com/14/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/digitalpreservation.wordpress.com/14/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/digitalpreservation.wordpress.com/14/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/digitalpreservation.wordpress.com/14/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/digitalpreservation.wordpress.com/14/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/digitalpreservation.wordpress.com/14/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/digitalpreservation.wordpress.com/14/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/digitalpreservation.wordpress.com/14/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/digitalpreservation.wordpress.com/14/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/digitalpreservation.wordpress.com/14/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/digitalpreservation.wordpress.com/14/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/digitalpreservation.wordpress.com/14/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/digitalpreservation.wordpress.com/14/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=digitalpreservation.wordpress.com&amp;blog=156049&amp;post=14&amp;subd=digitalpreservation&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://digitalpreservation.wordpress.com/2006/03/22/initiatives-tools/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/91890bc5aa9197368aeaa4f2f0fbf011?s=96&#38;d=identicon" medium="image">
			<media:title type="html">digitalpreservation</media:title>
		</media:content>
	</item>
	</channel>
</rss>
