{"id":795,"date":"2010-01-21T16:56:49","date_gmt":"2010-01-21T15:56:49","guid":{"rendered":"http:\/\/blog.soton.ac.uk\/keepit\/?p=795"},"modified":"2010-01-21T16:56:49","modified_gmt":"2010-01-21T15:56:49","slug":"preservation-file-formats-report-progresses-the-field","status":"publish","type":"post","link":"https:\/\/generic.wordpress.soton.ac.uk\/test-media\/2010\/01\/21\/preservation-file-formats-report-progresses-the-field\/","title":{"rendered":"Preservation file formats report progresses the field"},"content":{"rendered":"<p>File formats are a critical feature in digital preservation. This is hardly news to specialists, but in their <a title=\"Tag Archives: exemplar profiles, Diary, various entries\" href=\"http:\/\/blog.soton.ac.uk\/keepit\/tag\/exemplar-profiles\/\" target=\"_self\">objectives<\/a> the repository managers of our exemplars also expressed an interest in file formats, so I was interested\u00a0to discover what the recent report on file formats\u00a0from the Digital Preservation Coalition (DPC) might offer them.<\/p>\n<p><a href=\"http:\/\/www.dpconline.org\/\"><img loading=\"lazy\" decoding=\"async\" class=\"alignright size-full wp-image-798\" src=\"http:\/\/generic.wordpress.soton.ac.uk\/test-media\/wp-content\/blogs.dir\/sites\/52\/2010\/01\/dpc_logo.png\" alt=\"Digital Preservation Coalition logo\" width=\"200\" height=\"44\" \/><\/a> Malcolm Todd (The National Archives),\u00a0<a href=\"http:\/\/www.dpconline.org\/newsroom\/file-formats-for-preservation-technology-watch-report.html\" target=\"_self\">File Formats for Preservation<\/a>, DPC Technology Watch Report Series,\u00a0Report 09-02, 2 December 2009<\/p>\n<p>This is a well researched, wide-ranging and a deeply considered critical review of recent work on file formats.\u00a0The report is intended &#8216;to assist repository managers and the preservation community&#8217;. My impression is it may work better for the latter than the former. &#8216;Repositories&#8217; is used as a generic term; institutional repositories are mentioned by reference only.<\/p>\n<p>In KeepIt our approach aims to be practical, joining up a series of tools for the workflow of file format management. This is based on an approach elaborated as <a title=\"Brown, Developing Practical Approaches to Active Preservation, IJDC, V2N1, 2007\" href=\"http:\/\/www.ijdc.net\/index.php\/ijdc\/article\/view\/37\" target=\"_self\">active preservation<\/a> at the National Archives, but not directly mentioned in the report. &#8220;The development and use of tools developed within the digital preservation community has a mostly separate literature from that of defining and implementing selection criteria.&#8221; (section 5)<\/p>\n<p>The report&#8217;s summary says &#8220;At the time of writing, there is apparent consensus on five main criteria for file format selection.&#8221; It goes on to list the criteria, but work here has already progressed beyond this. The <a title=\"Tarrant et al., Where the Semantic Web and Web 2.0 meet format risk management: P2 registry, ECS EPrints, 12 June 2009\" href=\"http:\/\/eprints.ecs.soton.ac.uk\/17556\/\" target=\"_self\">P2 registry<\/a> that this project described at <a title=\"iPres 2009 Conference Program\" href=\"http:\/\/www.cdlib.org\/services\/uc3\/iPres\/confsched.html\" target=\"_self\">iPres 2009<\/a> links hundreds of criteria for different formats.<\/p>\n<p>The report continues: &#8220;The main finding of this report is to support the proposal by Rog and van Wijk of the National Library of the Netherlands (2008) that such criteria should be used as a tool to work out the detailed implementation of a clear preservation strategy according to a prioritisation <em>appropriate to the repository<\/em>. This is essential to make sense of an otherwise bewildering array of considerations and provides key governance to ensure a preservation institution is managing the risk of obsolescence to its holdings.&#8221;<\/p>\n<p>In other words, there has to be a way of connecting the format information with the repository requirements. This is being done in KeepIt by integrating the P2 registry with a planning tool, <a title=\"Preservation planning depends on repository context, Diary, November 20, 2009 \" href=\"http:\/\/blog.soton.ac.uk\/keepit\/2009\/11\/20\/preservation-planning-depends-on-repository-context\/\" target=\"_self\">Plato<\/a>, developed by Andreas Rauber and colleagues at Vienna University of Technology for the Planets project. It&#8217;s this joined-up approach that has been conspicuously lacking for preservation file format management workflow so far &#8211; &#8220;Some of the current literature appears to minimise this (interdependence)&#8221;. Although a work-in-progress this potentially ground-breaking approach will be the focal point of our ongoing\u00a0<a title=\"Digital preservation tools for repository managers, Diary, December 18, 2009 \" href=\"http:\/\/blog.soton.ac.uk\/keepit\/2009\/12\/18\/digital-preservation-training-for-repository-managers\/\" target=\"_self\">KeepIt course<\/a> on digital preservation tools (see module 4).<\/p>\n<p>&#8220;Integrating the ability of formats to represent information content into scoring criteria seems some way off except for very simple digital objects&#8221;, but <a title=\"The P2 Registry, presentation, iPres 2009\" href=\"http:\/\/eprints.ecs.soton.ac.uk\/17556\/6\/iPres2009.pdf\" target=\"_self\">not as far off as it may seem<\/a> (see slides 23, 24).<\/p>\n<p>My initial thought on the report was that it will be useful to the extent that the work reviewed might be considered useful, but on detailed reading I have reappraised that view. The report&#8217;s effect is to progress the work it is reporting upon, by bringing new insights and identifying little-noted connections explicitly, although it also has to be said that this is probably the most complex aspect of the investigation, but worth the effort.<\/p>\n<p>Effectively it&#8217;s describing the foundations for work that has already moved forward significantly: &#8220;this topic has progressed rapidly in the last decade. This research has improved considerably our understanding of effective format management strategies \u2013 even if the proliferation of initiatives and tools seems at first to render it less accessible.&#8221; I think it is fair to say this work is even further advanced than this report recognised.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>File formats are a critical feature in digital preservation. This is hardly news to specialists, but in their objectives the repository managers of our exemplars also expressed an interest in file formats, so I was interested\u00a0to discover what the recent report on file formats\u00a0from the Digital Preservation Coalition (DPC) might &hellip;<\/p>\n","protected":false},"author":869,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_et_pb_use_builder":"","_et_pb_old_content":"","_et_gb_content_width":"","footnotes":""},"categories":[4],"tags":[284],"class_list":["post-795","post","type-post","status-publish","format-standard","hentry","category-uncategorized","tag-file-formats"],"jetpack_featured_media_url":"","_links":{"self":[{"href":"https:\/\/generic.wordpress.soton.ac.uk\/test-media\/wp-json\/wp\/v2\/posts\/795","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/generic.wordpress.soton.ac.uk\/test-media\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/generic.wordpress.soton.ac.uk\/test-media\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/generic.wordpress.soton.ac.uk\/test-media\/wp-json\/wp\/v2\/users\/869"}],"replies":[{"embeddable":true,"href":"https:\/\/generic.wordpress.soton.ac.uk\/test-media\/wp-json\/wp\/v2\/comments?post=795"}],"version-history":[{"count":0,"href":"https:\/\/generic.wordpress.soton.ac.uk\/test-media\/wp-json\/wp\/v2\/posts\/795\/revisions"}],"wp:attachment":[{"href":"https:\/\/generic.wordpress.soton.ac.uk\/test-media\/wp-json\/wp\/v2\/media?parent=795"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/generic.wordpress.soton.ac.uk\/test-media\/wp-json\/wp\/v2\/categories?post=795"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/generic.wordpress.soton.ac.uk\/test-media\/wp-json\/wp\/v2\/tags?post=795"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}