Richard's Blog

Archivist on the digital archives team at State Records NSW

The latest version of siegfried, 1.5.0, has just been released. The big change is support for a second identifier type, freedesktop.org’s Shared MIME-Info specification.

By Richard, posted in Richard's Blog

13th Mar 2016  5:42 AM  2555 Reads  No comments

At my workplace, we write a lot of small scripts to encode preservation workflows. These scripts pipeline simple actions like munging metadata, moving files about, and calling other tools such as Tika and ImageMagick. Often these actions are conditional on the format of the file being processed: for example, we only want to run Tika over the formats for […]

By Richard, posted in Richard's Blog

18th Feb 2016  3:20 AM  3084 Reads  No comments

Siegfried v 1.0 is now available at http://www.itforarchivists.com/siegfried. Siegfried is a PRONOM-based file format identification tool. Key features are: complete implementation of PRONOM (byte and container signatures) reliable results (siegfried is tested against Ross Spencer’s skeleton suite and QA tested against DROID and FIDO output using http://github.com/richardlehane/comparator) fast matching without limiting the number of bytes […]

By Richard, posted in Richard's Blog

25th Mar 2015  5:02 AM  3332 Reads  No comments

A little while ago I introduced a new file format identification tool, siegfried, on this blog. Since then, I’ve been refining the tool, with a goal of a 1.0 release early next year (probably around March). I hope you don’t mind the occasional update about my progress on this blog. One of my goals with […]

By Richard, posted in Richard's Blog

12th Dec 2014  12:34 PM  2552 Reads  No comments

Ok. I know what you’re thinking. Do we really need another PRONOM-based, file format identification tool? A year or so I might have said “no” myself. In DROID and FIDO, we are already blessed with two brilliant tools. In my workplace, we’re very happy users of DROID. We trust it as the reference implementation of […]

By Richard, posted in Richard's Blog

27th Sep 2014  7:52 AM  15454 Reads  8 Comments