Xena (software)

Source: Wikipedia, the free encyclopedia.

Xena is open-source software for use in digital preservation. Xena is short for XML Electronic Normalising for Archives.

Xena is a Java application that was developed by the National Archives of Australia. It is available free of charge under the GNU General Public License.

Version 6.1.0 was released 31 July 2013. Source code and binaries for Linux, OS X and Windows are available from SourceForge. However, as of 2018, it is no longer maintained or supported.

Mode of operation

Xena attempts to avoid

PNG. If the file format is not supported or the Binary Normalisation option is selected, Xena will perform ASCII Base64
encoding on binary files and wrap the output in XML metadata. The resulting .xena file is plain text, although the content of the data itself is not directly human-readable. The exact original file can be retrieved by stripping the metadata and reversing the Base64 encoding, using an internal viewer.

Features

Platforms supported by Xena are

Mac OS X
.

Xena uses a series of plugins to identify file formats and convert them to an appropriate openly specified format.

Xena has an

application programming interface
which allows any reasonably skilled Java developer to develop a plugin to cover a new file type.

Xena can process individual files or whole directories. When processing a whole directory, it can preserve the original directory structure of the converted records.

Xena can create plain text versions of file formats such as

PDF, with the use of Tesseract (software)
.

The Xena interface or Xena Viewer can be used to view or export a Xena file (extension .xena) in its target file format. These files contain the normalised file as well as any extra information relevant to the normalisation process. The Xena Viewer supports bulk export of Xena files to target file formats.

Xena can be used via its

command line
.

For Xena to be fully functional, it requires a local installation of the following external software:

  • LibreOffice suite - to convert office documents to OpenDocument format
  • Tesseract - to create plain text versions of file formats
  • PNG
  • Readpst - to convert Microsoft Outlook PST files to XML. Readpst is part of the free and open source libpst software suite.
  • FLAC
    - to convert audio files to FLAC format. This is also required to play back audio files using Xena.

Supported file types

Xena will recognize and process the file types listed below, plus a few others of minor importance. Unsupported file types will automatically undergo binary normalization.

Office file formats:

Graphics:

Archive Files:

  • Files are extracted from
    WAR
    , Mac binary) and normalised into a separate Xena file. A Xena index file is created, which when opened in the internal Xena viewer will display the files in a table.

Audio files:

  • FLAC
    files.

Databases:

  • SQL files are processed as plain text wrapped in XML

Other file types:

  • HTML is converted to XHTML
  • TXT text files are stored as plain text wrapped in XML; CSS files are stored as plain text wrapped in XML

Reviews

An April 22, 2010 review in Practical e-Records rated Xena at 82/100 points. At present Xena has no target preservation format for video files.[1]

References

  1. ^ "Review of XENA Normalization Software". 2010-04-22. Archived from the original on 2012-07-08.

External links