File:Automated metadata extraction (IA automatedmetadat109454057).pdf

Page contents not supported in other languages.
This is a file from the Wikimedia Commons
Source: Wikipedia, the free encyclopedia.
Go to page
next page →
next page →
next page →

Original file(1,275 × 1,650 pixels, file size: 315 KB, MIME type: application/pdf, 84 pages)

Summary

Automated metadata extraction   (Wikidata search (Cirrus search) Wikidata query (SPARQL)  Create new Wikidata item based on this file)
Author
Migletz, James J.
Title
Automated metadata extraction
Publisher
Monterey, California. Naval Postgraduate School
Description

Metadata is data that describes data. There are many computer forensic uses of metadata and being able to extract metadata automatically provides positive forensic implications. This thesis presents a new technique for batch processing disk images and automatically extracting metadata from files and file contents. The technique is embodied in a program called fiwalk that has a plug-in architecture allowing new metadata extractors to be readily incorporated. Output from fiwalk can be provided in multiple formats such as ARFF and text. The plug-ins created for this thesis include one created by Simson Garfinkel for extracting metadata from .jpeg files, two for Microsoft Office documents (one for prior to Office 2007 release and one for Office 2007 release), and a default plug-in for extracting metadata from .gif, .pdf, and .mp3 files. To better understand the metadata available in common file formats such as .doc, .docx, .odt, .pdf, .mp3, .mp4, .jpeg, tiff, and .gif, an examination of these formats is provided.


Subjects: Metadata; Data mining
Language English
Publication date June 2008
Current location
IA Collections: navalpostgraduateschoollibrary; fedlink
Accession number
automatedmetadat109454057
Source
Internet Archive identifier: automatedmetadat109454057
https://archive.org/download/automatedmetadat109454057/automatedmetadat109454057.pdf
Permission
(Reusing this file)
Approved for public release, distribution unlimited

Licensing

Public domain
This work is in the public domain in the United States because it is a work prepared by an officer or employee of the United States Government as part of that person’s official duties under the terms of Title 17, Chapter 1, Section 105 of the US Code. Note: This only applies to original works of the Federal Government and not to the work of any individual U.S. state, territory, commonwealth, county, municipality, or any other subdivision. This template also does not apply to postage stamp designs published by the United States Postal Service since 1978. (See § 313.6(C)(1) of Compendium of U.S. Copyright Office Practices). It also does not apply to certain US coins; see The US Mint Terms of Use.

Captions

Add a one-line explanation of what this file represents

Items portrayed in this file

depicts

application/pdf

b996cecc9b1a7f4323c20848ce2221ce2c011dc1

322,761 byte

1,650 pixel

1,275 pixel

File history

Click on a date/time to view the file as it appeared at that time.

Date/TimeThumbnailDimensionsUserComment
current22:08, 14 July 2020Thumbnail for version as of 22:08, 14 July 20201,275 × 1,650, 84 pages (315 KB)FEDLINK - United States Federal Collection automatedmetadat109454057 (User talk:Fæ/IA books#Fork8) (batch 1993-2020 #8754)
No pages on the English Wikipedia use this file (pages on other projects are not listed).

Metadata