Single-instance storage
This article needs additional citations for verification. (October 2007) |
Single-instance storage (SIS) is a system's ability to take multiple copies of content and replace them by a single shared copy. It is a means to eliminate data duplication and to increase efficiency. SIS is frequently implemented in
Concept
In the case of an
When used in conjunction with backup software, single-instance storage can reduce the quantity of archive media required since it avoids storing duplicate copies of the same file. Often identical files are installed on multiple computers, for example operating system files. With single-instance storage, only one copy of a file is written to the backup media therefore reducing space. This becomes more important when the storage is offsite and on cloud storage such as Amazon S3. In such cases, it has been reported that deduplication can help reduce the costs of storage, costs of bandwidth and backup windows by up to 10:1.[2]
ISO CD/DVD image files can be optimized to use SIS to reduce the size of a CD/DVD compilation (if there are enough duplicated files) to make it fit into smaller media.
SIS is related to system wide file duplication search and multiple file instance detection tools such as the P2P application
Microsoft
SIS was introduced with the
The file-based Windows Imaging Format introduced in Windows Vista also supported single-instance storage. Single-instance storage was a feature of Microsoft Exchange Server since version 4.0 and is also present in Microsoft's Windows Home Server. It is deduplicating attachments only in Exchange 2007 and was dropped completely in Microsoft Exchange Server 2010.[5] Microsoft announced Windows Storage Server 2008 (WSS2008)[6] with Single Instance Storage on June 1, 2009, and states this feature is not available on Windows Server 2008.[6]
The feature is officially deprecated since Windows Server 2012, when a new, more powerful chunk-based data deduplication mechanism was introduced. It allows files with similar content to be deduplicated as long as they have stretches of identical data. This mechanism is more powerful than SIS.[7] Since Windows Server 2019, the feature is fully supported on ReFS.[8]
See also
References
- ^ Explaining deduplication rates and single-instance storage to clients. George Crump, Storage Switzerland
- ^ Deduplication + Amazon S3 will save you time and money. White Paper: Published June 2008
- ^ a b Douceur, John (JD); Goebel, David; Corbin, Scott; Bolosky, Bill (August 2000). "Single Instance Storage in Windows 2000" (PDF). Microsoft Research. Microsoft Research and Balder Technology Group.
- ^ Single Instance Storage in Microsoft Windows Storage Server 2003 R2 Archived 2007-01-04 at the Wayback Machine: Technical White Paper: Published May 2006
- ^ [1] The Exchange Team Blog, Microsoft Corp.
- ^ a b Windows Storage Server 2008 at Microsoft
- ^ FileCAB-Team (10 April 2019). "Introduction to Data Deduplication in Windows Server 2012". Microsoft Tech Community.
- ^ "Data Deduplication interoperability". docs.microsoft.com. 29 March 2022.