Archive site

In web archiving, an archive site is a website that stores information on webpages from the past for anyone to view.

Common techniques

Two common techniques for archiving websites are using a web crawler or soliciting user submissions:

Using a
robots.txt
).

User submissions: While it can be difficult to start user submission services due to potentially low rates of user submissions, this system can yield some of the best results. By crawling web pages one is only able to obtain the information the public has chosen to post online; however, potential content providers may not bother to post certain information, assuming no one would be interested in it, because they lack a proper venue in which to post it, or because of copyright concerns.^[1] However, users who see someone wants their information may be more apt to submit it.

Examples

Google Groups

On 12 February 2001,
Deja.com and turned it into their Google Groups service.^[2] They allow users to search old discussions with Google's search technology, while still allowing users to post to the mailing lists
.

Internet Archive

The Internet Archive is building a compendium of websites and digital media. Starting in 1996, the Archive has been employing a web crawler to build up their database. It is one of the best known archive sites.

NBCUniversal Archives

NBCUniversal Archives offer access to exclusive content from NBCUniversal and its subsidiaries. Their NBCUniversal Archives website provides easy viewing of past and recent news clips, and it is a prime example of a news archive.^[3]

Nextpoint

Nextpoint offers an automated cloud-based, SaaS for marketing, compliance, and litigation related needs including electronic discovery.

PANDORA Archive

PANDORA (
Pandora Archive), founded in 1996 by the National Library of Australia
, stands for Preserving and Accessing Networked Documentary Resources of Australia, which encapsulates their mission. They provide a long-term catalog of select online publications and web sites authored by Australians or that are of an Australian topic. They employ their PANDAS (PANDORA Digital Archiving System) when building their catalog.

textfiles.com

bulletin board systems
(BBS) of his youth and to document other people's experiences on the bulletin board systems.

See also

Internet portal

Internet Archive

Pandora Archive

WebCite

Web archiving

References

doi:10.1045/march2012-niu1
.

^ "Google Acquires Usenet Discussion Service and Significant Assets from Deja.com". 12 February 2001.

^ NBCUniversal Archives

Retrieved from "https://en.wikipedia.org/w/index.php?title=Archive_site&oldid=1215594469"

[1] doi:10.1045/march2012-niu1
.

[2] "Google Acquires Usenet Discussion Service and Significant Assets from Deja.com". 12 February 2001.

[3] NBCUniversal Archives

[1]

[2]

[3]