Australian Web Archive
The Australian Web Archive (AWA) is an publicly available
History of the three components
The PANDORA service started archiving websites in October 1996.[6]
In 2005, the NLA started archiving annual snapshots of the entire Australian web domain (URLs with the suffix. ".au"[4]),[7] collected via large crawl harvests.[8] Later, the earliest websites from the .au web domain, dating back to 1996, were obtained from the Internet Archive. In 2019 this content was first made publicly accessible through Trove.[9]
The PANDORA infrastructure, which works well for a selective small scale archiving, does not adapt to large scale "bulk harvesting" of web content, so a new technical system had to be developed whereby a web archiving service which would integrate the delivery of archived websites within a live website interface delivering the archived websites seamlessly to the user, which is difficult to achieve technically.[10]
AGWA
Australian Government websites are Commonwealth records, and are therefore publications to be managed in accordance with the Archives Act 1983.[11]
The Australian Government Web Archive (AGWA) consists of bulk archiving of
The AGWA meets the preservation and retention requirements for websites as "retain as national archives" (RNA) material under the Archives Act; however
As of early 2015, the AGWA included content dating from 2005, which amounted to about 144 million files occupying 15
Amalgamation
In 2017, the AGWA and the PANDORA archive were amalgamated with the other web archive collections, to form the Trove web archive collection.[9] After further development and the creation of the Australia Web Archive, government websites archived via AGWA and now included in AWA can still be searched separately using the "Advanced Search" option.[9]
Description of AWA
A web archive is described by the NLA as a "collection of snapshots of websites captured while they are accessible on the web, and then preserved in a static copy". The collection archived in the AWA is "relevant to the cultural, social, political, research and commercial life and activities of Australia and Australians". It collects web material via both scheduled archiving of selected websites and publications as well as some ad hoc harvesting relating to significant events.[9]
As of March 2019, when it began, AWA already contained around 600
The archive is fully searchable, based on a combination of techniques used by the developers. Each team created a unique and complex
There is a "Limit to the gov.au web domain" option before searching,[15] and government websites archived via AGWA can still be searched separately using the "Advanced Search" option.[9] Other options in Advanced Search are to limit by timespan of the snapshots, domain and file type.[16]
With many of the earlier websites from the 1990s now lost, mainly because of the frequent change of web platforms, the Australian Web Archive is a significant initiative that will help to save current and future web pages, especially Australian content.
Asia/Pacific websites
Websites in the
See also
References
- ^ "Preserving and Accessing Networked DOcumentary Resources of Australia". Pandora Archive. Retrieved 30 April 2020.
- ^ "Archived websites". National Library of Australia. 23 March 2020. Retrieved 30 April 2020.
- ^ Koerbin, Paul (11 February 2015). "The Australian Government Web Archive". National Library of Australia. Archived from the original on 30 April 2020. Retrieved 30 April 2020.
- ^ a b c Bruns, Axel (14 March 2019). "The Australian Web Archive is a momentous achievement – but things will get harder from here". The Conversation. Retrieved 30 April 2020.
- ^ a b c d Nott, George (11 March 2019). "National Library launches 'enormous' archive of Australia's Internet". Computerworld. Retrieved 6 May 2020.
- ^ "History and Achievements". PANDORA. 18 February 2009. Retrieved 6 May 2020.
- ^ McKenzie, Amelia (12 March 2019). "Preserving Australia's Web History:The beginning of the Australian Web Archive". National Library of Australia. Retrieved 6 May 2020.
- ^ "Archived websites (1996 – now)". Trove. Retrieved 6 May 2020.
- ^ a b c d e f g "About the Australian Web Archive". Trove Help Centre. Archived from the original on 17 March 2020. Retrieved 8 May 2020.
- ^ a b c Koerbin, Paul (11 February 2015). "The Australian Government Web Archive: Collecting the government's online documentary heritage goes large scale". National Library of Australia. Archived from the original on 1 May 2020. Retrieved 6 May 2020.
- ^ a b "Archiving Australian Government websites". National Archives of Australia. Retrieved 8 May 2020.
- ^ "Archived websites". National Library of Australia. 7 December 2018. Retrieved 6 May 2020.
- ^ NOTE: AWA help page says 400 tb, 8 billion records
- ^ "Check Out Australia's Web Archive". Southern Phone. 11 April 2019. Retrieved 8 May 2020.
- ^ "Australian Web Archive". Trove. Retrieved 8 May 2020.
- ^ "Australian Web Archive - Advanced Search". Trove. Retrieved 8 May 2020.
- ^ "Archived websites". National Library of Australia. 23 March 2020. Retrieved 8 May 2020.