Department Blog

Health Sciences Libraries

Change in NSF Reporting Rules

January 10th, 2013 by Deanna Johnson

Web Archiving Service

February 13th, 2012 by Mary Wood


The Web Archiving Service, from the California Digital Library, enables librarians, archivists and researchers to capture, curate and preserve websites and web‐published materials.  WAS makes it easy to build web archives, with scheduling and other tools to help manage your archive. You control public access to your archives and can configure the appearance and navigation of each archive. We also provide collection development consultation and help desk support for web archiving questions.

WAS collection stats & trends:  January 2012

82 archives actively collected
1,704 sites archived
1.2 terabytes of content added
UC Irvine archived 495 campus websites
WRCA archived 488 California water agency and resource sites


Library of Congress …..Web Archiving

In 2000, the Library of Congress established a pilot project to collect and preserve websites. A multidisciplinary team of Library staff studied methods to evaluate, select, collect, catalog, provide access to, and preserve these materials for future generations of researchers.

The Library has developed thematic web archives on such topics as the United States National Elections, the Iraq War, and the events of September 11. More about these collections plus many other available collections can be found at the Library of Congress Public Web Archives website.

In July 2003, the Library and the national libraries of Australia, Canada, Denmark, Finland, France, Iceland, Italy, Norway, Sweden, the British Library (UK), and the Internet Archive (USA) acknowledged the importance of international collaboration for preserving Internet content for future generations and formed the International Internet Preservation Consortium (IIPC).

In 2004, the Library’s Office of Strategic Initiatives created a Web Archiving Team to support the goal of managing and sustaining at-risk digital content. The team is charged with building a Library-wide understanding and technical infrastructure for capturing web content. The team, in collaboration with a variety of Library staff, and national and international partners, is identifying policy issues, establishing best practices and building tools to collect and preserve web content.

The 2005 Hurricane Katrina and Rita web archive collaboration included work by the Library of Congress,  Internet Archive, California Digital Library, and other similar institutions who nominated news, personal, relief and government websites to be captured and archived.

As of January 2012, the Library has collected about 285 terabytes of web archive data; view public web archive collections here.

Library of Congress catalogers at their desks, Library of Congress (circa 1920)