CyberCemetery, Spring 2007

03.22.2007

 

archived websites, coming in summer 2007

  • Iraq Study Group
  • Return to Flight Task Group
  • Commission on the Future of Higher Education
  • The Independent Counsel’s Investigation of the President (Starr Commission)
  • Select Bipartisan Committee to Investigate the Preparation for and Response to Hurricane Katrina

 

archiving software

  • Heritrix
  • from the Heritrix website http://crawler.archive.org/
    • “Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project. Heritrix (sometimes spelled heretrix, or misspelled or missaid as heratrix/heritix/ heretix/heratix) is an archaic word for heiress (woman who inherits). Since our crawler seeks to collect and preserve the digital artifacts of our culture for the benefit of future researchers and generations, this name seemed apt.”
  • open-source
  • written in Java
  • stores the web resources in an Arc file


Page Information

  • 1 year ago [history]
  • View page source
  • You're not logged in
  • No tags yet learn more

Wiki Information

Recent PBwiki Blog Posts