Computational Research Use of Web Archive Data
The ALEXANDRIA project
Web Archives for Longitudinal Knowledge
Global Event and Trend Archive Research & Integrated Digital Event Archiving
Project: PoliInformatics & The .gov Web Domain Analysis
Tempas: Historical Web Archive Search and Information Retrieval
Researcher: Alexandria Project, University of Hannover
Description: Research and development for information retrieval and text and URL search for
web archive collections. Funding: European Commission.
References: http://tempas.l3s.de/v2/index
Storytelling for Summarizing Collections in Web Archives
NetLab & Historical Internet Data for the Arts and Humanities
Researcher: Research Infrastructure for the Study of Archived Web Materials Network
Description: Conducting a number of research-driven projects to contribute to the establishment,
test and development of a research infrastructure for the study of online as well as archived
internet materials. Also working in concert with a number of national libraries, as well as
Internet Archive, to support the study of national web domains.
Funding: European Commission | References: http://www.netlab.dk/ & http://resaw.eu/
An Archival Reconstruction of the Former Yugoslav Top Level Domain
The colors of the national Web: visual data analysis of the historical Yugoslav Web domain
Webverse
World Wide Web of Humanities
Researcher: Oxford Internet Institute
Description: Establishing a framework for e-Humanities research using available open source
tools and technologies and archived web content to create novel research interfaces to the first of
many, scholarly, e-Humanities web collections.
Funding: Joint Information Systems Committee & National Endowment for the Humanities
References: http://wwwoh-access.archive.org/wwwoh/about.htm
Using Archival Resources to Conduct Data-Intensive Internet Research & Newspapers and the long-term implications of hyperlinking
Researcher: Matt Weber, School of Information & Communication, Rutgers University
Description: The BCC-SBE Collaborative Research Project, "Using Archival Resources to
Conduct Data-Intensive Internet Research," has three goals: (1) to build a community of scholars
focused on tackling next-generation questions of Internet research through the use of archival
digital data; (2) to create sample databases and develop a prototype research tool,
HistoryTracker, using data from the Internet Archive, a library of Web pages from the World
Wide Web; and (3) to maintain an active community of scholars using the cutting-edge
community platform HUBzero. Funding: National Science Foundation
References: http://matthewsweber.com/current-research/nsf-internet-archive/ &
http://wwwconference.org/proceedings/www2014/companion/p1031.pdf
GifCollider
Researcher: Brian D. Davison, Professor of Computer Science & Engineering, Lehigh University
Description: Investigating efficient and effective approaches to store, index, and retrieve web
content from large-scale historical archives. The temporal content and structure of the archives
are mined to exploit temporal characteristics that can improve search result ranking.
Funding: National Science Foundation
References: http://wume.cse.lehigh.edu/projects/archive/