Wayback is loading resource in ArchivalUrl mode before date redirection
ACC-104
Automatically add trailing slash to replayURIPrefix configuration, if omitted.
ACC-98
LiveWeb fetching code is not correctly shutting down ARCWriterPool when tomcat shuts down
ACC-105
Wayback should ppeek at cotnent missing HTTP headers to guess at the type.
ACC-101
Wayback explicitly uses log4j
ACC-87
datespec flags are not forwarded to date redirects
ACC-88
Wayback calculates current year at JVM startup. Should be recalculated for each request.
ACC-90
"Bad Content Exception" for warc/revisit records
ACC-126
Exception with host Regex for server-side rewriting
ACC-89
source forge disclaimer
ACC-136
indexer should handle WARC "resource" records
ACC-94
Wayback does not handle UTF-16 correctly
ACC-59
enable Archival URL datespec flags
ACC-50
Wayback marks up text/javascript docs as if they were HTML when the Content-Type is text/html
ACC-48
Relative URLs before BASE href tag must be resolved against page URL or Content-Base HTTP header not original BASE href value
ACC-51
obscure AccessPoint bean name huge deployment hurdle
ACC-107
Return special response header from Wayback to confirm proxy mode is working
ACC-79
Parse javascript embedded in HTML files, and in external .js files and rewrite absolute URLs inside
ACC-54
Non JS timeline is shown in all frames of a frameset.
ACC-23
Advanced search localization needs to be updated
ACC-47
Broken https URLs in Proxy Mode
ACC-57
version of arc-indexer to handle bad records in ARC files
ACC-85
Only 1 Indexing Thread and 1 Merging Thread starts
ACC-8
Where is org.archive.wayback.resourcestore.http.FileLocationDBClient?
ACC-66
Wayback website not generated in UTF-8
ACC-29
Wayback .jsps do not output UTF-8
ACC-18
Administrator manual doesn't explain how to add ARCs to your installation
ACC-19
text in comment in bin/cdx-indexer is wrong
ACC-119
sourceforge hosted wayback documentation has broken links wherever named anchors occur
ACC-44
Wayback does not time out requests to backend nodes
ACC-38
Distributed index and dynamic index need updating
ACC-42
Wayback not rewriting xml dtd and xsl references
ACC-52
ArchivalUrl Server side rewriting adds content to pages with Framesets, breaking the pages
ACC-65
set up an access oracle for all access projects
ACC-69
Wayback replay issue
ACC-108
Improve timeline "Optimal" scaling / usability
ACC-4
set arc proxy in multimachine
ACC-22
locdata205-bu (ia400313)
ACC-68
source jars
ACC-99
live web leakage
ACC-27
access-control - update heritrix-commons dependency
ACC-102
testing JIRA
ACC-61
some @import url's in <style> section of html not being rewritten
ACC-16
Date ranges not appied in URL prefix queries.
ACC-58
Indexers need to extract Meta "no archive" tags
ACC-41
Wayback does not correctly replay chunk-encoded transfers
ACC-9
Clean up initialization of RequestParsers
ACC-39
accept https:// scheme in request urls
ACC-32
Make Wayback search results available in RSS format.
ACC-56
Wayback ROOT context to catch server-relative links
ACC-6
issue 1 of 81

Wayback is loading resource in ArchivalUrl mode before date redirection

Description

This is quite wasteful, buy may require a core interface change to correct. Currently, the ReplayRendererSelector is checking the DateMismatchSelector, at render time, to enforce the "redirect browser to actual date" so the location bar reflects the correct date. At this point - render time - wayback has already located the file, and downloaded at least the header for the record. This means wayback is:
1) locating and downloading the W/ARC record header
2) determining the date of the record is not what the user asked for
3) redirecting the user to the actual date
4) doing another query, followed by another locating and downloading of the W/ARC record header
5) finally showing the record to the user

#1 is a significant amount of work, and could be short-circuited, by adding a "pre-download" callback to the ReplayRendererSelector interface..

Environment

None

Status

Assignee

Kenji Nagahashi

Reporter

Brad Tofel

Labels

None

Group Assignee

None

ZendeskID

None

Estimated Difficulty

None

Actual Difficulty

None

Components

Sprint

None

Affects versions

Wayback-1.4.2

Due date

2010/08/09

Priority

Major
Configure