Issues

Add option to omit storing of content in segment
WAX-34
Add URL canonicalization to pageranker
WAX-33
500 error - java.lang.NegativeArraySizeException
WAX-32
contrib/archive/README.txt needs clarifications
WAX-31
Nutchwax requires very long timeouts on remotely hosted arc files
WAX-30
nutchwax home page issue tracker still points to sf.net
WAX-29
Investigate malformed URL report during date-adder
WAX-28
Sensible output for requesting page of results past the end.
WAX-27
Add XML elements containing all search URL params for self-link generation
WAX-26
Add utility/tool to dump unique values of a field in an index.
WAX-25
DateAdder fails due to uncaught exception in URL canonicalization
WAX-24
Add a "field setter" filter to set a field to a static value in the Lucene document during indexing.
WAX-23
Various code clean-ups based on code review using PMD tool.
WAX-22
Allow for blank lines and comment lines in manifest file.
WAX-21
bug in exacturl query
WAX-20
Add strict/loose option to DateAdder for revisit lines with extra data on end
WAX-19
Add reading of archive files from DFS
WAX-18
More aggressive collapsing by site in search results
WAX-17
Option to skip ARC record import based on HTTP status code of content
WAX-16
Investigate why reading content from archive file uses such small chunks
WAX-14
Add DFS read/write support to DateAdder
WAX-13
Add metadata field "fileoffset"
WAX-12
Change metadata field name in search results from "arcname" to "filename"
WAX-11
Add "exacturl" metadata field to indexing so it can be searched as-is, not parsed/tokenized like the "url" field.
WAX-10
Entire file not imported
WAX-9
1-25 of 33