Issues

nutchwax-0.13/src/java/org/archive/nutchwax/imagesearch/DocIndexer.java:309: error: method filter in class IndexingFilters cannot be applied to given types
WAX-83
Nutch HTML parser infinite loop.
WAX-82
Corrupt script tag at end of page causes HTML parser infinite loop.
WAX-81
Mime-type detection infinite loop due to control character in DOCTYPE declaration.
WAX-80
Extract HTML meta tags for 'description' and 'keywords' and add to segment.
WAX-79
HTML noindex and nofollow enforced in HTMLParser?
WAX-78
JDK6u23 breaks GzippedInputStream & W/ARCReaders with different GZIP handling
WAX-77
Slow parsing
WAX-76
Hacks to use with Hadoop-0.20 from Cloudera
WAX-75
Add support for storing fields in compressed form.
WAX-74
Change default value of searcher.fieldcache in nutch-site.xml to 'false'
WAX-73
Simply build system to copy NW files into Nutch dirs and use Nutch build.xml
WAX-72
NutchWAX-required libraries not included in nutch-1.0.job
WAX-71
Cannot use rsync URLs, no handler for rsync protocol.
WAX-70
Class not found when importing within a Hadoop MR job.
WAX-69
Compatibility with {index+segment}s created by NutchWAX 0.10.
WAX-68
Nutch OpenOffice parser does not pass along metadata.
WAX-67
Index documents without crawldb nor linkdb.
WAX-66
Some odd-ball characters display as '?' in search results.
WAX-65
research sorting feature for NutchWAX
WAX-64
LengthNormUpdater returning error code if no fields in index have norms is inconvenient.
WAX-63
Add ability to configure HTTP headers to support cacheing.
WAX-62
Change mime-type of OpenSearch XML response from text/xml to application/xml.
WAX-61
DateAdder should have an option to determine if norms should be used.
WAX-60
Wrong log() function used in PageRankScoringFilter.
WAX-59
1-25 of 83