Not exactly a heritrix bug per se, but I wanted to track this issue.
We are unable to crawl the url: https://netfiles.uiuc.edu/akachi2/home
We get the following error:
This seems to be more a Java issue than a heritrix issue; I am unable to get the file using the tika-app jar:
I don't know if there is anything to be done here, but I thought I'd report the issue.
ubuntu quantal, java 6 & 7