I'm writing this to know whether Hertrix is supporting Japanese Full Space or not.
When target URLs include Japanese Full Space ('E3 80 80' in UTF 8), Heritrix seems escape it as '%3000'. As a result, Heritrix cannot access and collect the page because it uses '%3000' instead of '%E3%80%80' in the escaped URL. This escapce process seems to be done in org.archive.net.UURIFactory#escapeWhitespace.
Is this because Heritrix don't support URLs which inculdes Japanese characters so far? If so, I would like to know if there is any concrete plan to support Japanese characters in the future.