Are URLs including 'Japanese Full Space' supported?

Description

I'm writing this to know whether Hertrix is supporting Japanese Full Space or not.

When target URLs include Japanese Full Space ('E3 80 80' in UTF 8), Heritrix seems escape it as '%3000'. As a result, Heritrix cannot access and collect the page because it uses '%3000' instead of '%E3%80%80' in the escaped URL. This escapce process seems to be done in org.archive.net.UURIFactory#escapeWhitespace.

Is this because Heritrix don't support URLs which inculdes Japanese characters so far? If so, I would like to know if there is any concrete plan to support Japanese characters in the future.

Environment

None

Status

Assignee

Unassigned

Reporter

Masahiro Shimada

Labels

None

Group Assignee

None

ZendeskID

None

Estimated Difficulty

None

Actual Difficulty

None

Priority

Minor
Configure