getDigest on ARCRecordMetadata has undocumented issues.

Description

When iterating through an ARC file using heritrix-commons, the getDigest method of the ARCRecordMetadata has some undocumented issues.

First, the record must have been read completely before the digest string is generated.

Second, if the client processes the HTTP headers itself, by using the read(...) methodes of the ARCRecord, the digest is calculated on the whole entry (including headers) and not the payload.

Environment

None

Status

Assignee

Unassigned

Reporter

Erik Hetzner

Labels

None

Group Assignee

None

ZendeskID

None

Estimated Difficulty

None

Actual Difficulty

None

Affects versions

Heritrix 3.1.1

Priority

Major
Configure