Do not expand list of outlinks automatically in ArchiveJSONViewLoader()

Description

When loading outlinks into a Pig script via ArchiveJSONViewLoader(), the outlinks for each URL are automatically flattened out so that for N outlinks you get N tuples in your relation. That is, the result is the cross product of the URL and the outlinks.

Maybe it's better to leave the outlinks as a Pig Bag and let the script-writer FLATTEN() them if desired. By leaving them in the bag, the group of outlinks for a particular URL can be processed w/o having to re-group them.

Environment

None

Status

Assignee

Brad Tofel

Reporter

Aaron Binns

Labels

None

Group Assignee

None

ZendeskID

None

Estimated Difficulty

None

Actual Difficulty

None

Components

Priority

Major
Configure