Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SHA1 Payload-Digest should use base 32 and not base 16 #213

Open
thomasegense opened this issue Nov 1, 2020 · 0 comments
Open

SHA1 Payload-Digest should use base 32 and not base 16 #213

thomasegense opened this issue Nov 1, 2020 · 0 comments

Comments

@thomasegense
Copy link

I know that the Warc specification do allow base 16 (https://iipc.github.io/warc-specifications/specifications/warc-format/warc-1.1/#warc-payload-digest). But since most other Warc tools all produce base 32, SHA1, it would be beneficial for the community to use the same standard.

The following tools all produce base32:
Heritrix
wget
Webrecorder

It was discovered when the warc-indexer in the webarchive-discovery project (https://github.com/ukwa/webarchive-discovery/tree/master/warc-indexer) reported errors with the payload when indexing warc-files generated woth Brozzler.

@thomasegense thomasegense changed the title SHA1 Payload-Digest should use base 16 and not base 32 SHA1 Payload-Digest should use base 32 and not base 16 Nov 1, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant