Should record IDs be GUIDs? #242

msheller · 2022-09-02T18:03:15Z

msheller
Sep 2, 2022
Maintainer

Currently, a registered dataset is given an auto-incremented ID, such that the ID is unique only in server context. This means that our filesystem could have multiple entries for the exact same data if a user were to register a dataset using the same prep logic on two different servers.

Alternatively, dataset IDs could be GUIDs, such that we could potentially submit the same registration to multiple servers and keep the datasets in a server-agnostic folder. To aid this, the client could do the same generated UID checks the server does when the server tries to prevent duplicate entries. This way, the client can check for this problem in cases split across servers. (Does the client perhaps already do this to prevent erroneous computation if the client tries to prepare the same dataset the same way multiple times?

hasan7n · 2023-11-22T13:49:43Z

hasan7n
Nov 22, 2023
Maintainer

converted to an issue

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Should record IDs be GUIDs? #242

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Should record IDs be GUIDs? #242

msheller Sep 2, 2022 Maintainer

Replies: 1 comment

hasan7n Nov 22, 2023 Maintainer

msheller
Sep 2, 2022
Maintainer

hasan7n
Nov 22, 2023
Maintainer