[Esip-preserve] Possible Workaround for data identity non-uniqueness?

alicebarkstrom at frontier.com alicebarkstrom at frontier.com
Wed Oct 13 12:05:11 EDT 2010


Briefly - the assertion should come with a practical test that could
be verified by a user or an auditor.

Also, the resolver of the UUID's should probably be responsible for
noting the alternatives - just as mirroring sites do.

Bruce B.
----- Original Message -----
From: "Christopher S. Lynnes (GSFC-6102)" <christopher.s.lynnes at nasa.gov>
To: "Curt Tilmes (GSFC-6145)" <curt.tilmes at nasa.gov>
Cc: esip-preserve at lists.esipfed.org
Sent: Wednesday, October 13, 2010 8:56:23 AM
Subject: [Esip-preserve] Possible Workaround for data identity	non-uniqueness?

I agree with Curt's assessment that the canonicalization has practical problems for data that has been reformatted in a way that does not affect the content.

Is there perhaps a workaround where the reformatting agent simply asserts that they are equivalent?  That is, to add a metadata attribute that says, "this file is scientifically equivalent to this other file (e.g., identified by uuid)"?  

On Oct 13, 2010, at 8:33 AM, Curt Tilmes wrote:

> You can argue that coming up with a C(x) canonicalization isn't
> practical for our data (I won't even disagree :-) I sure don't want to
> do it myself), but your paper doesn't present that argument, or even
> address the point.  Your conclusion simply assumes it is true.
> 
> As Altman demonstrates for his field, it is certainly conceivable.
> 
> I'm also not certain that we have to develop something that "applies
> to all Earth science data" to be useful.  Perhaps we can come up with
> something reasonable for a subset, for example, annotated files in one
> of the self-describing formats (HDF/NetCDF/etc.) where the annotations
> can contribute to the canonicalization process (i.e. you tag text
> fields with a property that says "case-insensitive canonicalization of
> this field will maintain scientific equivalence"

--
Dr. Christopher Lynnes    NASA/GSFC, Code 610.2, Greenbelt, MD 20771
Phone: 301-614-5185

_______________________________________________
Esip-preserve mailing list
Esip-preserve at lists.esipfed.org
http://www.lists.esipfed.org/mailman/listinfo/esip-preserve


More information about the Esip-preserve mailing list