[Gzz] Address meanings, not contents! (Re: Storm blocks and metadata)

gzz-dev

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Gzz] Address meanings, not contents! (Re: Storm blocks and metadata)

From:	Reto Bachmann-Gmuer
Subject:	[Gzz] Address meanings, not contents! (Re: Storm blocks and metadata)
Date:	Thu, 27 Mar 2003 17:10:15 +0100

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hi Benja

It is necessary for the interpretation of the data we get; and it'susually easy to agree on (people won't too often assign different mimetypes to the same bytes). One thing about content hashes is, when twopeople put the same file into a hash-based system, they will use thesame identifier for it. With MIME types, that's still pretty muchtrue; with more elaborate metadata, it isn't.

I certainly wouldn't argue to put even more metadata in the URI.

Using the same identifier is important for queries like, "Whichdocuments include this image?" If the three documents that use theimage use three different kinds of IDs for it (because they refer tothree different kinds of metadata), you're out of luck.

In the common sense meaning of the question "Which documents includethis image?", "this image" is not defined by the sequence of bytes thatmake up a specific jpeg version of "this image" but rather by aspecific visual representation of a thing. Giving an URI to the image(in the defined, encoding independent common sense meaning) itself andreferencing this URI rather than the URI of the byte-sequence whereverpossible allows answering queries that are closer to our real worldunderstanding of things (what is concrete for us, is fairly abstractfor the computer, computers deal with abstractions over the raw data toget the stuff non mathematicians can deal with, this"abstraction-process" is to be pushed further to get the semantic web).By the way mime-type isn't so unambiguous, e.g. a text using only arestricted set of characters may be encoded to the same sequence ofbytes using different encodings.

(...)
Higher level applications should not use block-uris anyway but dealwith an abstraction representing the content (like http urls should).
You mean as in, with content negotiation applied? You use a single URIwhich maps to different representations of the same resource?

You name it, the *same* resource. (But each representation is also aresource itself).

An example to be more explicit:
<urn:urn-5:G7Fj> <DC:title> "Ulisses"
<urn:urn-5:G7Fj> <DC:decription> "bla bli"
This, for example, I would not include here. :-) Firstly, it issomething I would want to be versioned independently: if I change thedescription of an image, that should not create a new version of theimage.

Surely not! Where I used literal in the examples one could use a urirepresenting the meaning of "bla bli", an attribute value of this URIwould then be a URI for the english expression of that meaning, anattribute of this URI would be an URI representing this expressionspoken by John, an attribute of this URI would be a byte storm-blockwith the mp3 encoding of it.I think you need a generic versioning system for rdf statement ratherthan for the data, later statement must have a mean to put earlierstatement out of the graph (while the older still should be accessiblein the style of the reification "i used to believe (s p v)"

Secondly, I don't see a reason why the URI of the image would need torefer to this.

me neither ;-). There must be a misunderstanding here.

Thirdly, I don't think that when a file is put into the system-- andthus given its identifier-- is necessarily the time to create thiskind of metadata. It would seem to hold up the task at hand. Rather,I'd like to be able to add it later on, and maybe someone else can dothat even better than me-- like a librarian who has scientificbackground in giving metadata about stuff.

Of course. Mechanisms of the application should probably add somemetadata that give the user a chance to find the data later, but thereshould always be the possibility to enter a new version of the metadata.

(...)
In this example application should reference "urn:urn-5:G7Fj" (whichdoes not have a mime type) rather than "urn:content-hash:Dj&/fjkZRT68" (which has a mime type in a specific context) whereverpossible, in many cases a higher abstraction "urn:urn-5:lG5d" can beused .
Um, using a urn-5 doesn't work since it's just a random number-- if weuse just a random number, we cannot check whether the data we mayretrieve from a p2p network is really what the person making thereference wanted us to see. We would need to use "urn:foo:ref:[blah]",which would be the above RDF data, from which we could then get thespecific representation.

The urn-5 URIs are intended to reference a certainconcept/idea/meaning/topic, peoples are free to associate attributes toexisting URIs. They may be subject to change like terms in naturallanguage are, if somebody wants to use a term in a specific sense shehas to make this explicit, maybe using digital signature stuff, butmore often I think a key free trust system(http://www.w3.org/2002/03/key-free-trust.html) is not only enough, butmore adapted to "fuzzy" trust levels in a P2P network.

While you can only deficiently use http to server a block,


Why?

The only http-header you can send back is the length and if you put itin the URI the content-type, most http features are unused.

you could server the uri of both the abstractions (urn:urn-5:G7Fj andurn:urn-5:lG5d) directly using http 1.1.features.
(Again, you'd have to use hashes, or you could be arbitrarily spoofed.)

(Again. No good networking without trust mechanisms ;-)

(...)
And how do you split the metadata in blocks
Well, depends very much on the application. How do you splitmetadata into files? :-)
Not at all ;-). The splitting into file is rudimentary representedmeta-data, if you use RDF the filesystem is a legacy application.
Um, but if you put metadata on an http server, you split it too?

My approach would be to split the data just in time. To make itaccessible over http a standard request the server could return all thestatements where a specific URI occurs, or only where it is thesubject. An extended request could contain the level of expansionrequested.

(...)


Cheers,
Reto
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.0.7 (Darwin)

iD8DBQE+gyJtD1pReGFYfq4RAgiFAKCEEvE6v/NwTl1ebjge5YPx9UAtqACgqXvF
RpcbVqiDuvMrGt9ReDMGZLI=
=TRAL
-----END PGP SIGNATURE-----

[Prev in Thread]

Current Thread

[Next in Thread]

[Gzz] Storm blocks and metadata (Re: P2P and RDF), Benja Fallenstein, 2003/03/25
- [Gzz] Address meanings, not contents! (Re: Storm blocks and metadata), Reto Bachmann-Gmuer <=
  - [Gzz] Re: Address meanings, not contents! (Re: Storm blocks and metadata), Benja Fallenstein, 2003/03/27

Prev by Date: [Gzz] Porting from zz to RDF
Next by Date: [Gzz] Re: Address meanings, not contents! (Re: Storm blocks and metadata)
Previous by thread: [Gzz] Storm blocks and metadata (Re: P2P and RDF)
Next by thread: [Gzz] Re: Address meanings, not contents! (Re: Storm blocks and metadata)
Index(es):
- Date
- Thread