Re: [Gzz] Raw pools?

gzz-dev

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Gzz] Raw pools?

From:	Benja Fallenstein
Subject:	Re: [Gzz] Raw pools?
Date:	Sun, 10 Nov 2002 20:00:26 +0100
User-agent:	Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.1) Gecko/20020913 Debian/1.1-1

Tuomas Lukka wrote:

On Sun, Nov 10, 2002 at 02:39:58PM +0100, Benja Fallenstein wrote:
Tuomas Lukka wrote:
An idea, related to the "canonical blocks":
Maybe we should label some pools "raw", i.e. no header, just  a block of
binary data. That way, we could be compatible with other content-basedsystems
for externally obtained data.
That means we cannot move blocks from there to non-raw pools. Also, we'dhave to guess the type.
Yes. But I think that's acceptable for the intended purpose:
allowing us to use material that we can't distribute.

As one of the important principles of Storm, I see what I call the"persistency commitment" (which I should peg ;-) ):

*The specification guarantees that all blocks created according to itremain readable indefinitely-- in fifty years, or two hundred-- by anyconforming implementation. This extends to all future versions of thisspecification, meaning that ALL future versions of the specificationwill be able to read ALL blocks created according to the rules in anyprevious version.*

Since all future implementations will have to support all 'features' weput in now, we have to be extremely careful, because all garbage we putin now has to be carried along indefinitely. This is relevant in two ways:- If we support "raw pools" now, all future versions of Storm will needto support raw pools; otherwise, the blocks could not be referenced anymore. This would violate the commitment.- Storm would cease to be a mapping from ids to a block of binary data*with metadata, at least a Content-Type*. This change would not berevertible, since that would violate the commitment.

I want Storm to be really simple at heart; provided that you can stillrun or can reproduce the application layer functionality above the Stormlayer, it should be possible to keep a collection of blocks in *any*Storm pool implementation and revive it in fifty years or whenever. Iexpect pool implementations to be quite different in fifty years-- but Irequire them to in some way represent a binary block of bytes with aMIME header containing at least a content type. As long as they do that,I can copy the blocks with notes I have now without ever losing data.The implementation can be manifold-- for example, I could store them inOceanStore and highly replicated to avoid lossage, or I could store themin compressed format on a CD I have at home-- and because all thoseimplementations are required to support the same block format, I'mguaranteed that I can move blocks from one to another freely, thus I'mable to archive my blocks even if the underlying implementations changedrastically.

Thus, I don't agree that any extension to the Storm specification *can*be decided in the context of a single intended use (such as distributingreferences to copyrighted blocks). To create a special type of poolwhich does not allow blocks to be moved to other types of pools wouldeither break the property I mentioned-- stuff in Storm should be freelymovable between pool implementations-- or it would require everyimplementation (in directory, on an HTTP server, in OceanStore, inCircle, ...) to have a 'raw' and a 'non-raw' flavor. So saying 'it's okfor this purpose' doesn't work.

[Disclaimer: Of course, it's not unlikely at all that Storm won'tsurvive even ten years. The point is to try and publish the results, sothat somebody designing a protocol that does last that long can learnfrom it.]

E.g. for xupdf, this would be vital for other people to be able
to use the demo.

We have the canonical blocks (with just the Content-Type header); sinceyou have to call a program to put something inside Storm anyway (unlessyou're going to calculate the SHA-1 hash yourself), I don't see thedifference it would make at this point in time.

How vehemently opposed are you?

As the above shows, very. ;-) I don't think the single purpose here isworth the change.


*snip my proposal*

(Note that the body chunks will *not* be Storm blocks, even thoughthey're refered to by an SHA-1 hash. It is therefore illegal to refer toa body chunk through a Storm URI.)
So these would, essentially, be blocks in raw pools?

No, since they would not be blocks. :-) But they would have the propertyyou're searching for: they could be queried through a contentdistribution network that uses a plain SHA-1 hash for the identificationof files.

The difference to your proposal above is that this proposal a) does notbreak the property that blocks can be moved from any pool to any poolfreely (BUT it does require pool implementations to support both theold-style and the new-style id checking mechanisms); b) does not breakthe assumption that all Storm blocks have a content-type (so you neitherneed to guess nor give any hints in the context of a Storm URI, which Ifear would happen otherwise); and c) has applicability far beyond thespecial case you were talking about.

Let me put it like this: It has broad enough applicability that I'mthinking this *might* just be good enough to warrant the burden onfuture implementations.

On the downside, there would be more computation in id checking, sincewe'd also need to parse the header.
Very small price to pay.


Ok.

- Benja

[Prev in Thread]

Current Thread

[Next in Thread]

[Gzz] Raw pools?, Tuomas Lukka, 2002/11/10
- Re: [Gzz] Raw pools?, Benja Fallenstein, 2002/11/10
  - Re: [Gzz] Raw pools?, Tuomas Lukka, 2002/11/10
    - Re: [Gzz] Raw pools?, Benja Fallenstein <=
    - Re: [Gzz] Raw pools?, Tuomas Lukka, 2002/11/11
    - Re: [Gzz] Raw pools?, Benja Fallenstein, 2002/11/11
    - Re: [Gzz] Raw pools?, Tuomas Lukka, 2002/11/16
    - Re: [Gzz] Raw pools?, Benja Fallenstein, 2002/11/16
    - Re: [Gzz] Raw pools?, Tuomas Lukka, 2002/11/16
    - Re: [Gzz] Raw pools?, Benja Fallenstein, 2002/11/17
    - Re: [Gzz] Raw pools?, Tuomas Lukka, 2002/11/18
    - Re: [Gzz] Raw pools?, Benja Fallenstein, 2002/11/18
    - Re: [Gzz] Raw pools?, Benja Fallenstein, 2002/11/16
    - Re: [Gzz] Raw pools?, Tuomas Lukka, 2002/11/17

Prev by Date: Re: [Gzz] Raw pools?
Next by Date: Re: [Gzz] Raw pools?
Previous by thread: Re: [Gzz] Raw pools?
Next by thread: Re: [Gzz] Raw pools?
Index(es):
- Date
- Thread