Modified: lucene/java/trunk/docs/fileformats.html URL: http://svn.apache.org/viewvc/lucene/java/trunk/docs/fileformats.html?... =========================================================================== === --- lucene/java/trunk/docs/fileformats.html (original) +++ lucene/java/trunk/docs/fileformats.html Wed Dec 13 02:08:28 2006 @@ -1346,7 +1346,8 @@ <a name="N104B4"></a><a name="Compound Files"></a> <h3 class="boxed">Compound Files</h3> <p>Starting with Lucene 1.4 the compound file format became default. This - is simply a container for all files described in the next section.</p> + is simply a container for all files described in the next section + (except for the .del file).</p> <p>Compound (.cfs) --> FileCount, <DataOffset, FileName> <sup>FileCount</sup> , @@ -2068,12 +2069,23 @@ <a name="N1073C"></a><a name="Deleted Documents"></a> <h3 class="boxed">Deleted Documents</h3> <p>The .del file is - optional, and only exists when a segment contains deletions: + optional, and only exists when a segment contains deletions. </p> -<p>Deletions +<p>Although per-segment, this file is maintained exterior to compound segment files. + </p> +<p> + +<b>Pre-2.1:</b> + Deletions (.del) --> ByteCount,BitCount,Bits </p> -<p>ByteSize,BitCount --> +<p> + +<b>2.1 and above:</b> + Deletions + (.del) --> [Format],ByteCount,BitCount, Bits | DGaps (depending on Format) + </p> +<p>Format,ByteSize,BitCount --> Uint32 </p> <p>Bits --> @@ -2081,6 +2093,20 @@ <sup>ByteCount</sup>
</p> +<p>DGaps --> + <DGap,NonzeroByte> + <sup>NonzeroBytesCount</sup> + +</p> +<p>DGap --> + VInt + </p> +<p>NonzeroByte --> + Byte + </p> +<p>Format + is Optional. -1 indicates DGaps. Non-negative value indicates Bits, and that Format is excluded. + </p> <p>ByteCount indicates the number of bytes in Bits. It is typically (SegSize/8)+1. @@ -2096,10 +2122,23 @@ Bits contains two bytes, 0x00 and 0x02, then document 9 is marked as deleted. </p> +<p>DGaps + represents sparse bit-vectors more efficiently than Bits. + It is made of DGaps on indexes of nonzero bytes in Bits, + and the nonzero bytes themselves. The number of nonzero bytes + in Bits (NonzeroBytesCount) is not stored. + </p> +<p>For example, + if there are 8000 bits and only bits 10,12,32 are set, + DGaps would be used: + </p> +<p> + (VInt) 1 , (byte) 20 , (VInt) 3 , (Byte) 1 + </p> </div>
MMM wrote: > Die in a slow fire, dickless super-homo. The only dickless-super-homo is Scott Aaronson @ M.I.T. and you. You're my bitch. Like it or not. Now squeal, bitch. Give me some more of your poor/rich language! I'm rich, bitch!
Crush little nb=bitches like this who stand in the way of a million- dollar cure to childhood cancer so they can continue to overcharge you for error-soaked software and upgrades, and spyware protection software that actually spies on you (read the fine print!)
> MMM wrote: >> Die in a slow fire, dickless super-homo. The only dickless-super-homo is >> Scott Aaronson @ M.I.T. and you. You're my bitch. Like it or not. Now >> squeal, bitch. Give me some more of your poor/rich language! I'm rich, >> bitch!
> MMM wrote: > > Die in a slow fire, dickless super-homo. The only > dickless-super-homo is Scott Aaronson @ M.I.T. and > you.
But you are the only talentless, lying, cheating, property thief asshole. It is your unique claim to fame.
You're my bitch. Like it or not. Now squeal,
> bitch. Give me some more of your poor/rich language! > I'm rich, bitch!.
Money obtained by ignoring ethics. And you are proud of it, aren't you.?
You stole that money from someone else.?. Or did you use others' resources to benefit yourself, you worthless fucking parasite.?. Your "talent" will lead nowhere unless you steal from others, and use their resources.
> > "debaser" <marty.musa...@gmail.com> wrote in > message
> MMM wrote: > > Die in a slow fire, dickless super-homo. The only > dickless-super-homo is Scott Aaronson @ M.I.T. and > you. You're my bitch. Like it or not. Now squeal, > bitch. Give me some more of your poor/rich language! > I'm rich, bitch!
> > "debaser" <marty.musa...@gmail.com> wrote in > message
> Crush little nb=bitches like this who stand in the > way of a million- > dollar cure to childhood cancer so they can continue > to overcharge you > for error-soaked software and upgrades, and spyware > protection > software that actually spies on you (read the fine > print!)
Asshole: I am here in NJ: you are free to come here anytime and try to crush me. Stay there and keep stealing.
Notice how musatov never defends himself from the _SPECIFIC_ charges that he is a thief. He only replies with insults, never addresses the specific claims.
As befits a lie, cheat and thief, who would never accomplish anything using his own resources, so he must cheat , steal and lie , shit all over everyone else's rights, to see if he can accomplish something. 'Cause cheating, lying and shitting on others are musatov's 'Christian Values'; that is the cover for musatov.
But even after ignoring ethics, musatov accomplishes nothing.