[Bizgres-general] A bitmap index access method is available in
bizgres CVS tip
Mark Kirkwood
markir at paradise.net.nz
Wed Dec 21 21:57:03 GMT 2005
Simon Riggs wrote:
>
> Real data is seldom uniformly randomly distributed when there are more
> than a few values. The most frequently occurring natural distribution is
> Zipf. With that, as we add more distinct values the frequency of
> occurrence of those additional values reduces and therefore the
> compressibility of the additional bitmaps increases also.
> Think about
> how big the bitmap for a value would be if only 1 row had that value.
>
Err - for a conventional bitmap it's always the same, so matter how many
rows actually have the value (e.g. for 10e7 rows like the example):
bitmap for 1 value size = 10000000/8 ~ 1.1Mb
Now clearly the confusion here is because I have not really got my head
around the Bizgres bitmap implementation yet (tho, Jie has provided some
pointers, and I'm reading the code today). If you have a nice
description of the algorithm you used, then that would be much appreciated!
Cheers
Mark
More information about the Bizgres-general
mailing list