A method is proposed for dealing with nonuniform data distributions in database organizations in order to estimate the expected number of blocks containing the tuples requested by a query. When tuples with equal attribute value are not uniformly distributed Over the blocks of secondary memory that store the relation, a clustering effect is observed. This can be detected by means of a single parameter, (the clustering factor), which can be stored in the system catalog. The method can be applied to uniform data distributions as well, since it is shown that a uniform distribution can be viewed as a particular instance of a class of clustered distributions. In this case the proposed method allows considerable reduction of the number of computational steps needed to compute the estimated result. © 1993 IEEE
Block Access Estimation for Clustered Data / Ciaccia P.. - In: IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING. - ISSN 1041-4347. - STAMPA. - 5:4(1993), pp. 712-718. [10.1109/69.234782]
Block Access Estimation for Clustered Data
Ciaccia P.
1993
Abstract
A method is proposed for dealing with nonuniform data distributions in database organizations in order to estimate the expected number of blocks containing the tuples requested by a query. When tuples with equal attribute value are not uniformly distributed Over the blocks of secondary memory that store the relation, a clustering effect is observed. This can be detected by means of a single parameter, (the clustering factor), which can be stored in the system catalog. The method can be applied to uniform data distributions as well, since it is shown that a uniform distribution can be viewed as a particular instance of a class of clustered distributions. In this case the proposed method allows considerable reduction of the number of computational steps needed to compute the estimated result. © 1993 IEEEI documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.