| |
In DATESO (2004), pp. 27-37.
Abstract
In this paper pivoting M-tree (PM-tree) is introduced, a metric access method combining M-tree with the pivot-based approach. While in M-tree a metric region is represented by a hyper-sphere, in PMtree the shape of a metric region is determined as an intersection of the hyper-sphere and a set of hyper-rings. The set of hyper-rings for each metric region is related to a fixed set of pivot objects. As a consequence, the shape of a metric region bounds the indexed objects more tightly ...
|
| |
In DASFAA, Vol. 3453 (2005), pp. 398-409.
Abstract
In this paper, we propose a novel high-dimensional index method, the BM+-tree, to support efficient processing of similarity search queries in high-dimensional spaces. The main idea of the proposed index is to improve data partitioning efficiency in a high-dimensional space by using a rotary binary hyperplane, which further partitions a subspace and can also take advantage of the twin node concept used in the M+-tree. Compared with the key dimension concept in the M+-tree, the binary hyperplane is more effective in ...
|
| |
In DASFAA, Vol. 3453 (2005), pp. 385-397.
Abstract
Indexing high dimensional datasets has attracted extensive attention from many researchers in the last decade. Since R-tree type of index structures are known as suffering curse of dimensionality problems, Pyramid-tree type of index structures, which are based on the B-tree, have been proposed to break the curse of dimensionality. However, for high dimensional data, the number of pyramids is often insufficient to discriminate data points when the number of dimensions is high. Its effectiveness degrades dramatically with the increase of dimensionality. ...
|
| |
In ADC (2003), pp. 161-168.
Abstract
In this paper, we propose a new metric index, called M+-tree, which is a tree dynamically organized for large datasets in metric spaces. The proposed M+-tree takes full advantages of M-tree and MVP-tree, with a new concept called key dimension, which effectively reduces response time for similarity search. The main idea behind the key dimension is to make the fanout of tree larger by partitioning a subspace further into two subspaces, called twin-nodes. We can double the filtering effectiveness by utilizing ...
|
| |
In ICDT, Vol. 2572 (2003), pp. 143-157.
Abstract
We propose a new indexing scheme, called the CRB-tree, for efficiently answering range-aggregate queries. The range-aggregate problem is defined as follows: Given a set of weighted points in R , compute the aggregate of weights of points that lie inside a d-dimensional query rectangle. In this paper we focus on range-COUNT, SUM, AVG aggregates. First, we develop an indexing scheme for answering two-dimensional range-COUNT queries that uses O(N=B) disk blocks and answers a query in... ...
|
| |
In VLDB '01: Proceedings of the 27th International Conference on Very Large Data Bases (2001), pp. 421-430.
Note (first note only)
iDistance
|
| |
Lecture Notes in Computer Science In EDBT, Vol. 1777 (2000), pp. 51-65.
Abstract
In this paper we present the Slim-tree, a dynamic tree for organizing metric datasets in pages of fixed size. The Slim-tree uses the "fat-factor" which provides a simple way to quantify the degree of overlap between the nodes in a metric tree. It is well-known that the degree of overlap directly affects the query performance of index structures. There are many suggestions to reduce overlap in multidimensional index structures, but the Slim-tree is the first metric structure explicitly... ...
|
| |
In WAIM '00: Proceedings of the First International Conference on Web-Age Information Management (2000), pp. 356-373.
Abstract
One of the common query patterns is to find approximate matches to a given query object in a large database. This kind of query processing is referred as similarity search in a metric space. In this paper, we propose a new metric index MB+tree, called Metric B+tree, which supports near neighbour searching in a generic metric space. MB+tree is aimed at reducing both the number of I/O accesses and the number of distance calculations for similarity search in large databases, while allowing dynamic data updates. ...
|
| |
In ICDE '00: Proceedings of the 16th International Conference on Data Engineering (2000)
Note (first note only)
IQ-Tree
|
| |
In PODS '00: Proceedings of the nineteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems (2000), pp. 166-174.
Note (first note only)
iMinMax
|
| |
VLDB Journal, Vol. 9, No. 2. (2000), pp. 154-173.
Abstract
For some multimedia applications, it has been found that domain objects cannot be represented as feature vectors in a multidimensional space. Instead, pair-wise distances between data objects are the only input. To support content-based retrieval, one approach maps each object to a k-dimensional (k-d) point and tries to preserve the distances among the points. Then existing spatial access index methods such as the R-trees and KD-trees can support fast searching on the resulting k-d points. ... ...
|
| |
In Proc. of the 26th International Conference on Very Large Data Bases (VLDB) (September 2000), pp. 516-526.
Abstract
We propose a novel index structure, the A-tree (Approximation tree), for the similarity search of high-dimensional data. The basic idea of the A-tree is the introduction of Virtual Bounding Rectangles (VBRs) which contain and approximate MBRs or data objects. VBRs can be represented rather compactly, and thus a#ect the tree configuration both quantitatively and qualitatively. First, since tree nodes can contain a large number of VBR entries, fanout becomes large, which leads to fast search.... ...
|
| |
In ICDE '99: Proceedings of the 15th International Conference on Data Engineering (1999)
|
| |
In Proc. 24th Int. Conf. Very Large Data Bases, VLDB (FebruaryApril--FebruaryJuly~ 1998), pp. 194-205.
Abstract
For similarity search in high-dimensional vector spaces (or `HDVSs'), researchers have proposed a number of new methods (or adaptations of existing methods) based, in the main, on data-space partitioning. However, the performance of these methods generally degrades as dimensionality increases. Although this phenomenon---known as the `dimensional curse'---is well known, little or no quantitative analysis of the phenomenon is available. In this paper, we provide a detailed analysis of... ...
Note (first note only)
VA-file
|
| |
SIGMOD Rec., Vol. 27, No. 2. (June 1998), pp. 142-153.
|
| |
In SIGMOD '97: Proceedings of the 1997 ACM SIGMOD international conference on Management of data, Vol. 26, No. 2. (June 1997), pp. 369-380.
|
| |
ACM Transactions on Database Systems, Vol. 24, No. 3. (1999), pp. 361-404.
Abstract
In many database applications, one of the common queries is to find approximate matches to a given query item from a collection of data items. For example, given an image database, one may want to retrieve all images that are similar to a given query image. Distance based index structures are proposed for applications where the distance computations between objects of the data domain are expensive (such as high dimensional data), and the distance function used is metric. In this paper, ...
Note (first note only)
MVP-Tree
|
| |
In The VLDB Journal (1997), pp. 426-435.
Abstract
A new access method, called M-tree, is proposed to organize and search large data sets from a generic "metric space", i.e. where object proximity is only defined by a distance function satisfying the positivity, symmetry, and triangle inequality postulates. We detail algorithms for insertion of objects and split management, which keep the M-tree always balanced - several heuristic split alternatives are considered and experimentally evaluated. Algorithms for similarity (range and k-nearest neighbors) queries are also described. Results from extensive experimentation with ...
|
| |
In VLDB'96 (1996), pp. 28-39.
Abstract
In this paper, we propose a new method for indexing large amounts of point and spatial data in high-dimensional space. An analysis shows that index structures such as the R*-tree are not adequate for indexing high-dimensional data sets. The major problem of R-tree-based index structures is the overlap of the bounding boxes in the directory, which increases with growing dimension. To avoid this problem, we introduce a new organization of the directory which uses a split algorithm minimizing overlap and the ...
|
| |
In Storage and Retrieval for Image and Video Databases (SPIE) (1996), pp. 62-73.
Abstract
Efficient indexing support is essential to allow content-based image and video databases using similaritybased retrieval to scale to large databases (tens of thousands up to millions of images). In this paper, we take an in depth look at this problem. One of the major difficulties in solving this problem is the high dimension (6-100) of the feature vectors that are used to represent objects. We provide an overview of the work in computational geometry on this problem and highlight the results... ...
Note (first note only)
VAMSplit R-tree
|
| |
In ICDE '96: Proceedings of the Twelfth International Conference on Data Engineering (1996), pp. 516-523.
|
| |
The VLDB Journal, Vol. 3, No. 4. (October 1994), pp. 517-542.
|
| |
In SODA '93: Proceedings of the fourth annual ACM-SIAM Symposium on Discrete algorithms (1993), pp. 311-321.
Abstract
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references. ...
Note (first note only)
VP-Tree
|
| |
SIGMOD Rec. In SIGMOD '90: Proceedings of the 1990 ACM SIGMOD international conference on Management of data, Vol. 19, No. 2. (June 1990), pp. 322-331.
Abstract
The R-tree, one of the most popular access methods for rectangles, is based on the heuristic optimization of the area of the enclosing rectangle in each inner node. By running numerous experiments in a standardized testbed under highly varying data, queries and operations, we were able to design the R * -tree which incorporates a combined optimization of area, margin and overlap of each enclosing rectangle in the directory. Using our standardized testbed in an exhaustive performance comparison, it turned out ...
|
| |
In VLDB '89: Proceedings of the 15th international conference on Very large data bases (1989), pp. 45-53.
|
| |
In The VLDB Journal (1987), pp. 507-518.
Abstract
The problem of indexing multidimensional objects is considered. First, a classification of existing methods is given along with a discussion of the major issues involved in multidimensional data indexing. Second, a variation to Guttman's R-trees (R -trees) that avoids overlapping rectangles in intermediate nodes of the tree is introduced. Algorithms for searching, updating, initial packing and reorganization of the structure are discussed in detail. Finally, we provide analytical results... ...
|
| |
In SIGMOD '84: Proceedings of the 1984 ACM SIGMOD international conference on Management of data, Vol. 14, No. 2. (June 1984), pp. 47-57.
Abstract
In order to handle spatial data efficiently, as required in computer aided design and geo-data applications, a database system needs an index mechanism that will help it retrieve data items quickly according to their spatial locations However, traditional indexing methods are not well suited to data objects of non-zero size located m multi-dimensional spaces In this paper we describe a dynamic index structure called an R-tree which meets this need, and give algorithms for searching and updating it. We present the ...
|
| |
|
| |
In IDEAS '01: Proceedings of the International Database Engineering \& Applications Symposium (2001), pp. 58-67.
|
| |
ACM Comput. Surv., Vol. 24, No. 1. (March 1992), pp. 63-113.
|
| |
In SIGMOD '89: Proceedings of the 1989 ACM SIGMOD international conference on Management of data, Vol. 18, No. 2. (June 1989), pp. 110-121.
|
| |
ACM Trans. Database Syst., Vol. 8, No. 3. (September 1983), pp. 324-353.
|
| |
Nature, Vol. 410 (26 April 2001)
|
| |
In 12th Scandinavian Conference on Image Analysis (1 January 2001)
Abstract
Content-based image retrieval (CBIR) is a new but widely adopted method for finding images from vast and unannotated image databases. In CBIR images are indexed on the basis of low-level features, such as color, texture, and shape, that can automatically be derived from the visual content of the images. The operation of a CBIR system can be seen as a series of more or less independent processing stages. As there exists multiple choices for each of these stages, a multitude of ...
|