DDR: an index method for large time-series datasets

No Thumbnail Available
File version
Author(s)
An, Jiyuan
Chen, Yi-Ping Phoebe
Chen, Hanxiong
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
2005
Size
File type(s)
Location
License
Abstract

The tree index structure is a traditional method for searching similar data in large datasets. It is based on the presupposition that most sub-trees are pruned in the searching process. As a result, the number of page accesses is reduced. However, time-series datasets generally have a very high dimensionality. Because of the so-called dimensionality curse, the pruning effectiveness is reduced in high dimensionality. Consequently, the tree index structure is not a suitable method for time-series datasets. In this paper, we propose a two-phase (filtering and refinement) method for searching time-series datasets. In the filtering step, a quantizing time-series is used to construct a compact file which is scanned for filtering out irrelevant. A small set of candidates is translated to the second step for refinement. In this step, we introduce an effective index compression method named grid-based datawise dimensionality reduction (DRR) which attempts to preserve the characteristics of the time-series. An experimental comparison with existing techniques demonstrates the utility of our approach.

Journal Title

Information Systems

Conference Title
Book Title
Edition
Volume

30

Issue

5

Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement
Item Access Status
Note
Access the data
Related item(s)
Subject

Data Structures

Information Systems

Persistent link to this record
Citation
Collections