ATTENTION: The software behind KU ScholarWorks is being upgraded to a new version. Starting July 15th, users will not be able to log in to the system, add items, nor make any changes until the new version is in place at the end of July. Searching for articles and opening files will continue to work while the system is being updated. If you have any questions, please contact Marianne Reed at .

Show simple item record

dc.contributor.authorGupta, Ankur
dc.contributor.authorHon, Wing-Kai
dc.contributor.authorShah, Rahul
dc.contributor.authorVitter, Jeffrey Scott
dc.identifier.citationA. Gupta, W.-K. Hon, R. Shah, and J. S. Vitter. “Compressed Data Structures: Dictionaries and the Data-Aware Measures,” Theoretical Computer Science, 387(3), November 2007, 313–331. An extended abstract appears in Proceedings of the 2006 IEEE Data Compression Conference (DCC ’06), Snowbird, UT, March 2006, 213–222.
dc.description.abstractWe propose measures for compressed data structures, in which space usage is mea- sured in a data-aware manner. In particular, we consider the fundamental dictionary problem on set data, where the task is to construct a data structure to represent a set S of n items out of a universe U = f0; : : : ; u 􀀀 1g and support various queries on S. We use a well-known data-aware measure for set data called gap to bound the space of our data structures. We describe a novel dictionary structure taking gap+O(n log(u=n)= log n)+O(n log log(u=n)) bits. Under the RAM model, our dictionary supports membership, rank, select, and prede- cessor queries in nearly optimal time, matching the time bound of Andersson and Thorup's predecessor structure [AT00], while simultaneously improving upon their space usage. Our dictionary structure uses exactly gap bits in the leading term (i.e., the constant factor is 1) and answers queries in near-optimal time. When seen from the worst case perspective, we present the rst O(n log(u=n))-bit dictionary structure which supports these queries in near- optimal time under RAM model. We also build a dictionary which requires the same space and supports membership, select, and partial rank queries even more quickly in O(log log n) time. To the best of our knowledge, this is the rst of a kind result which achieves data-aware space usage and retains near-optimal time.
dc.titleCompressed Data Structures: Dictionaries and the Data-Aware Measures
kusw.kuauthorVitter, Jeffrey Scott
kusw.oaversionScholarly/refereed, author accepted manuscript
kusw.oapolicyThis item meets KU Open Access policy criteria.

Files in this item


This item appears in the following Collection(s)

Show simple item record