Predecessor problem

In

balanced binary search trees, van Emde Boas trees, and fusion trees. In the static predecessor problem, the set of elements does not change, but in the dynamic predecessor problem, insertions into and deletions from the set are allowed.^[1]

The predecessor problem is a simple case of the nearest neighbor problem, and data structures that solve it have applications in problems like integer sorting.

Definition

The problem consists of maintaining a set $S$ , which contains a subset of $U$ integers. Each of these

word size

of

w

, implying that

U\leq 2^{w}

. Data structures that solve the problem support these operations:^[2]

predecessor(x), which returns the largest element in $S$ less than or equal to $x$
successor(x), which returns the smallest element in $S$ greater than or equal to $x$

In addition, data structures which solve the dynamic version of the problem also support these operations:

insert(x), which adds $x$ to the set $S$
delete(x), which removes $x$ from the set $S$

The problem is typically analyzed in a transdichotomous model of computation such as word RAM.

Data structures

One simple solution to this problem is to use a

running time

of

O(\log n)

for predecessor queries. The Van Emde Boas tree achieves a query time of

O(\log \log U)

, but requires

O(U)

space.^[1] Dan Willard proposed an improvement on this space usage with the x-fast trie, which requires

O(n\log U)

space and the same query time, and the more complicated y-fast trie, which only requires

O(n)

space.^[3] Fusion trees, introduced by Michael Fredman and Willard, achieve

O(\log _{w}n)

query time and

O(n)

for predecessor queries for the static problem.^[4] The dynamic problem has been solved using exponential trees with

O(\log _{w}n+\log \log n)

query time,^{expected time
$O(\log _{w}n)$ using hashing.^[6]

Mathematical properties
There have been a number of papers proving
Big Theta notation
) $\Omega \left({\tfrac {\log w}{\log \log w}}\right)$ , and similarly, for all values of $n$ , there exists a value of $n$ such that the query time is $\Omega \left({\sqrt {\tfrac {\log n}{\log \log n}}}\right)$ .^[1] Other proofs of lower bounds include the notion of communication complexity.
For the static predecessor problem, Mihai Pătrașcu and Mikkel Thorup showed the following lower bound for the optimal search time, in the cell-probe model:^[7]
$O(1)\min \left\{{\begin{array}{l}\log _{w}n\\\lg {\frac {\ell -\lg n}{a}}\\{\frac {\lg {\frac {\ell }{a}}}{\lg \left({\frac {a}{\lg n}}\,\cdot \,\lg {\frac {\ell }{a}}\right)}}\\{\frac {\lg {\frac {\ell }{a}}}{\lg \left(\lg {\frac {\ell }{a}}\right/\left.\lg {\frac {\lg n}{a}}\right)}}\end{array}}\right.$
where the RAM has word length $w$ , the set contains $n$ integers of $\ell$ bits each and is represented in the RAM using $S$ words of space, and defining $a=\lg {\frac {S}{n}}+\lg w$ .
In the case where $w=\ell =\gamma \lg n$ for $\gamma >1$ and $S=n\cdot \lg ^{O(1)}n$ , the optimal search time is
$\Theta (\lg \ell )$ and the van Emde Boas tree achieves this bound.^[7]

See also
Integer sorting
y-fast trie
Fusion tree
References

^ S2CID 1991980
.

^ Rahman, Naila; Cole, Richard; Raman, Rajeev (17 August 2001). Optimized Predecessor Data Structures for Internal Memory (PDF). International Workshop on Algorithm Engineering. pp. 67–78.

doi:10.1016/0020-0190(83)90075-3
.

^ Fredman, Michael; Willard, Dan (1990). "Blasting through the information theoretic barrier with fusion trees". Symposium on Theory of Computing: 1–7.

S2CID 8175703
.

MR 1469229
.

^ S2CID 1232
.}

Retrieved from "https://en.wikipedia.org/w/index.php?title=Predecessor_problem&oldid=1169353470"

[Beame-1] 
S2CID 1991980
.

[Rahman-2] Rahman, Naila; Cole, Richard; Raman, Rajeev (17 August 2001). Optimized Predecessor Data Structures for Internal Memory (PDF). International Workshop on Algorithm Engineering. pp. 67–78.

[Willard-3] :10.1016/0020-0190(83)90075-3
.

[Fredman90-4] Fredman, Michael; Willard, Dan (1990). "Blasting through the information theoretic barrier with fusion trees". Symposium on Theory of Computing: 1–7.

[5] S2CID 8175703
.

[6] MR 1469229
.

[Mihai-7] 
S2CID 1232
.

[1]

[2]

[3]

[4]

hashing

[6]

communication complexity

Mihai Pătrașcu

Mikkel Thorup

cell-probe model

[7]

van Emde Boas tree

Integer sorting

y-fast trie

Fusion tree

1991980

^

Optimized Predecessor Data Structures for Internal Memory

10.1016/0020-0190(83)90075-3

^

Fredman, Michael

Willard, Dan

Symposium on Theory of Computing

8175703

1469229

1232