How general can the partitioning scheme be? Can it generate worst cases for rand...

arebop · on Jan 22, 2009

I think Xichekolas had in mind adversarial techniques such as the one described in http://www.cs.dartmouth.edu/~doug/mdmspe.pdf. That paper assumes O(1) pivot selection.

eru · on Jan 22, 2009

I read the paper. Seems like they can not beat a median pivot (that takes O(k) to find at each step, but keeps the quicksort runtime of O(n log n).)

Still, a cool progam.

lliiffee · on Jan 22, 2009

With random pivot selection, quicksort is expected O(n log n) time for all inputs. (The expectation is over the set of random numbers used to pick the random pivots, and doesn't involve the input at all). Algorithms like this are called "las-vegas".

eru · on Jan 22, 2009

Yes. And if your adversarial algorithm is good enough, it should work even in that case. (Of course any input is the worst case if all inputs have the same expected time.)

Retric · on Jan 23, 2009

Wait if finding the median takes n time, and your sort takes n log n time then your best option is ~ n * log n * log n.

yters · on Jan 23, 2009

No, the O(n) for median selection is the same O(n) for set partition.

Retric · on Jan 23, 2009

Ahh, ok, it takes N * log N to do the selection, but it's only O(l) for the length of the set partition each time. I was thinking it's the same n each cycle vs the same n at each depth. Err, nm it's hard to put into words.

PS: It still doubles the time but that's not important in O notation.

eru · on Jan 23, 2009

You can get away with less than the media. For example a pivot guaranteed to lie between the 25th and 75th percentile is good enough and can be found faster.

Retric · on Jan 23, 2009

Personally, I really like merge sort. It makes few assumptions, it's stable, it works well with sorted inputs, can be split across multiple CPU's, and it works well with sequential input streams. Radix sort tends to be faster especially for strings, but it does not work for many data structures without adapting the algorithm. EX: Radix sorting doubles requires bit manipulation of the data.

eru · on Jan 24, 2009

Merge sort can be thought of as a kind of binary counting.

yters · on Jan 25, 2009

Doh, right, k select is the same problem. Median of medians is a good linear time technique, though.