Hacker News new | past | comments | ask | show | jobs | submit login

This kind of almost sorted data is easily synthesised with any degree of already-sorted-ness.



Certainly! But then how do you decide what degree of already-sorted-ness fairly represents real world data?


You don’t have to agree on any one degree. You can set up a giant matrix with lengths on one axis, and the sortedness on the other.

Each cell can have a color indicating if TimSort beats, say, some other hybrid of MergeSort; green shades suggest TimSort is winning, red shades suggest the contender is winning.

For each cell, do multiple runs with those sortedness/length parameters and pick the average.


By collecting a lot of real world data?




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: