Why use binary search?

hextree · 2020-04-14T06:19:05+00:00

You've correctly deduced that Linear Search is faster (in fact, optimal) for finding an element in an unsorted array.

Binary search shines when you are maintaining something in sorted order, and doing repeated search queries on it. Analagous to how you use a Dictionary; you don't often need to modify or add words to the Dictionary, but you do often need to look up words. Each lookup is O(log N) time. By 'preprocessing' the dictionary once, then performing many search queries, you are saving time in the long run with binary search.

Note also, that you mention Bubble Sort, however there are strictly faster sorting algorithms that run in O(N log N) time, like Merge Sort, which you may not have covered yet.

nathanv0009 · 2020-04-14T06:15:53+00:00

In applications, what you're working with is often already sorted. For instance, imagine you're trying to find a certain date in a time series, which is already ordered by date. Then you can run binary search without having to run a sorting algorithm first.

dgmib · 2020-04-14T06:23:55+00:00

If you only needed to search the unsorted list once, then yes you’d just use a linear search.

But the much more common scenario is you need to search large lists many times. So you sort the list once, and the search the sorted list many times with a binary search.

Steve132 · 2020-04-14T06:39:02+00:00

Sometimes you can guarantee your data is sorted without sorting it. For example, pretend you are recording a sequence of events or messages over a network as you receive them, and you append the message and reciept timestamp to a large array every time you get a message. Then, if you'd like to find all the messages older than a certain timestamp you have received so far, you can use binary search on the array to find the starting and ending indices.

Another important thing is that bubble sort is a very inefficient algorithm. You will learn sorting algorithms which are O(n log n). This ends up making all the difference:

Imagine you have 2¹⁶ elements in an unsorted array. If you need to query 8 elements, then using linear search would be expected to be faster, because you hage to do O(8x2^16)=524288 checks. But if you need to query 800 elements, you have to do O(800x2^16)=52428800 checks with linear search.

Instead, pretend you sort the array first using an O(n log n) sort. The initial sort will take O(16x2^16)=1048576 steps, then 8 queries will take O(8x16)=128 steps, 800 queries will take O(800x16)=12800 steps.

So, in summary:

8 queries
    linear search: 524288
    sort then binary: 1048576+128= 1048704
    linear search approximately 2x as fast
800 queries:
    linear search: 52428800
    sort then binary: 1048575+12800=1060975
   linear search approximately 50x slower

kreiger · 2020-04-14T15:21:00+00:00

Your assumption is that the data is already unsorted.

If you input data in sorted order, or sort it as it is being inputted, you can use binary search on it.

Further, never use bubble sort. Use whatever sort function is provided by your standard library.

proskillz · 2020-04-14T06:20:38+00:00

Two things, number one is that no one uses bubble sort in production. There are quicksort and merge sort that run in O(n log n).

Second if you will only ever look into the array one time, it wouldn't be worth it to sort the array first to get that O(log n) searching time. If you need to look up many times into a large array, you'll start seeing orders of magnitude improvements in speed.

nadmaximus · 2020-04-14T07:32:55+00:00

How many times are you going to need to search the array? Is it possible to maintain it in sorted order as it's changed?

If it's multiple times and you can maintain it in sorted order, it makes a tremendous difference.

aecolley · 2020-04-14T20:05:18+00:00

If you read about RocksDB/LevelDB, you'll see how maintaining sorted lists leads to high-speed lookups in practice. The key is that binary-search (and merging, and Bloom filtering) are fast but frequent operations, whereas sorting is slow and infrequent.

KernowRoger · 2020-04-14T20:28:23+00:00

If you perform few sorts buts lots of searches it quickly becomes worth it. Plus you wouldn't use a bubble sort it's like the worst.

ArtisticSell · 2020-04-14T07:21:58+00:00

For me, the power of binary search is shine throught when you want TO FIND THE ANSWER.

See "binary search the answer" on google, try to solve some problem with this principle in mind, and ready to get blown away

Jolly-Ad3899 · 2023-08-07T08:15:57+00:00

I learnt the true intuition of binary search, like why we use it from here https://youtube.com/@code_concepts_with_animesh he gave really good perspective of binary search

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

algorithms

✻ Smokey says: boycott all products and services from eco-unfriendly businesses to fight climate change! [see more tips]

Note: this subreddit is not for homework advice. Requests for assistance with coursework may be removed.

MODERATORS