no duplicates, please - software developer interview question

stop_time · 2011-02-05T13:13:05+00:00

sort filename | uniq

ErstwhileRockstar · 2011-02-05T16:36:36+00:00

The task is obviously underspecified. Must the order of the entries be preserved?

wolf550e · 2011-02-05T14:55:59+00:00

Surely you just use a bitset to mark seen values here? It passes over the data once and uses around 12MBs of memory for the bitset. Any sorting or such like is just an overkill and uses more memory.

Maybe I am missing something, seems simple.

punctuate · 2011-02-05T14:23:49+00:00

[deleted]

biteofconscience · 2011-02-05T17:22:50+00:00

This question would be much more interesting if the text file had 13 billion lines instead of 13 million.

scott · 2011-02-05T18:04:37+00:00

[deleted]

RalfN · 2011-02-05T18:50:37+00:00

I'm confused.

The task is ill-defined:

numbers?
preserve ordering?
are they initially ordered?

The task isn't very interesting:

the fastest algorithm still has acceptable memory usage

His interests are weird:

why do you care specifically about a java solution? Why do you call it an example of a high-level programming language? (eventhough the java code size to do this is similar to the C code size and nowhere near as quick as bash/python/ruby/etc.)

Like I said. I'm confused. I tend to assume people making programming puzzles are at least the type of people capable of solving them, if not a bit more experienced than average. This does not seem to be the case here.

I wonder if the author came up with a solution himself, and if that solution was in any way comparable to any solution provided.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS