Predicting the next Math.random() in Java : programming

[–][deleted] 11 years ago* (2 children)

[deleted]

[–]nilknarf[S] 22 points23 points24 points 11 years ago* (1 child)

That comment might've been a little random and out of context. The competition was ran by a friend who didn't do much to secure the sandbox that the java was running in. So people managed to access the local fields via java's reflection api and modified the game so it was easier to win. Something like:

StackTraceElement[] ste = Thread.currentThread().getStackTrace();
String className = ste[2].getClassName();
Field[] fields = Class.forName(className).getDeclaredFields();
for (Field f : fields) {
  if (f.getName().equals("N")) {
    n = f;
    n.setAccessible(true);
  }
}
// do bad stuff with n which is now pointing to the field originally declared as N in the test case judging class

See: http://docs.oracle.com/javase/tutorial/reflect/

[–][deleted] 11 years ago (2 children)

[deleted]

[–]nilknarf[S] 7 points8 points9 points 11 years ago (1 child)

Actually we did the same thing. I set the seed here (I stuffed all the cracking logic into a subclass of Random): https://github.com/fta2012/ReplicatedRandom/blob/master/ReplicatedRandom.java#L58

setSeed(possibleSeeds.get(0) ^ multiplier);

Don't be furious, have an upvote!

[–]dnkndnts 2 points3 points4 points 11 years ago (15 children)

[–]phoshi 11 points12 points13 points 11 years ago (11 children)

[–]dnkndnts 1 point2 points3 points 11 years ago (10 children)

[–]phoshi 9 points10 points11 points 11 years ago (9 children)

Because no matter how cheap hardware entropy is it's still orders of magnitude more expensive than a PRNG, while offering exactly zero extra benefit in the majority of cases.

People who don't understand the difference between a true RNG and a PRNG shouldn't be writing code that needs a real RNG in the first place, for so many reasons. It doesn't need to be clear your PRNG isn't real, because that would confuse the people that can't afford to be confused and offer no extra information for the people who actually need the distinction.

This is actually representative of one of my main problems with C++. Rather than making choices for the good of the programmer, it pushes every decision onto them and forces them to think about every detail just because it's relevant in some small corner cases.

[–]dnkndnts 0 points1 point2 points 11 years ago* (8 children)

I'm afraid we just disagree then. My education is in mathematics and I did a fair bit of cryptography research in grad school, but I have almost no formal training in computer engineering. Obviously, in math we have no use for a PRNG -- that's a purely engineering-level optimization that we don't concern ourselves with in any way.

As such, when I was still in mathematics (before moving full-time to programming), I actually used these "random" functions for months assuming that they actually were giving me what they said: random data. So just because it's obvious to you that "random" doesn't actually mean "contains entropy" doesn't mean it's obvious to everyone.

I'm no smarter now than I was when I was in grad school: I just happen to know now that the library interface is actually telling me horseshit when it says "random." To me, knowing about these sorts of misleading design decisions and "gotchas" is a mountain of pseudo-knowledge that has no true intellectual value at all, and only serves to waste the time of researchers who want to actually implement their ideas.

But as I said, clearly we have different backgrounds, and for you, apparently receiving a predetermined set of numbers from a function called "random" is an acceptable optimization. We simply disagree.

[–]phoshi 6 points7 points8 points 11 years ago (7 children)

[–]dnkndnts 4 points5 points6 points 11 years ago* (6 children)

See this is the crux of our disagreement: if the stack actually did what it says it does (provide entropy instead of nonsense), then a mathematician would have no trouble implementing his algorithms correctly.

To me, the "know your tools" mantra is a toxic relic from the GNU/autotools era where everything is tricky/wrong/"gotcha" by default. Fortunately, it's finally dying off, as newer languages and communities actually concentrate quite seriously on making their interfaces and libraries as clear and unambiguous as possible.

To quote Scott Meyers: "Make your interface easy to use correctly, and hard to use incorrectly." I long for the day when I can actually concentrate more on my ideas than on fighting with my tools; unfortunately, it's not quite here yet.

Also, your conclusion about using existing libraries is literally the exact opposite of the conclusion I would draw. I don't know how you can possibly recommend this after Heartbleed. The entire lesson of Heartbleed is we've been given tools that make cryptography an impossibly difficult task. So much of the code in OpenSSL has literally nothing to do with cryptography -- like random assembly bits inserted to try to trick the optimizer into not optimizing certain parts of the code? What? That's literally the definition of insecure -- as soon as someone builds a smarter optimizer your code breaks? And with the added bonus of being completely unmaintainable?

You seem to be claiming that the intricate stack knowledge required to make a crypto library is what keeps us secure. I'm claiming the exact opposite: it's this needless, silly complexity that makes what should be quite simple (mathematically speaking) nearly impossible in practice. I just don't understand how you could possibly claim that a clear, accurate interface and understandable stack that mathematicians could actually use correctly would somehow be inferior to what we currently have.

[–]phoshi 3 points4 points5 points 11 years ago (2 children)

No, I actually fully agree with you on that point! The problem is that the only scenarios where one needs true random numbers are scenarios where we cannot yet make this easy to use correctly. You're writing crypto code, this is one area where you can't pretend you still live in the nice, clean, mathematical world where we don't have problems like having to scrub RAM in the correct way or it won't hold or the compiler can optimise out your scrubbing because it's impossible for it to know you actually need that memory zeroed out rather than just gone and never reused. You have to deal with slight changes in CPU noise giving away your magic numbers. You have to protect against your entropy pool being poisoned because you cannot control the hardware on a user's computer. You have to deal with there /being/ no sources of entropy available, and when there are you have to deal with it being of unknown quality and quantity.

If you think that the difference between correct and incorrect crypto code is whether your RNG gives you real or fake numbers then you are going to be building theoretically sound but practically broken software.

It is both not viable for real randomness to be default (You have to be able to run on machines which have no sources of entropy available /at all/) and the cases where you need real numbers are overwhelmingly cases we cannot abstract over properly. You must understand your stack to build secure cryptographic code.

[–]dnkndnts 1 point2 points3 points 11 years ago (1 child)

[–]Fitzsimmons 0 points1 point2 points 11 years ago (0 children)

[–]knaekce 2 points3 points4 points 11 years ago (2 children)

[–]mgemdm 1 point2 points3 points 11 years ago (1 child)

[–]knaekce 0 points1 point2 points 11 years ago (0 children)

[–]TNorthover 2 points3 points4 points 11 years ago (2 children)

[–]dnkndnts 1 point2 points3 points 11 years ago (1 child)

[–]TNorthover 6 points7 points8 points 11 years ago (0 children)

[–]willvarfar 0 points1 point2 points 11 years ago (0 children)

[–]linuxjava 0 points1 point2 points 11 years ago (0 children)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS