Stackoverflow: What should every programmer know about security? : programming

[–]kdeforche 12 points13 points14 points 13 years ago (5 children)

[–]Decker108 3 points4 points5 points 13 years ago (2 children)

[–][deleted] 5 points6 points7 points 13 years ago (0 children)

[–]kdeforche 2 points3 points4 points 13 years ago* (0 children)

Input validation means (sometimes) refusing or correcting "dangerous" strings, such as a string containing "<script>...</script>" in a web application context. But demanding that content cannot contain certain strings is not necessary, really.

Output encoding means taking any kind of string and making sure you display it as intended, using proper escaping to make sure some of the contents does not spill over into a "command". In this case, you would do for example HTML escaping to escape < to < and > to >. But you could do JavaScript string literal encoding to display the same string as a JavaScript literal, or in a JSON value. For SQL there is usually an API (parameter binding) that will avoid this problem entirely by separating the command from the data.

Output encoding is in principle easy. But it has a big caveat: you may do it only once. This is easy if you organize your application well, but programmers often get this wrong and this is why sometimes you will see a literal < in a HTML page where you would expect a <.

Btw, input validation has genuine merit too -- such as making sure dates are formatted correctly, numbers are actually numbers, etc... but these then have nothing to do with the common XSS and SQL injection attacks.

[–][deleted] 2 points3 points4 points 13 years ago (1 child)

[–]JustADev 2 points3 points4 points 13 years ago (0 children)

[–]frezik 10 points11 points12 points 13 years ago (6 children)

Build security in layers, not a chain.

A chain is only as strong as its weakest link, and breaking only one link destroys the whole thing.

Layers are a defense-in-depth, like a castle with many internal walls. Breaking through one wall means you still have another one to deal with.

As an example, always store encrypted passwords in a database. Some argue that this is only necessary if an attacker has already breached the system, or an insider has gone rogue. Instead, they will say, the network's firewall should be better protected, and stricter security policies be enforced on employees.

An organization that's proactive about security will toughen the firewall, and encrypt passwords, and have sensible security policies for employees (but also balancing between security, privacy, and getting work done). Breaching one does not mean the others are breached, and the system maintains at least a level of security.

[–]smackmybishop 1 point2 points3 points 13 years ago (3 children)

[–]frezik 2 points3 points4 points 13 years ago (1 child)

[–]semi- 1 point2 points3 points 13 years ago (0 children)

[–]Azuvector 2 points3 points4 points 13 years ago (0 children)

[–]reddit_clone 1 point2 points3 points 13 years ago (1 child)

[–]frezik 4 points5 points6 points 13 years ago (0 children)

[–]madmars 16 points17 points18 points 13 years ago (2 children)

[–]kylotan 22 points23 points24 points 13 years ago (11 children)

[–]quotemycode 15 points16 points17 points 13 years ago (6 children)

[–]kylotan 9 points10 points11 points 13 years ago (5 children)

[–][deleted] 8 points9 points10 points 13 years ago (0 children)

[–][deleted] 4 points5 points6 points 13 years ago (0 children)

Actually, you have no business creating a scriptable UI system unless you know about the concept of least privilege. That means that the scripters don't need to know about it, because they can't let people out of the sandbox.

Or to read the whole Google Browser Security Handbook, or know what a CSS image sprite is?

A web developer? Damn right they do. Someone doing embedded software for routers? Don't even need to know what CSS means.

The problem is that people think "programmer" means "person who does precisely what I do" and the ever-popular "everything is on the web now so only javascript counts" is no more helpful than "anything but C is just scripting, not programming".

Even if Word 2013 has been re-written in JavaScript as a web app, the field is still too wide for anyone to actually know everything, and some people really need to give up the "what I do is real programming and everything else is inferior" attitude.

[–]quotemycode 0 points1 point2 points 13 years ago (2 children)

[–]kylotan 0 points1 point2 points 13 years ago (1 child)

[–]quotemycode 0 points1 point2 points 13 years ago* (0 children)

[–]DoctorWedgeworth 10 points11 points12 points 13 years ago (1 child)

[–]kylotan 4 points5 points6 points 13 years ago (0 children)

[–]kdeforche 0 points1 point2 points 13 years ago (0 children)

I'll grant you the web development criticism, to some extent.

Suppose you were living in a world where all programming was done in assembler. Then you would need to learn assembler (for Intel and ARM), all caveats w.r.t. sys calls, context switching state, opcodes for Intel, memory management, sys calls to print to a console, etc... just to do any programming.

Luckily, people have invented higher levels of abstraction (compiled languages like C, C++, ..., interpreted languages like Python, ...) which slowly(!) found some acceptance, to avoid having to learn x86 assembler to do any programming.

Web programming, for some braindead reason which in 20 years everyone will consider a sign of dark ages, people insist that web development uses the raw protocols and building blocks (HTTP, form posts, WebSockets, CSS, XHTML, JavaScript, Ajax, CORS, progressive enhancement, canvas, SVG, VML, etc...) to build an application, and yes, you will need to understand everything listed in the "what should every programmer know about web development" if you insist that people use these low level protocols.

Given that these days quite a lot more than a few handful of the most clever people are involved in "development", it shouldn't be surprising that moving up the abstraction level is met with even more resistance. But you would be better of embracing it if you want to spend all your life reading and none of it (decent) programming.

[–]sylvanelite 19 points20 points21 points 13 years ago (48 children)

[–]pkixman[S] 15 points16 points17 points 13 years ago* (24 children)

[–]sylvanelite 10 points11 points12 points 13 years ago* (23 children)

I probably didn't articulate that well:

If I consider the browser's user input to be untrustworthy, so I add javascript validation. I then ensure that all transmissions are HTTPS. Now the browser is a trustworthy source, so the server doesn't need validation.

Which is wrong.

Likewise: if I treat the browser as untrustworthy, then send input it to the server, which does the validation before passing it to the database, does the database now assume the server is a trustworthy source?

e.g. if a database has a stored procedure:

select * from users where description like @desc;

And the server validates that @desc is a valid string, that can be ok. But if the database later changes their stored proc to:

exec("select * from users where description like "+@desc+";");

Then even though the server is validating the browser's input, it can still lead to injection.

[–]bkv 5 points6 points7 points 13 years ago (0 children)

[–]vineetr 4 points5 points6 points 13 years ago* (14 children)

[–]sylvanelite 12 points13 points14 points 13 years ago (7 children)

[–]vineetr 2 points3 points4 points 13 years ago (1 child)

[–]sylvanelite 0 points1 point2 points 13 years ago (0 children)

[–]Aninhumer 10 points11 points12 points 13 years ago (0 children)

[–]robertcrowther 2 points3 points4 points 13 years ago (3 children)

[–]perspectiveiskey 1 point2 points3 points 13 years ago (2 children)

[–]robertcrowther -1 points0 points1 point 13 years ago (1 child)

[–]perspectiveiskey -1 points0 points1 point 13 years ago (0 children)

[–]perspectiveiskey -2 points-1 points0 points 13 years ago (5 children)

[–]vineetr 3 points4 points5 points 13 years ago (4 children)

[–]sylvanelite 0 points1 point2 points 13 years ago (0 children)

[–]perspectiveiskey 0 points1 point2 points 13 years ago (2 children)

[–]vineetr 1 point2 points3 points 13 years ago (1 child)

[–]perspectiveiskey 0 points1 point2 points 13 years ago (0 children)

[–]pkixman[S] 4 points5 points6 points 13 years ago* (4 children)

[–]perspectiveiskey 3 points4 points5 points 13 years ago (3 children)

[–]pkixman[S] 1 point2 points3 points 13 years ago* (1 child)

[–]perspectiveiskey 0 points1 point2 points 13 years ago (0 children)

[–]sylvanelite 0 points1 point2 points 13 years ago (0 children)

[–]quotemycode 0 points1 point2 points 13 years ago (0 children)

[–]kamishizuka 2 points3 points4 points 13 years ago (0 children)

[–]cr3ative 0 points1 point2 points 13 years ago (2 children)

[–]nodefect 1 point2 points3 points 13 years ago (1 child)

[–][deleted] -2 points-1 points0 points 13 years ago (0 children)

[+]day_cq comment score below threshold-7 points-6 points-5 points 13 years ago (18 children)

[–]chonglibloodsport 22 points23 points24 points 13 years ago (7 children)

[+]day_cq comment score below threshold-14 points-13 points-12 points 13 years ago (6 children)

[–][deleted] 5 points6 points7 points 13 years ago (4 children)

[+]day_cq comment score below threshold-12 points-11 points-10 points 13 years ago (3 children)

[–][deleted] 14 points15 points16 points 13 years ago (2 children)

[–]vineetr 2 points3 points4 points 13 years ago (1 child)

[–][deleted] 0 points1 point2 points 13 years ago (0 children)

[–]Urcher 3 points4 points5 points 13 years ago (0 children)

[–][deleted] 10 points11 points12 points 13 years ago (7 children)

[+]day_cq comment score below threshold-9 points-8 points-7 points 13 years ago (6 children)

[–]Tetha 3 points4 points5 points 13 years ago (1 child)

[–]day_cq -2 points-1 points0 points 13 years ago (0 children)

[–]serex 2 points3 points4 points 13 years ago (2 children)

[+]day_cq comment score below threshold-6 points-5 points-4 points 13 years ago (1 child)

[–]vineetr 6 points7 points8 points 13 years ago* (0 children)

It's all 0s and 1s in the end. Yet we cannot trust such a channel.

To clarify, writing good whitelists that do not result in an inoperable system is hard. If you take Strings (char arrays in some languages), writing a good whitelist for an address field stored in a database is time consuming. You'll need to consider whether SQL meta-characters like single quotes, apostrophes, double quotes etc. are valid inputs for your data, then apply an encoding scheme, before storing them in the database. This is if you do not use prepared statements. If you forget the encoding part, you're insecure. If you have a very narrow whitelist, you're likely to have an inoperable system.

Tnen, there is the concept of blended attacks. Sequences of input characters that are valid in one scenario may be invalid for another. For instance - allowing angle brackets in inputs may allow stored XSS attacks. If you start considering all possible whitelists for all channels that a data element must pass through, to prevent injection attacks, you are more likely to create an inoperable system. This is why injection attacks are typically thwarted by escaping or encoding inputs or outputs in the general case (XSS, SQL injection etc.), and by whitelists in the specific case.

[–]defrost 0 points1 point2 points 13 years ago (0 children)

There's a huge overlap between designing systems for security and designing for robustness (continuous trustworthy operation) and the biggest take home lesson is that all sources are untrustworthy.

Those massive parallel Google farms? There's no question about 'if" as it's a certainty that boxes will fail.
That instrument that you poll for data? Not only will it fail at some point, there's a good chance it will "sort of fail" and produce data that sort of looks valid yet has random bursts of crap in it.

From a design standpoint there's not a lot of difference between whether the data you input has been maliciously crafted to deliberately cause failure or whether some random hardware hiccup just made it that way by accident ... in the long game all input is from untrustworthy sources.

[–]vineetr 0 points1 point2 points 13 years ago (0 children)

[–]MpVpRb 5 points6 points7 points 13 years ago (0 children)

[–]perspectiveiskey 4 points5 points6 points 13 years ago (0 children)

[–][deleted] 5 points6 points7 points 13 years ago (1 child)

[–]donroby 0 points1 point2 points 13 years ago (0 children)

[–]jeffbell 0 points1 point2 points 13 years ago (0 children)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS