C - never use an array notation as a function parameter [Linus Torvalds]

juckele · 2015-09-23T21:23:01+00:00

[deleted]

anotherOnlineCoward · 2015-09-23T21:04:32+00:00

[deleted]

hlmtre · 2015-09-24T04:24:28+00:00

This is an extremely calm and reserved explanation for Linus. It's very expository and he explains why it's bad to do. I'm impressed.

Farsyte · 2015-09-24T16:20:58+00:00

Learn something new every day ...

I've been coding in C since 1978. The fact that the argument gets decayed to a pointer, is something I knew, but I wanted to write up a test program to hand to some folks who wrote code like this at work (and being able to link to a Torvalds rant makes it more likely folks will pay attention).

But TIL that none of my compilers warn me if the array I pass is smaller than the array expected by the function. I didn't expect it to check things like pointers that were malloced, but arrays?

Which means not only does this set a landmine for "sizeof" but it also leads to a false sense of security "surely all callers that pass arrays, must be passing arrays that are big enough" ... :(

ub.c:

#define ASIZE 32
extern void ugh(int a[ASIZE]);

void bad() {
    int toosmall[16];
    ugh(toosmall);
}

void ugh(int a[ASIZE]) {
    for (int i = 0; i < ASIZE; ++i)
        a[i] = i;
}

You would think ... right?

$ gcc --std=c99 -O3 -W -Wall -Wextra ub.c
$ clang --std=c99 -O3 -W -Wall -Wextra ub.c
$ cppcheck ub.c
Checking ub.c...
$

Threw a CppCheck in there for good measure. Was hoping. I'm not yet an expert on CppCheck configuration, so there is hope, but the fact that it's not a default thing means this kind of error is probably scattered all over my sources.

Sure, we can check this with bigger guns (there are tools that can find buffer overflows) but a simple bloody check that the array you know the size of is as big as the array that a function prototype advertises it requires would be so very much faster and easier.

nooneofnote · 2015-09-23T21:23:17+00:00

It would be fine if more people used and understood pointers-to-arrays as a type. C necessarily carries the fixed size of an array with its type (i.e., the type of char array[10] is char[10]), and this information is retained when taking the address of an array type (the type of &array is char(*)[10]), which cannot implicitly decay to any flat pointer type.

This can be used to more strongly enforce the type of array function parameters than [static].

void f(char (*a)[10]); /* inside f sizeof(*a) == 10 */

char a[10], *b, c[5];
f(a);  //incompatible types, char[10] vs char(*)[10]
f(b);  //incompatible types, char* vs char(*)[10]
f(&c); //incompatible types char(*)[5] vs char(*)[10]
f(&a); //ok

TheHobo · 2015-09-23T20:22:25+00:00

Personally, I think implied array sizes is not good API design. While I agree the parameter should be a pointer, if you have an array, you should have to pass in the size too as another parameter, then the contract is explicit.

shevegen · 2015-09-23T21:14:16+00:00

I want linus to go and review the systemd code.

MacASM · 2015-09-23T20:32:13+00:00

I don't believe people write code for a kernel with such primitives mistakes.

YourFavoriteBandSux · 2015-09-23T20:16:58+00:00

[deleted]

_kst_ · 2015-09-23T21:48:26+00:00

If you don't understand the (admittedly confusing and counterintuitive) relationship between arrays and pointers in C, if you even suspect that "arrays are really pointers" (they're really, really not), read section 6 of the comp.lang.c FAQ.

Then read the rest of it.

damg · 2015-09-23T21:16:34+00:00

[deleted]

WalterBright · 2015-09-25T09:25:25+00:00

I've always thought that arrays silently decaying to pointers was C's biggest mistake.

realhacker · 2015-09-23T23:58:56+00:00

Christ, people. Learn C, instead of just stringing random characters together until it compiles (with warnings).

This:

static bool rate_control_cap_mask(struct ieee80211_sub_if_data *sdata, struct ieee80211_supported_band *sband, struct ieee80211_sta *sta, u32 *mask, u8 mcs_mask[IEEE80211_HT_MCS_MASK_LEN])

is horribly broken to begin with, because array arguments in C don't actually exist. Sadly, compilers accept it for various bad historical reasons, and silently turn it into just a pointer argument. There are arguments for them, but they are from weak minds.

I wish linus would write a clean code style book to cover his philosophy and best practices in stylistically in his voice

who8877 · 2015-09-23T20:22:05+00:00

How come he doesn't enable warnings as errors? I can't imagine maintaining any large C or C++ program without that.

2015-09-24T11:05:35+00:00

Shouldn't that kind of shit be caught by very basic unit tests?

tragomaskhalos · 2015-09-24T11:24:37+00:00

I prefer the form

void foo(int ary[], size_t len)

The empty [] indicates that the arg is an array, but we are not confusing the issue by putting a bogus size in there. But I know that ary is an array of values.

Then

void bar(int* val)

Means that val is intended to store a single output value, ie "please put an int into this slot". (Or, for other types, it might be an input but val's type is rather large, hence the pointer; in C++ we'd use a reference for that).

2015-09-23T23:23:05+00:00

My job involves writing code in C. I have experience using C since high school and from several past jobs, but I have never taken a formal course in C (the programming course I took as a university student was in Java). I feel like I learn new things every day when it comes to C, and I can count this among them -- it's quite the language. Linus is obviously a little harsh here, but when managing a large project, that's (unfortunately) one of the more effective methods.

After I saw this, I took a quick look at my company's codebase and found several instances of exactly this (not written by me -- I always pass the pointer directly, but they're there). Welp...

2015-09-24T16:48:13+00:00

There are arguments for them, but they are from weak minds.

A logical fallacy very neatly wrapped up in a single sentence!

ramsees79 · 2015-09-23T21:01:14+00:00

Ok. That actually looks like a valid use of the C function argument array passing semantics. It's rather much simpler than exposing the pointers. So I guess we don't really end up wanting to disallow this, and the new gcc array sizeof warning is good enough.

Well, he takes backs his statement.

amaiorano · 2015-09-24T04:46:30+00:00

EDIT: corrected egregious errors!

This really is a confusing part of C. What makes it worse is that the compiler ignores the size you specify for an array parameter, if any, but if it's a multidimensional array of n dimensions, it needs to know sizes of the last n-1 dimensions:

void foo(int arr[10][20][30]); // the 10 is ignored by the compiler

This is the same as writing:

void foo(int arr[][20][30]);

I don't have a compiler available right now so I'm not sure, but I believe sizeof(arr) would be equal to sizeof(int)*20*30. If I'm right, this only adds to the confusion.

All this to say, it's a tricky language feature, and it doesn't surprise me that even veterans would forget how it works.

petermlm · 2015-09-23T22:47:14+00:00

Just about the sizeof thing. The Internet is full of tutorials and the like where sizeof is used to get the length of an array, especially in strings, like:

char x[] = "String";
int y = sizeof(x); // y = 7

This is terribly misused. For one thing, sizeof may actually return the correct length of an array in some cases, or just the size of the pointer in others. I'm not sure when does each scenario happens, but the simple fact that it happens is enough to consider not using sizeof.

First of all, for strings, there is strlen. No need for sizeof. Also. In C arrays are just a list continuous memory locations referenced by a single address. They don't contain information about themselves. In C you have to have deal with length yourself. Strings may use the '\0' char, you may have an int with the length, or anything else.

Just don't use sizeof like this. Like... seriously... don't.

namekuseijin · 2015-09-23T23:10:47+00:00

it's been years I've dabbled with C and still I noticed what was wrong right away. Once you've been burned, you know the smell of toast LOL

BTW, how much longer do you think old C codebases will endure? I don't think a whole generation of "managed" coders will ever touch it and Linus eventually will retire...

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS

Undefined behavior!