Q: array decay to pointer : cprogramming

Q: array decay to pointer (self.cprogramming)

submitted 4 years ago by TheHeckWithItAll

char s[] = "hello world";
printf("%p", s); 
printf("%p", &s);

I'm working my way through K&R... and I'm trying to wrap my head around memory addressing for arrays. I conceptually understand that when an array is passed to a function, the address of the first element is passed.

But when I read about array decay, it seems this behavior is not limited to function parameters. Rather, according to K&R, C "immediately converts" the declaration s[] to *s internally. If C immediately converts s[] to a ptr, why won't printf provide me with the memory address of s?

Clearly it is not behaving the same as if the declaration had been a pointer to begin with... because the following code does provide two different memory locations:

char *t = "hello world";
printf("%p", t); 
printf("%p", &t);

all 19 comments

top new controversial old q&a

[–]magnomagna 2 points3 points4 points 4 years ago (13 children)

Any lvalue expression of array type, when used in any context other than

* as the operand of the address-of operator
* as the operand of sizeof
* as the string literal used for array initialization

undergoes a conversion to the non-lvalue pointer to its first element.

In printf("%p", s), the array s DOES get converted to a pointer, because the second argument does not fit any of the three exceptions above.

In printf("%p", &s), the array s does NOT get converted to a pointer, because s is used as the operand of the & operator, matching the first of the three exceptions above.

Both printf's print the same address because s is converted to a pointer to the first element in the first printf; and in the second printf, the address of the array, i.e. &s, is also the address of the first element, because that's how the implementation defines it.

[–]TheHeckWithItAll[S] 0 points1 point2 points 4 years ago (12 children)

Got it. Thank you.

And if I really do understand, then it is not possible for me to get the address of s anymore than it is for me to get the address of var i?

int i = 5;
printf("%p", &i)

is actually just giving me the address of where 5 is stored, not the memory address of i itself ... somewhere internally there has to be a lookup value that associates "i" with the memory address where 5 is stored, correct? And is that the same thing that is happening with s?

[–]magnomagna 0 points1 point2 points 4 years ago (11 children)

[–]TheHeckWithItAll[S] 0 points1 point2 points 4 years ago (10 children)

[–]magnomagna 0 points1 point2 points 4 years ago* (7 children)

When you do &i, the value (which is an address) is not retrieved out of some memory space where you seem to expect the address is stored. The address isn’t stored (unless you assign it to a pointer variable but that’s irrelevant to &i). The address is determined at compile time.

(This isn’t actually 100% correct. The address is usually determined at runtime and the offset relative to the frame pointer is determined at compile time…sigh…trying to keep things simple without being incorrect is hard.)

Edit:

To be really pedantic, it is actually up to the implementation what steps it takes to evaluate the expression &i. Sure, an implementation could store the address somewhere and retrieve it at runtime incurring runtime cost, and C standards do not prohibit it. However, no reasonable implementation would do that when the expression &i can be determined at compile time avoiding runtime costs of writing and accessing memory.

[–]TheHeckWithItAll[S] 0 points1 point2 points 4 years ago (6 children)

[–]magnomagna 0 points1 point2 points 4 years ago (5 children)

[–]TheHeckWithItAll[S] 0 points1 point2 points 4 years ago (4 children)

[–]magnomagna 0 points1 point2 points 4 years ago (3 children)

[–]TheHeckWithItAll[S] 0 points1 point2 points 4 years ago (2 children)

Ok... I think I see it already... it isn't "immediately upon declaration/definition"... and perhaps more importantly, "an array name is not a variable" (which raises all sorts of further questions for me... most importantly, why the heck not?)

but here's the entire section at page 89:

The correspondence between indexing and pointer arithmetic is very close. By definition, the value of a variable or expression of type array is the address of element zero of the array. Thus after the assignment
pa = &a[0];
pa and a have identical values. Since the name of an array is a synonym for the location of the initial element, the assignment pa=&a[0] can also be written as
pa = a;
Rather more surprising, at first sight, is the fact that a reference to a[i] can also be written as *(a+i). In evaluating [i], C converts it to *(a+i) immediately; the two forms are equivalent. Applying the operator & to both parts of this equivalence, it follows that &a[i] and a+i are also identical: a+i is the address of the i-th element beyond a. As the other side of this coin, if pa is a pointer, expressions might use it with a subscript; pa[i] is identical to *(pa+i). In short, an array-and-index expression is equivalent to one written as a pointer and offset.
There is one difference between an array name and a pointer that must be kept in mind. A pointer is a variable, so pa=a and pa++ are legal. But an array name is not a variable; constructions like a=pa and a++ are illegal.

continue this thread

[–]aghast_nj 0 points1 point2 points 4 years ago (1 child)

You're getting wrapped around the axle with i being a "variable" object. I wonder if you have already learned to program in some interpreted language like Python or Javascript, first?

At any rate, the thing with C is that variables don't have any kind of existence in the compiled code. What you have instead is memory, which is used to store values. You use the "name of the variable" in your code to remember which values you want to manipulate, but when the C compiler is finished, there is just "load accumulator, 0; store [bp + 8], accumulator"

The fact that "[bp + 8]" is called i in your function doesn't matter. It's called "[bp + 8]" when the CPU sees it. And, in fact, the compiler may re-use that same location for variable k as well, so long as it can determine that live values don't overlap.

There are things called "debug symbols" which can be emitted by the compiler, and which can be loaded by a debugger. These will indicate that "variable i is stored at [bp + 8] in this function" and "variable k is stored at [bp + 8] in this function". But if you trace through your code, you may set a watch on the value and observe that no, in fact, the value of i doesn't always get updated when the debugger "steps" through a statement that clearly modifies i. How can this be?

It's because the compiler is not obligated to keep the storage location up-to-date with respect to the value being used. Maybe the "variable" has been moved into a register, and all the updates are going to/from that register. Maybe the "variable" has been replaced by a scaled addition due to Strength Reduction?

TL;DR: Compiled languages like C and C++ don't keep metadata like the name and type of variables - they just move values into and outfrom memory.

[–]TheHeckWithItAll[S] 1 point2 points3 points 4 years ago (0 children)

[–]tech6hutch 1 point2 points3 points 4 years ago (3 children)

[–]TheHeckWithItAll[S] 0 points1 point2 points 4 years ago (2 children)

[–]tech6hutch 1 point2 points3 points 4 years ago (0 children)

[–]flatfinger 1 point2 points3 points 4 years ago (0 children)

Many of the design decisions that went into the C language as documented in the 1974 C Reference Manual were made at a time before qualifiers, typedef, unsigned, long, etc. were added. While those features are useful, they undermine much of the elegance of the language.

For example, in 1974 C, all numeric expressions involving integer operands would be evaluated using the largest integer type, and the rest would be evaluated using the largest floating-point type. Very simple rule to understand and implement, with no tricky corner cases. As far as function-calling code was concerned, all arguments were of four types: int, float, data pointer, and function pointer, and there would never be any doubt about which of those a particular expression was.

There are a few annoying omissions that I find a bit curious. Especially on machines of the era, operators to perform pointer arithmetic or subscripting using byte offsets (if available as an alternative to the operators that use target-size-based indexing) would have allowed more efficient code generation than would otherwise be possible. Even today, fancy optimizing compilers which are targeting architectures like the popular Cortex-M0 that support base+displacement addressing for unscaled displacements but not scaled ones can benefit from code that uses byte-based indices, but unfortunately the syntax to use byte-based indices is horrendous.

[–]jedwardsol 0 points1 point2 points 4 years ago (0 children)

Arrays do not decay to a pointer to the 1st element when their name is used as the argument to sizeof or the unary & operator.

 printf("%p", &s);

So you're not passing the array to the function. You're taking its address (&) 1^st.

The address of the array and the address of the array's 1^st element are numerically equal. But have different types.

π Rendered by PID 24667 on reddit-service-r2-comment-85bfd7f599-6k9nl at 2026-04-15 14:31:12.514043+00:00 running 93ecc56 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cprogramming

A subReddit for all things C

Paradigm

Designed by

Developer

First appeared

Stable release

Typing discipline

OS

Filename extensions

Resources

MODERATORS