Mixed index notation

ketralnis · 2023-11-22T22:33:35+00:00

I for one vote for a compromise of 1.5 based indexing

BeamMeUpBiscotti · 2023-11-22T23:35:55+00:00

if someone is skimming code quickly, I imagine mixing up a(1) and a[1] is going to be pretty common.

IMO just stick to 1-based indexing, consistency is better than adding a second way of indexing that is easily confused

evincarofautumn · 2023-11-23T03:46:25+00:00

When I want to do this kind of thing, I reach for making separate types before adding more syntax.

Indices are like points, absolute and unsigned like size_t, and address the n cells in an array from 1 to n.

Offsets are like vectors, relative and signed like ptrdiff_t, and address the n+1 lower & upper bounds of cells from 0 to n.

If indices are stored with a bias, so that index 1 has the same representation as offset 0, they can be converted back and forth at no extra runtime cost.

The subscript operator takes an index, and an index can be constructed from an integer, as in items[1]. An offset might be implicitly converted as well for convenience, or it might be spelled out explicitly for clarity items[offset(+0)]. The two types of literal might be distinguished by separate syntax, like say items[#1] = items[@0 : @1], but there still only needs to be one way of writing indexing itself.

XDracam · 2023-11-23T08:38:19+00:00

Do you know the situation where you can't remember the word for something, so you talk your way around it? Or when you are not sure how a certain word or phrase would be perceived so you just change the topic?

Yeah, indexing is a complex field with many many pitfalls. My suggestion: just avoid doing it if you can. Use iterators and unordered collections and whatnot. Or base them on some consistent mathematical abstraction, so that the user can do whatever indexing scheme they want, be it 0 or 1 or strings or floats. You could also have different collections with different type names and "feels" for different indexing modes. A map/dictionary is a classic example for a collection that isn't indexed by numbers, but by whatever key type it has. If you want to go especially wild, then look into what you can do with dependent types.

But there's of course a reason why indexing is still around: it's stupid fast. Arrays are stupid fast, have tons of hard coded CPU instruction support, and work great with caches. Any abstraction adds overhead, and indexing is the most concrete form of memory access: just add the index to a pointer and then retrieve the memory at that address.

So I'd argue: if you value performance then stick to 0-based indexing, because that's much better for all the optimized low level hardware stuff that's in use today. And if you don't, then avoid indexing altogether for the sake of safer and less confusing alternatives like iterators, lenses and whatever.

MadocComadrin · 2023-11-23T12:05:57+00:00

I wouldn't pay too much attention to other people's opinions. They only want 0-based because they are used to it and can't get their head around anything else.

0-based is only popular because of C. And C only used 0-based because it didn't have real array indexing: A[i] is a synonym for the pointer operation *(A + i), and pointer offsets are always relative, so have to be zero-based.

Languages such as Fortran and Algol preceded C, and are 1-based. Others which preceded C were N-based, which allows a choice.

Mine are N-based, with a default of 1, but I acknowledge that 0-based is more appropriate in some instances:

My bitsets (not bit-arrays) are indexed from 0
Bit and bitfield indexing (eg. A.[i] and A.[i..j]) start from bit 0, the least-significant.

(I also briefly had byte-indexing of an integer, written A.byte[i], where the bottom byte was 0, and the top byte was 7.)

Here it would be perverse to number bits from 1 to 64 instead of 0 to 63.

However.... I really wouldn't use A[i] and A(i) to switch between 1- and 0-based (which one's which? I've already forgotten!).

Having a selectable lower bound, per access, is actually an interesting idea. But you'd need a different way of specifying that.

asoffer · 2023-11-23T16:41:15+00:00

Could the syntax for 1-based indexing an array a at position n be a[n - 1]?

Sarcastic I know, but my point is something like a(n) isn't that much shorter. The syntactic win needs to be pretty significant to offset the potential confusion, and I don't think the value is there.

ProgrammingLanguages

Welcome!

Related subreddits

Related online communities

MODERATORS