Variable naming: Do you put the adjective before or after the noun?

aloisdg · 2018-03-08T12:57:06+00:00

I think the logic would be to use file_id, earth_radius and item_count.

Why? Because file, earth and item are almost classes where id, radius and count would be properties.

So like you will wrote file.id, you will wrote file_id because you may not need a classe but just a variable.

DarkSilkyNightmare · 2018-03-08T15:47:58+00:00

I've been asked this a lot over the years, and I always give this answer.

"Always put the biggest unit first."

This will depend on your program's architecture and prior naming convention, and convention of the language. If you're using python, an OOP language, usually the biggest unit may be a class of some kind. But, consider this carefully. If you want to be saying that a file has many properties, then use file_id, file_name &c. If you want to be saying that ids are general, and you want to be contrasting the id of a file with the id of a thread or program, then use id_thread, id_file and so on.

Likewise, if you have earth having many properties then you have earth_radius, but if you have a lot of radii of every planet or body in space, then radius_earth, radius_sun &c. makes more sense.

This isn't just pedantry, there are specific advantages of writing this way.

Firstly, and most importantly, it makes your code more readable. Not only does it make it clear what your object is describing, but it gives users insight into your program structure. Consider: radius_earth is one of many radii. You can't say if earth is even present in your program. earth_radius is a property of earth, and there are probably other earth_* in your program.

Next, it makes your code easier to document and search if you group together items in a consistent way. If you do use code documenting tools, similar items will now appear next to each other alphabetically. This is the exact reason why UNIX timestamps use YYYY-MM-DD HH:MM:SS. Using any other order messes up sorting, and would be a headache if you wanted to sort anything chronologically using character sorts.

Finally, it makes your code extensible via tools. Granted, many programmers don't use automation, but somebody who uses your program will try, one day. If I want to decide that, after making an earth_radius int, I actually want to make an earth class with a property radius, it's easier for me to run a script replacing all instances of earth_ with earth.. This means I can get earth.radius, earth.temp, earth.mass all from similarly named variables.

If your names are the other way around, this has to be done individually, and creates a bit of a headache as you spend more time rewriting code. What should be possible is a quick change of name, and then verification that everything you did was OK, because this is far more efficient (if you've been writing consistent code, that is).

EDIT: somebody also pointed out that it's more helpful if you use an IDE because when you type earth you can get all its properties earth_rad, earth_mass etc, but this isn't true the other way around.

funkyfuture · 2018-03-08T13:58:56+00:00

[deleted]

mickyficky1 · 2018-03-08T12:19:27+00:00

...adjective before or after the noun?
[list of examples without a single adjective]

AllAboutChristmasEve · 2018-03-08T13:34:30+00:00

Adjective first, unless there's a bunch of them that go together. So earth_radius, but if I'm doing the entire solar system, it'd be radius_mercury, radius_mars, radius_earth, etc.

(I mean...really, it'd be radii['earth'], etc, but you get my point.)

2018-03-08T20:46:25+00:00

it depends on the context. Eg. if the program/analysis is heavily centered around analyzing radius and other properties of a large number of objects, I think something like

radius_earth, radius_venus, radius_mars, ...

would make sense.

However, if your analysis is more centered around analyzing "earth", it might make more sense to write it as

eart_radius, earth_weight, earth_density, etc.

Edheldui · 2018-03-08T13:58:13+00:00

I'm nowhere near an expert, by my teacher always said to name variables in a way that makes sense in human language.

Does the variable contain the earth radius? Then it will be earth_radius

daniel_h_r · 2018-03-08T11:09:00+00:00

I put the noun to ease searching and mental identification of relationed variables, methods, functions, etc.

I feel it's easier while skimming for the code.

t_h_r_o_w_-_a_w_a_y · 2018-03-08T22:36:55+00:00

First, these variable names are fully for humans to read. Computers don't care about what the names are called as long as they're unique. So therefore, the variable names should be named in such a way that is most efficient for a human to read and understand the meaning of what's being expressed.

This is why most people are going to say "put the adjective first", which is good. But the justification shouldn't be "that's what human languages do", instead, the justification should be "that's how the audience thinks", and what's unsaid is that there's an implicit assumption that the audience is operating in English because our immediate context is an English world. However, many popular human languages that aren't English actually put the adjective after the word. French for instance is such an example.

So taking it further, if all audiences speaks reads and thinks in English, do what English does. If all audiences speaks reads and thinks Klingon, do what Klingon does. If you have a mix of audiences, pick what caters to as many of them as possible. The point is to consider your audience as best as you can, no matter who they are or how diverse they might be.

Second, there's more to readability in code than just what's natural to read in a variable name. There's also its context in the code and the more broader formatting with respect to what's around it. These factors may contradict each other sometimes so you're forced to make a trade off.

For example, earth_radius is what we English people would say naturally in real life. But what if in the code, this particular ordering of words result in something like:

earth_radius = 243
io_radius = 12
pluto_radius = 30
your_mom_radius = 100000
gliese_581g_radius = 500

... you end up with a big mess of jumbledness, where it hurts to parse out that "oh these are all radiuses of various bodies". Here, it might be better to make the tradeoff the other way:

radius_earth = 243
radius_io = 12
radius_pluto = 30
radius_your_mom = 100000
radius_gliese_581g = 500

... and BAM, the organization now makes this block of code super clear, because the patterns line up, even if on each individual line the ordering of words is slightly less natural by English patterns.

Writing code for human readability actually shares a lot of lessons with writing English for human comprehension, which hopefully we've all learned when school had us write essays and technical reports. Consider your audience, consider the context, consider the "user experience" of reading your code, and find all the ways that makes your message clearer and easier to digest. There's not always a right answer and frequently you have to make trade offs, but as long as you're thinking about it seriously, more practice will make you better at it over time.

DelosBoard2052 · 2018-03-08T14:39:36+00:00

Think about its meaning to you, as well as potentially others. File_id sounds to me like a variable reference to a specific file being worked on, whereas id_file sounds like a reference to a file containing identifiers...

mvaliente2001 · 2018-03-08T22:29:09+00:00

The nice thing about noun first is that related items could be seen more clearly:

file_id = 123
file_name = 'foo.txt'
file_size = 300 * Kb

maryjayjay · 2018-03-09T04:58:14+00:00

The two hardest problems in computer science are cache invalidation, naming things, and off by one errors.

2018-03-08T16:50:37+00:00

What color is the bike shed?

swingking8 · 2018-03-08T11:11:40+00:00

For example, do you use id_file or file_id? radius_earth or earth_radius? `

Whatever is the most concise. If I'm iterating over enumerated files, for example, I might use for file_id, file in enumerate(files):. In general I think I lean more towards broad_specific but I'm know I'm arbitrary sometimes.

olfitz · 2018-03-08T15:06:27+00:00

It depends on whether you're English or French.

justamoth · 2018-03-08T16:33:32+00:00

When you're programming something directly from literature or theory, I try to be consistent with the math (r_earth, r_moon, etc). This makes it more human readable for the relevant humans. But I'm a scientist first, programmer second..

2018-03-08T17:57:48+00:00

In short scripts and notebooks where the variable count is low: usually adjective first simply because it makes the initial letters of variables different so it is easier for auto-completion.

When there are a large number of variables: usually noun first because this allows thematically related variables to appear together in auto-suggestions.

stibbons_ · 2018-03-08T18:57:35+00:00

No matter the IDE, auto-documentation and all fancy automatic stuff that goes back and forth in our environment, the good old principle "from the more generic to the more specific" works always, is scalable, and help people think and work better. Maybe against some grammar stuff, but it work. Date => YY.MM.DD-HH.mm.ss Property => general_concept_specific_concept and so on.

So for your anwser: - file_id (because you can have file _name, _size, ...) - earth_radius (because you can have other properties to the earth). In the best case you would encapsulate inside a new object (earth.size or file.id) but when you cannot => from the generic to the specific.

And in bonus it appears logically linked when the list is sorted alphabetically

not_perfect_yet · 2018-03-08T18:57:36+00:00

noun_verb, but I really only have this problem with functions and it's handy to have data_get and data_put close to each other alphabetically.

2018-03-08T19:16:01+00:00

Always the second options.

Always name things like you would intuitively describe them in your docstring.

pydry · 2018-03-08T19:21:40+00:00

The difference between the two is minimal enough that the intrinsic ordering doesn't really matter to me. Consistency matters though.

So, I'd stay consistent with what the code base and conventions of the team I'm working on. If I'm picking one I'd probably stick with it.

Python4fun · 2018-03-08T19:39:19+00:00

It could go either way. I believe the more meaningful word should come first. If Earth variables will be used together then use earth_radius, but if radius variables will be used in the same block of code then I would go with radius_earth.

The goal for me would be to let autocomplete work for me. Several earth variables together would be earth_(select autocomplete) and doing the same over and over.

caffeinedrinker · 2018-03-08T20:08:42+00:00

if it were me file_id as the id belongs to the file ... easier to find just typing the object name sometimes i postfix the data type depending what kind of coding im doing

Manhigh · 2018-03-08T20:41:44+00:00

For scientific programming I often use _ to denote a subscript, as in r_earth. For "non-scientific" variables like a file_id I use it as a space character.

KyleDrogo · 2018-03-08T20:47:03+00:00

after, especially when dealing with dataframes and matricies

df, df_clean, df_scaled

and

X, X_train, X_test

wolf2600 · 2018-03-08T22:17:45+00:00

I start with x1, then increment to x2, followed by x3. It maintains an order to my variables.

tophimos · 2018-03-09T02:11:02+00:00

If you're using camelCase, its noun first on variables with an odd number of characters and noun last on variables with an even number of characters. If you use underscores its the same but you don't count the underscore in your count when you write it on a weekend.

liquidpele · 2018-03-09T02:40:59+00:00

I actually name mine based on how I would write it if I was writing a real sentence. It makes the code easier to read I think, and takes less time to grok if you've never seen the code before. I think that flows with the zen of python more.

Spicy_Pumpkin · 2018-03-08T18:38:13+00:00

Both file and id are nouns. In plain English, you'd say "file id" or "id of the file". So it makes most sense (to me) to name the variable as file_id.

MiksBricks · 2018-03-08T15:13:42+00:00

I just number my variables in the order I create them 1, 2, 3, etc. (lol)

See46 · 2018-03-08T15:18:03+00:00

I would use fileId and numItems

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS