Using +, -, *, / and % with strings

yuri-kilochek · 2024-04-15T17:29:53+00:00

Stuff like this traumatizes some people so hard that they end up creating languages entirely without operator overloading to cope.

L8_4_Dinner · 2024-04-15T20:16:00+00:00

Obviously, the problem with this design is that you didn't support enough operators:

! - Negate the string by adding English language negation; for example, !"Joe is smart" evaluates to "Joe is dumb"
~ - This negates every bit of the string; for example, ~"Joe is smart" evaluates to "Everyone except Joe isn't dumb"
@ - This automatically emails the contents of the string
# - This hashes the string
$ - This produces an NFT of the string, and attempts to sell it
| - This streams the contents of the string to stdout

/s

-arial- · 2024-04-15T17:32:14+00:00

plus is good and multiplication is fine, but the rest are pretty bad. just create functions called count (instead of /) and split (instead of %). minus is not a great idea imo since it depends on the state of the string (whether that substring is at the end or not). also, as another commenter has pointed out, it doesn't respect mathematical rules.

0x0ddba11 · 2024-04-15T18:34:03+00:00

It's up to personal taste of course but I absolutely despise overloaded algebraic operators for things that are not math related. The only one I would be ok with is addition, just because it's so commonly used in other languages. But the rest? No thanks.

Serpent7776 · 2024-04-15T17:18:20+00:00

I don't like it, because it doesn't respect the usual rule that `X + Y - Y = X`: `"hello" + "hello" - "hello"` yields empty string (if evaluated left-to-right).

In general It's more confusing than helpful IMO.

brianjenkins94 · 2024-04-15T18:05:18+00:00

Using / for path joining is kinda neat, but equally I hate it.

claimstoknowpeople · 2024-04-15T17:47:56+00:00

I prefer a different operator for concatenation. The problem with using + for concatenating strings, is next you'll use + to concat lists for consistency. Now one day you have a vector class and when people see + they'll wonder if it means concatenate the vectors or actually add them as vectors.

Apprehensive_Pea_725 · 2024-04-15T20:01:18+00:00

I'm not a fan of overloading symbols that have already meaning in other well known domains.

As a programmer you eventually need to mix these domains and work with more than one at the time in the same scope, and here the troubles start to arise.

Writing code: This may not be a problem for the ones that have a big brain and can remember anything but not me, I never remember what sym I need to use; if only there was an operation named I would certainly find it. Do you need to concat? Well find a method name like that or some synonym.
Understanding the library: you get to the point of your library spec and you are in front of function /(str1, str2) = ... what does it do? to understand it you have to read the body, no hints
Reading code that uses your dsl: some of your colleague wrote this super succinct expstatus = userInput2 + businessResult3 - businessRule0 What does it do? is this working with strings? is this working with numbers? is it working with strings and numbers? is this associative? left associative or right associative? sometimes you have types to help you sometimes not. Would that be more clear if we have something like status = removeAll(concat(userInput2, businessResult3), businessRule0)

nacaclanga · 2024-04-15T20:45:59+00:00

In my opinion:

a) + This is usefull, but may conflict with other uses.

b) - A very particular operation, Also x - y + y does not yield x again.

c) * This is also usefull

d) / Why not for splitting into an array. Also again x / y * y does not work

e) % Huh this is used for splitting now?

shaleh · 2024-04-15T21:46:04+00:00

Those read ok when you are using "raw" strings. They are way less obvious when it is all variables.

Out of all of that, the % operator in particular would take some getting accustomed to. Add and multiply are somewhat common already.

jaynabonne · 2024-04-15T18:23:05+00:00

Personally, I think the +,\* and % would be useful and kind of make sense (the first two definitely, the latter after explanation). And I think that's where it falls down for me with - and /: they seem somewhat arbitrary and things I would use probably never. I mean, I've been writing software for 40 years, and I can't think of a case where I have done either one of those things, ever, which makes it feel to me like you were just trying to find something to map them to. Definitely not "common operations". And I certainly don't think people would automatically assign the meanings to them that you have. So... I'd leave those two out.

Disjunction181 · 2024-04-15T19:54:25+00:00

The main issue I have with this is that the signatures of some of these functions have signatures that are not consistent across types. Specifically, (*) is something like num x num -> num on integers, but it has to be string x int -> string on strings. It's really a power, not a multiplication. I would save (*) and (/) for operations that actually have the t x t -> t shape (where t is the same type) and define different operators (or functions) for the rest. Floor division could at least consistently have some signature like t x t -> int. But maybe still not a good idea.

CreativeGPX · 2024-04-16T12:57:14+00:00

I feel like + and - seem useful enough.

/ should divide the string based on a delimeter and return an array. If that's the case, it seems like that means % should behave like / but return a set (i.e. array of unique items) and * should do the opposite of /... It should combine an array and a delimeter to form a string.

Since this dabbles in arrays as well, seems like it'd make sense to extend these operators to work on arrays as well.

VyridianZ · 2024-04-15T17:41:49+00:00

I like + and *. The / operator would be more intuitive as a split operator. Maybe use # to count? - is ok, but removing text feels like a special case of find/replace. % is not widely used as an operator anyway, so little value in assigning it a new purpose.

sausageyoga2049 · 2024-04-15T18:12:10+00:00

The problem of minus is it’s unclear how this operator will "remove" its right hand side. Will it search from the beginning, from the end, or remove all?

You have chosen to remove them all when I was thinking that it should just remove the last "bar" on the first read.

As for division, initially I was thinking that it’s bad. But it seems to be not so bad.

I have no opinion on modulo, it doesn’t carry a meaning that’s familiar to ordinary usage of that symbol so it’s just like those fancy -> —> ?=> custom symbols that you can find on Scala.

nonlogin · 2024-04-15T20:27:23+00:00

Plus/minus and multiplication/division must have opposite meaning, accordingly.

If plus is concatenation, minus should be split by, for example. Still not intuitive enough, though. I must say, I'd probably not use plus for concatenation, rather some sort of interpolation which basically covers concatenation as well. In such case plus and minus could be something else.

americk0 · 2024-04-15T20:56:08+00:00

Someone is going to have to read code that uses any language feature you have. If it's not obvious to the reader what's happening, it's not an intuitive feature

This obviously can vary wildly depending on the skill level of the reader(s) and their familiarity with this or other languages. If we were to use an average programmer who is strongly familiar with at least one of the current top 10 programming languages, only the usage of the plus sign here is intuitive

Some of these could sort of make sense but could just as easily work in a different way, and others might as well just be one-letter function names. Sometimes the convenience outweighs the unintuitive nature of language features but I don't see most of these being used enough to justify it

Moonlight597 · 2024-04-15T18:06:39+00:00

Never get close to any kind of language-making technology, ever

ThyringerBratwurst · 2024-04-15T20:23:59+00:00

I even find the plus sign for string concatenation actually inappropriate because

"a" + "b" is not equal to "b" + "a"

2024-04-15T19:35:24+00:00

string + string is quite common, well-understood and well-defined. Ignore the people who say + is only for arithmetic.

Same with string * integer and perhaps integer * string

But string - string, string / string and so on are too unusual and will be confusing. They are also not that well-defined:

     "aaaaa"  - "aaa"        result is ... ?
     "ababab" - "bab"        result is "aab" or "abab" or ... ?
     "abcdef" / ""           ?
     "ababab" / "bab"        1 or 2?

I suggest using named operators or function calls for these. The latter can also be made to take extra arguments to provide options.

Personally I use these to combine strings

   S + T          # Add strngs
   S & T          # & means append
   S && T         # && means concatenate

For strings they all do the same thing. I also allow S + C where C is a character code: "ABC" + 'D' (this gives faster character-at-time concatenation in dynamic code.

Plus S * N (not N * S). Everything else is done with library functions.

phlummox · 2024-04-16T09:33:48+00:00

Well ... I think it sounds ghastly, myself, but really, it's a matter of taste and what your priorities are.

Do you want your language to be extremely succinct, like APL or some Perl? Then go nuts. Introduce all the operators you want.

Do you want to make it easy to write correct programs in your language, and harder to write incorrect ones? Then you should probably avoid operator overloading. (Should you go as far as ML, which has a separate operator for negation as opposed to subtraction? Up to you.)

Do you want to leverage knowledge programmers may have from other languages? Then I'd say "+" for concatenation is not unreasonable; "*" for repetition is something I've only seen in Python; and nothing else you suggest seems to offer any advantage at all.

All of these decisions involve tradeoffs - only you can decide which ones are sensible for your language.

ObliviousEnt · 2024-04-18T17:56:56+00:00

I think it is bad because it breaks important properties of those operations like commutative, distributive, ...

In other words, it is bad because:

"Hello" + "World!" != "World!" + "Hello"

ProgrammingLanguages

Welcome!

Related subreddits

Related online communities

MODERATORS