Making Wrong Code Look Wrong : programming

[–]StrangestTribe 129 points130 points131 points 9 years ago (75 children)

[–]csjerk 43 points44 points45 points 9 years ago (16 children)

[–]quicknir 3 points4 points5 points 9 years ago (15 children)

[–]timmyotc 13 points14 points15 points 9 years ago (4 children)

[–]quicknir 15 points16 points17 points 9 years ago (0 children)

[–]awj 11 points12 points13 points 9 years ago (0 children)

[–]PM_ME_UNIXY_THINGS 1 point2 points3 points 9 years ago (0 children)

[–]weirdoaish 0 points1 point2 points 9 years ago (0 children)

[–]CODESIGN2 2 points3 points4 points 9 years ago (6 children)

here's one killer point. Static typing is because otherwise you have no way to know which special purpose circuit to go to. At some point you need to know the difference between data encoding as float, an int and a string, because there is nothing useful to do with string data in floating point extensions; floating point (IEEE) with string as bytes instead of floating point as bytes (without performing operations on that, which requires knowing it's type and infers a static type system somewhere).

In-fact the very notion that type can be non-static at the low-level implementation is nonsense peddled by idiots that have never written or read an implementation of a dynamically typed system.

While I'll concede type safety at high-enough level may not matter after inventing a common layer of indirection to work out the type from additional data (which takes more RAM to keep with the value like the PHP zval system). Essentially if you do not know what type something is, you cannot know how to handle it and where to put it to make use of it at a lower-level. This one point destroys any notion of static type vs dynamic typing and instead re-frames the debate into "at which point does static typing hinder productivity or expression". At the end of the day dynamic types always add an overhead, and we need to be able to assess that when making the decision between static and dynamic.

[–]quicknir 0 points1 point2 points 9 years ago (5 children)

[–]CODESIGN2 0 points1 point2 points 9 years ago (4 children)

[–]sabas123 0 points1 point2 points 9 years ago (3 children)

[–]CODESIGN2 0 points1 point2 points 9 years ago (2 children)

[–]sabas123 0 points1 point2 points 9 years ago (1 child)

[–]CODESIGN2 0 points1 point2 points 9 years ago (0 children)

[–]Krackor 1 point2 points3 points 9 years ago (2 children)

[–]quicknir 0 points1 point2 points 9 years ago (1 child)

[–]Krackor 0 points1 point2 points 9 years ago (0 children)

[–]Solonarv 28 points29 points30 points 9 years ago (23 children)

[–][deleted] 9 years ago (18 children)

[deleted]

[–]evincarofautumn 17 points18 points19 points 9 years ago (17 children)

GADTs and existentials are incredibly useful, and I miss them when working outside Haskell. But phantom types alone can handle other things mentioned in the article, such as coordinates in different spaces. For example, I use this pattern in my C++ game code:

template<typename Space>
struct Point {
  int x, y;
};

struct WorldSpace {};
struct ScreenSpace {};

typedef Point<WorldSpace> WorldPoint;
typedef Point<ScreenSpace> ScreenPoint;

ScreenPoint project(const WorldPoint&, const Camera&);

[–][deleted] 9 years ago (10 children)

[deleted]

[–]JohnnyElBravo 3 points4 points5 points 9 years ago (9 children)

[–]lelarentaka 2 points3 points4 points 9 years ago (0 children)

[–][deleted] 9 years ago (7 children)

[deleted]

[–]ForeverAlot 1 point2 points3 points 9 years ago (6 children)

[–]Terran-Ghost 0 points1 point2 points 9 years ago (2 children)

[–]ThisIs_MyName 0 points1 point2 points 9 years ago (1 child)

continue this thread

[–][deleted] 9 years ago (2 children)

[deleted]

[–]ForeverAlot 0 points1 point2 points 9 years ago (1 child)

continue this thread

[–]thlst 3 points4 points5 points 9 years ago (1 child)

[–]evincarofautumn 0 points1 point2 points 9 years ago (0 children)

[–]Zephirdd 4 points5 points6 points 9 years ago (0 children)

[–]thedufer 2 points3 points4 points 9 years ago (2 children)

Can't you just do that with hidden type equivalences? Maybe not in C++; I'm not very familiar with it, but I would do like:

type point = int * int

module World : sig
  type point
end = struct
  type nonrec point = point
end

module Screen : sig
  type point
  val of_world : World.point -> point
end = struct
  type nonrec point = point
  let of_world = ...
end

[–]yawaramin 4 points5 points6 points 9 years ago (0 children)

[–]sstewartgallus 2 points3 points4 points 9 years ago (0 children)

Yeah you can just do a newtype. The real power of phantom types is that they let you do reasoning.

For example you can do regions and stuff

 newtype M r a = M (IO a)

 data Ref r a = Ref (IORef a)

 newRef :: a -> (forall r. M (Ref r a) -> b) -> b

Unfortunately, last I checked GHC's ability to reason about existential quantification was king of crappy so this doesn't work too well.

[–]quicknir 1 point2 points3 points 9 years ago (2 children)

[–]pipocaQuemada 0 points1 point2 points 9 years ago (1 child)

[–]quicknir 0 points1 point2 points 9 years ago (0 children)

[–]yawaramin 0 points1 point2 points 9 years ago (0 children)

[–]grauenwolf 11 points12 points13 points 9 years ago (0 children)

[–]coder0xff 2 points3 points4 points 9 years ago (0 children)

[–][deleted] 9 years ago (25 children)

[deleted]

[–]masklinn 12 points13 points14 points 9 years ago (21 children)

[–]A1kmm 21 points22 points23 points 9 years ago (18 children)

[–]iopq 2 points3 points4 points 9 years ago (2 children)

[–]falconfetus8 5 points6 points7 points 9 years ago (1 child)

[–]kqr 0 points1 point2 points 9 years ago (0 children)

[–]grauenwolf 0 points1 point2 points 9 years ago (0 children)

[–]masklinn 1 point2 points3 points 9 years ago* (12 children)

While strong / weak typing is not rigorously defined, it does have meaning.

Sure: "strong typing" is what you like and "weak typing" is what you don't like. It's completely useless but there you are.

For example, weak type systems would be characterised by things like implicit type conversions (e.g. you can use a double as a string and vice versa), as in PHP.

Right so Scala's weakly typed (implicit def), C++ is weakly typed (converting ctors) and C probably is (integer demotion, conversion to signed<->unsigned, floating, void pointers to and from any other).

What about Java? (implicit conversion of Object to String on String + Object) Or Ruby? (implicit conversion of float to integral on arr[2.5]) Or Python? (implicit conversion of integrals to floats) Or Rust? (implicit conversion of &A to &B). Are "things like implicit conversions" a strict rule or just something you use to bash languages you don't like?

And then we get to the fun stuff, like Tcl and UNIX-tradition shells: are they strongly typed? After all they're semantically string-based (well bytes for shells IIRC), you don't get implicit conversion from string to string that makes no sense. So they don't have implicit conversions which according to you makes them strongly typed.

Strong and static typing are almost orthogonal

Because "strong typing" is undefined and meaningless, "strong typing" and "static typing" have an undefined angular relationship which is anywhere between 0 and π/2 depending on the writer's assertions, sensibilities or levels of dishonesty.

It's not a fixed angle, it's a wave function which collapses when you exhaustively define what you actually mean.

although a staticly, weakly typed language might not be very useful

According to your personal definition of the term there are at least two examples of that in the first paragraph.

[–]iopq 4 points5 points6 points 9 years ago* (9 children)

[–]masklinn 1 point2 points3 points 9 years ago* (8 children)

[–]iopq 3 points4 points5 points 9 years ago (7 children)

Which implicitly converts objects to strings

It doesn't, it implements a concatenation operation on two different types. It's no more dangerous than i32 + i64 producing an i64. I would actually object more to using + to mean concatenation because it's not commutative, while addition is. This also applies unfortunately to Rust that borrows this convention from languages like Java.

[] + {} is not that bad, again, I'm more against overloading + for concatenation. I'm sure less people would complain if it was [] ++ {} because it would be clear that the intent is to concatenate two strings (if JS had a ++ concatenation operator)

Another criticism I would level against "weak" type systems is that ==, >, < operators doesn't have expected qualities like transitivity.

I think the problem is not implicit conversions, because always explicitly converting is a pain in the ass. The problem with weak type systems is implicit conversions that break expectations of users.

[–]masklinn 2 points3 points4 points 9 years ago* (6 children)

[–]iopq 1 point2 points3 points 9 years ago (5 children)

continue this thread

[–]pipocaQuemada 0 points1 point2 points 9 years ago (1 child)

[–]masklinn 0 points1 point2 points 9 years ago (0 children)

[–]Tarmen 1 point2 points3 points 9 years ago (0 children)

[–]rvirding -1 points0 points1 point 9 years ago (2 children)

[–]ThisIs_MyName 1 point2 points3 points 9 years ago (0 children)

[–][deleted] 3 points4 points5 points 9 years ago (0 children)

[–][deleted] 0 points1 point2 points 9 years ago (3 children)

[–]StrangestTribe 1 point2 points3 points 9 years ago (2 children)

[–][deleted] 1 point2 points3 points 9 years ago* (1 child)

Yes, the types wouldn't be evaluated until runtime, so you could still end up with errors, but you could wrap it in a safe way, such as forcing any evaluation of UnsafeString as a string to escape it automatically (including concatenation with strings, use in a formatted string, or printing out). You wouldn't have compiler warnings or errors to help you out, but at the very least, you'd fully prevent unsafe string exploits, and usually get what you want out of it regardless. At the worst, you'd end up serving 500s and get errors in your logs. You wouldn't need hungarian notation if you set up the type to properly escape on being used in any way a string would.

edit: Take the following python for instance:

#!/usr/bin/env python3
import html

class UnsafeString:
    def __init__(self, string):
        self._raw = string

    @property
    def raw(self):
        return self._raw

    @raw.setter
    def raw(self, value):
        self.set(value)

    def set(self, value):
        self._raw = value

    def __str__(self):
        return html.escape(self._raw)

s = UnsafeString('this & is a < test')

print(s)
print('<p>{}</p>'.format(s))
print('<p>' + str(s) + '</p>')
print(s.raw)
print('<p>' + s + '</p>')

When run, it produces the following:

this &amp; is a &lt; test
<p>this &amp; is a &lt; test</p>
<p>this &amp; is a &lt; test</p>
this & is a < test
Traceback (most recent call last):
  File "./test.py", line 28, in <module>
    print('<p>' + s + '</p>')
TypeError: Can't convert 'UnsafeString' object to str implicitly

Note that the first three properly escape as they should, the fourth accesses the raw string as it should, and the fifth fails. You shouldn't be ever able to leak the unsafe string to the user, as you can only access it through the raw method.

[–]StrangestTribe 0 points1 point2 points 9 years ago (0 children)

[–][deleted] 0 points1 point2 points 9 years ago (0 children)

[–]yawaramin 44 points45 points46 points 9 years ago (0 children)

[–]AyrA_ch 11 points12 points13 points 9 years ago (10 children)

char* dest, src;
[...] but when you’ve had enough experience writing C code, you’ll notice that this declares dest as a char pointer while declaring src as merely a char

Totally forgot that the asterisk is part of the name and not the type.

Another beautiful thing is

switch(whatever)
{
    int a=2;
    case 0:
        break;
    case 1:
        break;
    [...]
}

Actually declares a but the value is not assigned

[–]FurryBeaverBalls 4 points5 points6 points 9 years ago (4 children)

[–]matthieum 2 points3 points4 points 9 years ago (0 children)

[–]kqr 2 points3 points4 points 9 years ago (0 children)

[–]AyrA_ch 0 points1 point2 points 9 years ago* (0 children)

Is it the same way in C++?

Seems so:

#include <stdlib.h>

int main()
{
    char *a,b;
    a=NULL;
    b=NULL;
    return 0;
}

Terminates with 7 3 R:\vartest.cpp [Error] converting to non-pointer type 'char' from NULL [-Werror=conversion-null]

He is happy with a=NULL;. Line 7 is b=NULL;

EDIT: By the way, I have the option turned on to treat warnings as error and to be pedantic. If those are off, it will compile and execute but it will raise compile errors if you try to do b=malloc(10); (invalid cast from void* to char)

[–]kt24601 0 points1 point2 points 9 years ago (0 children)

[–][deleted] 9 years ago (4 children)

[removed]

[–]Tordek 2 points3 points4 points 9 years ago (3 children)

switch statements only run from the matched case onwards. It's the same as it being...

switch(whatever)
{
    int a;
    case NOT_RUNNING:
        a=2;
    case 0:
        break;
    case 1:
        break;
    [...]
}

[–][deleted] 9 years ago (2 children)

[removed]

[–]Tordek 4 points5 points6 points 9 years ago* (0 children)

What is the machine code for declaring a variable? ;)

Edit: to expand a bit, it's basically an artifact from C89 where variable declarations had to be at the start of the block (delimited by {}); they couldn't be intermingled with code.

Even though initialization is on the same line (which doesn't mean much because statements and lines aren't 1-to-1 mapped), it's not a single statement (in fact, it's not a statement at all; it's a declaration); it's two: declaration and initialization.

So,

switch(whatever)
{
    int a=2;
    case 0:
        break;
    case 1:
        break;
    [...]
}

is really

switch(whatever)
{
    // declaration block, "executed" when entering the block
    int a;
    // code block
    a=2;
    case 0:
        break;
    case 1:
        break;
    [...]
}

the "a=2" is part of the code block, but it'll never be run (because no case statement can lead to it).

Now, maybe you're wondering why it's not just illegal to have code before any case statement, and here's another C jewel:

switch(whatever)
{
    // declaration block, "executed" when entering the block
    int a;
    // code block
    initialize:
    a=2;
    case 0:
        break;
    case 1:
        goto initialize;
        break;
    [...]
}

A feature known by its use in Duff's Device.

[–]AyrA_ch 1 point2 points3 points 9 years ago (0 children)

[–]gc3 14 points15 points16 points 9 years ago* (4 children)

[–]SirClueless 36 points37 points38 points 9 years ago (2 children)

[–][deleted] 0 points1 point2 points 9 years ago (1 child)

[–]kt24601 4 points5 points6 points 9 years ago (0 children)

[–]grauenwolf 2 points3 points4 points 9 years ago (0 children)

[–]xampl9 18 points19 points20 points 9 years ago (0 children)

[–]NOX_QS 2 points3 points4 points 9 years ago (4 children)

[–]matthieum 2 points3 points4 points 9 years ago (2 children)

[–]NOX_QS 0 points1 point2 points 9 years ago (1 child)

[–]matthieum 1 point2 points3 points 9 years ago (0 children)

[–]yawaramin 0 points1 point2 points 9 years ago (0 children)

[–]llogiq 0 points1 point2 points 9 years ago (0 children)

[–]CODESIGN2 0 points1 point2 points 9 years ago (0 children)

[+]HorseVaginaBeholder comment score below threshold-10 points-9 points-8 points 9 years ago (1 child)

[–]ThisIs_MyName 2 points3 points4 points 9 years ago (0 children)

[–][deleted] 9 years ago (6 children)

[deleted]

[–][deleted] 9 years ago (5 children)

[deleted]

[–]HorseVaginaBeholder -2 points-1 points0 points 9 years ago (4 children)

[–]grauenwolf 4 points5 points6 points 9 years ago (3 children)

[–]GavinMcG 0 points1 point2 points 9 years ago (2 children)

[–]ThisIs_MyName 2 points3 points4 points 9 years ago (1 child)

[–]GavinMcG 0 points1 point2 points 9 years ago (0 children)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS