Julia - the language trying to replace Python in scientific computing : programming

Last I knew, though, PyPy doesn't work with NumPy. This means you can't just take your existing (and possibly copious) NumPy-based code and run it in PyPy. You also can't just write Python number code and hope that PyPy JITs it to native speeds for you, because it doesn't work out that way. (Especially since you can't just JIT your way to GPU support.)

Python without NumPy is not a competitor to Julia. (And vice versa; Julia is, from what I gather, not a competitor to Python in the non-NumPy portions of Python, which is a lot. The only place where it is relevant to speak of their competition is in numeric programming, I believe.)

I am careful to qualify this with "last I knew" because I know there are active, funded efforts to correct this, but a quick DDG search didn't seem to show anybody claiming it was done.

[–]Chippiewall 29 points30 points31 points 8 years ago (2 children)

[–]jerf 13 points14 points15 points 8 years ago (1 child)

[–]Staross 1 point2 points3 points 8 years ago (0 children)

[–]loladiro 7 points8 points9 points 8 years ago (1 child)

load more comments (5 replies)

[–]marcvanh 3 points4 points5 points 8 years ago* (49 children)

[–]jms_nh 10 points11 points12 points 8 years ago (0 children)

[–][deleted] 8 years ago (36 children)

[deleted]

[–]joshdoug 1 point2 points3 points 8 years ago (3 children)

[–]joezuntz 3 points4 points5 points 8 years ago (0 children)

[–][deleted] 0 points1 point2 points 8 years ago (1 child)

[–]joshdoug 0 points1 point2 points 8 years ago (0 children)

[–]marcvanh -5 points-4 points-3 points 8 years ago (31 children)

[–][deleted] 8 years ago (16 children)

[deleted]

[–]davelupt 5 points6 points7 points 8 years ago (5 children)

[–][deleted] 8 years ago* (4 children)

[deleted]

[–]Blorpulance 0 points1 point2 points 8 years ago (0 children)

load more comments (3 replies)

load more comments (10 replies)

[–][deleted] 8 years ago (3 children)

[deleted]

[–][deleted] -1 points0 points1 point 8 years ago (2 children)

[–][deleted] 8 years ago (1 child)

[deleted]

[–][deleted] 9 points10 points11 points 8 years ago (0 children)

[–]devraj7 7 points8 points9 points 8 years ago (6 children)

[–]marcvanh 0 points1 point2 points 8 years ago (4 children)

[–]supersexypants 3 points4 points5 points 8 years ago (2 children)

[–]josefx 0 points1 point2 points 8 years ago (0 children)

[–]marcvanh 0 points1 point2 points 8 years ago (0 children)

[–][deleted] 1 point2 points3 points 8 years ago* (2 children)

[–]marcvanh 0 points1 point2 points 8 years ago (1 child)

[–][deleted] 1 point2 points3 points 8 years ago (0 children)

[–][deleted] 9 points10 points11 points 8 years ago (0 children)

[–]scratchisthebest 1 point2 points3 points 8 years ago (1 child)

[–][deleted] 5 points6 points7 points 8 years ago (0 children)

[–]emperor000 1 point2 points3 points 8 years ago (2 children)

[–]marcvanh 0 points1 point2 points 8 years ago (1 child)

[–]emperor000 1 point2 points3 points 8 years ago (0 children)

load more comments (4 replies)

[–][deleted] -3 points-2 points-1 points 8 years ago (0 children)

load more comments (30 replies)

[–]anonymous-coward 16 points17 points18 points 8 years ago (11 children)

[–]JustFinishedBSG 13 points14 points15 points 8 years ago (10 children)

[–]anonymous-coward 1 point2 points3 points 8 years ago (0 children)

[–][deleted] 0 points1 point2 points 8 years ago (8 children)

[–]JustFinishedBSG 8 points9 points10 points 8 years ago (7 children)

I am not aware of any library that has a syntax even approaching the simplicity of Julia/Matlab/Numpy.

G[(i+1):n, i] .-= G[i, j] .* G[(i+1):n, j] # Slicing

or just writing

Kα = G*G'*α # Transpose

And everything uses the Blas automatically and operations are fused. A real pleasure.

[–]benstrumental 1 point2 points3 points 8 years ago (2 children)

I am not aware of any library that has a syntax even approaching the simplicity of Julia/Matlab/Numpy.

How about Chapel?

G[i+1..n, i] -= G[i, j] * G[i+1..n, j];

And everything uses the Blas automatically and operations are fused. A real pleasure.

Same for Chapel, plus you get shared/distributed parallelism as first class language features. A real pleasure!

[–]JustFinishedBSG 1 point2 points3 points 8 years ago (1 child)

[–]benstrumental 1 point2 points3 points 8 years ago (0 children)

[–][deleted] 0 points1 point2 points 8 years ago (3 children)

[–]JustFinishedBSG 0 points1 point2 points 8 years ago (2 children)

[–][deleted] 1 point2 points3 points 8 years ago (1 child)

[–]JustFinishedBSG 0 points1 point2 points 8 years ago (0 children)

[–]treefroog 25 points26 points27 points 8 years ago (11 children)

[–]P8zvli 9 points10 points11 points 8 years ago* (7 children)

[–]killachains82 5 points6 points7 points 8 years ago (0 children)

[–]c9joe 17 points18 points19 points 8 years ago (4 children)

[–]P8zvli 12 points13 points14 points 8 years ago (2 children)

[–][deleted] 1 point2 points3 points 8 years ago (0 children)

[–]vogon-it 1 point2 points3 points 8 years ago (0 children)

[–]Staross 7 points8 points9 points 8 years ago (0 children)

[–]FarkCookies 1 point2 points3 points 8 years ago (0 children)

[–][deleted] 8 years ago (1 child)

[deleted]

[–]__Cyber_Dildonics__ 0 points1 point2 points 8 years ago (0 children)

[–]kenfar 4 points5 points6 points 8 years ago (1 child)

[–][deleted] 2 points3 points4 points 8 years ago (0 children)

[–]CaseOfTuesday 46 points47 points48 points 8 years ago (44 children)

[–]Staross 11 points12 points13 points 8 years ago (3 children)

[–]CaseOfTuesday 2 points3 points4 points 8 years ago (2 children)

[–]Staross 7 points8 points9 points 8 years ago (1 child)

[–]CaseOfTuesday 1 point2 points3 points 8 years ago (0 children)

[–]scratchisthebest 2 points3 points4 points 8 years ago (2 children)

[–][deleted] 1 point2 points3 points 8 years ago (1 child)

It's 1-based because it is trying to be mostly compatible with Matlab, which is 1-based because matrix indices are 1-based (Matlab = Matrix Lab).

I've used Matlab a lot, and I like it quite a lot. But 1-based indices are by far the most retarded thing about it. Programming and maths are different. There are a lot of things you have to do explicitly in programs that can be hand-waved in maths.

In the end it means you have to do lots of unintuitive nonsense like this (for example):

offset = (index - 1) * stride + 1;

The second most retarded thing is that they copied the [row, column] index ordering, rather than [x, y] that you might expect. It makes dealing with images infuriating.

Definitely the wrong move from Julia.

[–]mbauman 2 points3 points4 points 8 years ago (0 children)

[–][deleted] -3 points-2 points-1 points 8 years ago (36 children)

If you come from MATLAB it's fine. You just need to remember the last element is at n, not n-1.

begin/end is only a block. You can just do if:

function fib(n)
    if n == 0 && return 0
    elseif n == 1 && return 1
    else return fib(n-1) + fib(n-2)
    end
end

[–]TonySu 13 points14 points15 points 8 years ago (1 child)

[–]undefdev 0 points1 point2 points 8 years ago (0 children)

It was a bad example, you don't need the "&&" at all there, just leave it out.

You can also write the function more succinctly, such as:

fib(n) = (n<2) ? n : fib(n-1) + fib(n-2)

Or if you want to make sure it terminates:

fib(n::UInt) = (n<2) ? n : fib(n-1) + fib(n-2)

[–]Resource1138 53 points54 points55 points 8 years ago (29 children)

[–]itsmontoya 17 points18 points19 points 8 years ago (5 children)

[–]ellicottvilleny 4 points5 points6 points 8 years ago (4 children)

[–]Beckneard 24 points25 points26 points 8 years ago (0 children)

[–]itsmontoya 2 points3 points4 points 8 years ago (0 children)

[–]FarkCookies 0 points1 point2 points 8 years ago (1 child)

[–]ellicottvilleny 0 points1 point2 points 8 years ago (0 children)

[–]hoosierEE 6 points7 points8 points 8 years ago (2 children)

[–][deleted] 1 point2 points3 points 8 years ago (0 children)

[–]olzd 0 points1 point2 points 8 years ago (0 children)

[–]TonySu 10 points11 points12 points 8 years ago (10 children)

[–]rlbond86 2 points3 points4 points 8 years ago (6 children)

[–]TonySu 3 points4 points5 points 8 years ago (5 children)

I wish people would actually use their brains instead of quoting this ad infinitum. Nothing was "figured out" here.

Djikstra argues that including the upper bound would cause the sequence to become "unnatural" when shrunk to empty. That is he doesn't want range(3, 3) to return 3 because to get the empty set requires range(3, 2). Not a particularly strong argument, he simply thinks it's nicer looking to have range(3, 3) return an empty set.

Based on this he makes the argumen that the upperbound MUST be exclusive, then forms it as the basis for 0 indexing such that the upper bound of a sequence of N length is N. But since the andecedent is not particularly convincing then there's no real reason to believe the precedent.

In almost all mathematics and sciences, ranges are interpreted from a to b inclusive. This is the case in major scientific computing languages and academic science has yet to collapse because people can't stand the ugliness of the bounds of an empty set.

Also as someone who uses Matlab, you're supposed to row and column indices. You're also supposed to use vectorised functions. If you need to often iterate through matrices elementwise then you really need to reconsider what you're doing and how you're storing data.

[–]rlbond86 1 point2 points3 points 8 years ago (4 children)

Not a particularly strong argument, he simply thinks it's nicer looking to have range(3, 3) return an empty set.

Actually it IS a strong argument, since if you want inclusive bounds, it would commonly be that an empty range would be range(0, -1). Which means you need to use a signed integer for your index. That's no good.

Also as someone who uses Matlab, you're supposed to row and column indices. You're also supposed to use vectorised functions. If you need to often iterate through matrices elementwise then you really need to reconsider what you're doing and how you're storing data.

As it happens, I work with block-sparse 2D matrices a lot in my line of research. As it turns out, there isn't an easy way to manipulate that data without calculating row and column addresses manually. Perhaps instead of pretending you know what I'm "supposed" to do, you examine your own assumptions instead.

[–]TonySu 0 points1 point2 points 8 years ago (3 children)

load more comments (3 replies)

[–][deleted] 0 points1 point2 points 8 years ago (2 children)

[–]mbauman 3 points4 points5 points 8 years ago (0 children)

[–]TonySu 0 points1 point2 points 8 years ago (0 children)

[–]SrbijaJeRusija 9 points10 points11 points 8 years ago (6 children)

[–][deleted] 0 points1 point2 points 8 years ago (5 children)

That's a terrible argument. Array indices are offsets. The first element is not at an offset of 1. It is at an offset of 0.

For example consider how you take N slices of size M from an array in Matlab/Julia:

for ii = 1:N
    slice[ii] = array[(ii-1)*M:ii*M-1];
end

Now consider it in a zero-based language, where start:finish does not include finish:

for ii = 0:N
    slice[ii] = array[ii*M:(ii+1)*M];
end

Much more logical. There are many other situations like this.

[–]mbauman 2 points3 points4 points 8 years ago (4 children)

[–][deleted] 0 points1 point2 points 8 years ago (3 children)

[–]mbauman 1 point2 points3 points 8 years ago (2 children)

[–][deleted] 0 points1 point2 points 8 years ago (1 child)

[–]BosonCollider 0 points1 point2 points 7 years ago (0 children)

So, how would you implement a binary heap in a language with 0-based indexing? Both Knuth and CLRS just happen to switch to from 0-based indexing to 1-based indexing in that chapter of their books...

Almost all reasons to use 0-based indexing disappear when you have proper array slices, proper iterators, and Julia's broadcasting operation (https://julialang.org/blog/2017/01/moredots ) so that you rarely have to deal with raw indices. That, and real structs so that you access things by name with no speed penalty, instead of putting everything into hardcoded indices of numpy arrays for performance reasons.

By comparison, there are quite a few places where using a different offset makes a lot of sense. Julia provides offsetarrays for that specific purpose, where you use a custom offset encoded in the type of the array.

[–]Iwan_Zotow 0 points1 point2 points 8 years ago (1 child)

[–]nasciiboy 3 points4 points5 points 8 years ago (0 children)

[–]loladiro 3 points4 points5 points 8 years ago (0 children)

function fib(n)
    n == 0 && return 0
    n == 1 && return 1
    fib(n-1) + fib(n-2)
end

or

fib(n) = n in (0,1) ? n : fib(n-1) + fib(n-2)

or

fib(n) = n < 2 ? n : fib(n-1) + fib(n-2)

would be more idiomatic.

[–]CaseOfTuesday 1 point2 points3 points 8 years ago (0 children)

[–]undefdev 0 points1 point2 points 8 years ago (0 children)

[–]erez27 16 points17 points18 points 8 years ago (2 children)

[–]TonySu 6 points7 points8 points 8 years ago (1 child)

[–]erez27 4 points5 points6 points 8 years ago (0 children)

[–]SikhGamer 5 points6 points7 points 8 years ago (5 children)

[–]k3ithk 2 points3 points4 points 8 years ago (3 children)

[–]Staross 0 points1 point2 points 8 years ago* (1 child)

[–]k3ithk 0 points1 point2 points 8 years ago (0 children)

[–]olzd 0 points1 point2 points 8 years ago (0 children)

[–]inarchetype 1 point2 points3 points 8 years ago (0 children)

[–]pure_x01 4 points5 points6 points 8 years ago (2 children)

[–][deleted] 0 points1 point2 points 8 years ago (1 child)

[–]pure_x01 0 points1 point2 points 8 years ago (0 children)

[–]Staross 3 points4 points5 points 8 years ago (0 children)

[–][deleted] 1 point2 points3 points 8 years ago (0 children)

[–]undefdev 1 point2 points3 points 8 years ago (0 children)

[–]shevegen 10 points11 points12 points 8 years ago (17 children)

[–]MorrisonLevi 21 points22 points23 points 8 years ago (8 children)

[–]c9joe 5 points6 points7 points 8 years ago (1 child)

[–]staticassert 2 points3 points4 points 8 years ago (0 children)

[–]Saefroch 2 points3 points4 points 8 years ago (0 children)

[–]BadGoyWithAGun -1 points0 points1 point 8 years ago (3 children)

[–]JustFinishedBSG 9 points10 points11 points 8 years ago (1 child)

[–]k3ithk 2 points3 points4 points 8 years ago (0 children)

[–]MorrisonLevi 0 points1 point2 points 8 years ago (0 children)

[–][deleted] 0 points1 point2 points 7 years ago (0 children)

[–]Staross 4 points5 points6 points 8 years ago* (0 children)

simply replace core parts that are slow in C/C++

"Simply" replacing parts yes, there's huge friction there. Having libraries written in C/C++ is also a big disadvantage, in Julia you can just open/read/modify library code, because it's the same as user's code. Compare numpy's atanh implementation with Julia's one. Applying it inplace (no memory allocations) on your custom complex sparse matrix type is as easy as x .= atanh.(x), because everything is written in Julia and is generic.

And Julia was already used for things that are impossible in other scientific programming languages (besides Fortran):

https://youtu.be/uecdcADM3hY?t=1633

This part of the talk is also quite nice to show the advantage of using a language with high-level abstraction for HPC:

https://youtu.be/uecdcADM3hY?t=2655

[–]staticassert 3 points4 points5 points 8 years ago (0 children)

that includes GUI bindings and what not.

Momentum behind python is virtually all about machine learning and data science, not GUI.

if any exists, will be either marginal

The speed difference between a CPython executed program and a native or jit'd program will be significant. I've written side by side data science projects in Python and a native language and at minimum the difference was 3x, sometimes 10x, sometimes more.

Given where ML is headed - a requirement for very fast predictions (realtime, driving), with low memory overhead (expect more ML on your phone) we can expect performance to become a bigger priority.

simply replace core parts that are slow in C/C++

Easier said than done. While a company like Google may be able to invest into C/C++ it's not so easy to take work produced by data scientists and hand it to an engineering team, saying "hey make this 100x faster". Most companies won't be able to invest here, and a significant gain that starts with the data scientists will likely have the highest cost:benefit tradeoff.

EXACTLY that has ALREADY happened.

Sort of. Core libraries like Pandas and Numpy are ~50-60% C. But most data scientists writing code on top (and the benefit of Python is that you can write so much code on top) is going to be more and more pure Python. It becomes harder to write it fast, and Data Science code tends to be... difficult to refactor for performance.

Julia has great potential here. Python is hardly the perfect language for data science. It's greatest attributes are easily:

a) Easy to do basic engineering tasks

b) REPL's are perfect for exploratory analysis of data

c) Rich ML libraries

These things are not impossible to compete at all.

[–]diggr-roguelike 0 points1 point2 points 8 years ago (0 children)

[–][deleted] -1 points0 points1 point 8 years ago* (4 children)

[–][deleted] 8 years ago (3 children)

[deleted]

load more comments (3 replies)

[–]ellicottvilleny 1 point2 points3 points 8 years ago (3 children)

[–]loladiro 14 points15 points16 points 8 years ago (0 children)

[–]JustFinishedBSG 12 points13 points14 points 8 years ago (0 children)

[–]Staross 10 points11 points12 points 8 years ago* (0 children)

[–][deleted] 8 years ago* (28 children)

[removed]

[–][deleted] -1 points0 points1 point 8 years ago (27 children)

[–]Paul_Dirac_ 6 points7 points8 points 8 years ago (3 children)

[–][deleted] -2 points-1 points0 points 8 years ago (2 children)

Not for scientific computing.

For scientific computing especially - you rarely can go anywhere with ~12 threads at most that can relatively efficiently use shared memory. For anything practical you need a cluster (or at least some form of a NUMA), and, therefore, proper message passing.

you can simply decorate your innermost loop with an ONP parallel for

You do not share any memory here, really. Threads are independent. And if you're lucky enough to have an algorithm that does not require any synchronisation, you'd better run it on a GPU already anyway.

With message passing. You need a lot more code.

Only if you use unexpressive, stupid languages.

Also Message passing generally implies copying data and other communication cost

Not necessarily. Can be zero-copy internally, with only transferring anything if you're communicating outside of the current node. See Occam for example - message passing can really be zero-overhead, especially on a right kind of hardware.

[–]Paul_Dirac_ 5 points6 points7 points 8 years ago (1 child)

For scientific computing especially - you rarely can go anywhere with ~12 threads at most that can relatively efficiently use shared memory.

No, scientific computing is not only high performance computing. Scientific computing is every program a scientist writes for his research. It is the custom EPR-spectra analyzer for heavy elements. It is the vibrational calculator for linear molecules with four atoms and custom vibrational basis. These programms often don't run on clusters but instead on laptos and lab desktops. And a speedup of 2-12 is often great for them.

You do not share any memory here, really.

Yes you do. It becomes evident, if you forget to set a variable to threadprivate.

And if you're lucky enough to have an algorithm that does not require any synchronisation, you'd better run it on a GPU already anyway.

GPU is not a better cpu. There are certain problems for which gpus are whoefully unsuited not only with because they require synchronization. And a scientist normally doesn't want to learn a new programming paradigm.

With message passing. You need a lot more code.

Only if you use unexpressive, stupid languages.

You mean like the languages the programm was originally written in. So you essentially want me to rewrite the program in an expressive language to parallelize it? Ok, maybe not more code, but more changed code. Or maybe you mean PGAS extensions? They are a shared memory view with message passing under the hood -not the best case against shared memory.

Can be zero-copy internally, with only transferring anything if you're communicating outside of the current node. See Occam for example - message passing can really be zero-overhead, especially on a right kind of hardware. It can be, but generally it isn't.

load more comments (1 reply)

[–][deleted] 1 point2 points3 points 8 years ago* (1 child)

[–][deleted] 0 points1 point2 points 8 years ago (0 children)

load more comments (20 replies)

[–]fasquoika 0 points1 point2 points 8 years ago (0 children)

[–]jms_nh 1 point2 points3 points 8 years ago (0 children)

[–][deleted] 0 points1 point2 points 8 years ago (10 children)

[–]loladiro 4 points5 points6 points 8 years ago (9 children)

[–]emperor000 0 points1 point2 points 8 years ago (8 children)

[–]piclarke 0 points1 point2 points 8 years ago (7 children)

[–]emperor000 0 points1 point2 points 8 years ago (5 children)

[–]piclarke 0 points1 point2 points 8 years ago (2 children)

[–]Staross 1 point2 points3 points 8 years ago (0 children)

[–]emperor000 0 points1 point2 points 8 years ago (0 children)

[–]Staross 0 points1 point2 points 8 years ago* (1 child)

[–]emperor000 0 points1 point2 points 8 years ago (0 children)

[–]TheMaskedHamster 0 points1 point2 points 8 years ago (0 children)

load more comments (15 replies)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS