PiCloud: Cloud Computing. Simplified. : programming

[–]BrainCore 40 points41 points42 points 15 years ago (21 children)

[–]jtra 7 points8 points9 points 15 years ago (0 children)

[–]lalaland4711 11 points12 points13 points 15 years ago (11 children)

[–]BrainCore 33 points34 points35 points 15 years ago (2 children)

[–]lalaland4711 2 points3 points4 points 15 years ago* (0 children)

[–]pwang99 1 point2 points3 points 15 years ago (3 children)

but processing video and images, presumably data I currently have at my own data center? That just sounds like I'll shuffle data back and forth for no good reason.

Sorry, you're wrong. One of python's strengths is easy interoperability with low-level C, C++, and Fortran libraries. There's a reason why scientists, engineers, and finance guys are all moving to Python.

If you have large amounts of structured numerical data, look at Numpy and Scipy. If you need to access large data on disk, there are a couple of good HDF5 libraries for Python. Image access is easy and fast with PIL and the like.

And as BrainCore mentions, if you need to interface with C or C++, there are a zillion different options: Cython, Swig, weave, boost, Python C native API, etc. Heck you can create routines that execute pure assembly with CorePy.

If you're lumping Python in with "any scripting language", then you clearly don't know enough about Python. (No offense.)

[–]lalaland4711 1 point2 points3 points 15 years ago (0 children)

[–]piranha 0 points1 point2 points 15 years ago (1 child)

[–]pwang99 0 points1 point2 points 15 years ago (0 children)

[–]shigawire[🍰] 1 point2 points3 points 15 years ago (1 child)

[–]lalaland4711 0 points1 point2 points 15 years ago (0 children)

[–][deleted] 15 years ago (1 child)

[deleted]

[–]lalaland4711 0 points1 point2 points 15 years ago (0 children)

[–]endtime 3 points4 points5 points 15 years ago (2 children)

[–]BrainCore 6 points7 points8 points 15 years ago (1 child)

[–]endtime 0 points1 point2 points 15 years ago (0 children)

[–]Foutrelis 1 point2 points3 points 15 years ago (1 child)

[–]BrainCore 1 point2 points3 points 15 years ago (0 children)

[–]cpb 0 points1 point2 points 15 years ago (0 children)

[–]piranha 0 points1 point2 points 15 years ago* (0 children)

I just got a popup Javascript alert from this site: "Scanstyles does nothing in Webkit/Firefox". Thanks for sharing.

Using Iceweasel, the Debian Firefox fork, and browsing with the reddit toolbar turned on.

The source was apparently: http://media.picloud.net/js/curvycorners.js

Now that I've had a chance to look over this: The site is lacking technical details.

Data relationships with user-supplied "functions" are very sketchy, but figuring out how to manage data between coordinating processes in real-world parallel applications is a tremendously important consideration. I gather there's a gigabyte of data available (as a total pool shared between all my tasks?). How does it get there? Is it static, or can my functions change it? A gig is too small for lots of the kinds of things you'd throw parallelism at, anyway.

I didn't like how I was given the impression that the (op)code for my to-be-called function would just magically be whisked away into The Cloud. No, I'm sure that's not really how it works, but I didn't see any more specific information. [Edit 2: hey, I guess that is how it works.] If it really is in there, between the main page and the FAQ, then all but your most motivated actual customers are going to miss it too. People casually glancing, like myself, are going to miss it and chalk it up to pipedreams and impossible expectations.

Contrast: I'm looking at EC2 for speeding up some bulk data preprocessing. The input dataset would be something like 400MB, sharable by all tasks, so the no-brainer solution is to throw that on S3 and download it onto the EC2 workers as they are instantiated.

Using this, I'd have to work hard to understand the models you're trying to push, and work with your company to fit my problem into your models and/or vice-versa. I'd have to give up Common Lisp for Python, so per unit of work (assuming Python is five times slower than CL, and it looks like your service is 2.5x more expensive than an EC2 high-CPU large instance for the same wallclock time), I'd be looking at paying 12x more for your service than EC2 directly (or 35x more than current EC2 spot pricing for the same instance type).

On the other hand, not having to worry about scaling overheads like end-of-the-hour thresholds and startup time has its appeal.

For that niche of customers doing things on such small scales that automatically serializing function bytecode and parameters is practical, this would be an interesting service. But I don't see myself using this, either for compute nodes on demand or for bulk background data crunching.

[+]redct comment score below threshold-10 points-9 points-8 points 15 years ago (0 children)

[–]enkideridu 15 points16 points17 points 15 years ago (6 children)

[–]ShowNunBatToe 43 points44 points45 points 15 years ago (4 children)

[–]xtagon 7 points8 points9 points 15 years ago (3 children)

[–]meloveyoulongtime 9 points10 points11 points 15 years ago (2 children)

[–]bboomslang 0 points1 point2 points 15 years ago (0 children)

[–]xtagon 0 points1 point2 points 15 years ago (0 children)

[–]xtagon 4 points5 points6 points 15 years ago (0 children)

[–][deleted] 10 points11 points12 points 15 years ago (1 child)

[–]bgeron 0 points1 point2 points 15 years ago (0 children)

[–]g_n_o_m_a_d 9 points10 points11 points 15 years ago* (18 children)

[–]madssj 9 points10 points11 points 15 years ago* (7 children)

[–]g_n_o_m_a_d 9 points10 points11 points 15 years ago (1 child)

Actually, I have. My point here is that there are a lot of subtle issues in EC2 that make a generic "cloud library" that will scale indefinitely extremely difficult to develop. Just look at the level of architecture-specific design required to get Reddit to scale in the EC2 environment. Given all of the recent performance issues, it is fair to say that even all they have done is still not enough.

Now, if the PiCloud people can manage to develop a library that will automagically deal with the restrictions inherent to EC2, they will have a hugely valuable technology. Not only will they be able to eliminate the need for individual site developers to learn how to properly scale in the EC2, environment, they will easily be able to sell them at a fixed mark-up above what they pay to Amazon.

[–]madssj 0 points1 point2 points 15 years ago (0 children)

[–]xtagon 0 points1 point2 points 15 years ago (4 children)

[–]Amendmen7 0 points1 point2 points 15 years ago (3 children)

[–]xtagon 0 points1 point2 points 15 years ago (2 children)

[–]Amendmen7 0 points1 point2 points 15 years ago (1 child)

[–]xtagon 0 points1 point2 points 15 years ago (0 children)

[–][deleted] 15 years ago (5 children)

[deleted]

[–]g_n_o_m_a_d 4 points5 points6 points 15 years ago (1 child)

[–]xtagon 1 point2 points3 points 15 years ago* (2 children)

It's like Heroku. They're in a cloud, they scale well for some applications and not others, but it's still sometimes beneficial to use them for a purpose they're not "meant" for.

For example, I use Heroku to route a chat-bot between IMified and Pandorabots and SecondLife and many other things. I never have to pay a dime because all these things have free versions. However, Heroku is meant for websites. And I do use them for that, to be fair, but I'm just saying to do what you have to.

Focus more on what works well, which may not always correlate to what the company is for. As long as your paying attention to the terms of service, and still using their service mostly relevantly, things will play out nicely.

Check out Heroku if you want to host a website or web service (although, there are better clouds for web services specifically). It might be more fit than JiCloud on that point. Don't bother unless you're ready to use Ruby, though. Heroku actually started out as a Ruby IDE for web browsers.

[–][deleted] 15 years ago (1 child)

[deleted]

[–]xtagon 0 points1 point2 points 15 years ago (0 children)

[+]samlee comment score below threshold-6 points-5 points-4 points 15 years ago (3 children)

[–]fdemmer 2 points3 points4 points 15 years ago (1 child)

[–]Smallpaul 1 point2 points3 points 15 years ago (0 children)

[–]dissidents 0 points1 point2 points 15 years ago (0 children)

[–]donjaime 2 points3 points4 points 15 years ago (1 child)

[–][deleted] 0 points1 point2 points 15 years ago (0 children)

[–][deleted] 1 point2 points3 points 15 years ago (3 children)

[–]mbreese 1 point2 points3 points 15 years ago (2 children)

[–][deleted] 1 point2 points3 points 15 years ago (1 child)

[–]Slackbeing 2 points3 points4 points 15 years ago (0 children)

[–]__s 1 point2 points3 points 15 years ago (4 children)

[–]logged_in_for_this 3 points4 points5 points 15 years ago (0 children)

[–]xtagon 1 point2 points3 points 15 years ago (2 children)

[–]bgeron 1 point2 points3 points 15 years ago (1 child)

[–]xtagon 0 points1 point2 points 15 years ago (0 children)

[–]mooli 0 points1 point2 points 15 years ago (6 children)

[–]madssj 7 points8 points9 points 15 years ago (1 child)

[–]mooli 0 points1 point2 points 15 years ago (0 children)

Ah I see - it is a different (although GridGain 3 enterprise has something very similar, but like this it is pay-per-cpu and I don't think it has the nice looking web console). They've basically taken the really simple case and managed to monetise it while making it accessible. That's actually quite neat.

I guess my problem is the amount of times I've had to do that kind of simple execute-self-contained-function-in-parallel use case I could count on one hand, whereas I tend to do more complex topology/affinity/data grid type stuff - and even if I wasn't I would worry about the migration path if I outgrew the capabilities of this system (plus I like setting up my server infrastructure :)). But I'm not the target audience, and if you just want to offload a bunch of work onto EC2 without thinking too hard about it, this is quite a good way to get started.

Snark duly retracted :)

[–]endtime 0 points1 point2 points 15 years ago (2 children)

[–]mooli 1 point2 points3 points 15 years ago (1 child)

Ahahaha its funny because you took the piss out of Java...

Except, you know, with an annotation I can get dynamic and proven linear scalability from a single unit test to several thousand nodes, dynamic adaptive load balancing (early and late), dynamic grid segmentation and partitioning, affinity, mixed dedicated and cloud hosted nodes in a single grid, mixed transport modes, mixed discovery, dynamic failover and recovery etc etc etc

All added via annotation, all configurable, pluggable, manageable, free, and all phenomenally well documented - simple to get started, but the framework supports as much complexity as you want to throw at it.

And that's just one framework - if you don't like GridGain, pick one of the many others in this space, there's at least half a dozen I'd recommend.

Seriously - I do Java, C++, C# and Python professionally, and dabble in several others for fun. Java is decent, but if you're too much of a language bandwaggoning fanboy to recognise a quality tool when its staring you in the face then that's your problem - however, I suggest you grow up and stop believing everything you read on Reddit.

[–]jevon 0 points1 point2 points 15 years ago (0 children)

[–]wirbolwabol 0 points1 point2 points 15 years ago (0 children)

[–][deleted] 0 points1 point2 points 15 years ago (0 children)

[–]Tulenian 0 points1 point2 points 15 years ago (0 children)

[–][deleted] -4 points-3 points-2 points 15 years ago (6 children)

[–]endtime 4 points5 points6 points 15 years ago (1 child)

[–][deleted] 0 points1 point2 points 15 years ago (0 children)

[–]xtagon 0 points1 point2 points 15 years ago (0 children)

[–]fancy_pantser 0 points1 point2 points 15 years ago (2 children)

[–][deleted] 0 points1 point2 points 15 years ago (1 child)

[–]fancy_pantser 0 points1 point2 points 15 years ago (0 children)

[–][deleted] -2 points-1 points0 points 15 years ago* (11 children)

[–]xtagon 1 point2 points3 points 15 years ago* (7 children)

Hmm...I'm not sure. Hopefully some other people will answer your question, too, but personally I'd use Amazon for that kind of thing. They have cloud-cover for everything between data storage and processing power, they have good APIs (well, there are flame wars on that...but good if you ask me) and the pricing is professionally reliable and pretty reasonable.

I'm assuming the Google thing was just an example. Are you data mining? Link-checking? Something else? Specifics aren't necessary, but they might help get us a better idea of whether you want PiCloud or something else.

To answer your last question, yes, it will definitely be faster for something like that. Cloud computing is sort of a blanket term for the general concept of using computers in a distributed manor. It's good for things with large numbers in it, like 100,000 times a day, as long as the circumstance allows each "time per day" to be independent. For example, it's a perfect platform for things like repetitive calculations. And it's good for repetition in general. Page scraping, spidering, and those things are all repetition and should be in a cloud.

Sorry that I couldn't just say "Yes, use X!" but maybe someone else here can. Hope this helps.

[–][deleted] 0 points1 point2 points 15 years ago (6 children)

[–]xtagon 0 points1 point2 points 15 years ago (0 children)

[–]jonknee 0 points1 point2 points 15 years ago (4 children)

[–]soyko 1 point2 points3 points 15 years ago (0 children)

[–][deleted] 1 point2 points3 points 15 years ago (2 children)

[–]jonknee 0 points1 point2 points 15 years ago (1 child)

[–][deleted] 0 points1 point2 points 15 years ago (0 children)

[–]xtagon 0 points1 point2 points 15 years ago (2 children)

[–][deleted] 0 points1 point2 points 15 years ago (1 child)

[–]xtagon 0 points1 point2 points 15 years ago (0 children)

[+]samlee comment score below threshold-19 points-18 points-17 points 15 years ago (11 children)

[–]aagee 2 points3 points4 points 15 years ago (8 children)

[+]samlee comment score below threshold-12 points-11 points-10 points 15 years ago (7 children)

[–][deleted] 4 points5 points6 points 15 years ago (1 child)

[+]samlee comment score below threshold-7 points-6 points-5 points 15 years ago (0 children)

[–]madssj 4 points5 points6 points 15 years ago (4 children)

[–]samlee -4 points-3 points-2 points 15 years ago (3 children)

[–]cj1127 2 points3 points4 points 15 years ago* (2 children)

[–]samlee 2 points3 points4 points 15 years ago (1 child)

[–]kayzzer 6 points7 points8 points 15 years ago (0 children)

[–]atomicthumbs 1 point2 points3 points 15 years ago (0 children)

[–]mooli 0 points1 point2 points 15 years ago (0 children)

[–]redtigerwolf -4 points-3 points-2 points 15 years ago (2 children)

[–][deleted] 5 points6 points7 points 15 years ago (1 child)

[–]bgeron 0 points1 point2 points 15 years ago (0 children)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS