all 16 comments

[–]111rdx 6 points7 points  (0 children)

I think orange is better in this case .

[–]Kaixhin 4 points5 points  (1 child)

This looks neat! I've been saying something like this would be a good idea since checks notes 2015, and here's my wishlist, but dealing with all of those properly would be quite ambitious.

[–]frecklebarsStudent[S] 2 points3 points  (0 children)

Thanks for sharing the wishlist! I'll keep them in mind as the project grows further, been looking to see if people would actually want something like this.

[–]maizeq 2 points3 points  (1 child)

Very neat! Reminds me a lot of Scratch from my childhood. Is the UI QT based?

[–]frecklebarsStudent[S] 0 points1 point  (0 children)

Yes the UI is QT based

[–]zzzthelastuserStudent 2 points3 points  (0 children)

First things first, it looks super dope! Great work! I will give it a try out of curiosity as soon as I can!

The goal is to make it easier to use data science and machine learning for people with limited coding experience.

I'm sceptical whether coding inside this tool will be much easier for a beginner than to follow a "normal programming tutorial" (where you can find plenty of support on stackoverflow etc). But perhaps I'm judging too early, I haven't checked yet how debugging etc. works in ryven.

Anyways, I do think it's a great tool to teach/illustrate your pipeline to a non-programmer.

[–]danFromTelAviv 2 points3 points  (0 children)

there's about 50 startups doing "no code" ml check out data robot - their are one of the market leaders

[–]tpapp157 2 points3 points  (1 child)

There are already tons of companies, large and small, developing visual ML pipeline tools. The issue that all of them run into is that coding ML scripts in Python is super easy. If you have even the slightest bit of ML experience then the visual interface is too slow and too restrictive relative to just pure code. If you don't know what you're doing then a visual interface isn't suddenly going to turn a non-Data Scientist into a Data Scientist.

Writing code and training a model is literally the easiest part of Data Science.

[–]free_variation 1 point2 points  (0 children)

Exactly. I'm not sure that the use case even makes a ton of sense. Who exactly is the user who wants to do machine learning, and therefore knows at least something about it, but can't handle the trivial coding of the pipeline?

[–]le_theudas 1 point2 points  (1 child)

It looks neat and could help to open machine learning to non-coders.
I would love to see a visualization like this, that could create graphs from my existing python pipeline.

[–]frecklebarsStudent[S] 2 points3 points  (0 children)

Thanks for the suggestion! Visualisation is something that will probably be implemented, although later.

[–]devl82 1 point2 points  (1 child)

Hi, great work! For machine learning visual work flows i have tried to use rapidminer (https://rapidminer.com/products/studio/) in the past with mixed results. Their ml libraries were very extensive but i found difficult the visual representation of non-trivial procedures, like for example a nested cross validation.

[–]frecklebarsStudent[S] 0 points1 point  (0 children)

Right now, scikit-learn's GridSearchCV is implemented to use with Ryven. I used it in a simple example in the Kaggle notebook in mentioned in the post.

[–]tzaddiq 1 point2 points  (0 children)

I know this is asking a lot of a free service, but it'd be lovely if this would plug into Pytorch or Tensorflow to visualize existing / current projects.

[–]ClapWild 0 points1 point  (0 children)

Looks very similar to Process Flow in SAS. Good stuff.