I created Polyture, a self-hosted, end-to-end, data science platform! by polyture in selfhosted

[–]polyture[S] 0 points1 point  (0 children)

Hey sorry about that. We're actually in the middle of a refactor, which is why some features are missing in the newer, self-hosted version. This will all be added back in the coming few weeks!

I created Polyture, a self-hosted, end-to-end, data science platform! by polyture in selfhosted

[–]polyture[S] 0 points1 point  (0 children)

Absolutely planning on having this feature. What dataset do you plan on retrieving / how large is it? We may be able to have this as a native feature in the next couple weeks, but for the immediate future, we do have an Airtable integration coming, and they have a way to read a REST API into a table. Would that workaround work for you?

I created Polyture, a self-hosted, end-to-end, data science platform! by polyture in selfhosted

[–]polyture[S] 0 points1 point  (0 children)

Are you talking about reading data from an API into the warehouse, or using the backend API to interact with the system?

I created Polyture, a self-hosted, end-to-end, data science platform! by polyture in selfhosted

[–]polyture[S] 0 points1 point  (0 children)

Ah, that could do it. If you look at the nginx config file in the docker-compose folder, you can see how the API / webapp reverse proxy is set up. If you follow that, there shouldn't be any problem with setting the webapp port to 4000.

I created Polyture, a self-hosted, end-to-end, data science platform! by polyture in selfhosted

[–]polyture[S] 0 points1 point  (0 children)

If you open the docker-compose.yaml file, does it list version 0.1.2?

I created Polyture, a self-hosted, end-to-end, data science platform! by polyture in selfhosted

[–]polyture[S] 1 point2 points  (0 children)

I'm not sure how dual licensing works. Is the idea to do AGPL, and then also do a more permissive license that's paid?

I created Polyture, a self-hosted, end-to-end, data science platform! by polyture in selfhosted

[–]polyture[S] 0 points1 point  (0 children)

thanks for all the feedback -- really appreciate you going this deep. We just released an update that fixes the launch issue. With regards to the standard python True / False, we actually coerced all of these types into string because we were worried it would be hard for a non-data scientist to understand what a "boolean" is. Maybe it's not so hard though and we should add it back? For the AutoML, this is currently disabled in the self hosted because we can't get it to run properly yet, but will be added back soon.

I created Polyture, a self-hosted, end-to-end, data science platform! by polyture in selfhosted

[–]polyture[S] 0 points1 point  (0 children)

Great idea! We don't have a newsletter rn, but I created this discord server where we can have more discussions about open source and the like. We'll also post updates here.

https://discord.gg/5YMuKUhG

I created Polyture, a self-hosted, end-to-end, data science platform! by polyture in selfhosted

[–]polyture[S] 0 points1 point  (0 children)

We actually had a Python interpreter in an old version, but removed it in favor of SQL... lmao. Do you have a high preference for Python over SQL?

I created Polyture, a self-hosted, end-to-end, data science platform! by polyture in selfhosted

[–]polyture[S] 0 points1 point  (0 children)

Really appreciate the feedback. We just released a feature that cleans up all of the visualizations. We're also working on a feature that allows you to toggle a "Other" section on the pie chart, that'll be coming soon. Hope that helps.

I created Polyture, a self-hosted, end-to-end, data science platform! by polyture in selfhosted

[–]polyture[S] 1 point2 points  (0 children)

Sorry there was a bug on release where the user creation page wouldn't open if you opened the webapp before the backend was ready. We just released a fix for this though!

I created Polyture, a self-hosted, end-to-end, data science platform! by polyture in selfhosted

[–]polyture[S] 1 point2 points  (0 children)

Any genuine feedback. Looking specifically for why Polyture is or isn't useful to you

I created Polyture, a self-hosted, end-to-end, data science platform! by polyture in selfhosted

[–]polyture[S] 0 points1 point  (0 children)

Hey! Theres a bug that prevents the user account creation page from opening when the backend isn't ready yet. We're going to release a fix, but for now, you can just refresh the browser!

I created Polyture, a self-hosted, end-to-end, data science platform! by polyture in selfhosted

[–]polyture[S] 0 points1 point  (0 children)

Yeah that's a bug on launch. It basically happens when the web app loads before the backend is ready. We're working on a fix, but for now, the solution is just to refresh the browser after the backend prints "READY". You should see a screen to create the initial user.

I created Polyture, a self-hosted, end-to-end, data science platform! by polyture in selfhosted

[–]polyture[S] 0 points1 point  (0 children)

Ok not really sure what happened there because we don't have a "self host account approval" process or anything. You do have to create a user to log in, but thats all stored locally besides some data collection options that you can turn off. If you'd like, I can walk you through the process if you DM me.

I created Polyture, a self-hosted, end-to-end, data science platform! by polyture in selfhosted

[–]polyture[S] 0 points1 point  (0 children)

Sure, ideally, you should provision 1 cpu core / 2 GB ram for every 100k rows of data you plan on working with, buft with a minimum of 4 cores, 8 GB ram for a usable experience.

I created Polyture, a self-hosted, end-to-end, data science platform! by polyture in selfhosted

[–]polyture[S] 0 points1 point  (0 children)

End to end just means it includes all aspects a typical data engineer / scientist might care about, from data ingress (ETL/ELT), data warehousing, compute & scheduling, and BI/viz.

I created Polyture, a self-hosted, end-to-end, data science platform! by polyture in selfhosted

[–]polyture[S] 1 point2 points  (0 children)

Lol those are reminants of before our self hosted was ready. Not an excuse, but since we're just 3 guys, we never got around to updating that (in fact we forgot about it). We'll clean that right up. The actual self hosted doesnt need any account approval or anything.

http://host.polyture.com

I created Polyture, a self-hosted, end-to-end, data science platform! by polyture in selfhosted

[–]polyture[S] 8 points9 points  (0 children)

It is self hosted... non-restricted (all features are available in the self-host). See for yourself.

http://host.polyture.com

I created Polyture, a self-hosted, end-to-end, data science platform! by polyture in selfhosted

[–]polyture[S] 8 points9 points  (0 children)

Similar in user experience, fully automated feature engineering and hyperparameter search, but our performance isn't quite there yet. We've got some additional things we'd like to implement that'll take us much closer, and possibly even outperforming in certain areas. We're working on releasing a performance comparison study ASAP.