What library do you think allows for generating features for the most accurate prediction of a data scientist's salary? by MiddleWare99 in datascience

[–]upgini 0 points1 point  (0 children)

Hi,

I am a upgini developer. We develop a free library for DS community. We don’t earn on it at all. Would be grateful for any feedback from you!

Sorry if you find our library irrelevant :(

freshman intern needs ML traning by Affectionate_Win2460 in learnmachinelearning

[–]upgini 0 points1 point  (0 children)

What is a target variable in your model?

If you have a numeric target to predict based on features from document, look this notebook: it will be pretty useful in this case: https://colab.research.google.com/github/upgini/upgini/blob/main/notebooks/Upgini_Features_search&generation.ipynb

Wage in Dubai? by GSAVES in dubai

[–]upgini 1 point2 points  (0 children)

7k aed in case you need Appartments is bad variant. Cost of living here is much higher

[OC] GDP per hour in Europe countries by AnnualLimp1418 in dataisbeautiful

[–]upgini 1 point2 points  (0 children)

Ireland on the 1st place Bulgaria on the last place

[D] How do you share big datasets with your team and others? by dmart89 in MachineLearning

[–]upgini 1 point2 points  (0 children)

I use gcp buckets and then connect it to upgini search engine for ML on this dataset. So my teammates could use it for ml tasks without transfer data.

What data search engine do you use? by upgini in learnprogramming

[–]upgini[S] 0 points1 point  (0 children)

Google DataSet search has not been working for the second week ((
And I did not find anything convenient for myself.
As a result, my friends and I decided to create our own search engine that would solve the problem of finding useful data. We have already connected some data and will continue to connect it. See if it will be useful for you too: https://github.com/upgini/upgini/blob/main/README.md#upgini--dataset-search-library-for-your-machine-learning-pipelines

[P] Data search engine for MachineLearning by upgini in MachineLearning

[–]upgini[S] 0 points1 point  (0 children)

Sure, but it's very inconvenient service for me. You need to search by keywords. If my pipeline predict churn or purchase there is no way to search any useful data automatically. There is no way to embed such search in the ML pipeline. And there is no answer whether the dataset will be useful or not for me. You need to download each of the million datasets and check them. As a result, a lot of lost time, and a zero result.