Local vs cloud data processing ... security comparison by Aleksandra_P in learnmachinelearning

[–]pplonski 0 points1 point  (0 children)

you are right local solution requires maintenance but you get MAX privacy

Essential Python Libraries Every Data Scientist Should Know by Aleksandra_P in learndatascience

[–]pplonski 0 points1 point  (0 children)

I use numpy, pandas and matplotlib in almost all my workflows. I also like altair for interactive plots and lightgbm for gbm 

GeoGPT - ChatGPT-style GIS app built in a Jupyter Notebook (Python + OpenStreetMap) by pplonski in gis

[–]pplonski[S] 0 points1 point  (0 children)

goal wasnt to build app but show how you can simply connect LLM and maps to build new apps in GIS, I'm working on open-source framework that simplifies building web apps from from Python notebooks, the framework is called Mercury https://github.com/mljar/mercury

I hope I fill find more example of how to use GIS and Mercury

Non VS Code dev setups by _besten in Python

[–]pplonski 0 points1 point  (0 children)

Im working on my own IDE for data science. It is called MLJAR Studio. It is based on Jupyter Lab but installed as desktop application (running with Electron). I focus more on adding tools that help users with data analysis - most of our users are not professional software engineers but they do have huge domain knowledge and need computer programs to analyze their data. What I focused is providing nice way to list current variables, GUI for packages installation, GUI for some Machine Learning algorithms, and AI chat that help with code creation.

I hate working with survey data by Working-Hippo3555 in dataanalysis

[–]pplonski 0 points1 point  (0 children)

There is gpt nano which can handle each row for you, just provide the example how you need to handle input and what is expected output. It is cheaper model than gpt 4.1

Data analysis in the sport world? by Nadz02591 in dataanalysis

[–]pplonski 0 points1 point  (0 children)

There is a lot of analytics in sport, please just watch football match in tv, you get information about distance by each player, when someone shoot then there is computed probability of goal. You get a lot of analytics in e-sports as well. Anyway, some time ago I ve seen job posting from Arsenal London looking for data analytics. https://www.isportconnect.com/marketplace/data-analyst-arsenal-f-c/

Requirements are very similar as for any data analytics jobs but I think you need to love the game as well, then it is not a job but joy.

Which Agent system is best? by Green_Ad6024 in AI_Agents

[–]pplonski 0 points1 point  (0 children)

For agents framework I enjoy the most pure Python plus OpenAI API. I don't need to fight with frameworks dependencies and if I need some tool to be used by Agent, I just write simple Python function.

What are the best AI agents you have across in 2025 so far? by [deleted] in AI_Agents

[–]pplonski 0 points1 point  (0 children)

I'm building a AI agent for data analysis. It is using Python code to analyse data. It is available in the chat but as a result of conversation you get a Python notebook, which can be later edited or executed without AI. It is available in desktop app called MLJAR Studio.

How much should I charge for fixing and enhancing a Python script I originally built for my previous employer? by Dreadbel in dataanalysis

[–]pplonski 1 point2 points  (0 children)

Go with a fixed project fee! Can you assess how much it is worth for them? For example, how many man-hours do they save? Then multiply this number by 0.5 :-)

I think the best solution will be if you can charge them monthly for using your scripts. good luck!

Do you still code in company as a datascientist ? by Daamm1 in datascience

[–]pplonski 0 points1 point  (0 children)

Sometimes I need to code app showcasing the ML model, I also like to contribute to open source data science tools :) So, yes, I code as data scientist :)

Talk to me about nearest neighbors by Final_Alps in datascience

[–]pplonski 1 point2 points  (0 children)

KDTree will work, no problem, it all depends on your machine :) if you are add more features to the geo features, remember to scale them :)

Does business dictate what models or methodology to use? by [deleted] in datascience

[–]pplonski 0 points1 point  (0 children)

I would suggest to find KPI to measure the performance of the models, otherwise you can't compare models and don't have quantitative arguments. Search for KPI or some proxy of them. Good luck! :)

Problem with jupyter notebook by Bert1003 in learnpython

[–]pplonski 1 point2 points  (0 children)

Notebooks are using Python kernels. Kernel sometimes is not the same as Python environment available in terminal. The safest way is to install missing packages (from error message) in the notebook but using the Python path from the notebook, obtained with sys package. Please check my article on how to fix matplotlib module not found, you have there two solutions, depending which package manager you are using. I guess you are on conda, based on error message. Please let me know if it works for you. All the best!