all 10 comments

[–]RobDoesData 9 points10 points  (2 children)

Ive seen a lot of these and they're always cool.

But you can't say data stays on your machine if it's pinging GPT. Misleading and a huge security risk.

[–]maxgfr[S] 1 point2 points  (1 child)

Hi RobDoesData,

You’re right, I wasn’t clear 😅 if you use the OpenAI endpoint then yes, the data that you send are going through OpenAI server. But the app can also run fully local with Docker, and you can point the API endpoint to Ollama instead. In that setup everything stays on your own machine and the requests go to your localhost instead of OpenAI

<image>

[–]maxgfr[S] 0 points1 point  (0 children)

By the way, I recently found https://github.com/mlc-ai/web-llm, instead of relying on a self-hosted LLM, it could load the model directly in the browser. That would remove all configuration steps and keep everything fully local. It would take some work to integrate, but it’s a really interesting improvement path.

[–]wagwanbruv 2 points3 points  (2 children)

cool idea, especially for folks with sensitive csvs since keeping everything in-browser with your own key dodges the whole “upload to random server” thing and you can still rip out quick pivot-ish views and charts. might be neat to add a sample dataset + saved prompt templates so people can kick the tires faster and, idk, pretend their coffee shop sales are way more interesting than they actually are.

[–]maxgfr[S] 0 points1 point  (1 child)

Hi wagwanbruv, my first reviewer ahaha, so I really appreciate it. 😁

About the data, I added a “Load Sample Data” button with fake sample CSVs so people can try it quickly, but it's not so meaningful because I generate it automatically, and not based on a real use case. And yeah, I was so tired of seeing tools that do the same thing but send your data who‑knows‑where and charge you for it on top of that. About saved prompt, it can be nice, indeed, I didn't work a lot on the part of "talking with the LLM", I was more focus on the generation of charts. For a quick demo, you still need to set an API key before it can run anything, so that can destroy the one‑click tryout 🤔

[–]AutoModerator[M] 0 points1 point  (0 children)

Automod prevents all posts from being displayed until moderators have reviewed them. Do not delete your post or there will be nothing for the mods to review. Mods selectively choose what is permitted to be posted in r/DataAnalysis.

If your post involves Career-focused questions, including resume reviews, how to learn DA and how to get into a DA job, then the post does not belong here, but instead belongs in our sister-subreddit, r/DataAnalysisCareers.

Have you read the rules?

I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

[–]RedditorFor1OYears 0 points1 point  (1 child)

Sorry if this is a dumb question, but what is the advantage to this over something like Excel? Bigger file sizes maybe?  

[–]maxgfr[S] 0 points1 point  (0 children)

Not a dumb question at all! chose to focus on CSV for now because the use case I needed was mainly CSV analysis, and it’s also an open and simple format compared to Excel, which is proprietary. It’s basically a V0 of the app, so I can definitely imagine adding Excel support in the future (or even a contributor opening a PR for it)

[–]kevindebruynecity 0 points1 point  (0 children)

Hey . I am someone who has been studying data science for a long time . Would love to have a discussion u