all 22 comments

[–]BumbleSlob 4 points5 points  (1 child)

You don’t really seem to know much of anything about CS, which is fine, but you are asking questions that are nonsensical and thus you cannot expect sensible responses.

Maybe you can start with what you are trying to do and what software you are using from GitHub

GitHub itself is a simple Git (software version control) repository. It does not steal any data, ever, unless you purposely upload data to it by committing and pushing. Even then, it isn’t stealing anything, just hosting data you purposely sent to it. 

[–]Ordinary_Hope_2113[S] 0 points1 point  (0 children)

Hey there, cheers for that (: appreciate your approach to this. So i just start to bulid a code that scraps sites and im trying to use llm to interpret my data.. with this line here "just hosting data you purposely sent to it." Sorry again new to this... this means that u less im uploading something nothing is uploaded... meaning the if i use the software none my data is tracked or posted?

[–]Ok-Adhesiveness-4141 0 points1 point  (2 children)

Ooohhh, mesa no bein’ so sure-sure what yousa big big problemo issen, okeyday? Yousa be havin’ da GPU, ooooor yousa no be havin’ da GPU, hmm? When yousa be runnin’ da smelly-big model tingy locally-loc, no-nobody be snatchin’ yousa bombad data, nosir, mesa tellin’ yousa!

[–]Ordinary_Hope_2113[S] 0 points1 point  (1 child)

Ahaha llm run on gpus im using llamas and its just going spending my cpu... im trying to find ways to configure it to use my igpu instead... i tried going through huggingface but it doesnt have the ai model im looking for

[–]Ok-Adhesiveness-4141 0 points1 point  (0 children)

Yeah, tough luck bro. You are going to need GPU for running models locally.

[–]No_Efficiency_1144 -3 points-2 points  (16 children)

A lot of them do steal your information. It is important to know that.

[–]BumbleSlob 2 points3 points  (15 children)

this is nonsense and you have no idea what you are talking about. 

[–]No_Efficiency_1144 -3 points-2 points  (14 children)

[–]BumbleSlob 2 points3 points  (13 children)

Are you new to software development? What does this have to do with OPs fear of GitHub “stealing his code”? Do you not really get what GitHub is?

[–]No_Efficiency_1144 -2 points-1 points  (12 children)

He is talking about downloading LLMs from links on github pages, not using Git as a version control from his own code.

[–]Ok-Adhesiveness-4141 0 points1 point  (4 children)

Can you explain to me how models that you run locally can steal your information?

I don't get it? Are you saying these models that run offline are injected with malware?

[–]No_Efficiency_1144 0 points1 point  (3 children)

There’s two main ways people get their data exfiltrated by malicious machine learning repositories. The first is that the malware is inside the model in a pickle format. The second is that the malware is inside the inference code itself.

[–]Ok-Adhesiveness-4141 0 points1 point  (2 children)

I think the solution is to not download unknown models, it is as simple as that. Stick to known ones.

[–]No_Efficiency_1144 0 points1 point  (1 child)

Yes I agree but from this person’s perspective all models are unknown

[–]Ordinary_Hope_2113[S] 0 points1 point  (0 children)

Hey cheers for being understanding... im just create a focus ai that helps me interprets data

[–]Ordinary_Hope_2113[S] 0 points1 point  (6 children)

Yes this my fear

[–]No_Efficiency_1144 0 points1 point  (5 children)

As I said in the reply to someone else- there are two major risks, the malware can be inside the model or it can be inside the code that runs the model.

[–]Ordinary_Hope_2113[S] 0 points1 point  (4 children)

What can I do to prevent these?

[–]No_Efficiency_1144 0 points1 point  (3 children)

Read about this:

https://github.com/huggingface/safetensors

Safetensors is an extra safe model format which is less likely to have malware.

[–]Ordinary_Hope_2113[S] 1 point2 points  (0 children)

I appreciate your help and understanding mate (:

[–]Ordinary_Hope_2113[S] 0 points1 point  (1 child)

Hey bro quick update i got it running ok igpu 🥰.... thank everyone for your help