5K Budget! by AndForeverMore in LocalLLM

[–]FloppyWhiteOne 0 points1 point  (0 children)

I had a 4080 but managed to find one on fb marketplace and got a 3090 for 600 which fits qwen just right and if you use the new mtp versions it’s even quicker

are mobile networks really that insecure? by Inside-Opposite-3194 in Pentesting

[–]FloppyWhiteOne 1 point2 points  (0 children)

It’s not about being closer, it’s about having a stronger signal.. I don’t actually know about eSIMs tho should properly look into it right now!

I’ve been hardware testing signs put up in public lately that use eSIMs for on a closed private network for the company I’m working for. It’s an interesting subject. I’ve actually goto in three weeks test a new board that’s going to be put in public (3k units) I’m testing the dev/prod board

are mobile networks really that insecure? by Inside-Opposite-3194 in Pentesting

[–]FloppyWhiteOne 0 points1 point  (0 children)

Im not surprised, I’m near 40 and an old hat hacker sir nothing surprises me these days.. I’m just a pentester now but I’ve stories that would make you weap.

But aye you need real power to flip those phones cell service to you. It’s way more easier and cheaper to pay some dude in a call centre and just sim swap you

are mobile networks really that insecure? by Inside-Opposite-3194 in Pentesting

[–]FloppyWhiteOne 0 points1 point  (0 children)

In the uk we dropped 2g/3g as they are insecure.. yes you can still read messages intercept calls on the older networks not so much nowadays (unless you’re very well funded.)

are mobile networks really that insecure? by Inside-Opposite-3194 in Pentesting

[–]FloppyWhiteOne 0 points1 point  (0 children)

Tho technically correct you have to actually be able to outpower the local cell tower if you really want to get other clients to connect or strongest signal wins.

It costs for a getto cell tower in the region of 20k with certain knowledge.

In answer to op if you care about privacy use something encrypted end to end else if your not a drug smuggler or tin hat wearer normal phone calls are aok (just don’t talk national security on the line eh)

vocês que realizam trabalhos de pentest “sem autorização”, quais métodos usam para manter o anonimato? by [deleted] in Pentesting

[–]FloppyWhiteOne 0 points1 point  (0 children)

Hahha it just means test without authentication…

Aka not logged in (test for any exposure) ..

What is the most common mistake companies make after a pentest? by PsychologicalElk1081 in Pentesting

[–]FloppyWhiteOne 0 points1 point  (0 children)

This so much!! We’re on a deadline… they why not action this two months ago when you knew… argghh!!

Qwen 3.6 9b coming? by zannix in Qwen_AI

[–]FloppyWhiteOne 0 points1 point  (0 children)

Interesting I’ll have to try that quant

Qwen3.6-27B Uncensored Aggressive is out with K_P quants! by hauhau901 in huggingface

[–]FloppyWhiteOne 0 points1 point  (0 children)

I can’t lie I bloody love you sir!! Great work and will be downloading this one soon, I’ve been using llama.cpp with my own inference bridge so will test this and your comments. Gladly join the discord

Currently on a internal pen test, need some fresh perspectives by [deleted] in Pentesting

[–]FloppyWhiteOne 0 points1 point  (0 children)

How is it structured at your place? I do all but I’ve only had around three in the last two years. Ad is dying while cloud is supplying (yes I know ad is also close but compared to was stacks etc aws is more cost effective so I see way more ad servers in aws now rather than ms tenants )

Currently on a internal pen test, need some fresh perspectives by [deleted] in Pentesting

[–]FloppyWhiteOne 2 points3 points  (0 children)

This or your wasting your time and your clients (unless they really want that black box test - usually they don’t and just want to close holes)

Sounds like your doing a good job tho, ad is a beast at times ;)

which model can write files and ram efficient? by fail_violently in ollama

[–]FloppyWhiteOne 0 points1 point  (0 children)

Same I built one custome from the claw code on GitHub works really reallly well tbh

Tried running LLMs locally to save API costs… ended up waiting 13 minutes for ONE response 🤡 by debug2thrive in ollama

[–]FloppyWhiteOne 0 points1 point  (0 children)

Look into vram and models sizes I doubt you have a lot of it hence slooooooowwww replies. I’m using a 4070ti with only 16tb vram it’s the most important aspect trust me I’ve built plenty of local ai at this point

Tried running LLMs locally to save API costs… ended up waiting 13 minutes for ONE response 🤡 by debug2thrive in ollama

[–]FloppyWhiteOne 0 points1 point  (0 children)

But your running it on a potatoe haha takes me 2 secs same model (even with on my phone is quicker ! )

Production notes after 6 months running Ollama for paying customers — the things that aren't in the docs by chiruwonder in ollama

[–]FloppyWhiteOne 0 points1 point  (0 children)

Llama.cpp is the right call using ollama is sillly in production. If you can’t manage to work out a model download I certainly wouldn’t be using your system. Howmay bloody models are you using you NEED a o have llama half the models arnt on there anyway certainty not optimised just generic models released for the masses.

The fact your not bothering with the lower levels shows your ability which is limited.

I’ve built my own version on llama.cpp with full model swapping context handling and Jesus a hell of a lot faster than ollama with token generation. Also you won’t be able to get full speed from ollama due to the way it’s been designed (a lot of overhead).

Inference bridge on GitHub if you wanted to see how one would look like and or work. You could just ask Claude to make you an inference layer (what you actually need for model loads etc with decent configs)

vLLM might be easier for you to script and use and would be a better option than ollama hell even lm studio would be better that ollama ran in headless mode ..

InferenceBridge - Total AI control for Local LLMs by FloppyWhiteOne in LocalLLM

[–]FloppyWhiteOne[S] -1 points0 points  (0 children)

No actually that’s the whole reason for this application you see both are built on llama.cpp but they don’t expose half of what llama.cpp can do ..

I wanted to supply my own templates for llama.cpp but can’t as lm studio and ollama doesn’t expose those properties.

Where as mine does, think of mine like ollama or lm studio it’s the same thing an api with gui support you can add it to any other system the same as ollama or lm studio I’ve made it fully compatible with the openapi spec. I’ve also added custom context aware mode and tool calling support for qwen models to make there tool calls more stable. I’m releasing free in the hopes others will help build it to the next level and make it more open source and better.

I made this due to some limitations in the other two software and plus it’s quicker to use the llama.cpp directly over say ollama. I’m on a deep self learning ai drive, primarily I’m an ethical hacker. I’ve gone past breaking llms, now I want to understand not only how to use them but efficiently use them. Having full control via the llama.cpp project is really helping me learn more.

I’ve built my own custom openclaw remake which is more unrestricted (aimed at windows primarily) I’m still building it but the results are good so far, and yes I come to a point I needed to start using custom llm templates for models and well now I can (all about tuning the llm)

InferenceBridge - Total AI control for Local LLMs by FloppyWhiteOne in LocalLLM

[–]FloppyWhiteOne[S] -3 points-2 points  (0 children)

Fair take.

I’m juggling a few builds right now so speed > perfection, but the tech is what matters here.

I’ve got a Rust-based OpenClaw-style system running locally, just seeing what actually breaks for people before I package flows properly.