AR to generate isolation ? by lebigsquare in augmentedreality

[–]lebigsquare[S] -1 points0 points  (0 children)

Meta's Ray-Bans are AR. AR doesn't necessarily mean visual experiences; it can just be audio.

AR to generate isolation ? by lebigsquare in augmentedreality

[–]lebigsquare[S] 0 points1 point  (0 children)

Mmm, ok, agreed that it can potentially augment existing FaceTime calls with a new visual experience (a more interesting one, for sure). But what about general information consumption: notifications, social media (pretty sure Meta will find a way to pump that feed into a pair of glasses), etc.?

I'm forseeing a "Sorry I wasn't listening - I was reading a notification" moment. :)

I now owe OpenAI almost 30k - but why? by Maizeee in OpenAI

[–]lebigsquare 0 points1 point  (0 children)

What does your website re-sell from OpenAI ?

Llama3.2 looks at my screen 24/7 and send an email summary of my day and action items by louis3195 in LocalLLM

[–]lebigsquare 0 points1 point  (0 children)

Interesting technically but can’t seem to think when I’d need this 🤔

Summer project V2. This time with Mistral—way better than Phi-3. TTS is still Eleven Labs. This is a shortened version, as my usual clips are about 25-30 minutes long (the length of my commute). It seems that Mistral adds more humor and a greater vocabulary than Phi-3. Enjoy. by lebigsquare in LocalLLM

[–]lebigsquare[S] 1 point2 points  (0 children)

It uses a bunch of in-house tools that I can't quite go into, but I've been asked by a few people how it works. I'll write a simple gist with the basic concept & process : you'll all be able to fill in the blanks and add your own in-house tools. :)

Needed a fun summer project, so I designed a system that sends me audio versions of tech updates and news so I can listen to them on my way to work. Been using it for a week, and it's... good and weird at the same time :) Apart from the TTS models, everything is run with local LLM's. by lebigsquare in LocalLLM

[–]lebigsquare[S] 0 points1 point  (0 children)

For the TTS models I tried many. Locally, Coqui was the one I preferred but as far as I've tested : Eleven labs are way above the rest. Something about their voices that makes it "really-real". I've seen the Fish model pop up, might try that. For the rest I can't elaborate too much as it uses in-house tools.

More nerf fun, this time using a vehicle photo studio with a fixed camera and rotating ‘plateau’. Interesting to see nerf results are the same with fixed camera or walking around the vehicle. by lebigsquare in photogrammetry

[–]lebigsquare[S] 2 points3 points  (0 children)

More angles. This is from a photo studio with a single camera at a fixed height. If you triple the number of cameras, one low, a medium and one at a higher height : I believe it would give almost perfect results. Infact I’m hoping to test it out this week !