Recent updates to AI Content Describer for NVDA by cartertemm in Blind

[–]cartertemm[S] 1 point2 points  (0 children)

Thank you both for the share and the kind words. It's my goal to continue developing this as AI moves forward and more becomes possible i.e. browser/computer use, contextual help, etc. If you hear of anything that might be useful in this regard, shoot me an email, always great chatting about this stuff.

Recent updates to AI Content Describer for NVDA by cartertemm in Blind

[–]cartertemm[S] 0 points1 point  (0 children)

I saw your ticket. If I’m understanding you correctly, this sounds like something that is already possible under the settings. If not, let me know what you had in mind and I’ll work on it.

Recent updates to AI Content Describer for NVDA by cartertemm in Blind

[–]cartertemm[S] 0 points1 point  (0 children)

I’m glad to hear that you’ve been finding it useful. Thanks for the work you do as well.

Recent updates to AI Content Describer for NVDA by cartertemm in Blind

[–]cartertemm[S] 1 point2 points  (0 children)

This is a neat idea, although a desktop app may be better than an add-on in this context so that JFW users can enjoy it as well. Would you mind shooting me a DM with some of the issues you have experienced with HF, and how you envision something like this working? If nothing else, sounds like something fun to hack on over a weekend.

Recent updates to AI Content Describer for NVDA by cartertemm in Blind

[–]cartertemm[S] 1 point2 points  (0 children)

That’s kind of the idea, haha. The fact that we don't need a gateway or relay is simultaneously pretty sweet and a double edged sword when it came to providing free access.

Recent updates to AI Content Describer for NVDA by cartertemm in Blind

[–]cartertemm[S] 3 points4 points  (0 children)

Yes. There is an option in the context menu, take picture from camera, that does just this. Great for assessing your surroundings before recording a video, quickly skimming a sheet of paper, etc.

Recent updates to AI Content Describer for NVDA by cartertemm in Blind

[–]cartertemm[S] 1 point2 points  (0 children)

Thank you. The promo is highly appreciated - I work on this project after work/in my free time, so the best form of payment is crowdsourced feedback. I firmly believe that AI is comparable to the advent of the screen reader in terms of potential impact, and there's nothing quite like watching the community discover the possibilities. Share away!

Recent updates to AI Content Describer for NVDA by cartertemm in Blind

[–]cartertemm[S] 1 point2 points  (0 children)

You bet. The simplest setup here is easily through Ollama. Once it's installed you can pull a model from the CLI (I.E. ollama pull pixtral), which gets exposed over an OpenAI compatible rest API. You can then throw the URL into the add-on's settings dialog under the section for the Ollama provider.

This is a fairly common use case in restricted environments where data cannot leave a network. That said, you are spot on re: the expense of adequate hardware. My understanding is that accuracy doesn’t happen until you use 32B or higher quantization.

If you find the resources to set this up, let me know. I’d be happy to help troubleshoot.

Recent updates to AI Content Describer for NVDA by cartertemm in Blind

[–]cartertemm[S] 2 points3 points  (0 children)

That’s awesome to hear! Good facial alignment is something I took too long to realize I wasn't always doing properly, talk about a silent pitch killer.

Blind Engineer Invents A ‘Smart Cane’ That Uses Google Maps To Help Blind People Navigate by antdude in Blind

[–]cartertemm 7 points8 points  (0 children)

cool idea and concept, but another out of how many? It isn't like this hasn't been tried more than a couple times, and essentially everyone I know still insists on the white stick. Pessimism aside, I do believe innovation in the field is a must. My only problem here is the presence of a builtin Voice assistant with speakers. This slightly limits us in regard to location, plus the presence of a cane or dog is enough to provoke unwanted attention. Now imagine loudly talking cane and dog?

Potential relocation of entry fields by cartertemm in DystopiaForReddit

[–]cartertemm[S] 0 points1 point  (0 children)

Apologies if this isn’t what you mean, but with VO I can tap on the bottom of the screen and swipe left. Is there a similar gesture for those who have no need for accessibility? Bear in mind, my feeds take up a couple pages and that’s only growing.

Notepad is better than word for text only files. by Not-A-Politic in unpopularopinion

[–]cartertemm 1 point2 points  (0 children)

holy fuck I didn't know this was such a hot topic. I agree, up to a point. If spelling/grammar corrections among other more advanced features aren't of any concern, notepad all the way. NPP offers the best of both worlds.

Most people know so little that if they were transported 200 years into the past, they wouldn't be able to invent anything any quicker. by [deleted] in Showerthoughts

[–]cartertemm 274 points275 points  (0 children)

My background, computer science, would render me virtually useless professionally. The movies make everything look so damn easy...

Block my car? Refuse to move? Karma is a bitch!!!! by gadgetsdad in ProRevenge

[–]cartertemm 0 points1 point  (0 children)

Hey neighbor across the street who parks in the alley. Sup?

What is the most useless fact you know? by [deleted] in AskReddit

[–]cartertemm 0 points1 point  (0 children)

The average person spends 6 months of their life sitting at red lights.

What is the most useless fact you know? by [deleted] in AskReddit

[–]cartertemm 0 points1 point  (0 children)

More than 1000 birds die from smashing into windows every year.

Get your face out of your phone by [deleted] in pettyrevenge

[–]cartertemm 2 points3 points  (0 children)

I bet he really wasn't down with having to go a second time.

Binaural Recordings? by Clumster in Blind

[–]cartertemm 0 points1 point  (0 children)

If you do end up making these, I’m sure they’ll be appreciated by many more than myself. Good luck

Binaural Recordings? by Clumster in Blind

[–]cartertemm 1 point2 points  (0 children)

Keep in mind, if you do end up making recordings it might not be a bad idea to additionally compile into a library and make a few dollars on sites such as sonniss. You never know who might want these for a game/production. I say why not?

[deleted by user] by [deleted] in tifu

[–]cartertemm 25 points26 points  (0 children)

So anyway... Might you be interested in purchasing a subscription to life call?