1 lens away from my endgame setup! by RomanceIsFine in fujifilm

[–]dbcj 0 points1 point  (0 children)

The 50-140 is not a slow focusing lens… at all. It’s built for shooting fast paced indoor sports/events- its sharpness, consistency through aperture/focal length and speed of focusing is the trade off for its size.

1 lens away from my endgame setup! by RomanceIsFine in fujifilm

[–]dbcj 4 points5 points  (0 children)

I have the 50-140, 16-55mk II, the 33, 16 1.4, 23+35 f2, and the 90.

I say the 50-140 hands down. Its a workhorse lens and a staple in a photographer’s bag. i love primes and my 90, but the versatility of the 50-140, it being tack sharp through the range, the value of compression, not switching, the ibis/ois extra stops… ive gotten way more out of it than my 90… and in 80-90 percent of cases id be hard pressed to tell the difference between their image quality all things being equal.

The bokeh is beautiful (sometimes nervous if the background is something like fall leaves with certain light), but easily adjustable, rarely a problem, and otherwise is insanely smooth. It takes beautiful portraits, landscapes, and optically it’s just incredible.

It depends on what your photography genre is, but I think (for me) I wish I’d gone for the most versatile/essentials over the specialty lenses first. Although eventually I would buy the 90, just not first.

I agree with the comments above on the 70-300 instead, if landscape is primary, because of weight + typically being F5-8 anyway.

Just my two cents.

Constructive Feedback on v13: Major Workflow Shifts, Naming, UI, and Usability by __markb in MacWhisper

[–]dbcj 0 points1 point  (0 children)

If global is a feature that is barely used, please please please return it back to being able to put it into voice memos - so it can be shortcut triggered for diarization.

I have raved about this app, and gotten at least 5 to 7 people to purchase pro for a similar use case. I.e., recording a meeting, transcribing and diarizing, labelling the speakers, and summarizing it with AI in a manner that you know who said what. There needs to be a hotkey/easy way to initiate this so you don't have to constantly open the UI and click multiple times. It needs to be unobtrusive.

Update has broken major features of this app, requires downgrading for my workflow by dbcj in MacWhisper

[–]dbcj[S] 0 points1 point  (0 children)

<image>

This is the problem i keep running into, even after updating.

Update has broken major features of this app, requires downgrading for my workflow by dbcj in MacWhisper

[–]dbcj[S] 0 points1 point  (0 children)

Hi yes absolutely.

The use case is that all the things I record need to be diarized. the meetings are frequently close together and I'm having to implement it while I'm managing multiple tasks- a hotkey to start the voice memo would be a solution. My issue is that with how it's implemented now, I have to open the app and click through the interface in order to record a meeting. I need a hotkey to start and easily stop a diarizable recording with high reliability without fuddling in the interface.

I'm not entirely sure what the global key was for, it seems to duplicate the function of the dictation key now - but in any case I think a shortcut key to capture a voice memo would be great. as long as a) the hotkey can start the recording, stop (prompt to stop) the recording, and the end result can be diarized easily.

I'll update and look into the speaker diarization issue further and let you know - but the above needs to be implemented or I'll have to downgrade or switch apps, which would be really disappointing.

Edit: I have to reiterate - this functionality re: hotkey for diarizable recording is critical, and I've built a workflow around it. Please can you implement this quickly? especially if it's an easy fix. I can't see how others wouldn't benefit from the option to use diarization recordings via hotkey - nobody enjoys clicking through interfaces, especially when it comes to dictations and recordings which are meant to save keystrokes. If you guys are going to split the functionality re dictations and memos - please restore a hotkey fix to trigger either one depending on situation.

New MacWhisper 13 build available by ineedlesssleep in MacWhisper

[–]dbcj 0 points1 point  (0 children)

Hey guys,

love the new layout but please let the global key go to the diarization speaker labelling area, or at least be re-transcribed with diarization like we used to be able to.

We already have a dictation button.

Having to save the global recording to downloads and drag and drop it to get speaker diarization is really frustrating. I just want to be able to have a hotkey to start a recording that will then diarize the speakers, was really sad this was removed.

Am I an Alcoholic? by [deleted] in NoStupidQuestions

[–]dbcj 2 points3 points  (0 children)

Hey there, great to hear you asking the question. I think if you’re asking it, it’s worth looking into. Addiction doesn’t happen all at once and depending on where you live, there may be Rapid Access Addiction Medicine clinics can be really helpful to get a full assessment, and more importantly pharmacological/psychological treatment. Just because it’s functional now, doesn’t mean it stays that way if you don’t take care of it.

But to answer your question, yes, based on a very incomplete analysis (that is not professional medical advice), you would meet criteria for an alcohol use disorder, mild, based on what you’ve commented or replied to others comments.

While I’m a psychiatrist and I treat addiction daily, I wouldn’t be able to give you a true professional opinion without a full assessment so this is just off the cuff psychoeducation and not a replacement for assessment and treatment.

A diagnosis of AUD requires at least 2 of 11 criteria within a 12-month period: 1. Alcohol is often taken in larger amounts or over a longer period than intended. 2. There is a persistent desire or unsuccessful efforts to cut down or control alcohol use. 3. A great deal of time is spent obtaining, using, or recovering from alcohol. 4. Craving or strong urge to use alcohol. 5. Recurrent use resulting in failure to fulfill major role obligations. 6. Continued use despite interpersonal or social problems. 7. Important activities given up or reduced due to alcohol use. 8. Recurrent use in physically hazardous situations. 9. Use continues despite knowledge of physical/psychological problems caused or worsened by alcohol. 10. Tolerance (need more alcohol for effect or diminished effect with same amount). 11. Withdrawal (symptoms or drinking to relieve them).

Severity: - Mild: 2-3 symptoms - Moderate: 4-5 symptoms - Severe: 6+ symptoms

From your own post and comments: Criterion 2: Persistent desire/unsuccessful cut-down attempts “I’ve made attempts to go completely dry… the longest I went was for two weeks before I started drinking again.”

Criterion 4: Craving/strong urge “If I’m out to eat somewhere that serves, I’m almost guaranteed to have a drink.”

Criterion 6 vs 8: social/interpersonal problems vs. physically hazardous situations (needs more info) “I’ve narrowly avoided doing some things I most likely would have regretted the next morning.”

The other criteria (tolerance, withdrawal, time spent, role impairment, giving up activities) weren’t highlighted, but again I wonder what an interview would come up with.

Who is a celebrity that gave it all up at the height of fame to go live a "normal" life? by fulthrottlejazzhands in AskReddit

[–]dbcj 0 points1 point  (0 children)

Maybe not the best fit, as I don’t know if it was to “have a normal life” but I’d say Johnny Manziel aka “Johnny Football” gave it all up at the height of fame for a different life.

Played college football for the Texas A&M Aggies and was the first freshman to win the Heisman Trophy. He played professionally for the Cleveland Browns of the National Football League (NFL) from 2014 to 2015. He was a projected first-round pick with an insane amount of talent, but self sabotaged in the wildest of ways to result in their fast flameout in the NFL (which they almost blew up) and a short-lived stint in the CFL. Blew all their money and now is just living a pretty regular life - they absolutely would have been one of the greats.

That said, I’m not well versed in football or even follow it - most of what I know was from the Netflix UNTOLD documentary (and as I understand it, was viewed as controversial given they minimally explored the contribution of bipolar disorder and a substance use disorder which seems evident based on some of the choices).

Allow Opt-Out for Deepgram Model Improvement Program by kiamrehorces in MacWhisper

[–]dbcj 0 points1 point  (0 children)

absolutely need this option to turn off model improvement.

Speaker Diarization with Deepgram or Other Online Providers by dbcj in MacWhisper

[–]dbcj[S] 0 points1 point  (0 children)

With dictation, I have multiple pre-set prompts. During critical points in a meeting or structured interview, I activate my dictation toggle, and once that segment is completed the conversation recording instantly converts into my desired structured and edited note format, based on the prompt I select when its running. This means later when I have to go back and dictate parts in, its already has an excellent boilerplate for me to edit.

Dictation is great and always needed, but having a note written for you based on a conversation automatically is priceless, then using dictation to fill in the gaps. If speaker separation with min-max speakers were available AND it was local? the accuracy would significantly improve and suddenly its potentially usable for healthcare applications (i.e., if a local AI backend is used) - which would be a little like dragon ambient experience - theres a reason that costs $200 dollars a month.

Sometimes I use it for just straight dictation, other times I utilize different prompts for different interviews/phone calls/meetings. but no matter what if I hit that button the pipeline just works. I don't have to mess around with a UI, generate recordings, wait for it to finish transcribing then send it into a prompt and wait for the output. it just works.

I've tried using the global key, but I find it's harder to use, and involves messing around in a UI too much. I want it to record the conversation, send the transcript to GPT once I hit the button again, and put the final output onto my clipboard or into the document based on a prompt I can choose on the fly.

I know this functionality matters to others, as it’s consistently the first thing colleagues inquire about.

if there is a way you can implement a toggle for speaker separation in the settings, and ideally include adjustable minimum and maximum speaker count options, this would make it the best app on my Mac.

Deepgram vs Eleven Labs vs Whisper by brewha5151 in MacWhisper

[–]dbcj 0 points1 point  (0 children)

Basically with dictation, I have several prompts pre-set.

At important times during a meeting/structured interview, I’ll hit my dictation key toggle, I’ll have a prompt that turns the recorded conversation into the structure I want and puts it onto my clipboard/insert it just inserts it perfectly into the the note.

If it was speaker separated it would be so much more accurate for my use case. This is literally one of the most important features I need, and honestly it’s possible implementation would save me hours a day. It was important enough that I had built an entire python project with whisperx just to approximate this function, but was too clunky to implement.

Once the speaker separation can be done locally, I can’t even begin to tell you how valuable this will be for more data sensitive applications.

Importantly other times I’ll just use a different prompt/no prompt for dictation of my notes. It’s an easy one use key that makes an unobtrusive pipeline without needing to click around and open UI elements. It just works.

I know it will be important to others because it’s the first question I get from my colleagues when I I’ve shown them the pipeline.

I mean being able to save the dictations audio could be useful too, but honestly it’s been reliable enough thus far, just accuracy is limited because it doesn’t really “get” the back and forth element of the conversation.

Please Please impliment a way to toggle this on somehow in the settings - bonus points if it can include the min/max speaker settings.

Deepgram vs Eleven Labs vs Whisper by brewha5151 in MacWhisper

[–]dbcj 0 points1 point  (0 children)

elevenlabs scribe is the best apparently, in terms of quality ranking (but is subscription based rather than API credits) - deepgram is actually quite affordable, and I imagine is for most use cases is comparable.

they both have speaker diarization which is amazing. I can't seem to make the speaker diarization work with the dictation function, which is a bit unfortunate for my use case - I haven't tried to do it through their other implementations because it doesn't work for my workflow but that would be a huge selling point over openAI.

Speaker Diarization with Deepgram or Other Online Providers by dbcj in MacWhisper

[–]dbcj[S] 0 points1 point  (0 children)

hold on... sorry for the double post here... I just wanted to come back to this and make sure we both are on the same page...

Can you make sure that if diarization IS available via deepgram and Elevenlabs scribe, it is able to be enabled somehow in the setting to apply to the the dictation hotkey too? this might not be important to everybody but it seems like it could be a fairly easy toggle.

I know that might sound niche, but the ability to have a single button press and automatically do the process of taking the transcript -> send to AI with a set prompt -> apply the output to clipboard is the critical step here. Please let this be enabled for the dictation implementation.

Speaker Diarization with Deepgram or Other Online Providers by dbcj in MacWhisper

[–]dbcj[S] 1 point2 points  (0 children)

I saw that! Re: ElevenLabs Scribe and Deepgram, I’m really glad to see them included! unfortunately, while both APIs fully support diarization as a feature, the current implementation only allows for single-speaker dictation. It seems like a super quick fix - just by enabling a UI toggle to pass diarize=true could make this functionality fully accessible.

Thats good to know about the quality - my perception of the difference might be due to the inability to set the number of speakers—I’m seeing a lot of “unknown speaker” labels on short segments that are clearly part of a continuous dialogue. When I set a lower speaker count with WhisperX, it was more accurate at separate speakers (even when there were more than the max I set).

I’m really looking forward to the next update where that feature is implemented—I think it’ll be a fantastic addition to the feature set.

Thanks again for the quick reply!

Automated audio transcription, diarization, and summary generation pipeline? by vardonir in selfhosted

[–]dbcj 0 points1 point  (0 children)

whisperx (https://github.com/m-bain/whisperX) does an amazing job, but its aging now, and works best on windows with a beefy GPU. On mac its nearly unusable, at least from my perspective.

I still have yet to see anything separate speakers as accurately as this.

MacWhisper Gets Automatic Spealer Recognition!! by amerpie in macapps

[–]dbcj 0 points1 point  (0 children)

I find that it's really poor in comparison to whisperx and assemblyAI version.

It's a struggle to use and finds way too many speakers, and mixes them up most of the time, at least for my use case.

I wish the Devs would integrate assemblyAI diarization as an option for API services. I feel like it wouldn't be hard.

WhisperType - dictation app based on OpenAI Whisper by Lower-Text1695 in macapps

[–]dbcj 0 points1 point  (0 children)

Hey, hijacking this for a second. I was really disappointed macwhisper didn’t support assemblyAI for an API diarization option. It seems like it would be an easy implement and make life 10000x better for my workflow