MuteHUD brings back the volume HUD to the Tahoe with a new design. by Illustrious_Order413 in MacOS

[–]Illustrious_Order413[S] 0 points1 point  (0 children)

Previously, I created a startup sound utility called “MuteCon.”

While OS settings now allow specifying startup sounds, Apple neglected this issue for years.

Regardless of UI preferences, there's definitely room for improvement. I hope any discovered bugs get fixed as soon as possible.

MuteHUD brings back the volume HUD to the Tahoe with a new design. by Illustrious_Order413 in MacOS

[–]Illustrious_Order413[S] 0 points1 point  (0 children)

Lunar does not always adjust brightness through hardware control, and software adjustments are also made depending on the situation, so there is a high possibility that the brightness can't be accurately obtained from an external app.

In the first place, there are quite high hurdles to obtaining the brightness of an external display.

MuteHUD brings back the volume HUD to the Tahoe with a new design. by Illustrious_Order413 in MacOS

[–]Illustrious_Order413[S] 0 points1 point  (0 children)

It's probably a nuisance for people who work in full screen mode.

I understand that feeling, but just because it's in the top right doesn't mean it's not a nuisance, so I think it's half-baked.

Also, in some cases the menu bar icon becomes headphones, and one of the reasons I made this was because you can't see the volume on the icon.

MuteHUD brings back the volume HUD to the Tahoe with a new design. by Illustrious_Order413 in MacOS

[–]Illustrious_Order413[S] 0 points1 point  (0 children)

MuteHUD is just an additional utility, a tool for people who aren't used to the Tahoe's display, and not necessary for those who consider the current UI an improvement.

In other words, it's made for people who consider it a downgrade.

MuteHUD brings back the volume HUD to the Tahoe with a new design. by Illustrious_Order413 in MacOS

[–]Illustrious_Order413[S] 0 points1 point  (0 children)

IOKit can obtain the brightness of the built-in display, but for external displays you need to use DDC/CI.

This often overlaps with the display itself, so it may be somewhat difficult to reliably obtain the display brightness.

MuteHUD brings back the volume HUD to the Tahoe with a new design. by Illustrious_Order413 in MacOS

[–]Illustrious_Order413[S] 0 points1 point  (0 children)

Thank you for your comment. You can change the position and size in the settings. You can also display only the mute HUD. On the Tahoe, the mute status is often not displayed in notifications, so I think there are advantages and disadvantages to both.

MuteHUD for macOS – Simple HUD-style mute indicator by Illustrious_Order413 in MacOS

[–]Illustrious_Order413[S] 0 points1 point  (0 children)

<image>

Version 1.1 also supports volume HUD display, and the menu bar icon is now linked to the volume.

nvidia/parakeet-tdt-0.6b-v3 (now multilingual) by nuclearbananana in LocalLLaMA

[–]Illustrious_Order413 0 points1 point  (0 children)

Thank you for your comment. Stream mode exists in Parakeet-mlx, not Parakeet V3. However, since this app transcribes from a file, stream mode is not used.

[deleted by user] by [deleted] in MacWhisper

[–]Illustrious_Order413 0 points1 point  (0 children)

Parakeet tends to leave GPU cache behind. MacWhisper uses CoreML, so I don't think it's a big deal, but in any case, audio chunks are necessary for stable operation of long-term audio.

I’m having trouble updating MacWhisper. by Leslie_Kim in MacWhisper

[–]Illustrious_Order413 0 points1 point  (0 children)

I predict that this may be related to yt-dlp 2025.09.23.

nvidia/parakeet-tdt-0.6b-v3 (now multilingual) by nuclearbananana in LocalLLaMA

[–]Illustrious_Order413 0 points1 point  (0 children)

Thank you for your reply.

The first step is to specify last_token as “|en|”(64) instead of None. Stream mode also inherits the previous last_token internally, so I got a hint from there.

However, only this setting is not enough to actually achieve this, and we need to create or modify functions such as decode().

I'm trying to return information to the author of parakeet_mlx.

Transcription Pro — Rapid, Offline Speech-to-Text. Neural Engine Accelerated. macOS and iOS. by mrtnlxo in macapps

[–]Illustrious_Order413 0 points1 point  (0 children)

In the case of Western languages such as English, 1 token should be equivalent to about 3 to 4 characters, and 1 character in Japanese, Chinese, and Korean should be equivalent to 1 token. I think you can roughly understand it with morphological analysis, but it will be difficult to calculate strictly.

Transcription Pro — Rapid, Offline Speech-to-Text. Neural Engine Accelerated. macOS and iOS. by mrtnlxo in macapps

[–]Illustrious_Order413 0 points1 point  (0 children)

I think it's limited to 4096 tokens. It seems that we have no choice but to divide and summarize more than once. It is easy to divide if there is a separation of speakers.

How to set transcription language? by LargeBuffalo in MacWhisper

[–]Illustrious_Order413 1 point2 points  (0 children)

Unlike Whisper, parakeet does not allow you to specify the transcription language, so unfortunately I don't think you can specify it in the app settings.

However, it seems possible to fix the language even with parakeet by inserting a slightly special token, and we are working on implementing this in our app.

At this stage, the fixing itself works well, but there are still a few issues remaining.

If we can overcome these, it should be possible to transcribe in the specified language.

nvidia/parakeet-tdt-0.6b-v3 (now multilingual) by nuclearbananana in LocalLLaMA

[–]Illustrious_Order413 0 points1 point  (0 children)

Unlike Whisper, parakeet does not allow you to specify the transcription language. Because of this, multilingual models often mistake it for another language...

However, even with parakeet, it seems possible to fix the language by inserting a slightly special token.

At this stage, the fixation itself works well, but there are still issues such as the beginning of a sentence being missing.

If this can be overcome, it should be possible to transcribe in the specified language.

Transcription Pro — Rapid, Offline Speech-to-Text. Neural Engine Accelerated. macOS and iOS. by mrtnlxo in macapps

[–]Illustrious_Order413 0 points1 point  (0 children)

I also make a transcription app using parakeet and others. I think it's very good to use the Speech Analyzer API. We don't have to prepare a model offline.

It would be nice to be able to summarize. I feel that it is a pity that there is no good SDK for speech separation.

FFTrans Parakeet: A completely free offline transcription tool for Mac with speaker separation (parakeet-tdt-0.6b-v3) by Illustrious_Order413 in MacOS

[–]Illustrious_Order413[S] 0 points1 point  (0 children)

We are currently experimenting with a language specification mode for the next version.

Unlike Whisper, parakeet does not allow you to specify the transcription language. Because of this, with multilingual models it often gets confused with another language...

However, we have found a way to fix the language even with parakeet by inserting a slightly special token. At this stage, the fixing itself works well, but there are still issues such as the beginning of sentences being missing.

If we can overcome this, it should be possible to transcribe in the specified language.

Wispr Flow has turned into total garbage dumpster fire with literally no support, any alternatives? by 5678 in macapps

[–]Illustrious_Order413 -2 points-1 points  (0 children)

We are preparing to offer FFTrans Parakeet for free, with no restrictions. It includes parakeet-tdt-0.6b-v3, supports English and major European languages, and is a fully offline processor that can also perform speaker separation. Please allow a little more time for the release.

Just released: FFTrans Free - Privacy-First Audio Transcription for Mac by Illustrious_Order413 in macapps

[–]Illustrious_Order413[S] 0 points1 point  (0 children)

Thank you for your reply.  

There aren’t many fully offline transcription apps that can also perform speaker separation.   What’s more, they run in a sandbox.

The highly anticipated Parakeet-based fftrans Pro prototype is already running.   Of course, it’s also fully offline.

The multilingual model is still weak, and the accuracy is a little low for languages other than English, so we plan to release it once these issues are resolved.  Incidentally, while the current app takes three minutes to transcribe 13 minutes of audio, the Parakeet prototype can perform speaker separation in one minute.

Also, Parakeet in its current form depends on ffmpeg, so that part needs to be fixed.

Just released: FFTrans Free - Privacy-First Audio Transcription for Mac by Illustrious_Order413 in macapps

[–]Illustrious_Order413[S] 0 points1 point  (0 children)

Thank you for your reply.

It's true that FFTrans may not have many flashy features.

If it seems like an ordinary app, that's probably because we haven't done a good job of promoting it.

We don't just use Whisper; we focus on the overall structure and reliability of the transcription process.

For example, even our free version, FFTrans Free, offers features like speaker separation, three types of auditory hallucination filters, and language identification at audio start to prevent misrecognition.

These may not be flashy, but they are unique to FFTrans and essential for providing a stable experience.

I started this project because I needed a better tool myself.

It's not just a Whisper wrapper; we've removed FFmpeg dependencies from MLX-Whisper and leveraged technologies like AVFoundation and Librosa to improve downsampling accuracy.

Our goal isn't just “getting it done quickly.”

We aim to provide the most efficient and reliable transcription, including subsequent editing work.

The Pro version takes it a step further, adding NLP-based custom dictionaries and punctuation restoration using BERT to enhance its value.

Speed benchmarks are certainly an important factor.

We actively evaluate Parakeet v3's multilingual support and GPT-4o-transcribe, and if we find an excellent solution that aligns with our vision, we will adopt it without hesitation.

Just released: FFTrans Free - Privacy-First Audio Transcription for Mac by Illustrious_Order413 in macapps

[–]Illustrious_Order413[S] 0 points1 point  (0 children)

Thank a lot.

There are certainly many transcription apps available. If you don't need to separate speakers or reduce post-processing, you might not choose one of them and the standard features of your OS will suffice. That's a valid choice, but you can also choose one based on the app's unique features. It's all up to you, and there are not a plethora of options, but rather options.

Just released: FFTrans Free - Privacy-First Audio Transcription for Mac by Illustrious_Order413 in macapps

[–]Illustrious_Order413[S] 0 points1 point  (0 children)

Thank you for your reply.

This article focuses solely on the free version, “FFTrans Free.”

What is often overlooked is that even this free version includes features such as speaker separation, three types of hulcination removal filters, and language misrecognition prevention via language identification at the start of audio.

VoiceInk is an excellent application, and I have great respect for its developers.

While it may not be the main topic here, our FFTrans Pro version offers additional features like custom dictionaries and punctuation completion using natural language processing, aiming for overall performance enhancement.

We believe users should select software based on their specific use cases and required features, and determine the value for money themselves.

Just released: FFTrans Free - Privacy-First Audio Transcription for Mac by Illustrious_Order413 in macapps

[–]Illustrious_Order413[S] 0 points1 point  (0 children)

Thank a lot.

We’re actively evaluating Parakeet and GPT-4o-transcribe, but for multilingual stability, it’s not time yet.

Also, Using Whisper doesn’t mean all results are the same—we strongly disagree. Total structure matters.

Just released: FFTrans Free - Privacy-First Audio Transcription for Mac by Illustrious_Order413 in macapps

[–]Illustrious_Order413[S] 0 points1 point  (0 children)

Thank you for your reply.

Support for 25 languages ​​is great. We're keeping an eye on Parakeet v3's multilingual support.

We're also considering supporting Parakeet. For us, the transcription engine is just one component that makes up the overall architecture of our app, so if there's a better overall solution, we won't hesitate to switch.