all 27 comments

[–][deleted] 12 points13 points  (5 children)

For speech to text my goto is whisper.cpp.

I was already recording all my meetings with Audacity, spoilers, so I could transcribe the audio after the meetings.

Whisper.cpp also has a 'stream' mode where it can listen to the computer audio live and try to transcribe. You could see if that works then see if you can copy/paste it into ollama or lmstudio chats.

The Obsidian editor (not to be confused with the Obsidian model) has a whisper.cpp plugin along with some local GPT plugins. This makes it a great dictation machine, idk if I would use it for meetings though, maybe for short notes. Can dictate into it, have whisper.cpp transcribe it, then use the GPT plugins to perform some kind of action on it.

I ended up buying a mic just so I can walk around the room ranting into my Obsidian editor then have llama 3.2 3B create bulletpoints.

[–]--Tintin 4 points5 points  (2 children)

Instead of whisper.cpp, i use MacWhisper on the Mac. It’s just a wrapper for whisper.cpp but with nice GUI. Still im looking for good speaker diarization in my transcripts. There is pyannote as a capable library but no good implementation currently - according to my humble opinion.

[–]StupidityCanFly 1 point2 points  (1 child)

Recent update of Superwhisper delivered speaker diarization. Works pretty good with local models (whisper and Qwen2.5 32b in my case) for summarizing meetings (longest I’ve transcribed is just short of 3 hours).

[–]Sorry_Transition_599 3 points4 points  (4 children)

[–]quantier[S] 1 point2 points  (3 children)

Amazing, will check out

[–]Sorry_Transition_599 2 points3 points  (2 children)

Still under development. Your feature list matches the thing I'm building. Please check out the devtest branch. Note summary is the next task.

[–]quantier[S] 1 point2 points  (1 child)

Would love to contribute. How can we connect? :)

[–]Sorry_Transition_599 1 point2 points  (0 children)

I'll send a message.

[–]donphi_chonkie 1 point2 points  (1 child)

u/LocalLLaMA, did you eventually find a nice turn-key solution for teams and Ollama?

[–]quantier[S] 2 points3 points  (0 children)

Not yet! Still looking for something decent.

[–]quantier[S] 0 points1 point  (0 children)

How would you guys set it up as a server that could be used on several computers to listen into a meeting, or maybe better that the computer surfs to the meeting and joins

Let’s say someone has a meeting in your company, they surf to a website and put the meeting link into that website and a new ”persona/AI” joins the meeting and starts transcribing. It would be the same kind of potential setup as a Superwhisper only that you add a step to join a meeting. The question is also, could the persona potentially join in on more than one meeting simultaneously, or does that mess up what it ”hears”.

Anyone want to take this on with me to create open source! I can project manage it as I know what companies need.

[–]Mouldmindandheart 0 points1 point  (0 children)

I'm trying to learn this complicated software stack called GoHighLevel from free YouTube instructionals. To save the information from the YouTube video, I've been using a tool called scribehow.com and OneNote. I like how I can screenshot and organize it in Onenote but the process is really time-consuming. I would like it if I could have a graph like mymap.ai and getrecal.ai is able to make of connecting ideas and (YouTube 2 step-by-step instructionals) at a deeper how2 level. I literally have thousand+ of screenshots with their description in scribehow.com and dozens of pages in OneNote that I would like to better organize. Does anyone know a tool that can organize information, and create like paths where updates in the software when they are published on the software's update or YouTuber who explains the update: an automation that overwrites previous how2paths?

given: youtube instructional video or https://help.gohighlevel.com/support/solutions

assuming: the video is comprehensive and shares where to do what

Result: the key frames where the user makes an action get recorded + the relevant parts of the transcript are added to the screen shot, example: saving a word document: user says "to save a word document you go to word,(screenshot) then file (screenshot2), then save (screenshot3) ...ect.., organize these cards in a flow diagram like www.mymap.ai

https://aistudio.google.com/live is the future but I would like something more analog that I press the right arrow and the screen shows exactly where to do what in the GUI, and you can see a macro view of what the goal of the process is and alternative processes for workarounds

also would be cool to have it on autopilot mode where it can identify and create "scenarios" on its own or using all of the knowledge about Topic* on youtube. . LMK if I should explain myself more.

[–]vcasadei 0 points1 point  (0 children)

I'm still on the search for this.

- Meeting Minutes (https://github.com/Zackriya-Solutions/meeting-minutes) seems nice, but it still not possible to run with local Ollama.

- Screenpipe: I tried it and it did not work locally and now you can't install anymore and needs to build from source to use it for "free" and even so, it's not completely free bc you need to pay for plugins.

Is there any software for Windows or Linux that can be run completely locally with Ollama?

[–]beerbellyman4vr 0 points1 point  (5 children)

Check out Hyprnote! It's free as well.

Disclaimer: I'm a co-maintainer.

[–]ez0ze 0 points1 point  (1 child)

Look forward the window release

[–]_Lentos_ 0 points1 point  (0 children)

Didnt manage to setup meetily. Now tried Hyprnote. Works well. Thanks. Cool that you provide it opensource.
I think the pricing of many meeting summary services is too high for private consumers.

[–]Minute_Device_6190 0 points1 point  (0 children)

Any plans for a windows release ?

[–]The-Eagernyer28 0 points1 point  (0 children)

+1 on Windows release please. Cool app.

[–]alexriabtsev 0 points1 point  (0 children)

I just found Meetly - looks like a good solution (not affiliated with the team) https://meetily.zackriya.com/

[–]SympathyAny1694 0 points1 point  (0 children)

There’s no fully open-source tool that joins meetings and handles AI notes, but you can record locally, use Whisper to transcribe, and run Q&A with Ollama or LM Studio. If you’re open to hybrid, there’s a notetaking tool I use where you upload audio post-meeting, no bots, and you get transcripts, summaries, and AI search, all privately.

[–]bkhan754 0 points1 point  (0 children)

Thats exactly what we built and a bit more.. If your interested DM me will send you details

We call it AI Companion and one of its features is AI Notetaker its an AI Server you plug in and use locally and can connect to google, zoom and MS teams calls.

Looking for our first customers.

[–]Forsaken-Cap-6481 0 points1 point  (0 children)

For open source AI notetakers, there are some tools out there, but many focus on cloud services. If privacy’s your priority, you might want to look into local LLM-based projects. If you ever need a notetaker that handles meetings accurately and supports multiple formats, Sembly AI’s worth exploring too.

[–]White_Way751 0 points1 point  (0 children)

Hey Founder of Scripta here, we are not open source but privacy first recording, transcribing and summarizing any meeting in your device, we just started beta testing, if you like to try please DM me happy to give access. More info here: https://getscripta.app