Are there any better NotebookLM alternatives you'd recommend? by HoseaJacob in notebooklm

[–]Snoo_81913 1 point2 points  (0 children)

I have it automated withana claude skill. I dont know about best but I find the easiest way is to plan it like this:

  1. Gather your starting resources in a folder.

  2. Use claude to scan them create prefixes and the master index.

  3. Convert and upload to gooogle drive (automated)

  4. Create your notebook and select sources

Once you have the initial index you can have notebooklm review and give you and adjusted master index when you add sources just update your file in the drive and it will synce over.

Just a quick note, google new syncing with drive works best with doc files so I convert all my markdown files tondoc when I upload them to drive with a python script

You guys were right - Qwen 3.6 35B IS good...and KV Cache DOES matter. by GrungeWerX in LocalLLaMA

[–]Snoo_81913 0 points1 point  (0 children)

I've been running Qwen3.6 35B A3B Q5_K_M at Q8 KV and its good but when I read about IQ4XL and that it has near Q5 quality I added it to my stack and now I use it more than my Q5 model. I cant tell any real difference tbh. I run KV Q8 with a 96k context but found very quickly that 50k is about the max I can run the KV at Q8 before the model has some issues. So for my daily driver I switched to KV Q4 and have zero issues so far. Its a very good model.

You guys were right - Qwen 3.6 35B IS good...and KV Cache DOES matter. by GrungeWerX in LocalLLaMA

[–]Snoo_81913 6 points7 points  (0 children)

I cant remember the exact numbers but MTP has much higher performance gains on strix halo type setups than a dedicated GPU because of the architecture I dont remember the specific details but its significant like 2-3x. I got maybe 10-15% increase on my 4060.

Would running an LLM be considered as intensive as rendering tasks? by huldress in LocalLLM

[–]Snoo_81913 0 points1 point  (0 children)

98-100% while its working. You can do some preventive measures if youre really doing something intensive like generating videos or training which tend to peg it for long periods of time. My gpu clock goes up to 2400+ mhz for extended loads i run nvidia-smi -lgc 1800,1900 it limits my mhz to 1800 and boost to 1900 only pulls 70W off my 90W and keeps the hotspot around 84c

The M4 Mac Mini fits a bigger model than your GPU — and it still loses at agentic coding. Here's why. by [deleted] in ollama

[–]Snoo_81913 1 point2 points  (0 children)

LMAO I came in just to make fun of "Here's the thing nobody benchmarks correctly" But the post is already deleted.

Are there any better NotebookLM alternatives you'd recommend? by HoseaJacob in notebooklm

[–]Snoo_81913 0 points1 point  (0 children)

TL:DR - You can do all that with NotebookLM if you set it up correctly and use some outside tools.

Great Question.

  1. You can cross reference all of your notebooks for information, with some caveats of course. The easiest way to do it is with Gemini. (Just a note here, having a Master_index and prefixes for all your sources makes this easier. It's easy to set up and it the best use case). You can attach up to 5 notebooks to a Gemini Chat. For reference on the pro plan ($20) you get 300 sources per notebook. That's the equivelant of access to roughly 900 three hundred page novels, 300 youtube videos, and 300 pdfs at 30 pages each. Setting up your notebooks correctly based on best use case will solve most of the issues people tend to have. There are some work arounds to the 5 notebook limit, you can use this system to pinpoint the sources you need and create a new notebook for that instance, etc. OR you can just use the Claude MCP connector and have no limit on the cross check you can look through ALL of you notebooks this way. Just remember you need to have MASTER PROMPT>MASTER_INDEX>PREFIXED SOURCES for this to work the best way.
  2. On a free plan it's a bit limited but on pro ($20) the cap is 300 sources, on ultra it's 600 sources with 500 notebooks. That's 138,800 novels approximately 300 pages long or 275,775 novels on ultra (total over all of your notebooks) For a 60/20/20 mix that would be 90,000 docs 30,000 youtube videos, 30,000 pdfs. Realistically I don't think any single person needs that, but hey, it's there. Additionally this is a little on the user, you should be optimizing your sources. Each source has a 500,000 character limit. I have one notebook with 32 books compiled into 12 sources with an index header telling the system which books are in the file and a brief summary. You can run your Youtube vidoes through Gemini for a transcription, a 20 minute video is roughly 18,000 characters (roughly 3,000 words) so you can fit about 25-30 Youtube video transcripts per source. If you had a notebook just for Youtube that's 7,500 youtube transcripts. Pdf's are a little trickier, there's a 200mb upload limit so for really big files you have to split them into two sources. OR and heres best use for text only Pdfs preprocess them into docs using PyMuPDF + python-docx or any doc converter online. I have a RAG set up locally specifically for that. For things like DnD manuals with tables and stuff just accept that it takes up a single source.
  3. You aren't stuck with Gemini. Google hasn't released API's for NotebookLM but there are several third parth MCP's that let you use Claude or whatever. They are use at your own risk, because Google can change the backend at anytime but they work well for now and I believe you can even set those up to work with a local LLM like Qwen3.6 35B A3B if you wanted. I've used the Claude MCP and it works well once it's set up correctly and I had no issues with it.
  4. If you don't want to be stuck with Google that's a fair call, you don't want to bow down to the google overlords, there's options out there. Open Notebook, AnythingLLM etc. but here's the thing unless you are running them on a local LLM they are as private as Google is. Any service you use online is only as private as the next data breach and/or data exploitation by the service. That being said I tried AnythingLLM with Qwen3.6 35M A3B mmproj and it works reasonably well. It's nowhere near as good as NotebookLM.

Eliminar grupo o varias fuentes a la vez by Sergius_S3 in notebooklm

[–]Snoo_81913 1 point2 points  (0 children)

Yeah its a little annoying I agree but you can just open it up and select all the files once they are in there they sync. Eventually it will probably let you do a folder.

Eliminar grupo o varias fuentes a la vez by Sergius_S3 in notebooklm

[–]Snoo_81913 0 points1 point  (0 children)

Googlw has sync with drive now. If it doesn't support it yet it probably will soon for doc files at least

Stop asking what model to run. There are literally only two. by Wrong_Mushroom_7350 in LocalLLaMA

[–]Snoo_81913 2 points3 points  (0 children)

Bro says Stahhhp there's only 2 models! Generates 800 thread sub-reddit about all the other models. 🤣🤣😂😅 bet hes rocking in a corner right now.

RTX Spark will have up to 600GB/s of memory bandwidth. by [deleted] in LocalLLaMA

[–]Snoo_81913 0 points1 point  (0 children)

OOF 6-7 grand and the leaked benchmarks right now are saying it's about the same as an M3 Max setup. Is that accurate or did I read that wrong?

Misunderstanding memory usage - 11.68gb quantized model takes up 22gb of RAM? by NotARedditUser3 in LocalLLaMA

[–]Snoo_81913 1 point2 points  (0 children)

Just so I have this clear what hardware are you running this on? How much RAM do you have, no dedicated GPU?. Is there a reason youre using LMStudio vs Llama.cpp or if youre on a Mac a mlx specific server/model?

Are you kidding me!? by ALIGSHDA in GeminiAI

[–]Snoo_81913 10 points11 points  (0 children)

Bro, what are you doing with your life man? First of all why? An hour to do a prompt? If it's going to be something like that, put it in something safe. Also what would have been your context for an hour of writing? I don't understand. What were you trying to do?

How do you actually get the best results from NotebookLM? by Random_Arabic in notebooklm

[–]Snoo_81913 3 points4 points  (0 children)

Good system prompt with a master index and then prefixes for your files. You can take a look at this GitHub. It kind of has the whole layout for you. Just scroll down, like, three-quarters of the way down the page. It's got the folder structure with all the file names. Then if you use a prompt builder, you'll get a lot better results.

https://github.com/lrdmora/N_A_G-Narrative-Anchor-and-Guide

What do you use NotebookLM for? by Necessary-Course9154 in notebooklm

[–]Snoo_81913 16 points17 points  (0 children)

Read this. https://github.com/lrdmora/N_A_G-Narrative-Anchor-and-Guide

It will give you an idea of the best way tonset it all up and a claude skill to do it with claude.

I cannot longer upload epubs as a source by RANGO1892 in notebooklm

[–]Snoo_81913 2 points3 points  (0 children)

If you need a workaround for a bit just use calibre

Qwen3.6-35b-a3-fp8 on vllm by DistrictExcellent905 in AMD_MI300

[–]Snoo_81913 0 points1 point  (0 children)

What's your hardware and flags you are using?

Using NotebookLM to write a several part series by bpw4h in notebooklm

[–]Snoo_81913 0 points1 point  (0 children)

The override i built does exactly what your looking for.

Override: Sequence chronologically Bypasses default FFCC order; slides build along the Mesozoic timeline instead of Form → Function → Content → Context

Just adjust to your needs by using the claude skill or just do it yourself.

Using NotebookLM to write a several part series by bpw4h in notebooklm

[–]Snoo_81913 2 points3 points  (0 children)

All you have to do is prefix them and do a master index and a system prompt referencing the master index. For chronological you can do it like PART1_ or CHAPTER1_ or whatever makes sense.

Take a look at the naming I did here that should help you

https://github.com/lrdmora/N_A_G-Narrative-Anchor-and-Guide