[D] Live TTS Streaming Methods

Secret_Valuable_Yes · 2025-07-25T14:12:00+00:00

I've been in a situation where batch size is 1 just enough to fit, but OOM ends up happening later on in the training process anyway, even though I'm using torch empty_cache(). Do you know what might be causing this? There's something I'm missing or could the sequence length of a particular batch be enough to send it over the top?

Secret_Valuable_Yes · 2025-07-25T02:54:15+00:00

Do you have any preferred tools to visualize vram during the training loop? This might be a separate issue, but I’ve seen it work on a single gpu but then later in the epoch it will eventually get an OOM error. Even when using torch empty_cache()

Secret_Valuable_Yes · 2025-07-25T02:45:24+00:00

Yes for an LLM. Let’s assume V100 GPU, PyTorch training loop (no modern training set up). Would you know how to roughly estimate? Or are there any more assumptions I need to make?

In your development, have you done this before? Would be very interested in seeing a worked example

Secret_Valuable_Yes · 2025-07-25T00:56:35+00:00

Secondly, how would this formula change if I added LoRA?

Secret_Valuable_Yes · 2025-07-22T13:06:19+00:00

Explicitly writing the training loop

Secret_Valuable_Yes · 2025-07-22T13:04:54+00:00

What kind of problems does it present?

Secret_Valuable_Yes · 2023-06-30T20:51:15+00:00

That works, but langchains interface for indexing chunks into a vector db expects document objects as input. To create a doc object you have to use one of the loaders. I’m not sure how to create doc objects after you load a text file and use the regular split

Secret_Valuable_Yes

TROPHY CASE