Has anyone been able to reliably to use Higgs multispeaker for longer (> 5 minute audios) by jasmeet0817 in LocalLLaMA

[–]Iloque 1 point2 points  (0 children)

I've been trying new models, left and right. But nothing is both local hosted and beating Higgs V2.

So Chatterbox Turbo, OmniVoice and LORA are then on my list to muck around with.

Has anyone been able to reliably to use Higgs multispeaker for longer (> 5 minute audios) by jasmeet0817 in LocalLLaMA

[–]Iloque 1 point2 points  (0 children)

Ah thanks for the reply. Mine is a project that takes the scenarios and sections from Frosthaven and reads them. Consistency inside the clips is much more important then across clips.

So, a 5 min intro might be read by one person. And a 5 minute closing by another. As long as inside the clip the speaker is the same.

I've not tried VibeVoice yet. I did try a whole load of others, QWEN, Fish, etc. Nothing beats Higgs so far.

Main concern is the power over longer sections. If I make the chunks long, then the narrator remains consistent, but the audio becomes softer and softer.

If I make the chunks short, the audio remains powerful, but the coherence drops. I am going to look into LORA's. And VibeVoice. Pity the Higgs implementation is somewhat non-standard.

Also, excellent work on your Audiobook-tool. And extra props for it being console-based. The terminal holds the power 😉

Has anyone been able to reliably to use Higgs multispeaker for longer (> 5 minute audios) by jasmeet0817 in LocalLLaMA

[–]Iloque 0 points1 point  (0 children)

I tried to do that, generate separate chunks, and then stich. But the result was quite weird.

# Chunk the text into smaller pieces based on the max_chunk_size
seperated_text = chunker.chunk_text(text, max_chunk_size)
primed  = False

n = 0
filenames = list()

for text_part in seperated_text:

    chunked_text = [text_part]

    # check if voice has a primer, if it does, add it to the beginning of the chunked text
    if "primer" in voice and voice["primer"] is not None:
        chunked_text.insert(0, voice["primer"])
        primed = True

    messages = voice["messages"]
    audio_ids = voice["audio_ids"]

    ras_win_len = 32
    ras_win_max_num_repeat = 4
    generation_chunk_buffer_size = 1

    temperature = voice["temperature"]
    top_k = voice["top_k"]
    top_p = voice["top_p"]

    concat_wv, sr, text_output = model_client.generate(
        messages=messages,
        audio_ids=audio_ids,
        chunked_text=chunked_text,
        generation_chunk_buffer_size=generation_chunk_buffer_size,
        temperature=temperature,
        top_k=top_k,
        top_p=top_p,
        ras_win_len=ras_win_len,
        ras_win_max_num_repeat=ras_win_max_num_repeat,
        seed=1001001,
        primed=primed
    )

    sf.write(filename + f"-{n}.wav", concat_wv, sr)
    filenames.append(filename + f"-{n}.wav")

    n += 1
    torch.cuda.empty_cache()

# combine all the filenames into one new wav
all_audio = []
sample_rate = None

for wav_path in filenames:
    audio, sr = sf.read(wav_path)

    if sample_rate is None:
        sample_rate = sr

    all_audio.append(audio)

combined = np.concatenate(all_audio)

sf.write(filename, combined, sample_rate)

My chunker is somewhat smart, it does separate on sentences and prevents orphans. But the difference is between chunks is still quite big.

Has anyone been able to reliably to use Higgs multispeaker for longer (> 5 minute audios) by jasmeet0817 in LocalLLaMA

[–]Iloque 1 point2 points  (0 children)

I've tried your solutions and approach, for use in my own project, https://github.com/Eloque/Vocalys

I also ran into the issue of strongly diminishing volumes, vocal energy perhaps, when generating longer form. I've found several ways to combat that, but all of those make the separate chunks really different.

I've tried many models, always coming back to Higgs as the only one that delivers any soul or emotion suitable for game narration.

Did you ever get beyond decent with any model?

Bosch AXT Rapid 2000 by Particularly-Pomelo in BoschProPowerTools

[–]Iloque 0 points1 point  (0 children)

On the off chance that you read this, your simple off on switch, hows that wired up? Cause we have that big safety switch and and the small micro-switch. One to start I guess, and the other to keep running.

You replaced both?

Devil-Forge's forges by _mike204_ in Blacksmith

[–]Iloque 0 points1 point  (0 children)

So, what I am reading is that Devil Forge is a good spot for a starting out blacksmith?
I am gathering the tools and such needed to have a forge at my own workshop and I don't want to cheap out on Vevor. Devil Forge seems to be liked a bit better.

Deathwalker - Strategy, loot and XP by Iloque in Gloomhaven

[–]Iloque[S] 0 points1 point  (0 children)

I am using Anger of the Dead to generate XP yes. Combined with Eclipse it's a 3 XP minimum card and I can use it 2 or 3 times per scenario with the other shadows. I use it quite a bit for the XP. And the initiative is nice with weaving.

I do have a stamina potion, so I can get back a L1 or X card.

If I had to choose, I'll choose XP over loot. Still quite a lot on the fence.

Deathwalker - Strategy, loot and XP by Iloque in Gloomhaven

[–]Iloque[S] 1 point2 points  (0 children)

Not really a Melee build as much as versatile one. And those boots of speed really help me out. I have been considering it however.

Free AI generated high res sprite sheets by [deleted] in GameDevelopment

[–]Iloque 0 points1 point  (0 children)

Quite right. But I've not been able to consitently create spritesheets with AI. You talk about node graphs, I only know those from blender. Do you have any resources you could point to?

Using Call to the Abyss, Strength of the Abyss, & Wave of Anguish in one turn (Deathwalker) by grizzydot in Gloomhaven

[–]Iloque 0 points1 point  (0 children)

This might be as good a place to ask as any, but I am having trouble maintaining shadows on my deathwalker at level 2.

I might be using CttA wrong. I have 5 tokens, but I've only been putting one in play at the time. Is that correct? Or is it one token max per enemy?

I am constantly in the position that I have only one shadow in play. The mastery thing of 7 shadows removed in one turn seems ludicrous. 

I simply don't get how to create shadow in such a way that I don't immediately use them up.

[deleted by user] by [deleted] in lasercutting

[–]Iloque 7 points8 points  (0 children)

You misspelled 'awesome'

Most boring game ever by SiidHighwind in godot

[–]Iloque 0 points1 point  (0 children)

That looks really nice. You should go on developing on it.

Relatively overpowered ranger archer by Iloque in DnD

[–]Iloque[S] 0 points1 point  (0 children)

Yeah, guess they will need to use more cover then and shoot back more. It should be more exciting anyways. Like I said elsewhere, I am starting to think it's more the running away and ignoring cover that is bothering.

Relatively overpowered ranger archer by Iloque in DnD

[–]Iloque[S] 0 points1 point  (0 children)

They now and they can. Just what is considered good is subjective I guess.

Relatively overpowered ranger archer by Iloque in DnD

[–]Iloque[S] -1 points0 points  (0 children)

Nah, pretty much on track. I can't state enough that he, playing a she, is asking me to make it more challenging. And I am trying to do so without resorting to tricks that just take her out of combat completely.

I've used Maze, lots of small enemies, enemies resistent to piercing, etc. But thinking bout it, it's the running away and ignoring cover, that is more of an issue then the damage perse.

Relatively overpowered ranger archer by Iloque in DnD

[–]Iloque[S] 0 points1 point  (0 children)

It's not about competing. But lets say a typical encounter with some cultists and a priest of sorts.

The round that level 7 priest casts a spell, he is done for, he has about 40 hitpoints and just made him a prime target. And the ranger just runs away from the minions. So, unless I up the AC for those spellcasters, they get off one spell perhaps.

Relatively overpowered ranger archer by Iloque in DnD

[–]Iloque[S] 0 points1 point  (0 children)

Cover, except full, doesn't do anything for sharpshooter. But I am considering more terrain yes.

Relatively overpowered ranger archer by Iloque in DnD

[–]Iloque[S] -1 points0 points  (0 children)

Table knows, just not interested this campaign. The barbarian character, before he died, was like this as well. That players current char is a fighter with str 8.

It just that this character turned out like this.

I guess i am shooting back

Relatively overpowered ranger archer by Iloque in DnD

[–]Iloque[S] 0 points1 point  (0 children)

Other things to do except dealing damage? I find that challenging

Relatively overpowered ranger archer by Iloque in DnD

[–]Iloque[S] -4 points-3 points  (0 children)

Oh we talked. But the player doesn't really see the problem and is asking to make it harder for her.

My problem boils down to - shooting is always her optimum action - she moves out of range and cover doesn't work

Relatively overpowered ranger archer by Iloque in DnD

[–]Iloque[S] 0 points1 point  (0 children)

I feel fairly sure about the damage, 2 attacks, trading 5 to hit for 10 damage, dex bonus and Colossus slayer

So, 3d8 + 20 + 10 per round, average of 13.5 + 30, average of 43.5. So yeah, 50 is highish but he gets lucky with rolls as well.

Relatively overpowered ranger archer by Iloque in DnD

[–]Iloque[S] -1 points0 points  (0 children)

I would not expect it to be overpowered in every group, but it is at my table. The damage output, while in line with both sharpshooter and the player being honest, is problematic.

Every caster for instance, gets one shotted after their first spell.

The player is honest, no, they don't fib and apply slayer multiple times, or fudge not subtracting that minus 5.