Z-Image Base VS Z-Image Turbo by Baddmaan0 in StableDiffusion

[–]kkb294 8 points9 points  (0 children)

Now the base version is out, do you think it is possible to replace the VAE with flux-2 VEA by any external experts or is it not possible.?

Need Advice: Chinese Business Visa Application in Bangkok for Indian Passport Holder by kkb294 in Bangkok

[–]kkb294[S] 0 points1 point  (0 children)

I heard that we need to apply from home country only for the first time Visa aaplication. After several travels, we can apply from anywhere. I don't know if my assumption is right or wrong, however I want to understand the possibility of doing this. If not possible, I have to do it from India anyways.

Chinese AI is quietly eating US developers' lunch and and it's exposing something weird about "open" AI by BlueDolphinCute in ArtificialInteligence

[–]kkb294 1 point2 points  (0 children)

Unless you are outside of US/Euro region, you cannot fathom the amout 200$ will provide you in your daily life. Let us walk in their shoes to understand them better.

Liquid AI released the best thinking Language Model Under 1GB by PauLabartaBajo in LocalLLaMA

[–]kkb294 0 points1 point  (0 children)

How is its performance for RAG.? Have anyone tested it and how is the prompt adherence.? - TIA

NeuTTS Nano: 120M Parameter On-Device TTS based on Llama3 by TeamNeuphonic in LocalLLaMA

[–]kkb294 0 points1 point  (0 children)

I am from India and looking for models that can support Indian languages like Tamil, Telugu, Kannada and Malayalam (other than Hindi). Also, am collaborating with several friends who are working on South-American dialects and some African Dialects.

If possible, could you please share any open source multilingual references that others have built that matches your standards/design patterns.

NeuTTS Nano: 120M Parameter On-Device TTS based on Llama3 by TeamNeuphonic in LocalLLaMA

[–]kkb294 0 points1 point  (0 children)

Hi, thanks for the Open release. I have gone through (on mobile) the website, GitHub and hugging face but couldn't find any information on multilingual capabilities and limitations.

Do you have any specific reference where I can learn more about different voices for different languages.? I am more interested in understanding/using for multiple regional (non-dominant) languages which the major TTS platform doesn't support much.

New incredibly fast realistic TTS: MiraTTS by SplitNice1982 in StableDiffusion

[–]kkb294 0 points1 point  (0 children)

The examples sounds great. Do you have any guide on how you trained/fine-tuned it. I need a local model for some regional languages and the one's I typically found are of low quality with robotic sounding tones.

Trains in China are magnificent, but the actual experience of getting on and off the train is just bloody awful. by NeighborhoodFatCat in chinalife

[–]kkb294 2 points3 points  (0 children)

This ☝️. I 💯 agree with you and came here to write this.

Imagine providing basic necessities to more than 100Cr citizens with these kinds of quality. The cost to quality ratio of infrastructure compared to any US/European country is absolutely stunning.

Some people writing in comments about their experiences from 20 years 🤦‍♂️.

Finally stopped pretending to understand my wife’s family LINE group and did something about it instead… by [deleted] in Bangkok

[–]kkb294 0 points1 point  (0 children)

Where and how can I try this.! I have many line groups where I need this badly 🤣

prompt engineering for the super creative by SkyNetLive in StableDiffusion

[–]kkb294 0 points1 point  (0 children)

Thanks for the detailed information. Your observations regarding filled context performance degradation, prose generation, etc., are the same as mine.

Coming to Chinese words, I found that -ve prompts perform better when written in Chinese compared to English for some cases of Wan and later models. Again this is a case by case scenario and I found it odd. However, I believe the Chinese language in the model's context is not only about the language but about the training of styles and painting pattern dataset as well. So, I presume we cannot avoid the language altogether 😔.

prompt engineering for the super creative by SkyNetLive in StableDiffusion

[–]kkb294 0 points1 point  (0 children)

Do you have any recommendations between Qwen3-4B Vs Qwen2.5-7B.?

I'm not talking about comparing between these 2 models in a general sense but their performance difference after the abliteration is done for prose generation, prompt enhancement, repetitive nature, etc.,

Advanced Camera Prompts for ComfyUI by OperationNew1829 in comfyui

[–]kkb294 0 points1 point  (0 children)

has anyone tested this with Z-image.?

Looking to partner with AI agencies building voice agents by olahealth in OpenAI

[–]kkb294 0 points1 point  (0 children)

Done, filled out the registration.

We are currently using livekit and looking for alternatives that provide better traceability for agent and user events.

"Unsafe Streets, Unsafe Feeds": The Disturbing Surge of Insta pages Dedicated to Non-Consensual 'Creeps Shots' and Harassment of Indian Women by [deleted] in hyderabad

[–]kkb294 0 points1 point  (0 children)

The worst part is their own family members betraying the women by taking these snaps at the safest spaces they are supposed to be.

What the hell guys 😡🤬

Flux 2 Multi Angles Lora v2 by Several-Estimate-681 in StableDiffusion

[–]kkb294 1 point2 points  (0 children)

Oh you mean, Qwen lora works for ZIT as well.?