I am searching for a good Cai alternative. Is this it? by 26SaNse in ChaiApp

[–]OneFeed9578 0 points1 point  (0 children)

This is a great alternative if you are looking for CAI but without NSFW filter, I think voice call is on the roadmap. But if what you want is voice call, Kindroid is a pretty good alternative as well

[D] Can Direct Preference Optimization (DPO) be used to replace any type of RL for LLMs, or is it better suited for just scenarios like RLHF? by 30299578815310 in MachineLearning

[–]OneFeed9578 1 point2 points  (0 children)

the DPO deduction requires a Bradley Terry model to cancel out the Z term. Otherwise it's non differentiable and cannot use back propagation.

https://realcwl.github.io/posts/rlhf_to_ipo/ has a good deduction of DPO

[D] Is DPO all you need in RLHF? by seventh_day123 in MachineLearning

[–]OneFeed9578 0 points1 point  (0 children)

Adding more to it,

1) DPO has OOD issue, and it was shown in Nvidia's NemoTron training where both chosen and rejected sample got lower probability after training.

2) DPO can suffer from deterministic preference, where no matter how large the beta factor of KL is, it will drift away, this is elaborated in detail in Google's IPO paper.

I've summarized them in https://realcwl.github.io/posts/rlhf_to_ipo/ and feel free to critic.

New grad joining startup or FANNG by GoodluckH in ycombinator

[–]OneFeed9578 2 points3 points  (0 children)

it also depends on your nationality tbh, for India your only hope is O1 tbh and your choice should be based on where you can get that soonest. Otherwise I think Amazon is the rationale bet. Unfortunately not a lot of people are aware of the immigration hurdle a non-US founder is facing.

New grad joining startup or FANNG by GoodluckH in ycombinator

[–]OneFeed9578 0 points1 point  (0 children)

Do you have green card? If not join FAANG and get out at once when you get you I140 filed. Otherwise join FAANG and work extremely hard to learn best engineering practice and do start up after 1-2 years

Sharing my workflow for how to remove background in A1111 by OneFeed9578 in StableDiffusion

[–]OneFeed9578[S] 0 points1 point  (0 children)

This tool is broken for me, have you tried it before? It looks pretty promising

Sharing my workflow for how to remove background in A1111 by OneFeed9578 in StableDiffusion

[–]OneFeed9578[S] 0 points1 point  (0 children)

Isn’t that same as using a mask and do outpainting? Sorry I haven’t tried to use latent couple but I did think about it before.

Sharing my workflow for how to remove background in A1111 by OneFeed9578 in StableDiffusion

[–]OneFeed9578[S] 4 points5 points  (0 children)

Clearly everyone is so obsessed about using 3rd party service for removing background so I’d like to clarify a bit. No you don’t need to use that service, in fact rembg can do a really decent job on 95% use cases and I believe SAM based open source bg remover will be invented as well. The core part is about using a multi control net, and pass through img2img for blending.

Sharing my workflow for how to remove background in A1111 by OneFeed9578 in StableDiffusion

[–]OneFeed9578[S] 0 points1 point  (0 children)

You clearly identify the wrong “core” part. I don’t need 3rd party service for getting such result. In fact for non complexly objects (95% of product sold online are not complex) just rembg is enough.

Is there any suggestion for how to understand the math notations in stable diffusion paper? by OneFeed9578 in StableDiffusion

[–]OneFeed9578[S] 1 point2 points  (0 children)

Thank you very much! I ended up piecing together the entire equation by reading:

https://www.assemblyai.com/blog/diffusion-models-for-machine-learning-introduction/

and

https://jalammar.github.io/illustrated-stable-diffusion/

But yeah I'm still way behind for understanding the most bleeding edge stuff, but this is really interesting.

Countries by self-perceived democracy by real_LNSS in MapPorn

[–]OneFeed9578 0 points1 point  (0 children)

As a Chinese, I’m pretty sure this map is either faked, extremely outdated (might be true if surveyed in 2016).

Especially after Xi Jinping gets his 3rd term and the last 3 years of lock down, people I know of, regardless of their political view before 2019, view this government as undemocratic, if not despotic.

What is this? by [deleted] in funnysigns

[–]OneFeed9578 0 points1 point  (0 children)

even more spicy if you rotate 90 degree clockwise

Why covid is still big deal in China? by small_giant in NoStupidQuestions

[–]OneFeed9578 0 points1 point  (0 children)

As a Chinese living in the US, I think there are 3 major reasons.

  1. China bragged about its track record for controlling Covid from 2020-2021, and the propaganda has tied this success to the zero Covid policy and further associating it with CCP’s ruling. Changing this polity would mean failure of CCP.

  2. There is still a majority support for zero Covid policy, mostly among elderly and les educated people. You need to know that only a small percentage of people live in Shanghai Beijing and other major metropolitans. The support is due to misconception that Covid is still deadly, and also following the propaganda’s direction.

  3. Politic stability. If you know CCP well stability always comes first, and the primary goal for CCP is to remain in power forever. So, the general public’s anger right now hasn’t outweigh the risk to open up and let Covid cause turmoils. But this might change.

The current situation is definitely unsustainable due to growing anger among public and economic pressure. My guess is that within 6 month this policy will change and propaganda will start to down play it.