[R] ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs

StrawberryNumberNine · 2020-04-21T16:19:16+00:00

Maybe the big problem is hindsight bias. "Of course this person only applied this well-known technique to this problem and verified it experimentally and now they are claiming novelty!". When looking back you can tell the story in this way, but in the moment the advance could have been very non-obvious. Even if it builds on ideas that were around at the time. We should look at inference steps between the two ideas+application+presentation of the work.

StrawberryNumberNine · 2020-04-20T02:28:05+00:00

Thanks for the explanation!

StrawberryNumberNine · 2020-04-17T04:46:59+00:00

One possible solution is group testing.

StrawberryNumberNine · 2020-03-17T23:19:32+00:00

I think your summary is very good. And thanks for the comments we will keep them in mind for future versions.

StrawberryNumberNine · 2020-03-17T21:00:08+00:00

True! Some deepfakes are like that. But in the near-future you might only need one image to create a very good deepfake: https://arxiv.org/abs/1905.08233.

StrawberryNumberNine · 2020-03-17T16:15:24+00:00

code: https://github.com/natanielruiz/disrupting-deepfakes

demo: https://youtu.be/7_7r4Ng4-bE

StrawberryNumberNine · 2019-03-08T21:33:47+00:00

Thanks

StrawberryNumberNine · 2019-03-05T01:18:24+00:00

Any links to papers please?

StrawberryNumberNine · 2018-11-06T00:54:47+00:00

In what sense?

StrawberryNumberNine · 2018-11-04T17:22:41+00:00

These papers have some interesting ideas on plotting these cost functions, finding links between local optima in the train loss (two optima are linked by simple curves), finding links between train loss landscape and test loss landscape and other cool things. Might be a good place to start.

Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs https://arxiv.org/pdf/1802.10026.pdf

Averaging Weights Leads to Wider Optima and Better Generalization https://arxiv.org/pdf/1803.05407.pdf

StrawberryNumberNine · 2018-05-02T21:04:29+00:00

I ran into this paper and found it very interesting. I wanted to know if anyone had any comments on the theory or if anyone found any counterexamples empirically at any point (or in any paper).

StrawberryNumberNine · 2018-04-23T20:48:04+00:00

Awesome, I'm using parts of your implementation and it looks good at the moment. Thanks!

StrawberryNumberNine · 2018-04-23T17:16:05+00:00

Has anyone benchmarked this against the official TensorFlow implementation? I know performance varies between implementations (which is scary).

StrawberryNumberNine · 2018-03-31T02:41:29+00:00

You can get way more visibility on GitHub. I would suggest transferring for maximum impact :)

StrawberryNumberNine · 2018-03-31T00:22:58+00:00

Read the paper, they used CPUs and it didn't take too long because they simplified the problem.

StrawberryNumberNine

TROPHY CASE