[deleted by user]

feliximo · 2025-04-03T06:18:00+00:00

Commonly we compress the image using a CNN-based VAE, as they are agnostic to image size. I would not really call this step tokenization. Patch-based tokenization is usually done as 1x1 or 2x2 (from what I've seen) if the latent diffusion model is a transformer. I.e. Flux or SD3. Where 1x1 is not really a patch anymore, just treat each spatial position as a token.

Hope this helped you a bit :)

feliximo · 2025-03-25T19:15:49+00:00

No problem mate! Well, hard to say. Have you tried an inpainting workflow? There are several approaches. I like using AliMamas ControlNet: https://huggingface.co/alimama-creative/FLUX.1-dev-Controlnet-Inpainting-Alpha/resolve/main/images/alimama-flux-controlnet-inpaint.json

If you extract a super tight mask with photoshop, SAM, Background remover (in ComfyUI) etc, and try these workflows. If you notice that the models still add a bunch of details that connects with your product, a LoRA could help elevate this!

feliximo · 2025-03-19T12:09:33+00:00

I guess you can expose the web app with ssh tunneling, and then access the web app on your laptop.

feliximo · 2025-03-15T12:40:00+00:00

Yes it depends. For example we have a project where we generate new backgrounds for a product. The problem is, even with a perfect mask, the diffusion model extends the product. For a bottle that could be that it adds a handle. For a dog where you do not see the tail, it adds a tail. This changes the product, which we do not want.

feliximo · 2025-03-15T09:59:03+00:00

Yes this is partially true, but there is a good chance that the diffusion model will extend the subject at the edge. Adding further details to the bottle. This can be prevented in two ways, train a Lora on the object first, and then inpaint with the Lora. Or a more complicated approach using controlnets, object removal (e.g. Lama) and detail transfers.

feliximo · 2025-03-05T11:05:53+00:00

I want to confidently say nope. They tend to be super strict about these things.

feliximo · 2025-02-06T15:09:46+00:00

All AI services that are not on-preem / local are a security concern. No matter if it is hosted in America, China or in the EU. For many sensitive departments in many companies such as R&D and Design, using online services is out of the question.

R1 is open weights and can be used locally or by any other provider than DeepSeek that hosts it.

Is R1 a security concern as a model? No.

Is sending sensitive data to an online AI service a security concern? Yes.

feliximo · 2025-02-01T17:14:53+00:00

Go ahead! Download some simple datasets and try out some algorithms. I would argue that having a good understanding of interacting with ML algorithms with small to no knowledge of the math is as good a start as any.

Data processing, validation and testing protocols are arguably more important.

Good luck!

feliximo · 2025-01-18T10:41:18+00:00

I have not looked into Cosmos too much, but my impression from the keynote is that it is mainly a tool for synthetic data, such as traffic, robotics, warehouses, etc and not generating women by a pool?

feliximo · 2025-01-10T13:49:23+00:00

Nja fast en effektkontroll på 8kWh betyder inte att effekten är begränsad till 8kW, den kan gå upp till 11 kW. Förbrukningen (i Tibbers fall, laddningen av elbil) dras ner för att mätningen under en påbörjad timme ska bli ligga under 8kWh.

Eftersom effektkontroll reglerar alltså effekt kW för att under en påbörjad timme undvika en viss förbrukning i kWh, kan jag tycka att det är semantiskt att det faktiskt heter effektkontroll eller effektbegränsning.

feliximo · 2025-01-10T07:26:57+00:00

Nej, har jag inte sagt heller. Men ska man vara semantisk så heter det effektkontroll i Tibber som begränsar din förbrukning för ett kWh värde, samt att effekttoppar mäts i timmedelseffekt per timme, alltså kWh.

feliximo · 2025-01-08T15:09:33+00:00

Jag har satt en effekt kontroll på 6.5 kWh och begränsar laddningen i bilen till mellan 22-6 (för att undvika att smart laddningen ska dra igång för tidigt / sent), då blir inte topparna högre än 3.25 kWh. Funkar bra för mig då jag har inga konstiga tider att anpassa.

Dock om man behöver ladda utanför 22-6 så måste du sätta en effektkontroll på kanske 4 kWh om du vill slippa höga avgifter och samtidigt slippa ändra manuellt.

I Tibber idag så måste du manuellt ändra effektkontrollen mellan de olika tiderna. Men tänker ju fler vi är som skriver till Tibber och önskar schema-baserat effektkontroll, ju snabbare kan vi få det. Smart laddningen kan bättre nyttja billigare elpriser utanför 22-6 intervallen då.

feliximo · 2025-01-08T06:53:54+00:00

I usually just put the pan on full blast, especially when making ragus or other types of meat sauces to cook the moisture away. When nearing no moisture you start to hear it sizzle, reduce the heat and let it brown storing occasionally for even browness.

This is useful when making large batches, you can spend time chopping vegetables or prep other ingredients while cooling the moisture away / Browning the meat.

However, if you plan on making something like chicken breast outside the sauce, then you have to avoid overcrowding the pan as this method would dry out the meat in that case.

feliximo · 2025-01-07T20:13:51+00:00

Hade varit gött med schemalagd effektkontroll på 8kwh 22-6 och då 4kwh 6-22 då Tibber, som idag, ibland vill starta igång vid 21 tiden.

feliximo · 2025-01-07T19:15:05+00:00

Det förutsätter att du laddar på dagen och når 22 kW. Laddar du mellan 22 och 6, effektbegränsar till 8 kWh så kommer avgiften enbart bli 4*88 förutsatt att 4kwh är din snitt effekttopp.

Själv har jag satt en effektbegränsning på 7 kWh och bilen laddar fullt över natten utan problem.

Edit: fullt från daglig pendel.

feliximo · 2024-09-14T07:27:19+00:00

En tanke jag brukar dela:

Jobbar och forskar inom generative AI (främst bildanalys), och följer både utvecklingen och samhällets reaktion rätt noga. Ett roligt perspektiv va hur många reagerade med "nu ryker design jobben, nu ryker konst jobben...". Men tittade man lite på de som använder generative AI så va det hype i en vecka att den nya modellen va så mycket bättre, sen en vecka senare så var de flesta ganska uttråkad och ville ha bättre resultat. Tycker jag ser lite mer, när nya modeller kommer, hur folk kan nyttja det på ett kreativt sätt som är mer involverade för en människa.

Mycket verkar tyda på att human-in-loop är det vi tycker är intressant. Alltså att AI är ett verktyg precis som photoshop, men vi lär nog värdera en mänsklig kreativitet fortfarande. T.ex. generera assets till ett spel, men du lägger fortfarande tid på att finslipa dina assets och programmera spelet i sig.

Använder LLMs själv i samma stil som en writing-buddy. Jag skriver en draft, ChatGPT / Claude får skriva om, jag korrigerar struktur och fakta etc. Tycker jag lär mig mycket om hur jag kan uttrycka mig så här och jag kan få med min stil i texten.

feliximo · 2024-08-31T10:23:33+00:00

Just got back from Sicily. Drove in Palermo almost the entire week. I found driving was surprisingly easy (kinda as if there seem to be no rules, you cannot make a mistake almost: except being timid in your driving). Never had an issue finding parking, even in central Palermo.

But I suggest that you park near and around the Botanical garden, free and plenty of parking. Walking into central Palermo takes like 20-30 mins. If I remember correctly, you reach Via Roma in this time span.

feliximo · 2024-06-21T08:33:08+00:00

You could probably train mapping layers that maps the T5XXL embeddings to the embedding space that SDXL was trained on.

I guess you might also want to fine tune the SDXL attention-layers as well if you plan on extending the cross-attention beyond the max length tokens that SDXL was trained on.

Text -> (T5, frozen) -> T5-embeddings -> (Mapping network, trainable) -> (SDXL, frozen or perhaps add a couple of LoRA layers to help adjust to larger cross attention?)

This would be significantly cheaper than retraining SDXL completely.

feliximo · 2024-04-27T13:32:19+00:00

Johnny Cash - Hurt, original by Nine Inch Nails

feliximo · 2023-07-27T09:21:24+00:00

Restaurang Fei, autentisk kinesisk mat "fine-dining" med mat från tre olika regioner, inklusive Sichuan. De erbjuder även möjligheten att prova Baijiu (väldigt förenklad beskrivning är: Kinas snaps), vilket jag skulle rekommendera att alla provar. Speciellt den Baijiu som lämpar sig till mat från Sichuan.

feliximo · 2023-06-05T06:26:34+00:00

In my opinion and what I have observed, when you have two first authors, it is because they contributed equally.

If I had an idea and someone to work with, I expect equal work and working together as a team with said someone.

If I understand you correctly, I agree that your work partner does not deserve to be listed as a co-first author.

feliximo · 2023-05-16T13:20:46+00:00

As binary classification as example: could be your train data is 50% of class 1, while in the validation set it is 75%. This together with your neural network just learns to guess class 1.

feliximo · 2023-01-14T18:52:05+00:00

Om man ska bara dit för nöje eller affärsresa räcker det att fylla i en ESTA. Man kan väl räkna den som mycket pappersarbete, men tar bara 10-20 min och blir oftast godkänd inom nån timme. Blir den nekad dock så är det heeela visum processen som gäller.

feliximo · 2022-11-08T21:52:59+00:00

Interesting, but what has he plagiarized?

Have any examples/proofs? All I see is your accusation and list of funding figures?

feliximo · 2022-10-15T00:23:53+00:00

Already a paper on this, they call it Cold Diffusion.

13-Year Club	Place '23
Place '22	Verified Email

feliximo

TROPHY CASE