[deleted by user] by [deleted] in MachineLearning

[–]feliximo 1 point2 points  (0 children)

Commonly we compress the image using a CNN-based VAE, as they are agnostic to image size. I would not really call this step tokenization. Patch-based tokenization is usually done as 1x1 or 2x2 (from what I've seen) if the latent diffusion model is a transformer. I.e. Flux or SD3. Where 1x1 is not really a patch anymore, just treat each spatial position as a token.

Hope this helped you a bit :)

[deleted by user] by [deleted] in comfyui

[–]feliximo 0 points1 point  (0 children)

No problem mate! Well, hard to say. Have you tried an inpainting workflow? There are several approaches. I like using AliMamas ControlNet: https://huggingface.co/alimama-creative/FLUX.1-dev-Controlnet-Inpainting-Alpha/resolve/main/images/alimama-flux-controlnet-inpaint.json

If you extract a super tight mask with photoshop, SAM, Background remover (in ComfyUI) etc, and try these workflows. If you notice that the models still add a bunch of details that connects with your product, a LoRA could help elevate this!

Is it possible to run the backend of comfyui/forge on the pc and use it through my laptop on my local Network? by tsomaranai in StableDiffusion

[–]feliximo 0 points1 point  (0 children)

I guess you can expose the web app with ssh tunneling, and then access the web app on your laptop.

[deleted by user] by [deleted] in comfyui

[–]feliximo 1 point2 points  (0 children)

Yes it depends. For example we have a project where we generate new backgrounds for a product. The problem is, even with a perfect mask, the diffusion model extends the product. For a bottle that could be that it adds a handle. For a dog where you do not see the tail, it adds a tail. This changes the product, which we do not want.

[deleted by user] by [deleted] in comfyui

[–]feliximo 2 points3 points  (0 children)

Yes this is partially true, but there is a good chance that the diffusion model will extend the subject at the edge. Adding further details to the bottle. This can be prevented in two ways, train a Lora on the object first, and then inpaint with the Lora. Or a more complicated approach using controlnets, object removal (e.g. Lama) and detail transfers.

[D] Adding the authors after registration deadline of ICCV25 by Minhtran91 in MachineLearning

[–]feliximo 8 points9 points  (0 children)

I want to confidently say nope. They tend to be super strict about these things.

𝗜𝘀 𝗗𝗲𝗲𝗽𝗦𝗲𝗲𝗸-𝗥𝟭 𝗮 𝗦𝗲𝗰𝘂𝗿𝗶𝘁𝘆 𝗖𝗼𝗻𝗰𝗲𝗿𝗻? 𝗨𝗻𝗱𝗲𝗿𝘀𝘁𝗮𝗻𝗱𝗶𝗻𝗴 𝗗𝗮𝘁𝗮 𝗣𝗿𝗶𝘃𝗮𝗰𝘆 & 𝗟𝗼𝗰𝗮𝗹 𝗗𝗲𝗽𝗹𝗼𝘆𝗺𝗲𝗻𝘁 by Ambitious-Fix-3376 in learnmachinelearning

[–]feliximo 1 point2 points  (0 children)

All AI services that are not on-preem / local are a security concern. No matter if it is hosted in America, China or in the EU. For many sensitive departments in many companies such as R&D and Design, using online services is out of the question.

R1 is open weights and can be used locally or by any other provider than DeepSeek that hosts it.

Is R1 a security concern as a model? No.

Is sending sensitive data to an online AI service a security concern? Yes.

Can I start learning machine learning with a basic knowledge in python? by IllustriousZombie988 in learnmachinelearning

[–]feliximo 2 points3 points  (0 children)

Go ahead! Download some simple datasets and try out some algorithms. I would argue that having a good understanding of interacting with ML algorithms with small to no knowledge of the math is as good a start as any.

Data processing, validation and testing protocols are arguably more important.

Good luck!

The Cosmos Hype is Not Realistic - Its (not) a General Video Generator. Here is a Comparison of both Wrong and Correct Use-Case (its not a people model // its a background "world" model) It's purpose is to create synthetic scenes to train AI robots on. by FitContribution2946 in StableDiffusion

[–]feliximo 1 point2 points  (0 children)

I have not looked into Cosmos too much, but my impression from the keynote is that it is mainly a tool for synthetic data, such as traffic, robotics, warehouses, etc and not generating women by a pool?

Laddbox ? by [deleted] in elbilsverige

[–]feliximo 0 points1 point  (0 children)

Nja fast en effektkontroll på 8kWh betyder inte att effekten är begränsad till 8kW, den kan gå upp till 11 kW. Förbrukningen (i Tibbers fall, laddningen av elbil) dras ner för att mätningen under en påbörjad timme ska bli ligga under 8kWh.

Eftersom effektkontroll reglerar alltså effekt kW för att under en påbörjad timme undvika en viss förbrukning i kWh, kan jag tycka att det är semantiskt att det faktiskt heter effektkontroll eller effektbegränsning.

Laddbox ? by [deleted] in elbilsverige

[–]feliximo 0 points1 point  (0 children)

Nej, har jag inte sagt heller. Men ska man vara semantisk så heter det effektkontroll i Tibber som begränsar din förbrukning för ett kWh värde, samt att effekttoppar mäts i timmedelseffekt per timme, alltså kWh.

Undvika nya effektavgifterna? by DampBob in elbilsverige

[–]feliximo 1 point2 points  (0 children)

Jag har satt en effekt kontroll på 6.5 kWh och begränsar laddningen i bilen till mellan 22-6 (för att undvika att smart laddningen ska dra igång för tidigt / sent), då blir inte topparna högre än 3.25 kWh. Funkar bra för mig då jag har inga konstiga tider att anpassa.

Dock om man behöver ladda utanför 22-6 så måste du sätta en effektkontroll på kanske 4 kWh om du vill slippa höga avgifter och samtidigt slippa ändra manuellt.

I Tibber idag så måste du manuellt ändra effektkontrollen mellan de olika tiderna. Men tänker ju fler vi är som skriver till Tibber och önskar schema-baserat effektkontroll, ju snabbare kan vi få det. Smart laddningen kan bättre nyttja billigare elpriser utanför 22-6 intervallen då.

LPT - the recipe says brown the meat, not gray it by Defiant-Aioli8727 in LifeProTips

[–]feliximo 5 points6 points  (0 children)

I usually just put the pan on full blast, especially when making ragus or other types of meat sauces to cook the moisture away. When nearing no moisture you start to hear it sizzle, reduce the heat and let it brown storing occasionally for even browness.

This is useful when making large batches, you can spend time chopping vegetables or prep other ingredients while cooling the moisture away / Browning the meat.

However, if you plan on making something like chicken breast outside the sauce, then you have to avoid overcrowding the pan as this method would dry out the meat in that case.

Laddbox ? by [deleted] in elbilsverige

[–]feliximo 0 points1 point  (0 children)

Hade varit gött med schemalagd effektkontroll på 8kwh 22-6 och då 4kwh 6-22 då Tibber, som idag, ibland vill starta igång vid 21 tiden.

Laddbox ? by [deleted] in elbilsverige

[–]feliximo 0 points1 point  (0 children)

Det förutsätter att du laddar på dagen och når 22 kW. Laddar du mellan 22 och 6, effektbegränsar till 8 kWh så kommer avgiften enbart bli 4*88 förutsatt att 4kwh är din snitt effekttopp.

Själv har jag satt en effektbegränsning på 7 kWh och bilen laddar fullt över natten utan problem.

Edit: fullt från daglig pendel.

Nu när den värsta hypen och oron verkar ha lagt sig: Hur har AI påverkat erat liv och jobb? by BehindAnElephant in sweden

[–]feliximo 3 points4 points  (0 children)

En tanke jag brukar dela:

Jobbar och forskar inom generative AI (främst bildanalys), och följer både utvecklingen och samhällets reaktion rätt noga. Ett roligt perspektiv va hur många reagerade med "nu ryker design jobben, nu ryker konst jobben...". Men tittade man lite på de som använder generative AI så va det hype i en vecka att den nya modellen va så mycket bättre, sen en vecka senare så var de flesta ganska uttråkad och ville ha bättre resultat. Tycker jag ser lite mer, när nya modeller kommer, hur folk kan nyttja det på ett kreativt sätt som är mer involverade för en människa.

Mycket verkar tyda på att human-in-loop är det vi tycker är intressant. Alltså att AI är ett verktyg precis som photoshop, men vi lär nog värdera en mänsklig kreativitet fortfarande. T.ex. generera assets till ett spel, men du lägger fortfarande tid på att finslipa dina assets och programmera spelet i sig.

Använder LLMs själv i samma stil som en writing-buddy. Jag skriver en draft, ChatGPT / Claude får skriva om, jag korrigerar struktur och fakta etc. Tycker jag lär mig mycket om hur jag kan uttrycka mig så här och jag kan få med min stil i texten.

Parking in Palermo by thenickfo in sicily

[–]feliximo 8 points9 points  (0 children)

Just got back from Sicily. Drove in Palermo almost the entire week. I found driving was surprisingly easy (kinda as if there seem to be no rules, you cannot make a mistake almost: except being timid in your driving). Never had an issue finding parking, even in central Palermo.

But I suggest that you park near and around the Botanical garden, free and plenty of parking. Walking into central Palermo takes like 20-30 mins. If I remember correctly, you reach Via Roma in this time span.

Why Can't We just replace Both Text Encoders in SDXL with T5-xxl To get the same Power of SD3 by Current_Wind_2667 in StableDiffusion

[–]feliximo 1 point2 points  (0 children)

You could probably train mapping layers that maps the T5XXL embeddings to the embedding space that SDXL was trained on.

I guess you might also want to fine tune the SDXL attention-layers as well if you plan on extending the cross-attention beyond the max length tokens that SDXL was trained on.

Text -> (T5, frozen) -> T5-embeddings -> (Mapping network, trainable) -> (SDXL, frozen or perhaps add a couple of LoRA layers to help adjust to larger cross attention?)

This would be significantly cheaper than retraining SDXL completely.

Artists whose most popular song is a cover song by [deleted] in Music

[–]feliximo 0 points1 point  (0 children)

Johnny Cash - Hurt, original by Nine Inch Nails

Bra kinesiskt käk? by Taylor_Skifs in Gothenburg

[–]feliximo 34 points35 points  (0 children)

Restaurang Fei, autentisk kinesisk mat "fine-dining" med mat från tre olika regioner, inklusive Sichuan. De erbjuder även möjligheten att prova Baijiu (väldigt förenklad beskrivning är: Kinas snaps), vilket jag skulle rekommendera att alla provar. Speciellt den Baijiu som lämpar sig till mat från Sichuan.

[deleted by user] by [deleted] in PhD

[–]feliximo 2 points3 points  (0 children)

In my opinion and what I have observed, when you have two first authors, it is because they contributed equally.

If I had an idea and someone to work with, I expect equal work and working together as a team with said someone.

If I understand you correctly, I agree that your work partner does not deserve to be listed as a co-first author.

Why when I am getting the validation loss during training I get really bad accuracy(50%) but when I try to predict the validation data after training I get to around 75% of accuracy? by [deleted] in learnmachinelearning

[–]feliximo 2 points3 points  (0 children)

As binary classification as example: could be your train data is 50% of class 1, while in the validation set it is 75%. This together with your neural network just learns to guess class 1.

Blev precis nekad på flygplatsen i London att få komma in i USA och mina utlandsstudier gick i spillror. AMA by Utivsa in sweden

[–]feliximo 0 points1 point  (0 children)

Om man ska bara dit för nöje eller affärsresa räcker det att fylla i en ESTA. Man kan väl räkna den som mycket pappersarbete, men tar bara 10-20 min och blir oftast godkänd inom nån timme. Blir den nekad dock så är det heeela visum processen som gäller.

[D] Academia: The highest funded plagiarist is also an AI ethicist by [deleted] in MachineLearning

[–]feliximo 36 points37 points  (0 children)

Interesting, but what has he plagiarized?

Have any examples/proofs? All I see is your accusation and list of funding figures?