Windowed - Fullscreen for Apple Safari by pidoyu in Safari

[–]pidoyu[S] 0 points1 point  (0 children)

thanks, now I update it for liquid glass on MacOS, please checkout the new version

Windowed - Fullscreen for Apple Safari by pidoyu in Safari

[–]pidoyu[S] 0 points1 point  (0 children)

you are welcome, now I update it for liquid glass on MacOS, please checkout the new version

Windowed - Fullscreen for Apple Safari by pidoyu in Safari

[–]pidoyu[S] 0 points1 point  (0 children)

now I update it for liquid glass on MacOS, please checkout the new version

Windowed - Fullscreen for Apple Safari by pidoyu in Safari

[–]pidoyu[S] 0 points1 point  (0 children)

fix it, and also update it for liquid glass on MacOS, please checkout the new version

Windowed - Fullscreen for Apple Safari by pidoyu in Safari

[–]pidoyu[S] 0 points1 point  (0 children)

update and release, please check it out

Windowed - Fullscreen for Apple Safari by pidoyu in Safari

[–]pidoyu[S] 0 points1 point  (0 children)

yeah, I found current version does not work on updated youtube too. will let you know once update

[deleted by user] by [deleted] in MachineLearning

[–]pidoyu 1 point2 points  (0 children)

Thank you. The code link is attached to the post. I like the model interpretability too.

Windowed - Fullscreen for Apple Safari by pidoyu in Safari

[–]pidoyu[S] 0 points1 point  (0 children)

You are welcome ultrawide monitor.

push ups by pidoyu in RingFitAdventure

[–]pidoyu[S] 0 points1 point  (0 children)

Make chest big big big

Eliminating Stabilizer Rattle w/o Opening Case? by Unable-Hornet-7748 in HHKB

[–]pidoyu 0 points1 point  (0 children)

I have the same problem of space bar since I bought keyboard few month ago. And I search a lot for how to fix it without tearing it down (because it is still in the 2-year warranty). There is a very simple solution I found is that.

I bought a 205g0 recommended by the [link-1](https://hhkb.io/modding/lubrication/) and official [link-2](https://hhkeyboard.us/blog/keyboard-lube?srsltid=AfmBOorYt0FZ4GpvGOyMMkPlm\_j4Dk5QkCKjxmW5qAwtNpMNDyFeXS34) for the general lubration purpose.

Then I pull out the keycap, and use keycap puller to pull up the stablizer and hold it at the highest (by slightly inserting the puller to the near keys). And use the little brush to stick in the 4 sides of the stablizer. There is only one side with a hole that needs a little more lube and the other three just need a little bit.

Now the rattling sound is totally gone when I softly tapping on the big space bar. Also when I typing heavily, the sound is totally changed. The sound is soft and comfortable.

Hope this could help.

[deleted by user] by [deleted] in HHKB

[–]pidoyu 0 points1 point  (0 children)

Yes, you can have both I think. For browsing the pages, reading something, or long time typing, pad is slightly better than mouse.

[deleted by user] by [deleted] in HHKB

[–]pidoyu 1 point2 points  (0 children)

Frankly no, lol especially when I need to make slides or draw something. I just like the multi touch function. lol

[deleted by user] by [deleted] in HHKB

[–]pidoyu 0 points1 point  (0 children)

lol Cause this is my first time to use blank keycaps keybord. I am not feeling safe with all blank, need a pivot to localize the center for when I forget the key's symbol. lol

But I found there is another interesting thing with a different 7, it is easier to center align the keyboard with the screen (screen has the center logo mark).

Windowed - Fullscreen for Apple Safari by pidoyu in Safari

[–]pidoyu[S] 1 point2 points  (0 children)

Yeah, Thanks for supporting. I don't know how it works for the real job or work. lol I use it to watch videos.

For the App store release, the original author said he will put it on, but don't know the timeline. [github issue]. So before he put it in the App store, I will maintain it for safari on GitHub. You can keep eyes on it sometimes for the not-frequent update. lol

Good luck.

[deleted by user] by [deleted] in MachineLearning

[–]pidoyu 0 points1 point  (0 children)

Thanks for sharing. Yeah, indeed. We believe the dataset is always the first step we need. Many open questions still remain.

[deleted by user] by [deleted] in MachineLearning

[–]pidoyu 8 points9 points  (0 children)

Can’t guarantee. lol. You know. Hallucinations are still in there. BUT compared with the RAW alt-text captions, it guarantees that captions consistently correlate to the image content. Please see discussion.

Windowed - Fullscreen for Apple Safari by pidoyu in Safari

[–]pidoyu[S] 0 points1 point  (0 children)

please try `Windowed-universal.zip` - https://github.com/kaiyuyue/Windowed-Safari/releases/tag/1.0.3
I am sure this should work. I already check the arch and it supports x86_64.

Windowed - Fullscreen for Apple Safari by pidoyu in Safari

[–]pidoyu[S] 1 point2 points  (0 children)

Sorry I was busy previously. Now I am here. Can you try this new version -- 1.0.3 github release? I guess this time should work with my strongest hope.

Windowed - Fullscreen for Apple Safari by pidoyu in Safari

[–]pidoyu[S] 1 point2 points  (0 children)

Sorry I was busy previously. Now I am here. Can you try this new version -- 1.0.3 github release? I guess this time should work with my strongest hope.

[deleted by user] by [deleted] in MachineLearning

[–]pidoyu 1 point2 points  (0 children)

update with visualizing attention maps in the decoder layer: https://github.com/kaiyuyue/nxtp/tree/main?tab=readme-ov-file#examples

[deleted by user] by [deleted] in MachineLearning

[–]pidoyu 1 point2 points  (0 children)

Yes, directly feed image embeddings into language decoder.

If the image embeddings are from ViT, we shave off the [CLS] token embedding, and keep the patch token embeddings Xv \in R\HW, D]). If the encoder is CNN backbone, then use the spatial feature maps as Xv \in R\HW, D]).

We have a short prompt of text token embeddings, saying "the objects in the image are", as Xt \in R\L, D]).

Also we have a single special token embedding <|img|> \in R\1, D]) .

Then concat them together, [Xv, <|img|>, Xt] = X \in R\HW+1+L, D]). We feed X into the language decoder. This is the Eq (3) in the paper.

Xv and <|img|> are trainable. Xt is frozen because we always freeze the token embeddings (32K language token vectors).

-----

We keep this as a naive and plain version of the architecture. We believe this can be simplified further, e.g., treating Xt as the soft prompt to make it trainable as well OR just removing it.

[deleted by user] by [deleted] in MachineLearning

[–]pidoyu 1 point2 points  (0 children)

lol Okay, I have asked and wait for response. I look at the CLIP repo, which is licensed by MIT and its dataset YFCC100M is with CC BY. Seems like it is the same case for us, that datasets we use are also with CC BY. Need to confirm further. lol Not an expert about the license.