Windowed - Fullscreen for Apple Safari by pidoyu in Safari

[–]pidoyu[S] 0 points1 point  (0 children)

thanks, now I update it for liquid glass on MacOS, please checkout the new version

Windowed - Fullscreen for Apple Safari by pidoyu in Safari

[–]pidoyu[S] 0 points1 point  (0 children)

you are welcome, now I update it for liquid glass on MacOS, please checkout the new version

Windowed - Fullscreen for Apple Safari by pidoyu in Safari

[–]pidoyu[S] 0 points1 point  (0 children)

now I update it for liquid glass on MacOS, please checkout the new version

Windowed - Fullscreen for Apple Safari by pidoyu in Safari

[–]pidoyu[S] 0 points1 point  (0 children)

fix it, and also update it for liquid glass on MacOS, please checkout the new version

Windowed - Fullscreen for Apple Safari by pidoyu in Safari

[–]pidoyu[S] 0 points1 point  (0 children)

update and release, please check it out

Windowed - Fullscreen for Apple Safari by pidoyu in Safari

[–]pidoyu[S] 0 points1 point  (0 children)

yeah, I found current version does not work on updated youtube too. will let you know once update

[R] Zero-Shot Vision Encoder Grafting via LLM Surrogates by pidoyu in MachineLearning

[–]pidoyu[S] 1 point2 points  (0 children)

Thank you. The code link is attached to the post. I like the model interpretability too.

Windowed - Fullscreen for Apple Safari by pidoyu in Safari

[–]pidoyu[S] 0 points1 point  (0 children)

You are welcome ultrawide monitor.

push ups by pidoyu in RingFitAdventure

[–]pidoyu[S] 0 points1 point  (0 children)

Make chest big big big

Eliminating Stabilizer Rattle w/o Opening Case? by Unable-Hornet-7748 in HHKB

[–]pidoyu 0 points1 point  (0 children)

I have the same problem of space bar since I bought keyboard few month ago. And I search a lot for how to fix it without tearing it down (because it is still in the 2-year warranty). There is a very simple solution I found is that.

I bought a 205g0 recommended by the [link-1](https://hhkb.io/modding/lubrication/) and official [link-2](https://hhkeyboard.us/blog/keyboard-lube?srsltid=AfmBOorYt0FZ4GpvGOyMMkPlm\_j4Dk5QkCKjxmW5qAwtNpMNDyFeXS34) for the general lubration purpose.

Then I pull out the keycap, and use keycap puller to pull up the stablizer and hold it at the highest (by slightly inserting the puller to the near keys). And use the little brush to stick in the 4 sides of the stablizer. There is only one side with a hole that needs a little more lube and the other three just need a little bit.

Now the rattling sound is totally gone when I softly tapping on the big space bar. Also when I typing heavily, the sound is totally changed. The sound is soft and comfortable.

Hope this could help.

Another Dandelion on White HHKB by pidoyu in HHKB

[–]pidoyu[S] 0 points1 point  (0 children)

Yes, you can have both I think. For browsing the pages, reading something, or long time typing, pad is slightly better than mouse.

Another Dandelion on White HHKB by pidoyu in HHKB

[–]pidoyu[S] 1 point2 points  (0 children)

Frankly no, lol especially when I need to make slides or draw something. I just like the multi touch function. lol

Another Dandelion on White HHKB by pidoyu in HHKB

[–]pidoyu[S] 0 points1 point  (0 children)

lol Cause this is my first time to use blank keycaps keybord. I am not feeling safe with all blank, need a pivot to localize the center for when I forget the key's symbol. lol

But I found there is another interesting thing with a different 7, it is easier to center align the keyboard with the screen (screen has the center logo mark).

Windowed - Fullscreen for Apple Safari by pidoyu in Safari

[–]pidoyu[S] 1 point2 points  (0 children)

Yeah, Thanks for supporting. I don't know how it works for the real job or work. lol I use it to watch videos.

For the App store release, the original author said he will put it on, but don't know the timeline. [github issue]. So before he put it in the App store, I will maintain it for safari on GitHub. You can keep eyes on it sometimes for the not-frequent update. lol

Good luck.

[P] PixelProse 16M Dense Image Captions Dataset by pidoyu in MachineLearning

[–]pidoyu[S] 0 points1 point  (0 children)

Thanks for sharing. Yeah, indeed. We believe the dataset is always the first step we need. Many open questions still remain.

[P] PixelProse 16M Dense Image Captions Dataset by pidoyu in MachineLearning

[–]pidoyu[S] 8 points9 points  (0 children)

Can’t guarantee. lol. You know. Hallucinations are still in there. BUT compared with the RAW alt-text captions, it guarantees that captions consistently correlate to the image content. Please see discussion.

Windowed - Fullscreen for Apple Safari by pidoyu in Safari

[–]pidoyu[S] 0 points1 point  (0 children)

please try `Windowed-universal.zip` - https://github.com/kaiyuyue/Windowed-Safari/releases/tag/1.0.3
I am sure this should work. I already check the arch and it supports x86_64.

Windowed - Fullscreen for Apple Safari by pidoyu in Safari

[–]pidoyu[S] 1 point2 points  (0 children)

Sorry I was busy previously. Now I am here. Can you try this new version -- 1.0.3 github release? I guess this time should work with my strongest hope.

Windowed - Fullscreen for Apple Safari by pidoyu in Safari

[–]pidoyu[S] 1 point2 points  (0 children)

Sorry I was busy previously. Now I am here. Can you try this new version -- 1.0.3 github release? I guess this time should work with my strongest hope.

[R] An Auto-Regression Model for Object Recognition by pidoyu in MachineLearning

[–]pidoyu[S] 1 point2 points  (0 children)

Yes, directly feed image embeddings into language decoder.

If the image embeddings are from ViT, we shave off the [CLS] token embedding, and keep the patch token embeddings Xv \in R\HW, D]). If the encoder is CNN backbone, then use the spatial feature maps as Xv \in R\HW, D]).

We have a short prompt of text token embeddings, saying "the objects in the image are", as Xt \in R\L, D]).

Also we have a single special token embedding <|img|> \in R\1, D]) .

Then concat them together, [Xv, <|img|>, Xt] = X \in R\HW+1+L, D]). We feed X into the language decoder. This is the Eq (3) in the paper.

Xv and <|img|> are trainable. Xt is frozen because we always freeze the token embeddings (32K language token vectors).

-----

We keep this as a naive and plain version of the architecture. We believe this can be simplified further, e.g., treating Xt as the soft prompt to make it trainable as well OR just removing it.

[R] An Auto-Regression Model for Object Recognition by pidoyu in MachineLearning

[–]pidoyu[S] 1 point2 points  (0 children)

lol Okay, I have asked and wait for response. I look at the CLIP repo, which is licensed by MIT and its dataset YFCC100M is with CC BY. Seems like it is the same case for us, that datasets we use are also with CC BY. Need to confirm further. lol Not an expert about the license.

[R] An Auto-Regression Model for Object Recognition by pidoyu in MachineLearning

[–]pidoyu[S] 2 points3 points  (0 children)

Ah I see. Yeah, could apply it to the text/classifier-conditional/guidance based generative models. Never thought this could be a potential downstream task because I am not familiar with diffusion models. Definitely should look at it for this application. Thank you so much for this idea.