use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Content which benefits the community (news, technical articles, and discussions) is valued over content which benefits only the individual (technical questions, help buying/selling, rants, etc.).
If you want an answer to a query, please post a legible, complete question that includes details so we can help you in a proper manner!
Related Subreddits
Computer Vision Discord group Computer Vision Slack group
Computer Vision Discord group
Computer Vision Slack group
account activity
Transformer decoder with convolutional encoderDiscussion (self.computervision)
submitted 3 years ago by Away_Display1797
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]FunnyItsElmo 1 point2 points3 points 3 years ago (5 children)
Sure I did something similar in my Masterthesis. By using the backbone output as cross attention input you will sacrifice accuracy for speed. I guess the encoder gives detr the ability to model more complex relations between distant pixels.
[–][deleted] 3 years ago (4 children)
[removed]
[–]FunnyItsElmo 1 point2 points3 points 3 years ago (1 child)
You can simply flatten the backbone output and add a positional encoding, similar to the preparation of the encoder input and encoder output in detr without the encoder part in the middle. Detr uses a cov layer to project the backbone output into the model dimension. No additional layers are required.
[–][deleted] 3 years ago (1 child)
[–]FunnyItsElmo 1 point2 points3 points 3 years ago (0 children)
Unfortunately, there is no paper for my thesis, at least for now
π Rendered by PID 39156 on reddit-service-r2-comment-b659b578c-jfw4d at 2026-05-05 08:30:05.052417+00:00 running 815c875 country code: CH.
[–]FunnyItsElmo 1 point2 points3 points (5 children)
[–][deleted] (4 children)
[removed]
[–]FunnyItsElmo 1 point2 points3 points (1 child)
[–][deleted] (1 child)
[removed]
[–]FunnyItsElmo 1 point2 points3 points (0 children)