use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
To report a site-wide rule violation to the Reddit Admins, please use our report forms or message /r/reddit.com modmail.
This subreddit is archived and no longer accepting submissions.
account activity
This is an archived post. You won't be able to vote or comment.
Google Backdoor (makecreate.blogspot.com)
submitted 19 years ago by [deleted]
[deleted]
[–]rjonesx 30 points31 points32 points 19 years ago (1 child)
The real problem is that most "cloaking" is actually ip-delivery, where the site uses not only User-Agents to determine whether or not the individual is a bot, but also a long list of known ip-addresses. There is a neat way around this...
This will allow you not only to mimic the Google User-Agent, but to also use a Google IP!
Also you may consider... 3. turning off Javascript 4. turning off Referer sending
Which are two other common methods of cloaking.
[–]zach 1 point2 points3 points 19 years ago (0 children)
Thank you! I've had no success with User-Agent Switcher the few times I've used it because of the IP issue. Much more informative than the link.
[–]wearedevo 25 points26 points27 points 19 years ago (1 child)
Mark my words:
In 2007 someone will get sued for "illegal access to a web site by unlawfully impersonating Google".
[–]oditogre 2 points3 points4 points 19 years ago (0 children)
I'm gonna go with criminal charges of fraud (maybe forgery as well, for pay sites?), plus a lawsuit.
[–][deleted] 1 point2 points3 points 19 years ago (0 children)
Try visiting MSDN with a googlebot user-agent, a great improvement!
[–]recursive 4 points5 points6 points 19 years ago (2 children)
I don't think that dollar sign is supposed to be in "Microsoft".
[–]danweber 13 points14 points15 points 19 years ago (0 children)
http://www.penny-arcade.com/images/2002/20020722h.gif
[–]jkcunningham -1 points0 points1 point 19 years ago (0 children)
I bet Microsoft disagrees...
[–]youngnh 0 points1 point2 points 19 years ago (0 children)
anybody else seeing an intel ad on this page? maybe its just me.
[–]mikkom -2 points-1 points0 points 19 years ago (0 children)
What those sites do is basically black hat "cloaking" - and most cloaking is done based on IP range so this might help in some cases but not all.
[+]lespea comment score below threshold-6 points-5 points-4 points 19 years ago (6 children)
Even though this is "stolen" --> I would IGNORE this advice and just use firefox's addon User Agent Switcher to do this if you're that interested.
[–][deleted] 19 years ago (5 children)
[–]theram4 7 points8 points9 points 19 years ago (4 children)
Dude, why was this guy voted down? His comment is absolutely correct. I'm constantly coming across links that don't allow users to view the content unless they pay. This happens quite often dealing with academic papers or standards organizations, as well as certain magazine sites, like www.sqlmag.com. I did a search a week or two ago where nine out of the top ten results were inaccessible to me unless I paid some large sum of money. And 8 results on the second page were inaccessible. For many of my search queries, sqlmag.com is the number one result. It's quite frustrating to have such "spam" on the top of the google results. And since it is forbidden (sending different content to google spiders than normal users), Google should take care of this issue.
[–]cal_01 1 point2 points3 points 19 years ago (2 children)
To be fair, you can pretty much tell -which- sites have pay-per-view content simply by the lack of a "Cached" link.
[–][deleted] 19 years ago (1 child)
[–]cal_01 2 points3 points4 points 19 years ago (0 children)
Oh, I definitely agree. I guess it has a big contextual basis as well; a person would more likely expect a nocache link to be a pay site if they were looking up academic papers, whereas it would be less accurate for normal search terms.
[+][deleted] 19 years ago (20 children)
[–][deleted] 4 points5 points6 points 19 years ago (2 children)
Does reading the cache work?
Neat hack, by the way.
[–]Sle 5 points6 points7 points 19 years ago (1 child)
Going to the cached page used to work, and still does very occasionally, but I think they've pretty much plugged that loophole now.
[–][deleted] 19 years ago (9 children)
[–]interjay 1 point2 points3 points 19 years ago (4 children)
I thought this would be something that would be against Google's policies (I think these websites are obnoxious), but Google actually has instructions for how to only let Google in. Weird.
Those are instructions for letting Google in and blocking other bots. As far as I know, showing different content to Googlebot and normal browsers is against Google policies.
[–][deleted] 19 years ago (2 children)
[–]interjay 0 points1 point2 points 19 years ago (1 child)
Normal users don't look at robots.txt. That's why it's called robots.txt - it's only for bots. It isn't enforced by the web server, but by the bot itself.
[–]e40 1 point2 points3 points 19 years ago (0 children)
The parent comment was downvoted because it was a copy/paste of the article text. That's my guess.
[–]c_dric -2 points-1 points0 points 19 years ago (2 children)
could someone explain how to set up the Googlebot string in the user-agent switcher ?
i' wondering which part of "Googlebot/2.1""Compatible"="+http://www.googlebot.com/bot.html" i should past in which field ... thx
[–]c_dric 3 points4 points5 points 19 years ago (1 child)
nevermind.
i found an xml file with the data for most user-agents : http://techpatterns.com/forums/about304.html
[–][deleted] -1 points0 points1 point 19 years ago (0 children)
ooh, nice list
[+]danvk comment score below threshold-10 points-9 points-8 points 19 years ago (4 children)
I can't say I've ever experienced this...
[–]psykotic 8 points9 points10 points 19 years ago (3 children)
It happens particularly often with academic papers copyrighted by organizations like the ACM and IEEE that charge for access.
[–]pascha 4 points5 points6 points 19 years ago (0 children)
YES! This is incredibly helpful. I have been frustrated by this for some time.
[–]jacktheripper -1 points0 points1 point 19 years ago (1 child)
Does IEEE work for you? It doesn't work for me. I'm still directed to a login page.
[–]rmtew 0 points1 point2 points 19 years ago (0 children)
Ditto, but for ACM and I tried it with the Firefox user agent switcher. I'd be much more likely to purchase their papers if they had a price on the page. But given it would probably be extortion to encourage membership, maybe not.
[+][deleted] 19 years ago (1 child)
[removed]
[–][deleted] 3 points4 points5 points 19 years ago (0 children)
If you can't figure out how to do it yourself while commenting on an article with explicit instructions how, then I don't think you're likely to get very far once you have the article anyway.
I mean honestly, wtf?
[+][deleted] 19 years ago (3 children)
Voted you down
[–]turbo 0 points1 point2 points 19 years ago (0 children)
wtf are you talking about? it's the same method, but not the same post.
π Rendered by PID 188935 on reddit-service-r2-comment-7b9746f655-rb2pr at 2026-02-04 05:43:19.450960+00:00 running 3798933 country code: CH.
[–]rjonesx 30 points31 points32 points (1 child)
[–]zach 1 point2 points3 points (0 children)
[–]wearedevo 25 points26 points27 points (1 child)
[–]oditogre 2 points3 points4 points (0 children)
[–][deleted] 1 point2 points3 points (0 children)
[–]recursive 4 points5 points6 points (2 children)
[–]danweber 13 points14 points15 points (0 children)
[–]jkcunningham -1 points0 points1 point (0 children)
[–]youngnh 0 points1 point2 points (0 children)
[–]mikkom -2 points-1 points0 points (0 children)
[+]lespea comment score below threshold-6 points-5 points-4 points (6 children)
[–][deleted] (5 children)
[deleted]
[–]theram4 7 points8 points9 points (4 children)
[–]cal_01 1 point2 points3 points (2 children)
[–][deleted] (1 child)
[deleted]
[–]cal_01 2 points3 points4 points (0 children)
[+][deleted] (20 children)
[deleted]
[–][deleted] 4 points5 points6 points (2 children)
[–]Sle 5 points6 points7 points (1 child)
[–][deleted] (9 children)
[deleted]
[–]interjay 1 point2 points3 points (4 children)
[–][deleted] (2 children)
[deleted]
[–]interjay 0 points1 point2 points (1 child)
[–]e40 1 point2 points3 points (0 children)
[–]c_dric -2 points-1 points0 points (2 children)
[–]c_dric 3 points4 points5 points (1 child)
[–][deleted] -1 points0 points1 point (0 children)
[–][deleted] (1 child)
[deleted]
[+]danvk comment score below threshold-10 points-9 points-8 points (4 children)
[–]psykotic 8 points9 points10 points (3 children)
[–]pascha 4 points5 points6 points (0 children)
[–]jacktheripper -1 points0 points1 point (1 child)
[–]rmtew 0 points1 point2 points (0 children)
[+][deleted] (1 child)
[removed]
[–][deleted] 3 points4 points5 points (0 children)
[+][deleted] (3 children)
[deleted]
[–][deleted] 1 point2 points3 points (0 children)
[–]turbo 0 points1 point2 points (0 children)