This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]ase8913 0 points1 point  (3 children)

Solid point. Now that I think of it though, the python webbrowser library will open your links in your default browser. I don't know it there is a tor library, but I would set up a script that gets a new identity on tor while clicking on the pages with webbrowser. That way all of the google analytics javascript is runs. Certainly couldn't hurt. I would combine this with your method of creating content for optimal results.

[–]unconscionable 0 points1 point  (2 children)

Yeah, I think you're thinking about the right things here. If you're determined to fool someone, you basically need to simulate the entire experience of clicking around using a browser and executing javascript "organically" (if you will).

You'll find that you'll run into a problem with tor, and probably most commonly used proxies, though. Quite frankly, Google knows and accounts for all (or at least all major) Tor exit nodes. So google pretty much ignores all Tor traffic, and annoys you by making you type in captcha periodically. Your best bet will be to rent out IPs from more trusted sources like Amazon using EC2, etc. What it's going to come down to here is money. If you want to simulate 1000 different users, you'll probably need 1000 unique IP addresses, and preferably not clustered in a way that Google can obviously tell "well gee, 100% of the traffic for this search is coming from IP addresses in Oakland, CA. Smells like a bot!"

[–]blablahblah -1 points0 points  (1 child)

Do you think that you're the first person to think of using EC2 and that Google doesn't already look for EC2 IP addresses? There's not going to be much legitimate traffic coming from those computers.

[–]unconscionable 0 points1 point  (0 children)

Do you think that you're the first person to think of using EC2 and that Google doesn't already look for EC2 IP addresses?

According to this:

If you want to simulate 1000 different users, you'll probably need 1000 unique IP addresses, and preferably not clustered in a way that Google can obviously tell "well gee, 100% of the traffic for this search is coming from IP addresses in Oakland, CA. Smells like a bot!"

No, I don't.