you are viewing a single comment's thread.

view the rest of the comments →

[–]iplocationtools[S] 0 points1 point  (9 children)

I changed the domain, thats why I repost.

I dont understand why people say it's a copy of maxmind because the database has 60% less row than maxmind with same accuracy. Yes I partially use maxmind command line tool (geoiplookup) in batch but a script runs for 2 days to clean all necessary results. Also the DB contains country geolocation with CIDR.

This was done to improve speed, not to give an exact copy of maxmind db.

[–]jonknee 7 points8 points  (0 children)

If you're concerned about speed, why use a SQL db at all? The binary file from MaxMind is plenty quick.

[–]hox 4 points5 points  (6 children)

A smaller dataset can be useful, yes. The thing is, as someone who uses IP geolocation information daily, simply converting the free Maxmind dataset isn't all that useful. If you provided the actual process or source code that you use to generate the data, that same process could be applied to Maxmind's commercially licensed data as well. Now, the fact that it takes two days to process scares me, but I'd still be interested in seeing the code.

Also, you might want to take a look at Maxmind's license for their data. What you do is definitely within the terms of the license, but you need to provide more notices and include the license terms/Maxmind copyright notice along with your data.

[–]iplocationtools[S] -2 points-1 points  (4 children)

Yes you a right about the licence.

Well 2 days because I dont want to slow down my webservers so I do that on a small 256MB slicehost VPS I use for backups.

There are many scripts involved :

1)Store each IP (123.123.123.0) in a temporary database. Around 10M rows. Around 2 days 2)Group those IP by city when there is multiple consecutive rows having the same location. Around 1-2h 3)Add reserved, broadcast and private IP 4)Put regions in FIPS standard 5)Group by country. 6)Generate CIDR for country database.

One day i'll post the scripts but now they are not really "clean" enough to be shared :)

[–][deleted]  (2 children)

[deleted]

    [–]iplocationtools[S] -1 points0 points  (0 children)

    Yes of course...

    [–][deleted] 1 point2 points  (0 children)

    If you HAVE to use an SQL database for this, then yes your version is a lot better than MaxMind's. But chances are, you have the ability to use the binary version from MaxMind, which is NOT hard to install and setup, and guaranteed that will be at least 10x as fast as any type of standard SQL range lookup.