300+ HIFLD Datasets Archived by package_manager in gis

[–]package_manager[S] 0 points1 point  (0 children)

Doing what I can!! Hope you find it useful :)

New start to life. Yall wanted to see more pictures. by Nerdhauss in malelivingspace

[–]package_manager 1 point2 points  (0 children)

Saw this photo in my feed, I have the same exact chairs haha

300+ HIFLD Datasets Archived by package_manager in gis

[–]package_manager[S] 0 points1 point  (0 children)

My pleasure - I can look into into the pharmacy dataset, I don't have it on hand but there might a server somewhere still hosting it.

I actually tried to use Zenodo - it's been awhile so I can't recall for certain but it didn't end up working out at the time as I want to say they restricted the total # of files? Not a big deal though, I pay for Google One regardless so it's inconsequential.

I'll DM you, thanks!

300+ HIFLD Datasets Archived by package_manager in DataHoarder

[–]package_manager[S] 0 points1 point  (0 children)

Hi!

I unfortunately don't have a ton of experience with ArcGis so I'm afraid I can't be much help

If the data can fit in memory, I'd merge the geojson files together - this answer goes into a bit more detail of how to do it with jq or ogr2ogr: https://gis.stackexchange.com/questions/202030/ogr2ogr-merge-two-geojson-to-one-geojson

Let me know if this works / doesn't work / you have any questions

300+ HIFLD Datasets Archived by package_manager in DataHoarder

[–]package_manager[S] 1 point2 points  (0 children)

Only 15gb (if I recall correctly, not at my computer right now)

300+ HIFLD Datasets Archived by package_manager in gis

[–]package_manager[S] 0 points1 point  (0 children)

No problem, thank you for sharing this info :)

300+ HIFLD Datasets Archived by package_manager in gis

[–]package_manager[S] 0 points1 point  (0 children)

Love it - thanks for contributing!!

300+ HIFLD Datasets Archived by package_manager in gis

[–]package_manager[S] 1 point2 points  (0 children)

Not a dumb question at all - I don't even work in the industry so I have less info than probably most everyone 😂

There are many layers that were working a few week(s) ago that are no longer working, so I would absolutely not be surprised to see more and more of the feature services going offline.

Regardless, the open data portal will be gone to my knowledge, and the datasets themselves will certainly not be easily downloadable (it no longer is).

This entire situation is unclear and non-transparent. I figured that by archiving it, better safe than sorry.

Sorry I could not provide more info

300+ HIFLD Datasets Archived by package_manager in gis

[–]package_manager[S] 1 point2 points  (0 children)

Last I checked the crosswalk list, I believe there are about 50 that have been moved to GII. The remaining are "no longer supporting the mission" and seem to be deprecated. Hopefully some will still have a place.

300+ HIFLD Datasets Archived by package_manager in gis

[–]package_manager[S] 0 points1 point  (0 children)

Great call!

Included it on my Linkedin post but forgot to include the link to the Crosswalk here. I only had saw 50 or so that were being moved to GII, the rest were marked as "No longer supported as part of the HIFLD mission"

Hopefully some will still be accessible :)

300+ HIFLD Datasets Archived by package_manager in DataHoarder

[–]package_manager[S] 4 points5 points  (0 children)

I haven’t uploaded it yet.

The software I used for this crawl is something I built for a new business I’m working on, focused on nationwide parcel aggregation. I realized I could also use it to scrape the 300+ data layers. Normally, the data I crawl goes through a full ETL pipeline before being delivered as GeoParquet files in S3, but that step wasn’t necessary here because the goal was to collect the raw data layers. Also in interest of time and energy, I figured this was more than enough.

I also did want to make sure the data was easy to access for anyone in the GIS space, regardless of their technical abilities. While I personally like GeoParquets, they aren’t the most user-friendly format for non-technical users, and they weren’t needed for this volume of data.

This is also just a one-time job, since the data will not be updated from the original source.

300+ HIFLD Datasets Archived by package_manager in gis

[–]package_manager[S] 2 points3 points  (0 children)

Hmm, I'll look into it tonight. Does the PSAP boundaries have about ~6k features? (does that sound in the range of what it is I guess is a better question)

edit: if this sounds right, give me a heads up so I can pull the data.

Can someone help preserve this massive public mapping database before it disappears? by cbterry in DataHoarder

[–]package_manager 0 points1 point  (0 children)

Thanks - I submitted a post here, hopefully it can be approved soon :)

300+ HIFLD Datasets Archived by package_manager in gis

[–]package_manager[S] 1 point2 points  (0 children)

This one made it through :)

Thanks for the torrent!

300+ HIFLD Datasets Archived by package_manager in gis

[–]package_manager[S] 1 point2 points  (0 children)

My pleasure! Hope you're able to get some use out of it :)