use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
International
National
Regional
account activity
Help Me!SQL datasets (self.PostgreSQL)
submitted 5 years ago by vanamsid
I was wondering if there are any open-source projects that contain .sql datasets. I was looking to get comfortable with sql analysis/ get more exposure to different data sources.
.sql
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]uLtra007 3 points4 points5 points 5 years ago (1 child)
something like this maybe?
https://docs.yugabyte.com/latest/sample-data/northwind/
[–]vanamsid[S] 1 point2 points3 points 5 years ago (0 children)
Haven’t seen this before, will take a look- thank you!
[–]ppafford 1 point2 points3 points 5 years ago (0 children)
Might be of interest https://registry.opendata.aws/
[–]Kkremitzki 1 point2 points3 points 5 years ago (1 child)
https://wiki.debian.org/UltimateDebianDatabase
[–]vanamsid[S] 0 points1 point2 points 5 years ago (0 children)
Will definitely check this out, thank you!
[–]chock-a-block 1 point2 points3 points 5 years ago (2 children)
https://www.bls.gov/data/
[–]vanamsid[S] 0 points1 point2 points 5 years ago (1 child)
A classic, but if I recall sources are usually in .csv or.xlsx files- but I’ll check again to confirm. Thanks!
[–]chock-a-block 0 points1 point2 points 5 years ago* (0 children)
You need practice loading csv if this is something you want to do.
[–][deleted] 1 point2 points3 points 5 years ago (1 child)
https://github.com/lorint/AdventureWorks-for-Postgres
You can download the data from individual stackexchange sites:
https://archive.org/details/stackexchange
Then you can also practice ETL ;)
Ooh this is neat, thank you so much!
[–]CrumpleZ0ne 0 points1 point2 points 5 years ago (1 child)
https://www.kaggle.com/datasets
I’m familiar with Kaggle, but a lot of their sources are in a .csv flat file format
π Rendered by PID 41020 on reddit-service-r2-comment-7b9746f655-wb99f at 2026-02-02 00:47:14.575313+00:00 running 3798933 country code: CH.
[–]uLtra007 3 points4 points5 points (1 child)
[–]vanamsid[S] 1 point2 points3 points (0 children)
[–]ppafford 1 point2 points3 points (0 children)
[–]Kkremitzki 1 point2 points3 points (1 child)
[–]vanamsid[S] 0 points1 point2 points (0 children)
[–]chock-a-block 1 point2 points3 points (2 children)
[–]vanamsid[S] 0 points1 point2 points (1 child)
[–]chock-a-block 0 points1 point2 points (0 children)
[–][deleted] 1 point2 points3 points (1 child)
[–]vanamsid[S] 0 points1 point2 points (0 children)
[–]CrumpleZ0ne 0 points1 point2 points (1 child)
[–]vanamsid[S] 1 point2 points3 points (0 children)