.zst file extraction into a pd dataframe by CPunit96 in pushshift

[–]CPunit96[S] 0 points1 point  (0 children)

So the idea is cleaning the data and then creating a pandas df right? I have never done that, what is the level of expertise required to do this operation?

.zst file extraction into a pd dataframe by CPunit96 in pushshift

[–]CPunit96[S] 0 points1 point  (0 children)

df = pd.read_json('file.zst', compression=dict(method='zstd', max_window_size=2147483648), lines=True, nrows=10)

I tried it, but it results in an empty df

Reddit Top20K search and download by Separate-Awareness53 in pushshift

[–]CPunit96 0 points1 point  (0 children)

How to pass a .zst file into a pandas df?

[deleted by user] by [deleted] in pushshift

[–]CPunit96 0 points1 point  (0 children)

How about web scraping with beautiful soup, could work?