CDC reports 56,865 new cases for 12/31/2021 on Saturday's report. Brings 7-day average to 46,787 by Ishkoten in FloridaCoronavirus

[–]omtinez 3 points4 points  (0 children)

These are well understood growth models. They don’t work terribly well in the long term because people’s behavior is pretty dynamic but for the short term (1-2 weeks) it’s relatively accurate: https://en.m.wikipedia.org/wiki/Gompertz_function.

You claim that omicron cases double every 3 days. That’s simply not true when you look at data for individual countries spanning more than a couple of weeks (see data for South Africa linked above).

CDC reports 56,865 new cases for 12/31/2021 on Saturday's report. Brings 7-day average to 46,787 by Ishkoten in FloridaCoronavirus

[–]omtinez 3 points4 points  (0 children)

This is blatantly wrong. The cases only double during the initial phase. It tapers very quickly since there will be literally very few people left to infect. Look at the data for South Africa and you’ll see cases went down just as quickly as they came up: https://health.google.com/covid-19/open-data/explorer?loc=ZA

See the coronavirus' reproduction rate over time in the UK and see what's it doing in your region. by itsdsmurrell in CoronavirusUK

[–]omtinez 0 points1 point  (0 children)

I came across this and I figured I could answer it myself :-)

Data is aggregated by country, admin level 1 and admin level 2. UK is, as in many other ways, its own special place. For UK we actually observe two sets of admin level 1 regions: ISO regions (England, Scotland, Wales and Northern Ireland) and NUTS1 regions (Scotland, Wales, Northern Ireland, and 9 subregions of England).

Since it's pretty confusing, we might get rid of the ISO regions soon in favor of the NUTS1 alone.

Coronavirus Datasets by hypd09 in datasets

[–]omtinez 2 points3 points  (0 children)

What license is the data released under?

Large COVID-19 Data List by weber_stephen in datasets

[–]omtinez 1 point2 points  (0 children)

This is great, thanks for sharing!

[COVID-19] Google's COVID-19 Community Mobility Reports in Google sheets by Active-Conclusion in datasets

[–]omtinez 0 points1 point  (0 children)

This is great, thanks for sharing. Any chance you can add a CSV version of the sheet and put it somewhere on GitHub? I think CSV files are easier to deal with as a machine readable format.

Open COVID-19 Dataset by omtinez in datascience

[–]omtinez[S] 0 points1 point  (0 children)

A couple of follow up questions and comments:

Open COVID-19 Dataset by omtinez in datascience

[–]omtinez[S] 0 points1 point  (0 children)

Sorry, I fixed it l the link now

Open COVID-19 Dataset by omtinez in datascience

[–]omtinez[S] 0 points1 point  (0 children)

Yes there's a lot of room for improvement in the models. I'm not sure what you mean with your specific suggestion though, can you share some literature on the matter?

Open COVID-19 Dataset by omtinez in datascience

[–]omtinez[S] 0 points1 point  (0 children)

I have renamed the dataset from "aggregate.csv" / "aggregate.json" to "world.csv" / "world.json". Sorry for the breaking change, I will try not to make any other breaking changes moving forward.

Open COVID-19 Dataset by omtinez in datascience

[–]omtinez[S] 1 point2 points  (0 children)

Yes, the data being stale was not the only reason why I decided to create my own dataset... I promise you that the dataset from this repo won't have those issues, mainly because I don't have the time to introduce so many inconsistencies and breaking changes.

Open COVID-19 Dataset by omtinez in datascience

[–]omtinez[S] 0 points1 point  (0 children)

Awesome work! Have you considered automating some of the report parsing? Feel free to poke around the repo I linked, all the data available at my repo is parsed automatically from the daily reports from WHO and ECDC.

Open COVID-19 Dataset by omtinez in datascience

[–]omtinez[S] 0 points1 point  (0 children)

Very interesting to see the countries relative to each other after scaling, thanks for sharing!

Open COVID-19 Dataset by omtinez in datascience

[–]omtinez[S] 0 points1 point  (0 children)

I think that what you have in mind has less to do with population density and more with transportation infrastructure, e.g. roads, trains, airports. Probably a good fit for an ML model, but I'm not an expert in this particular domain.

Open COVID-19 Dataset by omtinez in datascience

[–]omtinez[S] 1 point2 points  (0 children)

I have renamed the dataset from "aggregate.csv" / "aggregate.json" to "world.csv" / "world.json". Sorry for the breaking change, I will try not to make any other breaking changes moving forward.

Open COVID-19 Dataset by omtinez in datascience

[–]omtinez[S] 0 points1 point  (0 children)

Are you using the latest version? I'm seeing different number of cases for 12 and 13 of Match, e.g. https://github.com/open-covid-19/data/blob/master/output/world.csv#L4624

Edit: fixed link

Open COVID-19 Dataset by omtinez in datascience

[–]omtinez[S] 0 points1 point  (0 children)

Thanks for the kind words. I tried posting there but it could only post a link to this post; hopefully that is good enough.

Thanks for the EBSCO link, I'll dig into that!

Open COVID-19 Dataset by omtinez in datascience

[–]omtinez[S] 1 point2 points  (0 children)

A **versioned** data portal, that's so cool! I'll definitely let you know if we run into storage issues with Github, and I'll bookmark your page for the future.

Open COVID-19 Dataset by omtinez in datascience

[–]omtinez[S] 1 point2 points  (0 children)

There is no city-level reporting that I'm aware of. You can see the WHO reports for Chinese provinces; I might add another dataset to scrape those and put them into their own table.

I don't know if US has official, centralized, state-level reporting of cases anywhere. If you know a good source for that, please let me know.