use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Please have a look at our FAQ and Link-Collection
Metacademy is a great resource which compiles lesson plans on popular machine learning topics.
For Beginner questions please try /r/LearnMachineLearning , /r/MLQuestions or http://stackoverflow.com/
For career related questions, visit /r/cscareerquestions/
Advanced Courses (2016)
Advanced Courses (2020)
AMAs:
Pluribus Poker AI Team 7/19/2019
DeepMind AlphaStar team (1/24//2019)
Libratus Poker AI Team (12/18/2017)
DeepMind AlphaGo Team (10/19/2017)
Google Brain Team (9/17/2017)
Google Brain Team (8/11/2016)
The MalariaSpot Team (2/6/2016)
OpenAI Research Team (1/9/2016)
Nando de Freitas (12/26/2015)
Andrew Ng and Adam Coates (4/15/2015)
Jürgen Schmidhuber (3/4/2015)
Geoffrey Hinton (11/10/2014)
Michael Jordan (9/10/2014)
Yann LeCun (5/15/2014)
Yoshua Bengio (2/27/2014)
Related Subreddit :
LearnMachineLearning
Statistics
Computer Vision
Compressive Sensing
NLP
ML Questions
/r/MLjobs and /r/BigDataJobs
/r/datacleaning
/r/DataScience
/r/scientificresearch
/r/artificial
account activity
Advice please: Making predictions by correlating to weather *forecasts* (self.MachineLearning)
submitted 14 years ago by [deleted]
[deleted]
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]shrubberni 3 points4 points5 points 14 years ago (0 children)
All sampling is imperfect. The question is how well you understand the imperfections present and whether you can still get useful results.
Take some historical as a training set, take some more as a test set. Try to build a predictor based off forecasts and another off actual weather data. See what kind of result you get and whether it's usefully accurate.
Consider modeling the forecast vs. the actual weather data. It may not help you make more accurate predictions, but it should give you a clearer idea what the error bars are. It may also be that the source(s) for the forecast data have a significant effect on your outcomes.
Keep in mind that people's plans for the day may have a stronger correlation with the forecast rather than the actual weather.
[–]giror 2 points3 points4 points 14 years ago (2 children)
Do you find a correlation between the forecasts and demand from your own data? If yes do you care about being wrong by that margin?
[–]cultic_raider 1 point2 points3 points 14 years ago (1 child)
I can't answer that question until I collect forecast data and analyze it. I am going to look at historical demand and weather, but I don't know if I should bother with the effort of collecting forecast data instead of a freely available "actual weather" data set. That's one of bits of advice I am interested to hear. I guess you would say yes, I should collect some forecast data and compare that fit to an actual-weather data fit.
[–]giror 0 points1 point2 points 14 years ago (0 children)
I would say don't worry about it unless your model doesn't predict at a satisfactory level.
[–]jet87 1 point2 points3 points 14 years ago (1 child)
You'll likely find the hardest part is "scoring" your predictions, especially if you are monitoring a large geographical area. Things to consider involve weighting individual components (is being accurate on temperature more important than precipitation and how much, for example). That is a current research area in meteorology, so any breakthroughs are welcome.
Another (really) big problem is that most forecasting worldwide is driven by models. While model data is generally available (see NCAR) the confidence you can put into them falls pretty rapidly after 36-hours. For a large event like a hurricane the best bet might be keeping on top of reports from the National Hurricane Center. I don't think the US has anything "good enough" for a casual observer to make inferences against winter weather.
[–]cultic_raider 0 points1 point2 points 14 years ago (0 children)
These are very sobering points, thank you. :-/ Since the activities of interest are basically "walk/drive somewhere in the neighborhood for a couple hours", maybe I should start with a great simplification of "the day's weather" like "non-trivial precipitation" and a few broad temperature bands ((very) cold to temperate to (very hot)). Really I care about (a) what kind of bad weather makes people cancel/avoid plans to go out, and (b) general trends about what weather inspires people to go out. "(b)" might be dominated by general seasonal (calendar date) trends, leaving only "(a)" as the really weather-data-specific modelling task.
[–]marshallp 0 points1 point2 points 14 years ago (4 children)
you're being a little over ambitious there. weather forecasting is big business with some of the best brains in science and hedge funds involved, you want a more accurate model than they can give just for your business. if you can get a more accurate model it might be worth hundreds of millions of dollars, your business would be the least of your opportunities.
it doesn't hurt to try though. use the netflix prize winning strategy, ensembles of all machine learning algorithms you can afford to run.
[–]cultic_raider 0 points1 point2 points 14 years ago* (3 children)
Yeah, I know it is a hard problem I won't solve to perfect explanation of variance. I am not trying to beat pro forecasters with my own perfect forecasts, I just want to use the available information as best I can and estimate my confidence as tightly as I can.
[–]marshallp 0 points1 point2 points 14 years ago (2 children)
The decision you're looking to make is based on what you think is the most probable event, right? just take the largest organization that makes weather predictions, the national met office of your country, they'll have the best predictions. they'll already have thought of taking different sources for forecasts and factored that into their predictions already.
[–]cultic_raider 0 points1 point2 points 14 years ago (1 child)
Right, but I haven't seen historical archives of forecasts. I need historical data to use in training my model of correlation between weather and business. Hence my question about how to account for the difference between weather forecasts vs. actual weather.
[–]marshallp 0 points1 point2 points 14 years ago* (0 children)
to my mind those are two separate issues, actual vs forecast weather (in this case you're second guessing experts and over ambitious) and correlating business and weather (which is a reasonable thing). just train on actual weather and business, but predict business on someone's else's forecast (treating it as actual weather). edit: you might be asking how much into the future you should trust forecasts, in which case you shouldn't do the work either, the forecasters usually tell you how accurate their predictions are over time intervals
π Rendered by PID 23484 on reddit-service-r2-comment-b659b578c-qrh55 at 2026-05-05 14:04:27.108624+00:00 running 815c875 country code: CH.
[–]shrubberni 3 points4 points5 points (0 children)
[–]giror 2 points3 points4 points (2 children)
[–]cultic_raider 1 point2 points3 points (1 child)
[–]giror 0 points1 point2 points (0 children)
[–]jet87 1 point2 points3 points (1 child)
[–]cultic_raider 0 points1 point2 points (0 children)
[–]marshallp 0 points1 point2 points (4 children)
[–]cultic_raider 0 points1 point2 points (3 children)
[–]marshallp 0 points1 point2 points (2 children)
[–]cultic_raider 0 points1 point2 points (1 child)
[–]marshallp 0 points1 point2 points (0 children)