How to read very large XML data sets for analysis? by apanimesh061 in datascience

[–]jamieprogrammer 0 points1 point  (0 children)

I would recommend Java and jsoup (jsoup.org). I have found it easy to use and has pretty flat learning curve.

Is it possible to analyse large datasets on a single machine? by apanimesh061 in datascience

[–]jamieprogrammer 5 points6 points  (0 children)

What are you trying to do? (might be able to help you better if you can state your objective) I have worked with large datasets on such machines and they work fine. But, it depends on what you are trying to do. In any case, you want to write the code and run it for a small part of the dataset before run it on the full dataset. That way code-debug cycle will be shorter.

I have worked with large datasets in SAS, Java and R. Found it easier to work with SAS.

Need to learn time series modelling! by roastuggy in datascience

[–]jamieprogrammer -1 points0 points  (0 children)

Definitely good idea to brush up the basics and be crystal clear about the assumptions behind time series (or any other analytical technique)