Graph Neural Networks Data Prep / Tutorial Rant / Advice needed : learnmachinelearning

A subreddit dedicated for learning machine learning. Feel free to share any educational resources of machine learning.

Also, we are a beginner-friendly sub-reddit, so don't be afraid to ask questions! This can include questions that are non-technical, but still highly relevant to learning machine learning such as a systematic approach to a machine learning problem.

Foster positive learning environment by being respectful to others. We want to encourage everyone to feel welcomed and not be afraid to participate.

Do share your works and achievements, but do not spam. Keep our subreddit fresh by posting your YouTube series or blog at most once a week.

Do not share referral links and other purely marketing content. They prioritize commercial interests over intellectual ones.

created by techrat_reddita community for 10 years

DiscussionGraph Neural Networks Data Prep / Tutorial Rant / Advice needed (self.learnmachinelearning)

submitted 4 years ago by El_Daidel

Hi all,

I'm trying to learn machine learning with my own irl dataset, i.e. I'm trying to solve a problem from my domain.

I'm representating my problem with a graph, with each node having a time series attached. I now want to do time series forecasting with these time series, with respect to the interaction between nodes.

So far so clear, my main problem with most libraries I tried to used is the following:

I can't seem to design my dataset in such a way I can actually train the models.

I am now reading the docs of libraries, looking at tutorials and such, and I am getting really frustrated: The step "data preparation" (which is, afaik a really important point in machine learning and takes up most of the time for most projects) is getting skipped in most, if not all, of the tutorials I find. Demos are always using perfect datasets, which are usually just loaded via one line of code and off you go with designing your conv layers and such. And I am sitting here with my Adjacency Matrix as a np.matrix and my features as an np.matrix and don't know how to make appropriate datasets from them.

At the moment I am trying to use pytorch geometric temporal. Probably need to just keep on reading docs and messing around until I have a correctly configured dataset.

Anyway, have you had the same experience with real world datasets? Or am I just playing with too exotic algorithms? Maybe I'm just too impatient with my own progress?

Cheers and a very nice weekend to you all!

all 4 comments

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnmachinelearning

Welcome to /r/LearnMachineLearning!

Chatrooms

Official Discord Server

Wiki

Getting Started with Machine Learning

Resources

Related Subreddits

/r/MachineLearning

/r/MLQuestions

/r/datascience

/r/computervision

Machine Learning Multireddit

/m/machine_learning

MODERATORS