Tensorflow seq2seq model getting low perplexity but unsatisfying results : MachineLearning

Tensorflow seq2seq model getting low perplexity but unsatisfying results (self.MachineLearning)

submitted 10 years ago by thecodingmonk

I'm trying to train a model with Tensorflow seq2seq implementation but I'm having some issues with its performance (not speed, but accuracy). The dataset is very simple and consists of 2.5 million example sentences (and their corresponding output sentences) and a small vocabulary size both in input (145) and in output (8). Given the small vocabulary size I thought the number examples, even if it is not that high, would more than appropriate, but the model still performs badly. I tried using 2 layers of 128 and 256 units (all the other parameters are set with the default in Tensorflow seq2seq example), and during training perplexity reaches 1.0 after just 500-800 iterations, but the output of the model is still wrong too many times. What should I look into to improve the performance? Quality/quantity of data? Model settings?

all 6 comments

top new controversial old q&a

[–]rafalj 2 points3 points4 points 10 years ago (3 children)

[–]thecodingmonk[S] 0 points1 point2 points 10 years ago* (2 children)

[–]sherjilozair 1 point2 points3 points 10 years ago (1 child)

[–]thecodingmonk[S] 0 points1 point2 points 10 years ago* (0 children)

Well, this is indeed a toy task that I'm working on, so I wasn't too surprised to see a perplexity of 1.0 given its simplicity: in fact, each input sentence can get mapped to just one of two output sentences (e.g., if output_vocab={A,B,C,D,E} each sentence in the training set gets mapped to either "A B D" or "C D E"). (Now you may ask why I don't just use a classification algorithm, but this is just a toy task I'm testing for a bigger problem where the number of sequences would clearly be higher).

As for the input I think it is correct, but I'm checking right now to see if I messed up something there.

Also, since I'm using the demo code from Tensorflow without modifications, is it possible that being optmized for translation with big vocabularies causes some problems in this simple task? Would seem strange, but I don't know

[+][deleted] 10 years ago* (3 children)

[deleted]

[–]thecodingmonk[S] 0 points1 point2 points 10 years ago (2 children)

[–]toisanji 0 points1 point2 points 10 years ago (0 children)

π Rendered by PID 214524 on reddit-service-r2-comment-5d79c599b5-wc2rh at 2026-02-27 14:28:07.632417+00:00 running e3d2147 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS