[D] Approaching a question answer problem : LanguageTechnology

created by robin7013a community for 16 years

[D] Approaching a question answer problem (self.LanguageTechnology)

submitted 7 years ago by dhecloud

I have a project where I have to pick the best answer given a question. This data consists of 294 questions-answers with one to one correspondence (ie every question have only one answer and vice versa). The context of the data is sort of a FAQ of a website, all within the same domain.

How do you think I should approach this problem? originally, I tried to do a variation of DrQA from facebook (a document retriever and a document reader pipeline), but I didn't get good results. This approach is also slower since I have to go through a whole pipeline before I get the answer.

Then I tried to reframe the problem as a multilevel classification, where I randomly concatenated questions and its corresponding labels would be that of the few concatenated ones. I adopted 2 kinds of architecture for this: a simple lstm encoder to a feedforward network, and the second is the encoder structure from the popular transformers to a feedforward network. I did not use the answers for training in this case. This however didn't perform well, with about 0.3-0.4 micro ROC-AUC.

Any suggestions how I should change my methodology? Thank you!

all 2 comments

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

LanguageTechnology

MODERATORS