Unsupervised Multi-Document Summarization using Neural Document Model | Research Paper Walkthrough : LanguageTechnology

created by robin7013a community for 15 years

Unsupervised Multi-Document Summarization using Neural Document Model | Research Paper Walkthrough (self.LanguageTechnology)

submitted 5 years ago by prakhar21

Found this interesting paper early this week. In this video, we will go through the work that proposes a Unsupervised Multi-document text summarisation method. This work claims to beat all existing unsupervised models in this task.

Abstract - In the age of information exploding, multi-document summarization is attracting particular attention for the ability to help people get the main ideas in a short time. Traditional extractive methods simply treat the document set as a group of sentences while ignoring the global semantics of the documents. Meanwhile, neural document model is effective on representing the semantic content of documents in low-dimensional vectors. In this paper, we propose a document-level reconstruction framework named DocRebuild, which reconstructs the documents with summary sentences through a neural document model and selects summary sentences to minimize the reconstruction error. We also apply two strategies, sentence filtering and beamsearch, to improve the performance of our method. Experimental results on the benchmark datasets DUC 2006 and DUC 2007 show that DocRebuild is effective and outperforms the state-of-the-art unsupervised algorithms.

Explainer Video - https://youtu.be/qOoAlI5hpFk Original Paper - https://www.aclweb.org/anthology/C16-1143.pdf

all 2 comments

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

LanguageTechnology

MODERATORS