Take a sneak peek at the new NIST.gov and let us know what you think!
(Please note: some content may not be complete on the beta site.).
NIST Authors in Bold
|Author(s):||Yi-Kai Liu; John M. Conroy; Sashka T. Davis; Jeff Kubina; Dianne P. O'Leary; Judith D. Schlesinger;|
|Title:||Multilingual Summarization: Dimensionality Reduction and a Step Towards Optimal Term Coverage|
|Published:||August 09, 2013|
|Abstract:||In this paper we present three term weighting approaches for multi-lingual document summarization and give results on the DUC 2002 data as well as on the 2013 Multilingual Wikipedia feature articles data set. We introduce a new interval-bounded nonnegative matrix factorization. We use this new method, latent semantic analysis (LSA), and latent Dirichlet allocation (LDA) to give three term- weighting methods for multi-document multi-lingual summarization. Results on DUC and TAC data, as well as on the MultiLing 2013 data, demonstrate that these methods are very promising, since they achieve oracle coverage scores in the range of humans for 6 of the 10 test languages.|
|Dates:||August 9, 2013|
|Keywords:||document summarization, nonnegative matrix factorization|
|Research Areas:||Data Mining|
|PDF version:||Click here to retrieve PDF version of paper (166KB)|