New Approach for Multi-Document Update Summarization

dc.contributor.authorLong, Chong
dc.contributor.authorHuang, Min-Lie
dc.contributor.authorZhu, Xiao-Yan
dc.contributor.authorLi, Ming
dc.date.accessioned2012-07-12T13:41:39Z
dc.date.available2012-07-12T13:41:39Z
dc.date.copyright2010
dc.date.issued2010
dc.description.abstractFast changing knowledge on the Internet can be acquired more efficiently with the help of automatic document summarization and updating techniques. This paper describes a novel approach for multi-document update summarization. The best summary is defined to be the one which has the minimum information distance to the entire document set. The best update summary has the minimum conditional information distance to a document cluster given that a prior document cluster has already been read. Experiments on the DUC/TAC 2007 to 2009 datasets (http://duc.nist.gov/, http://www.nist.gov/tac/) have proved that our method closely correlates with the human summaries and outperforms other programs such as LexRank in many categories under the ROUGE evaluation criterion.en
dc.formatTexten
dc.format.extentp. 739-749en
dc.identifier.citationLong, C., Huang, M., Zhu, X., & Li, M. (2010). A New Approach for Multi-Document Update Summarization. Journal of Computer Science and Technology, 25 (4): 739-749. doi: 10.1007/s11390-010-9361-xen
dc.identifier.issn1000-9000
dc.identifier.urihttp://hdl.handle.net/10625/49746
dc.language.isoen
dc.relation.journalJournal of Computer Science and Technology
dc.subjectDATA MININGen
dc.subjectTEXT MININGen
dc.subjectKOLMOGOROV COMPLEXITYen
dc.subjectINFORMATION DISTANCEen
dc.subjectDOCUMENT SUMMARIZATIONen
dc.subjectINFORMATION DISTANCEen
dc.subjectCLUSTER ANALYSISen
dc.subjectINFORMATION THEORYen
dc.titleNew Approach for Multi-Document Update Summarizationen
dc.typeAbstracten
idrc.copyright.holderSpringer Science + Business Media, LLC & Science Press
idrc.dspace.accessIDRC Onlyen
idrc.noaccessDue to copyright restrictions the full text of this research output is not available in the IDRC Digital Library or by request from the IDRC Library. / Compte tenu des restrictions relatives au droit d'auteur, le texte intégral de cet extrant de recherche n'est pas accessible dans la Bibliothèque numérique du CRDI, et il n'est pas possible d'en faire la demande à la Bibliothéque du CRDI.en
idrc.project.componentnumber104519006
idrc.project.number104519
idrc.project.titleInternational Research Chairs Initiative (IRCI)en
idrc.rims.adhocgroupIDRC SUPPORTEDen

Files