Home » Proposal » Multi document summarization thesis proposal

Multi document summarization thesis proposal

Multi document summarization thesis proposal Multi-document Summarization System

  • Shiyan Ou Affiliated with Division of Information Studies, School of Communication Information, Nanyang Technological University
  • . Christopher S. G. Khoo Affiliated with Division of Information Studies, School of Communication Information, Nanyang Technological University
  • . Dion H. Goh Affiliated with Division of Information Studies, School of Communication Information, Nanyang Technological University

* Final gross prices may vary according to local VAT.

Abstract

The design, implementation and evaluation of a multi-document summarization system for sociology dissertation abstracts are described. The system focuses on extracting variables and their relationships from different documents, integrating the extracted information, and presenting the integrated information using a variable-based framework. Two important summarization steps – information extraction and information integration were evaluated by comparing system-generated output against human-generated output. Results indicate that the system-generated output achieves good precision and recall while extracting important concepts from each document, as well as good clusters of similar concepts from the set of documents.

Boros, E. Kanto, P.B. Neu, D.J. A clustering based approach to creating multi-document summaries. In: Document Understanding Conferences 2002 (2002), Available at, www-nlpir.nist.gov/projects/duc/pubs/2001papers/rutgers_final.pdf

Harabagiu, S.M. Lacatusu, F. Generating single and multi-document summaries with GISTEXTER. In: Document Understanding Conferences 2002 (2002), Available at, www-nlpir.nist.gov/projects/duc/pubs/2002papers/utdallas_sanda.pdf

Multi document summarization thesis proposal Multi-document Summarization System for

Macskassy, S.A. Banerjee, A. Davison, B.D. Hirsh, H. Human performance on clustering Web pages: A preliminary study. In: Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining, pp. 264–268. AAAI Presss, Menlo Park (1998)

Mani, I. Bloedorn, E. Summarization similarities and differences among related documents. Information Retrieval 1(1), 1–23 (1999) CrossRef

Mckeown, K. Radev, D. Generating summaries of multiple news articles. In: Proceedings of the 18th Annual International ACM Conference on Research and Development in Information Retrieval (ACM SIGIR), Seattle, WA, pp. 74–82 (1995)

National Institute of Standards and Technology. In: Document Understanding Conferences (2002), Available at, www-nlpir.nist.gov/projects/duc/index.html

Otterbacher, J.C. Winkel, A.J. Radev, D.R. The Michigan single and multi-document summarizer for DUC 2002. In: Document Understanding Conferences 2002 (2002), Available at, www-nlpir.nist.gov/projects/duc/pubs/2002papers/umich_otter.pdf

Ou, S. Khoo, C. Goh, D. Multi-document summarization of dissertation abstracts using a variable-based framework. In: Proceedings of the 66th Annual Meeting of the American Society for Information Science and Technology (ASIST), Long Beach, CA, October 19-23, pp. 230–239 (2003)

Ou, S. Khoo, C. Goh, D. Heng, H.-H. Automatic discourse parsing of sociology dissertation abstracts as sentence categorization.

Multi document summarization thesis proposal Information Studies, School of Communication

In: Proceedings of the 8th International ISKO Conference, London, UK, July 13-16, pp. 345–350 (2004)

Pasi, J. Timo, J. A non-projective dependency parser. In: Proceedings of the 5th Conference on Applied Natural Language Processing, Washington, DD. Association for Computational Linguistics, pp. 64–71 (1997)

Radev, D. A common theory of information fusion from multiple text sources step one: cross-document structure. In: Proceedings of the 1st SIGdial Workshop on Discourse and Dialogue (2000), Available at, sigdial.org/sigdialworkshop/proceedings/radev.pdf

Radev, D. Jing, H. Budzikowska, M. Centroid-based summarization of multiple documents: sentence extraction, utility-based evaluation and user studies. In: Workshop held with Applied Natural Language Processing Conference / Conference of the North American Chapter of the Association for Computational Linguistics (ANLP/ANNCL), pp. 21–29 (2000)

Trochim, W. The research methods knowledge base. Atomic Dog Publishing, Cincinnati (1999)

White, M. Korelsky, T. Cardie, C. Ng, V. Pierce, D. Wagstaff, K. Multi-document summarization via information extraction. In: Proceedings of the 1st International Conference on Human Language Technology Research, HLT 2001 (2001)

Zhang, Z. Blair-Goldensohn, S. Radev, D. Towards CST-enhanced summarization. In: Proceedings of the 18th National Conference on Artificial Intelligence (AAAI-2002), Edmonton, Canada, August 2002 (2002)


Share this:
custom writing low cost
Order custom writing

ads