MIT CogNet, The Brain Sciences ConnectionFrom the MIT Press, Link to Online Catalog
SPARC Communities
Subscriber : Stanford University Libraries » LOG IN

space

Powered By Google 
Advanced Search

Selected Title Details  
Nov 2000
ISBN 0262133725
272 pp.
68 illus.
BUY THE BOOK
The Theory and Practice of Discourse Parsing and Summarization
Daniel Marcu
Until now, most discourse researchers have assumed that full semantic understanding is necessary to derive the discourse structure of texts. This book documents the first serious attempt to construct automatically and use nonsemantic computational structures for text summarization. Daniel Marcu develops a semantics-free theoretical framework that is both general enough to be applicable to naturally occurring texts and concise enough to facilitate an algorithmic approach to discourse analysis. He presents and evaluates two discourse parsing methods: one uses manually written rules that reflect common patterns of usage of cue phrases such as "however" and "in addition to"; the other uses rules that are learned automatically from a corpus of discourse structures. By means of a psycholinguistic experiment, Marcu demonstrates how a discourse-based summarizer identifies the most important parts of texts at levels of performance that are close to those of humans.

Marcu also discusses how the automatic derivation of discourse structures may be used to improve the performance of current natural language generation, machine translation, summarization, question answering, and information retrieval systems.
Table of Contents
 Figures
 Tables
 Preface
 acknowledgements
1 Introduction
I Theoretical Foundations
2 The Linguistics of Text Structure
3 The Mathematics of Text Structure
4 A Computational Account of the Axiomatization of Valid Text Structures and its Proof Theory
5 Discussion
II The Rhetorical Parsing of Free Text
6 Rhetorical Parsing by Means of Manually Derived Rules
7 Rhetorical parsing by Means of Automatically Derived Rules
8 Discussion
III Summarization
9 Summarizing Natural Language Texts
10 Improving Summarization Performance through Rhetorical Parsing Tuning
11 Discussion
 Bibliography
 Author Index
 Subject and Notation Index
 
 


© 2010 The MIT Press
MIT Logo