A query focused multi-document automatic summarizer has been described. The system clusters similar texts in multiple related documents having related (sub) topical features. A document graph is constructed, where nodes are sentences and edge scores reflect correlation measure between nodes. Then clusters are constructed from the graph. Each cluster gets a weight and has a cluster center. Next, query dependent weights for each sentence are added to the edge score as well as to the cluster score. Top ranked sentence of each cluster in order is identified for inclusion in the output summary. It...
A query focused multi-document automatic summarizer has been described. The system clusters similar texts in multiple related documents having related...