ISBN-13: 9783639216271 / Angielski / Miękka / 2009 / 120 str.
The past decade has seen a tremendous growth in the amount of biomedical literature, speci cally in the area of bioinformatics. As a result, biomedical text categorization has become a central task for providing researchers with literature appropriate for their speci c information needs. Pan et al. have explored a method that automatically identi es information-bearing sentence fragments within scienti c text. Their proposed method aims to automatically classify sentence fragments into certain sets of categories de ned to satisfy speci c types of information needs. The categories are grouped into ve different dimensions known as Focus, Polarity, Certainty, Evidence, and Trend. The reason that fragments are used as the unit of classi cation is that the class value along each of these dimensions can change mid-sentence. In order to automatically annotate sentence fragments along the ve dimensions, automatically breaking sentences into fragments is a necessary step. In this study, we investigate the problem of automatic fragmentation of biomedical sentences, which is a fundamental layer in the multi- dimensional fragment classi cation."