Researchers in a number of disciplines deal with large text sets requiring both text management and text analysis. Faced with a large amount of textual data collected in marketing surveys, literary investigations, historical archives and documentary data bases, these researchers require assistance with organizing, describing and comparing texts. Exploring Textual Data demonstrates how exploratory multivariate statistical methods such as correspondence analysis and clusteranalysis can be used to help investigate, assimilate and evaluate textual data....
Researchers in a number of disciplines deal with large text sets requiring both text management and text analysis. Faced with a large amount of textua...
This book is a revised version of my doctoral thesis which was submitted in April 1993. The main extension is a chapter on evaluation of the system de scribed in Chapter 8 as this is clearly an issue which was not treated in the original version. This required the collection of data, the development of a concept for diagnostic evaluation of linguistic word recognition systems and, of course, the actual evaluation of the system itself. The revisions made primarily concern the presentation of the latest version of the SILPA system described in an additional Subsection 8. 3, the development...
This book is a revised version of my doctoral thesis which was submitted in April 1993. The main extension is a chapter on evaluation of the system de...
This volume is a selection of papers presented at a workshop entitled Predicative Forms in Natural Language and in Lexical Knowledge Bases organized in Toulouse in August 1996. A predicate is a named relation that exists among one or more arguments. In natural language, predicates are realized as verbs, prepositions, nouns and adjectives, to cite the most frequent ones. Research on the identification, organization, and semantic representa tion of predicates in artificial intelligence and in language processing is a very active research field. The emergence of new paradigms in theoretical...
This volume is a selection of papers presented at a workshop entitled Predicative Forms in Natural Language and in Lexical Knowledge Bases organized i...
The last decade has been one of dramatic progress in the field of Natural Language Processing (NLP). This hitherto largely academic discipline has found itself at the center of an information revolution ushered in by the Internet age, as demand for human-computer communication and informa tion access has exploded. Emerging applications in computer-assisted infor mation production and dissemination, automated understanding of news, understanding of spoken language, and processing of foreign languages have given impetus to research that resulted in a new generation of robust tools, systems, and...
The last decade has been one of dramatic progress in the field of Natural Language Processing (NLP). This hitherto largely academic discipline has fou...
In both the linguistic and the language engineering community, the creation and use of annotated text collections (or annotated corpora) is currently a hot topic. Annotated texts are of interest for research as well as for the development of natural language pro- cessing (NLP) applications. Unfortunately, the annotation of text material, especially more interesting linguistic annotation, is as yet a difficult task and can entail a substan- tial amount of human involvement. Allover the world, work is being done to replace as much as possible of this human effort by computer processing. At the...
In both the linguistic and the language engineering community, the creation and use of annotated text collections (or annotated corpora) is currently ...
l This book evolved from the ARCADE evaluation exercise that started in 1995. The project's goal is to evaluate alignment systems for parallel texts, i. e., texts accompanied by their translation. Thirteen teams from various places around the world have participated so far and for the first time, some ten to fifteen years after the first alignment techniques were designed, the community has been able to get a clear picture of the behaviour of alignment systems. Several chapters in this book describe the details of competing systems, and the last chapter is devoted to the description of the...
l This book evolved from the ARCADE evaluation exercise that started in 1995. The project's goal is to evaluate alignment systems for parallel texts, ...
The study of prosody is perhaps the area of speech research which has undergone the most noticeable development during the past ten to fifteen years. As an indication of this, one can note, for example, that at the latest International Conference on Spoken Language Processing in Philadelphia (October 1996), there were more sessions devoted to prosody than to any other area. Not only that, but within other sessions, in particular those dealing with dialogue, several of the presentations dealt specifically with prosodic aspects of dialogue research. Even at the latest Eurospeech meeting in...
The study of prosody is perhaps the area of speech research which has undergone the most noticeable development during the past ten to fifteen years. ...
ANTONIS BOTINIS 1. 1 Background This introduction provides essential information about the structure and the objects of study of this volume. Following the introduction, fourteen papers which represent current research on intonation are organised into five thematic sections: (I) Overview of Intonation, (II) Prominence and Focus, (III) Boundaries and Discourse, (IV) Intonation Modelling, and (V) Intonation Technology. Within the sections the papers are arranged thematically, although several papers which deal with various aspects of intonation and prosody are basically intersectional. As the...
ANTONIS BOTINIS 1. 1 Background This introduction provides essential information about the structure and the objects of study of this volume. Followin...
ANTONIS BOTINIS 1. 1 Background This introduction provides essential information about the structure and the objects of study of this volume. Following the introduction, fourteen papers which represent current research on intonation are organised into five thematic sections: (I) Overview of Intonation, (II) Prominence and Focus, (III) Boundaries and Discourse, (IV) Intonation Modelling, and (V) Intonation Technology. Within the sections the papers are arranged thematically, although several papers which deal with various aspects of intonation and prosody are basically intersectional. As the...
ANTONIS BOTINIS 1. 1 Background This introduction provides essential information about the structure and the objects of study of this volume. Followin...
In this book we address robustness issues at the speech recognition and natural language parsing levels, with a focus on feature extraction and noise robust recognition, adaptive systems, language modeling, parsing, and natural language understanding. This book attempts to give a clear overview of the main technologies used in language and speech processing, along with an extensive bibliography to enable topics of interest to be pursued further. It also brings together speech and language technologies often considered separately. Robustness in Language and Speech Technology...
In this book we address robustness issues at the speech recognition and natural language parsing levels, with a focus on feature extraction and noise ...