Grammars are gaining importance in natural language processing and computational biology as a means of encoding theories and structuring algorithms. But one serious obstacle to applications of grammars is that formal language theory traditionally classifies grammars according to their weak generative capacity (what sets of strings they generate) and tends to ignore strong generative capacity (what sets of structural descriptions they generate) even though the latter is more relevant to applications.
This book develops and demonstrates a framework for carrying out rigorous comparisons of...
Grammars are gaining importance in natural language processing and computational biology as a means of encoding theories and structuring algorithms. B...
The past decade has seen a revolution in the field of spoken dialogue systems. As in other areas of Computer Science and Artificial Intelligence, data-driven methods are now being used to drive new methodologies for system development and evaluation.
This book is a unique contribution to that ongoing change. A new methodology for developing spoken dialogue systems is described in detail. The journey starts and ends with human behaviour in interaction, and explores methods for learning from the data, for building simulation environments for training and testing systems, and for...
The past decade has seen a revolution in the field of spoken dialogue systems. As in other areas of Computer Science and Artificial Intelligence, d...
The book provides an overview of more than a decade of joint R&D efforts in the Low Countries on HLT for Dutch. It not only presents the state of the art of HLT for Dutch in the areas covered, but, even more importantly, a description of the resources (data and tools) for Dutch that have been created are now available for both academia and industry worldwide.
The contributions cover many areas of human language technology (for Dutch): corpus collection (including IPR issues) and building (in particular one corpus aiming at a collection of 500M word tokens), lexicology, anaphora...
The book provides an overview of more than a decade of joint R&D efforts in the Low Countries on HLT for Dutch. It not only presents the state of t...
In order to exchange knowledge, humans need to share a common lexicon of words as well as
to access the world models underlying that lexicon. What is a natural process for a human turns out to be an extremely hard task for a machine: computers can't represent knowledge as effectively as humans do, which hampers, for example, meaning disambiguation and communication. Applied ontologies and NLP have been developed to face these challenges. Integrating ontologies with (possibly multilingual) lexical resources is an essential requirement to make human language understandable by...
In order to exchange knowledge, humans need to share a common lexicon of words as well as
to access the world models underlying that lexicon...
Questions related to language acquisition have been of interest for many centuries, as children seem to acquire a sophisticated capacity for processing language with apparent ease, in the face of ambiguity, noise and uncertainty. However, with recent advances in technology and cognitive-related research it is now possible to conduct large-scale computational investigations of these issues
The book discusses some of the latest theoretical and practical developments in the areas involved, including computational models for language tasks, tools and resources that help to approximate...
Questions related to language acquisition have been of interest for many centuries, as children seem to acquire a sophisticated capacity for proces...
The 1990s saw a paradigm change in the use of corpus-driven methods in NLP. In the field of multilingual NLP (such as machine translation and terminology mining) this implied the use of parallel corpora. However, parallel resources are relatively scarce: many more texts are produced daily by native speakers of any given language than translated. This situation resulted in a natural drive towards the use of comparable corpora, i.e. non-parallel texts in the same domain or genre. Nevertheless, this research direction has not produced a single authoritative source suitable for researchers and...
The 1990s saw a paradigm change in the use of corpus-driven methods in NLP. In the field of multilingual NLP (such as machine translation and termi...
The past decade has seen a revolution in the field of spoken dialogue systems. As in other areas of Computer Science and Artificial Intelligence, data-driven methods are now being used to drive new methodologies for system development and evaluation.
This book is a unique contribution to that ongoing change. A new methodology for developing spoken dialogue systems is described in detail. The journey starts and ends with human behaviour in interaction, and explores methods for learning from the data, for building simulation environments for training and testing systems, and for...
The past decade has seen a revolution in the field of spoken dialogue systems. As in other areas of Computer Science and Artificial Intelligence, d...
Current language technology is dominated by approaches that either enumerate a large set of rules, or are focused on a large amount of manually labelled data. The creation of both is time-consuming and expensive, which is commonly thought to be the reason why automated natural language understanding has still not made its way into "real-life" applications yet.
This book sets an ambitious goal: to shift the development of language processing systems to a much more automated setting than previous works. A new approach is defined: what if computers analysed large samples of language...
Current language technology is dominated by approaches that either enumerate a large set of rules, or are focused on a large amount of manually lab...
This book is an excellent introduction to multiword expressions. It provides a unique, comprehensive and up-to-date overview of this exciting topic in computational linguistics. The first part describes the diversity and richness of multiword expressions, including many examples in several languages. These constructions are not only complex and arbitrary, but also much more frequent than one would guess, making them a real nightmare for natural language processing applications.
The second part introduces a new generic framework for automatic acquisition of multiword...
This book is an excellent introduction to multiword expressions. It provides a unique, comprehensive and up-to-date overview of this excitin...
Nowadays most important developments in machine translation (MT) are achieved via combining data-driven and rule-based techniques. These combinations typically involve hybridization of different traditional paradigms, such as the introduction of linguistic knowledge into statistical MT paradigms, or the incorporation of data-driven components into rule-based paradigms, or statistical and rule-based pre- and post-processing for both types of MT architectures. The volume providesan overview of the field, as well as the latest relevant research conducted by linguists and practitioners from...
Nowadays most important developments in machine translation (MT) are achieved via combining data-driven and rule-based techniques. These combinations ...