A statisticallanguage model, or more simply a language model, is a prob abilistic mechanism for generating text. Such adefinition is general enough to include an endless variety of schemes. However, a distinction should be made between generative models, which can in principle be used to synthesize artificial text, and discriminative techniques to classify text into predefined cat egories. The first statisticallanguage modeler was Claude Shannon. In exploring the application of his newly founded theory of information to human language, Shannon considered language as a statistical source, and...
A statisticallanguage model, or more simply a language model, is a prob abilistic mechanism for generating text. Such adefinition is general enough to...
This book takes a unique approach to information retrieval by laying down the foundations for a modern algebra of information retrieval based on lattice theory. All major retrieval methods developed so far are described in detail, along with Web retrieval algorithms, and the author shows that they all can be treated elegantly in a unified formal way, using lattice theory as the one basic concept. The book s presentation is characterized by an engineering-like approach.
"
This book takes a unique approach to information retrieval by laying down the foundations for a modern algebra of information retrieval based on la...
Information visualization offers a way to reveal hidden patterns in a visual presentation and allows users to seek information from a visual perspective. Readers of this book will gain an in-depth understanding of the current state of information retrieval visualization. They will be introduced to existing problems along with technical and theoretical findings. The book also provides practical details for the implementation of an information retrieval visualization system.
Information visualization offers a way to reveal hidden patterns in a visual presentation and allows users to seek information from a visual perspe...
The Turn analyzes the research of information seeking and retrieval (IS&R) and proposes a new direction of integrating research in these two areas: the fields should turn off their separate and narrow paths and construct a new avenue of research. An essential direction for this avenue is context as given in the subtitle Integration of Information Seeking and Retrieval in Context. Other essential themes in the book include:
IS&R research models, frameworks and theories; search and works tasks and situations in context; interaction between humans and machines; information...
The Turn analyzes the research of information seeking and retrieval (IS&R) and proposes a new direction of integrating research in these t...
This book covers content recognition in text, elaborating on past and current most successful algorithms and their application in a variety of settings: news filtering, mining of biomedical text, intelligence gathering, competitive intelligence, legal information searching, and processing of informal text. Today, there is considerable interest in integrating the results of information extraction in retrieval systems, because of the demand for search engines that return precise answers to flexible information queries.
This book covers content recognition in text, elaborating on past and current most successful algorithms and their application in a variety of sett...
Information Retrieval has become a very active research field in the 21st century. Many from academia and industry present their innovations in the field in a wide variety of conferences and journals. Companies transfer this new knowledge directly to the general public via services such as web search engines in order to improve their information seeking experience.
In parallel, teaching IR is turning into an important aspect of IR generally, not only because it is necessary to impart effective search techniques to make the most of the IR tools available, but also because we must...
Information Retrieval has become a very active research field in the 21st century. Many from academia and industry present their innovations in the...
Describes highly effective information retrieval models for both smaller, classical data sets, and larger Web collections. In a shift away from heuristic, hand-tuned ranking functions and complex probabilistic models, the author presents feature-based retrieval models.
Describes highly effective information retrieval models for both smaller, classical data sets, and larger Web collections. In a shift away from heuris...
The purposeofthis book is to providea recordofthe stateofthe art in Topic Detection and Tracking (TDT) in a single place. Research in TDT has been going on for about five years, and publications related to it are scattered all over the place as technical reports, unpublished manuscripts, or in numerous conference proceedings. The third and fourth in a series of on-going TDT evaluations marked a turning point in the research. As such. it provides an excellent time to pause. review the state of the art. gather lessons learned, and describe the open challenges. This book is a collection...
The purposeofthis book is to providea recordofthe stateofthe art in Topic Detection and Tracking (TDT) in a single place. Research in TDT has been goi...
Mining the World Wide Web: An Information Search Approach explores the concepts and techniques of Web mining, a promising and rapidly growing field of computer science research. Web mining is a multidisciplinary field, drawing on such areas as artificial intelligence, databases, data mining, data warehousing, data visualization, information retrieval, machine learning, markup languages, pattern recognition, statistics, and Web technology. Mining the World Wide Web presents the Web mining material from an information search perspective, focusing on issues relating to the...
Mining the World Wide Web: An Information Search Approach explores the concepts and techniques of Web mining, a promising and rapidly growing...
Content-based image retrieval is the set of techniques for retrieving relevant images from an image database on the basis of automatically- derived image features. The need for efficient content-based image re- trieval has increased tremendously in many application areas such as biomedicine, the military, commerce, education, and Web image clas- sification and searching. In the biomedical domain, content-based im- age retrieval can be used in patient digital libraries, clinical diagnosis, searching of 2-D electrophoresis gels, and pathology slides. I started my work on content-based image...
Content-based image retrieval is the set of techniques for retrieving relevant images from an image database on the basis of automatically- derived im...