Graham Cormode (University of Warwick), Ke Yi (Hong Kong University of Science and Technology)
The massive volume of data generated in modern applications can overwhelm our ability to conveniently transmit, store, and index it. For many scenarios, building a compact summary of a dataset that is vastly smaller enables flexibility and efficiency in a range of queries over the data, in exchange for some approximation. This comprehensive introduction to data summarization, aimed at practitioners and students, showcases the algorithms, their behavior, and the mathematical underpinnings of their operation. The coverage starts with simple sums and approximate counts, building to more advanced...
The massive volume of data generated in modern applications can overwhelm our ability to conveniently transmit, store, and index it. For many scenario...