• Wyszukiwanie zaawansowane
  • Kategorie
  • Kategorie BISAC
  • Książki na zamówienie
  • Promocje
  • Granty
  • Książka na prezent
  • Opinie
  • Pomoc
  • Załóż konto
  • Zaloguj się

Data-Mining the Web » książka

zaloguj się | załóż konto
Logo Krainaksiazek.pl

koszyk

konto

szukaj
topmenu
Księgarnia internetowa
Szukaj
Książki na zamówienie
Promocje
Granty
Książka na prezent
Moje konto
Pomoc
 
 
Wyszukiwanie zaawansowane
Pusty koszyk
Bezpłatna dostawa dla zamówień powyżej 20 złBezpłatna dostawa dla zamówień powyżej 20 zł

Kategorie główne

• Nauka
 [2946600]
• Literatura piękna
 [1856966]

  więcej...
• Turystyka
 [72221]
• Informatyka
 [151456]
• Komiksy
 [35826]
• Encyklopedie
 [23190]
• Dziecięca
 [619653]
• Hobby
 [140543]
• AudioBooki
 [1577]
• Literatura faktu
 [228355]
• Muzyka CD
 [410]
• Słowniki
 [2874]
• Inne
 [445822]
• Kalendarze
 [1744]
• Podręczniki
 [167141]
• Poradniki
 [482898]
• Religia
 [510455]
• Czasopisma
 [526]
• Sport
 [61590]
• Sztuka
 [243598]
• CD, DVD, Video
 [3423]
• Technologie
 [219201]
• Zdrowie
 [101638]
• Książkowe Klimaty
 [124]
• Zabawki
 [2473]
• Puzzle, gry
 [3898]
• Literatura w języku ukraińskim
 [254]
• Art. papiernicze i szkolne
 [8170]
Kategorie szczegółowe BISAC

Data-Mining the Web

ISBN-13: 9780471666554 / Angielski / Twarda / 2007 / 218 str.

Zdravko Markov; Daniel T. Larose
Data-Mining the Web Larose, Daniel T. 9780471666554 Wiley-Interscience - książkaWidoczna okładka, to zdjęcie poglądowe, a rzeczywista szata graficzna może różnić się od prezentowanej.

Data-Mining the Web

ISBN-13: 9780471666554 / Angielski / Twarda / 2007 / 218 str.

Zdravko Markov; Daniel T. Larose
cena 491,65 zł
(netto: 468,24 VAT:  5%)

Najniższa cena z 30 dni: 487,62 zł
Termin realizacji zamówienia:
ok. 30 dni roboczych
Bez gwarancji dostawy przed świętami

Darmowa dostawa!

This book introduces the reader to methods of data mining on the web, including uncovering patterns in web content (classification, clustering, language processing), structure (graphs, hubs, metrics), and usage (modeling, sequence analysis, performance).

Kategorie:
Informatyka, Bazy danych
Kategorie BISAC:
Computers > System Administration - Storage & Retrieval
Computers > Data Science - Data Analytics
Wydawca:
Wiley-Interscience
Język:
Angielski
ISBN-13:
9780471666554
Rok wydania:
2007
Ilość stron:
218
Waga:
0.52 kg
Wymiary:
23.55 x 16.56 x 2.06
Oprawa:
Twarda
Wolumenów:
01
Dodatkowe informacje:
Wydanie ilustrowane

" it has to be noted that this book is an excellent resource for conducting Web mining lectures or single units within Data mining class. The data can be used for small as well as quite comprehensive business intelligence projects. The book′s content is easy to access; even students with very basic statistical skills can get the flavor of the intriguing aspects of Web mining." ( Journal of Statistical Software, April 2008)

" highlight[s] the exciting research related to data mining the Web a detailed summary of the current state of the art." (CHOICE, December 2007)

"I can say I really enjoyed reading this book a great educational resource for students and teachers." (Information Retrieval, 2008)

PREFACE.

PART I: WEB STRUCTURE MINING.

1 INFORMATION RETRIEVAL AND WEB SEARCH.

Web Challenges.

Web Search Engines.

Topic Directories.

Semantic Web.

Crawling the Web.

Web Basics.

Web Crawlers.

Indexing and Keyword Search.

Document Representation.

Implementation Considerations.

Relevance Ranking.

Advanced Text Search.

Using the HTML Structure in Keyword Search.

Evaluating Search Quality.

Similarity Search.

Cosine Similarity.

Jaccard Similarity.

Document Resemblance.

References.

Exercises.

2 HYPERLINK–BASED RANKING.

Introduction.

Social Networks Analysis.

PageRank.

Authorities and Hubs.

Link–Based Similarity Search.

Enhanced Techniques for Page Ranking.

References.

Exercises.

PART II: WEB CONTENT MINING.

3 CLUSTERING.

Introduction.

Hierarchical Agglomerative Clustering.

k–Means Clustering.

Probabilty–Based Clustering.

Finite Mixture Problem.

Classification Problem.

Clustering Problem.

Collaborative Filtering (Recommender Systems).

References.

Exercises.

4 EVALUATING CLUSTERING.

Approaches to Evaluating Clustering.

Similarity–Based Criterion Functions.

Probabilistic Criterion Functions.

MDL–Based Model and Feature Evaluation.

Minimum Description Length Principle.

MDL–Based Model Evaluation.

Feature Selection.

Classes–to–Clusters Evaluation.

Precision, Recall, and F–Measure.

Entropy.

References.

Exercises.

5 CLASSIFICATION.

General Setting and Evaluation Techniques.

Nearest–Neighbor Algorithm.

Feature Selection.

Naive Bayes Algorithm.

Numerical Approaches.

Relational Learning.

References.

Exercises.

PART III: WEB USAGE MINING.

6 INTRODUCTION TO WEB USAGE MINING.

Definition of Web Usage Mining.

Cross–Industry Standard Process for Data Mining.

Clickstream Analysis.

Web Server Log Files.

Remote Host Field.

Date/Time Field.

HTTP Request Field.

Status Code Field.

Transfer Volume (Bytes) Field.

Common Log Format.

Identification Field.

Authuser Field.

Extended Common Log Format.

Referrer Field.

User Agent Field.

Example of a Web Log Record.

Microsoft IIS Log Format.

Auxiliary Information.

References.

Exercises.

7 PREPROCESSING FOR WEB USAGE MINING.

Need for Preprocessing the Data.

Data Cleaning and Filtering.

Page Extension Exploration and Filtering.

De–Spidering the Web Log File.

User Identification.

Session Identification.

Path Completion.

Directories and the Basket Transformation.

Further Data Preprocessing Steps.

References.

Exercises.

8 EXPLORATORY DATA ANALYSIS FOR WEB USAGE MINING.

Introduction.

Number of Visit Actions.

Session Duration.

Relationship between Visit Actions and Session Duration.

Average Time per Page.

Duration for Individual Pages.

References.

Exercises.

9 MODELING FOR WEB USAGE MINING: CLUSTERING, ASSOCIATION, AND CLASSIFICATION.

Introduction.

Modeling Methodology.

Definition of Clustering.

The BIRCH Clustering Algorithm.

Affinity Analysis and the A Priori Algorithm.

Discretizing the Numerical Variables: Binning.

Applying the A Priori Algorithm to the CCSU Web Log Data.

Classification and Regression Trees.

The C4.5 Algorithm.

References.

Exercises.

INDEX.

Zdravko Markov, PhD, is Associate Professor of Computer Science at Central Connecticut State University. The author of three textbooks, Dr. Markov teaches undergraduate and graduate courses in computer science and artificial intelligence. He is currently a Principal Investigator (PI) in a National Science Foundation funded project designed to introduce machine learning to undergraduates.

Daniel T. Larose, PhD, is Professor of Statistics in the Department of Mathematical Sciences at Central Connecticut State University. He is the author of three data mining books and a forthcoming textbook in undergraduate statistics. He developed and directs CCSU′s DataMining@CCSU programs.

Learn How To Convert Web Data Into Web Knowledge

This text demonstrates how to extract knowledge by finding meaningful connections among data spread throughout the Web. Readers learn methods and algorithms from the fields of information retrieval, machine learning, and data mining which, when combined, provide a solid framework for mining the Web. The authors walk readers through the algorithms with the aid of examples and exercises.

This text is divided into three parts:

  • Part One, Web Structure, presents basic concepts and techniques for extracting information from the Web. Readers learn how to collect and index Web documents as well as search and rank Web pages according to their textual content and hyperlink structure.

  • Part Two, Web Content Management, offers two approaches, clustering and classification, for organizing Web content. For both approaches, the authors set forth specific algorithms that enable readers to convert Web data into knowledge.

  • Part Three, Web Usage Mining, demonstrates the application of data mining methods to uncover meaningful patterns of Internet usage.

Methods and algorithms are illustrated by simple examples. More than 100 exercises help readers assess their grasp of the material. Further, thirty–four hands–on analysis problems ask readers to use their new data mining expertise to solve real problems, working with large data sets. All the data sets needed for the examples, exercises, and analysis problems are available on the companion Web site.

The extensive use of examples, along with the opportunity to test and apply data mining skills, makes this text ideal for graduate and upper–level undergraduates in computer science and engineering. Web designers and researchers will find that this text gives them a new set of tools to further mine the Web for knowledge and move well beyond the capabilities of standard search engines.

Larose, Daniel T. Zdravko Markov, PhD, is Associate Professor of Com... więcej >


Udostępnij

Facebook - konto krainaksiazek.pl



Opinie o Krainaksiazek.pl na Opineo.pl

Partner Mybenefit

Krainaksiazek.pl w programie rzetelna firma Krainaksiaze.pl - płatności przez paypal

Czytaj nas na:

Facebook - krainaksiazek.pl
  • książki na zamówienie
  • granty
  • książka na prezent
  • kontakt
  • pomoc
  • opinie
  • regulamin
  • polityka prywatności

Zobacz:

  • Księgarnia czeska

  • Wydawnictwo Książkowe Klimaty

1997-2025 DolnySlask.com Agencja Internetowa

© 1997-2022 krainaksiazek.pl
     
KONTAKT | REGULAMIN | POLITYKA PRYWATNOŚCI | USTAWIENIA PRYWATNOŚCI
Zobacz: Księgarnia Czeska | Wydawnictwo Książkowe Klimaty | Mapa strony | Lista autorów
KrainaKsiazek.PL - Księgarnia Internetowa
Polityka prywatnosci - link
Krainaksiazek.pl - płatnośc Przelewy24
Przechowalnia Przechowalnia