000 -LEADER |
fixed length control field |
02248nam a22003137a 4500 |
003 - CONTROL NUMBER IDENTIFIER |
control field |
CITU |
005 - DATE AND TIME OF LATEST TRANSACTION |
control field |
20230214115725.0 |
007 - PHYSICAL DESCRIPTION FIXED FIELD--GENERAL INFORMATION |
fixed length control field |
cr an aaaaaaaa |
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION |
fixed length control field |
210823b ||||| |||| 00| 0 eng d |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER |
International Standard Book Number |
9781139058452 |
041 ## - LANGUAGE CODE |
Language code of text/sound track or separate title |
eng. |
082 ## - DEWEY DECIMAL CLASSIFICATION NUMBER |
Classification number |
006.312 |
100 1# - MAIN ENTRY--PERSONAL NAME |
Preferred name for the person |
Rajaraman, Anand. |
Relator term |
author |
245 ## - TITLE STATEMENT |
Title |
Mining of massive datasets / |
Statement of responsibility, etc |
Anand Rajaraman, Jeffrey David Ullman. |
264 #1 - PUBLICATION, DISTRIBUTION, ETC. (IMPRINT) |
Place of publication, distribution, etc |
Cambridge: |
Name of publisher, distributor, etc |
Cambridge University Press, |
Date of publication, distribution, etc |
c2012. |
300 ## - PHYSICAL DESCRIPTION |
Extent |
1 online resource (327 pages) : |
336 ## - CONTENT TYPE |
Source |
rdacontent |
Content type term |
text |
Content type code |
text |
337 ## - MEDIA TYPE |
Source |
rdamedia |
Media type term |
computer |
Media type code |
c |
338 ## - CARRIER TYPE |
Source |
rdacarrier |
Carrier type term |
online resource |
Carrier type code |
cr |
504 ## - BIBLIOGRAPHY, ETC. NOTE |
Bibliography, etc |
Includes bibliographical references and index. |
505 ## - CONTENTS |
Formatted contents note |
Data mining -- Large-scale file systems and map-reduce -- Finding similar items -- Mining data streams -- Link analysis -- Frequent itemsets -- Clustering -- Advertising on the web -- Recommendation systems. |
520 ## - SUMMARY, ETC. |
Summary, etc |
The popularity of the Web and Internet commerce provides many extremely large datasets from which information can be gleaned by data mining. This book focuses on practical algorithms that have been used to solve key problems in data mining and which can be used on even the largest datasets. It begins with a discussion of the map-reduce framework, an important tool for parallelizing algorithms automatically. The authors explain the tricks of locality-sensitive hashing and stream processing algorithms for mining data that arrives too fast for exhaustive processing. The PageRank idea and related tricks for organizing the Web are covered next. Other chapters cover the problems of finding frequent itemsets and clustering. The final chapters cover two applications: recommendation systems and Web advertising, each vital in e-commerce. Written by two authorities in database and Web technologies, this book is essential reading for students and practitioners alike. |
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM |
Topical term or geographic name as entry element |
Data mining. |
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM |
Topical term or geographic name as entry element |
Computer algorithms. |
655 #0 - INDEX TERM--GENRE/FORM |
Genre/form data or focus term |
Electronic books. |
700 1# - ADDED ENTRY--PERSONAL NAME |
Personal name |
Ullman, Jeffrey D. |
Dates associated with a name |
1942- |
Relator term |
author |
856 ## - ELECTRONIC LOCATION AND ACCESS |
Link text |
Full text available at Cambridge Online Library Click here to view |
Uniform Resource Identifier |
https://www.cambridge.org/core/books/mining-of-massive-datasets/A06D57FC616AE3FD10007D89E73F8B92 |
942 ## - ADDED ENTRY ELEMENTS |
Source of classification or shelving scheme |
|
Item type |
EBOOK |