Asia Pacific University Library catalogue


An architecture for fast and general data processing on large clusters (Record no. 383489)

000 -LEADER
fixed length control field 03974nam a22005297a 4500
001 - CONTROL NUMBER
control field 20437654
003 - CONTROL NUMBER IDENTIFIER
control field APU
005 - DATE AND TIME OF LATEST TRANSACTION
control field 20220824171944.0
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field 210803t20162016nyu b 000 0 eng d
010 ## - LIBRARY OF CONGRESS CONTROL NUMBER
LC control number 2017471066
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 9781970001587 (epub)
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 1970001585 (epub)
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 9781970001570 (pdf)
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 1970001577 (pdf)
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 9781970001594
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 9781970001563
035 ## - SYSTEM CONTROL NUMBER
System control number (OCoLC)ocn953497712
040 ## - CATALOGING SOURCE
Original cataloging agency YDXCP
Language of cataloging eng
Transcribing agency APU
Modifying agency SF
042 ## - AUTHENTICATION CODE
Authentication code lccopycat
050 00 - LIBRARY OF CONGRESS CALL NUMBER
Classification number QA76.9.D5
Item number Z34 2016eb
082 04 - DEWEY DECIMAL CLASSIFICATION NUMBER
Classification number 004.36
Edition number 23
100 1# - MAIN ENTRY--PERSONAL NAME
Personal name Zaharia, Matei,
9 (RLIN) 47413
245 13 - TITLE STATEMENT
Title An architecture for fast and general data processing on large clusters
Medium [electronic resources] /
Statement of responsibility, etc Matei Zaharia.
260 ## - PUBLICATION, DISTRIBUTION, ETC. (IMPRINT)
Place of publication, distribution, etc [New York] : Association for Computing Machinery ; [San Rafael] :
Name of publisher, distributor, etc Morgan & Claypool Publishers,
Date of publication, distribution, etc c2016.
300 ## - PHYSICAL DESCRIPTION
Extent 1 online resource (208 pages) ;
300 ## - PHYSICAL DESCRIPTION
Extent 1 pdf (208 pages) ;
490 1# - SERIES STATEMENT
Series statement ACM books ;
Volume number/sequential designation #11
International Standard Serial Number 2374-6777 ;
504 ## - BIBLIOGRAPHY, ETC. NOTE
Bibliography, etc Includes bibliographical references (pages 119-128).
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note 1. Introduction -- 1.1 Problems with specialized systems -- 1.2 Resilient distributed datasets (RDDs) -- 1.3 Models implemented over RDDs -- 1.4 Summary of results -- 1.5 Book overview --
505 8# - FORMATTED CONTENTS NOTE
Formatted contents note 2. Resilient distributed datasets -- 2.1 Introduction -- 2.2 RDD abstraction -- 2.3 Spark programming interface -- 2.4 Representing RDDs -- 2.5 Implementation -- 2.6 Evaluation -- 2.7 Discussion -- 2.8 Related work -- 2.9 Summary --
505 8# - FORMATTED CONTENTS NOTE
Formatted contents note 3. Models built over RDDs -- 3.1 Introduction -- 3.2 Techniques for implementing other models on RDDs -- 3.3 Shark: SQL on RDDs -- 3.4 Implementation -- 3.5 Performance -- 3.6 Combining SQL with complex analytics -- 3.7 Summary --
505 8# - FORMATTED CONTENTS NOTE
Formatted contents note 4. Discretized streams -- 4.1 Introduction -- 4.2 Goals and background -- 4.3 Discretized streams (D-streams) -- 4.4 System architecture -- 4.5 Fault and straggler recovery -- 4.6 Evaluation -- 4.7 Discussion -- 4.8 Related work -- 4.9 Summary --
505 8# - FORMATTED CONTENTS NOTE
Formatted contents note 5. Generality of RDDs -- 5.1 Introduction -- 5.2 Expressiveness perspective -- 5.3 Systems perspective -- 5.4 Limitations and extensions -- 5.5 Related work -- 5.6 Summary --
505 8# - FORMATTED CONTENTS NOTE
Formatted contents note 6. Conclusion -- 6.1 Lessons learned -- 6.2 Evolution of spark in industry -- 6.3 Future work -- References -- Author's biography.
520 ## - SUMMARY, ETC.
Summary, etc The past few years have seen a major change in computing systems, as growing data volumes and stalling processor speeds require more and more applications to scale out to clusters. Today, a myriad data sources, from the Internet to business operations to scientific instruments, produce large and valuable data streams. However, the processing capabilities of single machines have not kept up with the size of data. As a result, organizations increasingly need to scale out their computations over clusters. At the same time, the speed and sophistication required of data processing have grown. In addition to simple queries, complex algorithms like machine learning and graph analysis are becoming common. And in addition to batch processing, streaming analysis of real-time data is required to let organizations take timely action. Future computing platforms will need to not only scale out traditional workloads, but support these new applications too.
538 ## - SYSTEM DETAILS NOTE
System details note Mode of access: World Wide Web.
538 ## - SYSTEM DETAILS NOTE
System details note System requirements: Adobe Acrobat Reader.
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element Electronic data processing
General subdivision Distributed processing.
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element Distributed databases.
9 (RLIN) 11734
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element Big data.
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element Big data.
Source of heading or term fast
9 (RLIN) 47414
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element Distributed databases.
Source of heading or term fast
9 (RLIN) 11734
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element Electronic data processing
General subdivision Distributed processing.
Source of heading or term fast
9 (RLIN) 47415
830 #0 - SERIES ADDED ENTRY--UNIFORM TITLE
Uniform title ACM books ;
Volume number/sequential designation #11.
9 (RLIN) 47379
856 ## - ELECTRONIC LOCATION AND ACCESS
Uniform Resource Identifier https://dl-acm-org.ezproxy.apu.edu.my/doi/book/10.1145/2886107
Public note Available in ACM Digital Library. Requires Log In to view full text.
942 ## - ADDED ENTRY ELEMENTS (KOHA)
Source of classification or shelving scheme
Koha item type E-Book
Holdings
Withdrawn status Lost status Source of classification or shelving scheme Damaged status Not for loan Collection code Home library Current library Shelving location Date acquired Source of acquisition Total Checkouts Full call number Date last seen Copy number Price effective from Koha item type
Not Withdrawn Available   Not Damaged Available for loan E-Book APU Library APU Library Online Database 07/03/2022 OTHERS   QA76.9.D5 Z34 2016eb 07/03/2022 1 07/03/2022 General Circulation