OReilly.Hadoop.The.Definitive.Guide.4th.Edition
Introduction
电子版自序
Foreword
Preface
Part I. Hadoop Fundamentals
Part II. MapReduce
Part III. Hadoop Operations
- Chapter 10. Setting Up a Hadoop Cluster
- Chapter 11. Administering Hadoop
Part IV. Related Projects
Part V. Case Studies
- Chapter 22. Composable Data at Cerner
  - From CPUs to Semantic Integration
  - Enter Apache Crunch
  - Building a Complete Picture
  - Integrating Healthcare Data
  - Composability over Frameworks
  - Moving Forward
- Chapter 23. Biological Data Science: Saving Lives with Software
  - The Structure of DNA
  - The Genetic Code: Turning DNA Letters into Proteins
  - Thinking of DNA as Source Code
  - The Human Genome Project and Reference Genomes
  - Sequencing and Aligning DNA
  - ADAM, A Scalable Genome Analysis Platform
    - Literate programming with the Avro interface description language (IDL)
    - Column-oriented access with Parquet
    - A simple example: k-mer counting using Spark and ADAM
  - From Personalized Ads to Personalized Medicine
  - Join In
- Chapter 24. Cascading
  - Fields, Tuples, and Pipes
  - Operations
  - Taps, Schemes, and Flows
  - Cascading in Practice
  - Flexibility
  - Hadoop and Cascading at ShareThis
  - Summary
Appendix A. Installing Apache Hadoop
- Prerequisites
- Installation
- Configuration
  - Standalone Mode
  - Pseudodistributed Mode
  - Fully Distributed Mode
Appendix B. Cloudera’s Distribution Including Apache Hadoop
Appendix C. Preparing the NCDC Weather Data
Appendix D. The Old and New Java MapReduce APIs
Index
Colophon

Powered by GitBook

User-Defined Functions

results matching ""

No results matching ""