MapReduce Design Patterns ppt

251 504 0
MapReduce Design Patterns ppt

Đang tải... (xem toàn văn)

Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống

Thông tin tài liệu

[...]... choosing the right design to solve your problem with MapReduce extremely important and can yield performance gains of several orders of magnitude Before we dive into some design patterns in the chapters following this one, we’ll talk a bit about how and why design patterns and MapReduce together make sense, and a bit of a history lesson of how we got here Design Patterns Design patterns have been making... solving, come unique design patterns 1 What is a MapReduce design pattern? It is a template for solving a common and general data manipulation problem with MapReduce A pattern is not specific to a domain such as text processing or graph analysis, but it is a general approach to solving a problem Using design patterns is all about using tried and true design principles to build better software Designing good...Preface Welcome to MapReduce Design Patterns! This book will be unique in some ways and familiar in others First and foremost, this book is obviously about design patterns, which are templates or general guides to solving problems We took a look at other design patterns books that have been written in the past as inspiration, particularly Design Patterns: Elements of Reusable Object-Oriented... added by Pig or Hive will matter, in which case you should use Java MapReduce Pig and Hive are likely to influence MapReduce design patterns more than anything else New feature requests in Pig and Hive will likely translate down into something that could be a design pattern in MapReduce Likewise, as more design patterns are devel‐ oped for MapReduce, some of the more popular ones will become first-class... have patterns of their own and experts will start documenting more as they solve more problems Hive has the benefit of building off of decades of SQL patterns, but not all patterns in SQL are smart in Hive and vice versa Perhaps as these platforms gain more popularity, cookbook and design pattern books will be written for them 12 | Chapter 1: Design Patterns and MapReduce CHAPTER 2 Summarization Patterns. .. part, the MapReduce design patterns in this book are intended to be plat‐ form independent MapReduce, being a paradigm published by Google without any actual source code, has been reimplemented a number of times, both as a standalone system (e.g., Hadoop, Disco, Amazon Elastic MapReduce) and as a query language within a larger system (e.g., MongoDB, Greenplum DB, Aster Data) Even if design patterns. .. concepts conveyed in the book useful in modern languages such as Ruby and Python The patterns in this book should be usable with systems other than Hadoop You’ll just have to use the code examples as a guide to developing your own code Design Patterns | 3 MapReduce History How did we get to the point where a MapReduce design patterns book is a good idea? At a certain point, the community’s momentum and widespread... Chapter 1: Design Patterns and MapReduce opers with a common language in which to communicate verbally and through code Simply saying “abstract factory” is easier than explaining what an abstract factory is over and over Also, when looking at a stranger’s code that implements an abstract factory, you already have a general understanding of what the code is trying to accomplish MapReduce design patterns. .. arbitrary and is very important to what the MapReduce job is accomplishing The key is what the data will be grouped on and the value is the information pertinent to the analysis in the reducer Plenty of detail will be provided in the design patterns in this book to explain what and why the particular key/value is chosen One major differentiator between MapReduce design patterns is the semantics of this pair... MapReduce design patterns in a ecosystem with Hive and Pig However, we would like to take this opportunity early in the book to explain why MapReduce design patterns are still important Pig and Hive are higher-level abstractions of MapReduce They provide an interface that has nothing to do with “map” or “reduce,” but the systems interpret the higher-level language into a series of MapReduce jobs Much . h1" alt="" Donald Miner and Adam Shook MapReduce Design Patterns ISBN: 978-1-449-32717-0 [LSI] MapReduce Design Patterns by Donald Miner and Adam Shook Copyright. ix 1. Design Patterns and MapReduce. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 Design Patterns 2 MapReduce

Ngày đăng: 22/03/2014, 17:20

Từ khóa liên quan

Mục lục

  • Copyright

  • Table of Contents

  • Preface

    • Intended Audience

    • Pattern Format

    • The Examples in This Book

    • Conventions Used in This Book

    • Using Code Examples

    • Safari® Books Online

    • How to Contact Us

    • Acknowldgements

    • Chapter 1. Design Patterns and MapReduce

      • Design Patterns

      • MapReduce History

      • MapReduce and Hadoop Refresher

      • Hadoop Example: Word Count

      • Pig and Hive

      • Chapter 2. Summarization Patterns

        • Numerical Summarizations

          • Pattern Description

          • Numerical Summarization Examples

          • Inverted Index Summarizations

            • Pattern Description

            • Inverted Index Example

            • Counting with Counters

              • Pattern Description

Tài liệu cùng người dùng

  • Đang cập nhật ...

Tài liệu liên quan