Using Flume

Using Flume

4.11 - 1251 ratings - Source

How can you get your data from frontend servers to Hadoop in near real time? With this complete reference guide, youa€™ll learn Flumea€™s rich set of features for collecting, aggregating, and writing large amounts of streaming data to the Hadoop Distributed File System (HDFS), Apache HBase, SolrCloud, Elastic Search, and other systems. Using Flume shows operations engineers how to configure, deploy, and monitor a Flume cluster, and teaches developers how to write Flume plugins and custom components for their specific use-cases. Youa€™ll learn about Flumea€™s design and implementation, as well as various features that make it highly scalable, flexible, and reliable. Code examples and exercises are available on GitHub. Learn how Flume provides a steady rate of flow by acting as a buffer between data producers and consumers Dive into key Flume components, including sources that accept data and sinks that write and deliver it Write custom plugins to customize the way Flume receives, modifies, formats, and writes data Explore APIs for sending data to Flume agents from your own applications Plan and deploy Flume in a scalable and flexible waya€”and monitor your cluster once ita€™s running... the compression ratioalso improves, as does the timetaken tocompress the data . You can read about deflate compression and compression levels in the zlib manual [zlibmanual]. Compression Type Mismatches If the RPC client is configuredanbsp;...

Title:Using Flume
Author:Hari Shreedharan
Publisher:"O'Reilly Media, Inc." - 2014-09-16


You Must CONTINUE and create a free account to access unlimited downloads & streaming