Most Used Categories

Learn Apache Flume

Apache Flume – Sequence Generator Source

Apache Flume – Sequence Generator Source, In the previous chapter, we have seen how to fetch data from twitter source to HDFS. This chapter explains how to fetch data from Sequence generator.…Read More

Learn Apache Flume

Apache Flume – NetCat Source

Apache Flume – NetCat Source, This chapter takes an example to explain how you can generate events and subsequently log them into the console. For this, we are using the NetCat source and th…Read More

Learn Apache Flume

Apache Flume – Fetching Twitter Data

Apache Flume – Fetching Twitter Data, Using Flume, we can fetch data from various services and transport it to centralized stores (HDFS and HBase). This chapter explains how to fetch data from Twitt…Read More

Learn Apache Flume

Apache Flume – Environment

Apache Flume – Environment, We already discussed the architecture of Flume in the previous chapter. In this chapter, let us see how to download and setup Apache Flume.…Read More

Learn Apache Flume

Apache Flume – Configuration

Apache Flume – Configuration, After installing Flume, we need to configure it using the configuration file which is a Java property file having key-value pairs. We need to pass values to the…Read More

Learn Apache Flume

Apache Flume – Data Transfer In Hadoop

Apache Flume – Data Transfer In Hadoop, Big Data, as we know, is a collection of large datasets that cannot be processed using traditional computing techniques. Big Data, when analyzed, gives valuable…Read More

Learn Apache Flume

Apache Flume – Architecture

Apache Flume – Architecture, The following illustration depicts the basic architecture of Flume. As shown in the illustration, data generators (such as Facebook, Twitter) generate data whic…Read More

Learn Apache Flume

Apache Flume – Data Flow

Apache Flume – Data Flow, Flume is a framework which is used to move log data into HDFS. Generally events and log data are generated by the log servers and these servers have Flume agent…Read More

Learn Apache Flume

Apache Flume Tutorial

Apache Flume Tutorial, Flume is a standard, simple, robust, flexible, and extensible tool for data ingestion from various data producers (webservers) into Hadoop. In this tutorial, we…Read More

Learn Apache Flume

Apache Flume – Introduction

Apache Flume – Introduction, Apache Flume is a tool/service/data ingestion mechanism for collecting aggregating and transporting large amounts of streaming data such as log files, events (e…Read More