Hiển thị các bài đăng có nhãn DataIngestion. Hiển thị tất cả bài đăng
Hiển thị các bài đăng có nhãn DataIngestion. Hiển thị tất cả bài đăng

Thứ Hai, 21 tháng 11, 2016

Keedio FTP Flume Source

banner_0009
Keedio-flume-ftp was created to meet the need of processing information stored on a FTP server. Information is processed by Apache Flume, whose base data information unit is an “event”.
Usually, in an FTP server, data is loaded in bulk, which is a completely different usage paradigm than the event-based paradigm on which Flume relies.

Apache Flume - Giới thiệu

What is Flume?

Apache Flume is a tool/service/data ingestion mechanism for collecting aggregating and transporting large amounts of streaming data such as log files, events (etc...) from various sources to a centralized data store.
Flume is a highly reliable, distributed, and configurable tool. It is principally designed to copy streaming data (log data) from various web servers to HDFS.
Apache Flume