Apache Flume Apache Software Foundation

Welcome to Apache FlumeΒΆ

Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms. It uses a simple extensible data model that allows for online analytic application.

Agent component diagram

News

July 16, 2014 - Apache Flume 1.5.0.1 Released

The Apache Flume team is pleased to announce the release of Flume 1.5.0.1

Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of streaming event data.

Version 1.5.0.1 is the sixth Flume release as an Apache top-level project. Flume 1.5.0.1 is stable, production-ready software, and is backwards-compatible with previous versions of the Flume 1.x codeline.

Apache Flume 1.5.0.1 is a maintenance release primarily meant to add support to build against Apache HBase 0.98.x. This release adds a new build profile that builds Flume against HBase 0.98.2.

Apache BigTop 0.8.0 release will ship Flume binaries built against HBase 0.98.x.

The full change log and documentation are available on the Flume 1.5.0.1 release page.

This release can be downloaded from the Flume Download page.

Your contributions, feedback, help and support make Flume better! For more information on how to report problems or contribute, please visit our Get Involved page.

The Apache Flume Team

May 20, 2014 - Apache Flume 1.5.0 Released

The Apache Flume team is pleased to announce the release of Flume 1.5.0.

Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of streaming event data.

Version 1.5.0 is the fifth Flume release as an Apache top-level project. Flume 1.5.0 is stable, production-ready software, and is backwards-compatible with previous versions of the Flume 1.x codeline.

Several months of active development went into this release: 123 patches were committed since 1.4.0, representing many features, enhancements, and bug fixes. While the full change log can be found on the 1.5.0 release page (link below), here are a few new feature highlights:

  • New in-memory channel that can spill to disk
  • A new dataset sink that use Kite API to write data to HDFS and HBase
  • Support for Elastic Search HTTP API in Elastic Search Sink
  • Much faster replay in the File Channel.

The full change log and documentation are available on the Flume 1.5.0 release page.

This release can be downloaded from the Flume Download page.

Your contributions, feedback, help and support make Flume better! For more information on how to report problems or contribute, please visit our Get Involved page.

The Apache Flume Team

July 2, 2013 - Apache Flume 1.4.0 Released

The Apache Flume team is pleased to announce the release of Flume 1.4.0.

Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of streaming event data.

Version 1.4.0 is the fourth Flume release as an Apache top-level project. Flume 1.4.0 is stable, production-ready software, and is backwards-compatible with previous versions of the Flume 1.x codeline.

Six months of active development went into this release: 261 patches were committed since 1.3.1, representing many features, enhancements, and bug fixes. While the full change log can be found on the 1.4.0 release page (link below), here are a few new feature highlights:

  • New JMS Source
  • New Solr Sink with ETL capabilities
  • Updated ElasticSearch sink to support ES version 0.90
  • Support for secure SSL transport over Avro-RPC clients, sources & sinks
  • Support for Thrift-RPC as a transport mechanism
  • Support for embedding a Flume agent within applications
  • Support for a new plugins.d directory structure for managing Flume addons
  • Support for reading Avro files via the Spooling Directory source
  • Support for writing Avro files with arbitrary schemas via the HDFS sink
  • Support for ingesting Avro-serializable objects via the log4j API
  • Improvements to the file channel to keep a backup checkpoint to avoid replays
  • Performance improvements to the file channel, including group commit
  • New file channel consistency check tool

Below is the list of people (from Git/SVN logs) who submitted and/or reviewed improvements to Flume during the 1.4.0 development cycle:

  • Alexander Alten-Lorenz
  • Aline Guedes Pinto
  • Brock Noland
  • Cameron Gandevia
  • Chris Birchall
  • Christopher Nagy
  • Deepesh Khandelwal
  • Denny Ye
  • Edward Sargisson
  • Hari Shreedharan
  • Israel Ekpo
  • Ivan Bogdanov
  • Jarek Jarcec Cecho
  • Jeff Lord
  • Joey Echeverria
  • Jolly Chen
  • Juhani Connolly
  • Mark Grover
  • Mike Percy
  • Mubarak Seyed
  • Nitin Verma
  • Oliver B. Fischer
  • Patrick Wendell
  • Paul Chavez
  • Pedro Urbina Escos
  • Phil Scala
  • Rahul Ravindran
  • Ralph Goers
  • Roman Shaposhnik
  • Roshan Naik
  • Sravya Tirukkovalur
  • Steve Hoffman
  • Ted Malaska
  • Thiruvalluvan M. G.
  • Thom DeCarlo
  • Tim Bacon
  • Tom White
  • Venkat Ranganathan
  • Venkatesh Sivasubramanian
  • Will McQueen
  • Wolfgang Hoschek

The full change log and documentation are available on the Flume 1.4.0 release page.

This release can be downloaded from the Flume Download page.

Your contributions, feedback, help and support make Flume better! For more information on how to report problems or contribute, please visit our Get Involved page.

The Apache Flume Team

January 2, 2013 - Apache Flume 1.3.1 Released

The Apache Flume team is pleased to announce the release of Flume version 1.3.1. Apache Flume 1.3.1 is the fifth release under the auspices of Apache of the so-called “NG” codeline, and our third release as a top-level Apache project! Flume 1.3.1 has been put through many stress and regression tests, is stable, production-ready software, and is backwards-compatible with Flume 1.3.0 and Flume 1.2.0.

Apache Flume 1.3.1 is a maintainance release for the 1.3.0 release, and includes several bug fixes and performance enhancements.

This release can be downloaded from the Flume download page at: http://flume.apache.org/download.html

The change log and documentation are available on the 1.3.1 release page: http://flume.apache.org/releases/1.3.1.html

Your help and feedback is more than welcome!

December 2, 2012 - Apache Flume 1.3.0 Released

The Apache Flume team is pleased to announce the release of Flume version 1.3.0.

Apache Flume 1.3.0 is the fourth release under the auspices of Apache of the so-called “NG” codeline, and our second release as a top-level Apache project! Flume 1.3.0 has been put through many stress and regression tests, is stable, production-ready software, and is backwards-compatible with Flume 1.2.0.

Four months of very active development went into this release: a whopping 221 patches were committed since 1.2.0, representing many features, enhancements, and bug fixes. While the full change log can be found in the link below, here are a few new feature highlights:

  • New HTTP Post Source
  • New Spool Directory Source
  • New Multi-port Syslog Source
  • New Elastic Search Sink
  • New Regex Extractor Interceptor
  • File Channel Encryption

This release can be downloaded from the Flume download page at: http://flume.apache.org/download.html

The change log and documentation are available on the 1.3.0 release page: http://flume.apache.org/releases/1.3.0.html

Your help and feedback is more than welcome!

July 26, 2012 - Apache Flume 1.2.0 Released

The Apache Flume team is pleased to announce the release of Flume version 1.2.0.

Apache Flume 1.2.0 is the third release under the auspices of Apache of the so-called “NG” codeline, and our first release as a top-level Apache project! Flume 1.2.0 has been put through many stress and regression tests, is stable, production-ready software, and is backwards-compatible with Flume 1.1.0.

Four months of very active development went into this release: a whopping 192 patches were committed since 1.1.0, representing many features, enhancements, and bug fixes. While the full change log can be found in the link below, here are a few new feature highlights:

  • New durable file channel
  • New client API
  • New HBase sinks (two different implementations)
  • New Interceptor interface (a plugin processing API)
  • New JMX-based monitoring support

This release can be downloaded from the Flume download page at: http://flume.apache.org/download.html

The change log and documentation are available on the 1.2.0 release page: http://flume.apache.org/releases/1.2.0.html

Your help and feedback is more than welcome!

July 26, 2012 - Flume Meetup NYC

Strata + Hadoop World (http://strataconf.com/stratany2012) will be held October 23-25 in NYC. A Flume Meetup is being organized around this event. This Meetup will be a good place for users to interact with each other and with Flume developers.

In order to help with organization, a form has been set up with a few questions about what kind of Meetup the community wants, and which evening is best. Please fill out the form. Feel free to post to the User’s mailing list with any questions.