Apache Flume Apache Software Foundation

Version 1.6.0ΒΆ

Status of this release

Apache Flume 1.6.0 is the ninth release of Flume as an Apache top-level project (TLP). Apache Flume 1.6.0 is production-ready software.

Release Documentation

Changes

Release Notes - Flume - Version v1.6.0

** Sub-task
** Bug
  • [FLUME-1793] - Unit test TestElasticSearchLogStashEventSerializer fails with IBM JDK
  • [FLUME-1934] - Spoolingdir source exception when reading multiple zero size files
  • [FLUME-2024] - Add ExecSource flush timeout to Flume User Guide
  • [FLUME-2126] - Problem in elasticsearch sink when the event body is a complex field
  • [FLUME-2148] - Windows : Add flume-env.ps1
  • [FLUME-2214] - FileChannelIntegrityTool needs documentation in user guide
  • [FLUME-2245] - HDFS files with errors unable to close
  • [FLUME-2337] - export JAVA_HOME in flume-env.sh.template and increase heap size
  • [FLUME-2346] - idLogFileMap in Log can lose track of file ids
  • [FLUME-2404] - Default maxReadBufferBytes might cause OOM and cause scribe source exit
  • [FLUME-2408] - Remove FLUME-1899 from Flume 1.5.0 change log.
  • [FLUME-2416] - Use CodecPool in compressed stream to prevent leak of direct buffers
  • [FLUME-2420] - HDFSEventSink#stop might throw ConcurrentModificationException
  • [FLUME-2425] - FileChannel should trim data and checkpoint directories
  • [FLUME-2432] - Update Kite dependency to 0.15.0
  • [FLUME-2438] - Make Syslog source message body configurable
  • [FLUME-2450] - Improve replay index insertion speed.
  • [FLUME-2466] - Embedded agent name cannot contain spaces
  • [FLUME-2470] - Kafka Sink and Source must use camel case for all configs.
  • [FLUME-2472] - SyslogUtils fails when system locale is not English
  • [FLUME-2475] - toLowerCase/toUpperCase used without Locale parameter to manipulate Enum
  • [FLUME-2479] - Kafka property auto.commit.enable is incorrect for KafkaSource
  • [FLUME-2480] - Typo in file channel exception: “Usable space exhaused”
  • [FLUME-2481] - TestFileChannelEncryption fails with System locale other than en_US
  • [FLUME-2482] - Race condition in File Channels’ Log.removeOldLogs
  • [FLUME-2486] - TestExecSource fails on some environments
  • [FLUME-2487] - SyslogParser has rounding errors in timestamp parsing
  • [FLUME-2488] - TestElasticSearchRestClient fails on Oracle JDK 8
  • [FLUME-2489] - Upgrade Apache Curator to 2.6.0
  • [FLUME-2492] - Flume’s Kafka Source doesn’t account time correctly
  • [FLUME-2495] - Kafka Source may miss events when channel is not available
  • [FLUME-2497] - TCP and UDP syslog sources parsing the timestamp incorrectly
  • [FLUME-2500] - Add a channel that uses Kafka
  • [FLUME-2505] - Test added in FLUME-2502 is flaky
  • [FLUME-2517] - Performance issue: SimpleDateFormat constructor takes 30% of HDFSEventSink.process()
  • [FLUME-2525] - flume should handle a zero byte .flumespool-main.meta file for the spooldir source
  • [FLUME-2538] - TestResettableFileInputStream fails on JDK 8
  • [FLUME-2556] - TestBucketPath.testDateFormatHours fails intermittently
  • [FLUME-2557] - DatasetSink doesn’t work with Kerberos
  • [FLUME-2560] - Kafka channel entries missing for output jar generation
  • [FLUME-2568] - Additional fix for TestReliableSpoolingFileEventReader
  • [FLUME-2578] - Kafka source throws NPE if Kafka record has null key
  • [FLUME-2586] - HDFS Sink should have an option to try rename even if close fails
  • [FLUME-2592] - Specify main manifest attribute in flume tools jar
  • [FLUME-2594] - Close Async HBase Client if there are large number of consecutive timeouts
  • [FLUME-2595] - Add option to checkpoint on file channel shutdown
  • [FLUME-2624] - Improve Hive Sink performance
  • [FLUME-2626] - Remove trustmanager-type from Thrift RPC client
  • [FLUME-2633] - Update Kite dependency to 1.0.0
  • [FLUME-2639] - SecureThriftRpcClient client privileged calls throw IllegalAccessError
  • [FLUME-2645] - ipFilter.rules property name is wrong
  • [FLUME-2654] - Memory channel error message related to byteCapacity needs refining
  • [FLUME-2657] - Upgrade to Hive 1.0
  • [FLUME-2658] - Upgrade to Hbase 1.0
  • [FLUME-2664] - Site profile fails due to dependency issues
  • [FLUME-2665] - Update documentation for hdfs.closeTries based on FLUME-2586
  • [FLUME-2670] - Modify dependencies to not pull hadoop/hive related jars
  • [FLUME-2673] - Remove unused import in TestLog
  • [FLUME-2678] - Upgrade xalan to 2.7.2 to take care of CVE-2014-0107 vulnerability
  • [FLUME-2679] - Make hbase-1 the default profile
** Dependency upgrade
  • [FLUME-2443] - org.apache.hadoop.fs.FSDataOutputStream.sync() is deprecated in hadoop 2.4
** Documentation
  • [FLUME-1594] - Document JMX options
  • [FLUME-2509] - Add description for properties in Failover Sink Processor
  • [FLUME-2630] - Update documentation for Thrift Src/Sink SSL support and Kerberos support
  • [FLUME-2635] - Documentation for zookeeper dynamic configuration in flume
  • [FLUME-2668] - Document SecureThriftRpcClient/SecureRpcClientFactory in Flume Developer Guide
** Improvement
  • [FLUME-1334] - Write an startscript for flume agents on Windows
  • [FLUME-1491] - Dynamic configuration from Zookeeper watcher
  • [FLUME-1521] - Document the StressSource
  • [FLUME-1710] - JSONEvent.getBody should not return null
  • [FLUME-1920] - Test case TestFileChannel fails when flume is built from paths containing the string “hadoop”
  • [FLUME-2237] - Example config in User Guide to use same notation for Agent name
  • [FLUME-2246] - event body data size can make it configurable for logger sinker
  • [FLUME-2273] - ElasticSearchSink: Add handling for header substitution in indexName
  • [FLUME-2373] - Enable configuration to switch thrift source serialization protocol
  • [FLUME-2385] - Flume spans log file with “Spooling Directory Source runner has shutdown” messages at INFO level
  • [FLUME-2401] - Optionally Compress Backup Checkpoint
  • [FLUME-2439] - Update DatasetSink for Kite 0.15.0
  • [FLUME-2462] - Remove use of deprecated methods in DatasetSink
  • [FLUME-2463] - Add support for Hive and HBase datasets to DatasetSink
  • [FLUME-2469] - DatasetSink should load dataset when needed, not at startup
  • [FLUME-2499] - Include Kafka Message Key in Event Header, Updated Comments
  • [FLUME-2502] - Spool source’s directory listing is inefficient
  • [FLUME-2558] - Update javadoc for StressSource
  • [FLUME-2562] - Metrics for Flafka components
  • [FLUME-2591] - Kite DatasetSink 2.0
  • [FLUME-2613] - Tool/script for deleting individual message from queue
  • [FLUME-2642] - Limit kerberos relogin attempt
** New Feature
  • [FLUME-1734] - Create a Hive Sink based on the new Hive Streaming support
  • [FLUME-2242] - Flume Sink and Source for Apache Kafka
  • [FLUME-2426] - Support interceptors in the Embedded Agent
  • [FLUME-2431] - Add simple regex search-and-replace interceptor
  • [FLUME-2570] - Add option to not pad date fields
  • [FLUME-2574] - SSL Support for Thrift Rpc
  • [FLUME-2631] - End to End authentication in Flume
** Task
  • [FLUME-2365] - Please create a DOAP file for your TLP
  • [FLUME-2454] - Support batchSize to allow multiple events per transaction to the Kafka Sink
  • [FLUME-2455] - Documentation update for Kafka Sink
  • [FLUME-2523] - Document Kafka channel
  • [FLUME-2612] - Update kite to 0.17.1
** Test
  • [FLUME-1501] - Flume Scribe Source needs unit tests.