Package cascading.tap.hadoop

Class Summary
Dfs Class Dfs is a Tap class that provides access to the Hadoop Distributed File System.
GlobHfs Class GlobHfs is a type of MultiSourceTap that accepts Hadoop style 'file globing' expressions so multiple files that match the given pattern may be used as the input sources for a given Flow.
HadoopTupleEntrySchemeCollector Class TapCollector is a kind of TupleEntryCollector that writes tuples to the resource managed by a particular Tap instance.
HadoopTupleEntrySchemeIterator  
Hfs Class Hfs is the base class for all Hadoop file system access.
Lfs Class Lfs is a Tap class that provides access to the Local File System via Hadoop.
MultiInputFormat Class MultiInputFormat accepts multiple InputFormat class declarations allowing a single MR job to read data from incompatible file types.
MultiInputSplit Class MultiInputSplit is used by MultiInputFormat
MultiRecordReaderIterator Class TapIterator is an implementation of CloseableIterator.
RecordReaderIterator RecordReaderIterator is a utility class for handing off a single RecordReader instance via the Iterator.
TapOutputCollector  
TempHfs Class TempHfs creates a temporary Tap instance for use internally.
TemplateTap Class TemplateTap can be used to write tuple streams out to sub-directories based on the values in the Tuple instance.
TemplateTap.TemplateScheme  
 



Copyright © 2007-2011 Concurrent, Inc. All Rights Reserved.