public abstract class AbstractSplitter extends java.lang.Object implements Splitter
Splitter
implementations.Constructor and Description |
---|
AbstractSplitter()
Instantiates a new abstract splitter.
|
AbstractSplitter(org.apache.hadoop.conf.Configuration configuration)
Instantiates a new abstract splitter.
|
Modifier and Type | Method and Description |
---|---|
protected Split |
buildSplit(long start,
long length,
java.lang.String[] hosts)
Builds the split.
|
protected long |
computeSplitSize(long blockSize,
long minSize,
long maxSize)
Compute split size.
|
protected int |
getBlockIndex(org.apache.hadoop.fs.BlockLocation[] blocks,
long offset)
Gets the block index.
|
org.apache.hadoop.conf.Configuration |
getConfiguration()
Gets the hadoop configuration.
|
abstract java.util.List<Split> |
getSplits(org.apache.hadoop.fs.Path path)
Gets the input splits for a
Path . |
void |
setConfiguration(org.apache.hadoop.conf.Configuration configuration)
Sets the configuration.
|
public AbstractSplitter()
public AbstractSplitter(org.apache.hadoop.conf.Configuration configuration)
configuration
- the configurationpublic abstract java.util.List<Split> getSplits(org.apache.hadoop.fs.Path path) throws java.io.IOException
Splitter
Path
. A path needs to
be a resource which can be split into a list of splits. Actual
implementation will define if split is enforced to be a single
file or a collection of files.public org.apache.hadoop.conf.Configuration getConfiguration()
@Autowired(required=false) public void setConfiguration(org.apache.hadoop.conf.Configuration configuration)
configuration
- the new configurationprotected long computeSplitSize(long blockSize, long minSize, long maxSize)
blockSize
- the block sizeminSize
- the min sizemaxSize
- the max sizeprotected int getBlockIndex(org.apache.hadoop.fs.BlockLocation[] blocks, long offset)
blocks
- the blk locationsoffset
- the offsetjava.lang.IllegalArgumentException
- if offset is outside of blocksprotected Split buildSplit(long start, long length, java.lang.String[] hosts)
start
- the startlength
- the lengthhosts
- the hosts