Spring for Apache Hadoop

org.springframework.data.hadoop.pig
Class PigServerFactoryBean

java.lang.Object
  extended by org.springframework.data.hadoop.pig.PigServerFactoryBean
All Implemented Interfaces:
org.springframework.beans.factory.Aware, org.springframework.beans.factory.BeanNameAware, org.springframework.beans.factory.FactoryBean<PigServerFactory>

public class PigServerFactoryBean
extends java.lang.Object
implements org.springframework.beans.factory.FactoryBean<PigServerFactory>, org.springframework.beans.factory.BeanNameAware

Factory for creating a PigServer instance. Note that since PigServer is not thread-safe and the Pig API does not provide some type of factory, the factory bean returns an instance of ObjectFactory (which handles the creation of PigServer instances) instead of the raw PigServer object which cannot be reused. Note that the caller needs to handle the object clean-up, specifically calling PigServer.shutdown(). In general, to avoid leaks it is recommended to use the PigTemplate.

Author:
Costin Leau

Constructor Summary
PigServerFactoryBean()
           
 
Method Summary
protected  org.apache.pig.PigServer createPigInstance()
           
 PigServerFactory getObject()
           
 java.lang.Class<?> getObjectType()
           
 boolean isSingleton()
           
 void setBeanName(java.lang.String name)
           
 void setJobName(java.lang.String jobName)
          Sets the job name.
 void setJobPriority(java.lang.String jobPriority)
          Sets the job priority.
 void setParallelism(java.lang.Integer parallelism)
          Sets the parallelism.
 void setPathsToSkip(java.util.Collection<java.lang.String> pathToSkip)
          Sets the paths to skip.
 void setPigContext(org.apache.pig.impl.PigContext pigContext)
          Sets the PigContext to use.
 void setScripts(java.util.Collection<PigScript> scripts)
          Sets the scripts to execute at startup.
 void setUser(java.lang.String user)
          Sets the user impersonation (optional) for executing Pig jobs.
 void setValidateEachStatement(java.lang.Boolean validateEachStatement)
          Indicates whether each statement should be validated or not.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

PigServerFactoryBean

public PigServerFactoryBean()
Method Detail

getObject

public PigServerFactory getObject()
                           throws java.lang.Exception
Specified by:
getObject in interface org.springframework.beans.factory.FactoryBean<PigServerFactory>
Throws:
java.lang.Exception

getObjectType

public java.lang.Class<?> getObjectType()
Specified by:
getObjectType in interface org.springframework.beans.factory.FactoryBean<PigServerFactory>

isSingleton

public boolean isSingleton()
Specified by:
isSingleton in interface org.springframework.beans.factory.FactoryBean<PigServerFactory>

createPigInstance

protected org.apache.pig.PigServer createPigInstance()
                                              throws java.lang.Exception
Throws:
java.lang.Exception

setBeanName

public void setBeanName(java.lang.String name)
Specified by:
setBeanName in interface org.springframework.beans.factory.BeanNameAware

setPigContext

public void setPigContext(org.apache.pig.impl.PigContext pigContext)
Sets the PigContext to use.

Parameters:
pigContext - The pigContext to set.

setPathsToSkip

public void setPathsToSkip(java.util.Collection<java.lang.String> pathToSkip)
Sets the paths to skip.

Parameters:
pathToSkip - The pathToSkip to set.

setScripts

public void setScripts(java.util.Collection<PigScript> scripts)
Sets the scripts to execute at startup.

Parameters:
scripts - The scripts to set.

setParallelism

public void setParallelism(java.lang.Integer parallelism)
Sets the parallelism.

Parameters:
parallelism - The parallelism to set.

setJobName

public void setJobName(java.lang.String jobName)
Sets the job name.

Parameters:
jobName - The jobName to set.

setJobPriority

public void setJobPriority(java.lang.String jobPriority)
Sets the job priority.

Parameters:
jobPriority - The jobPriority to set.

setValidateEachStatement

public void setValidateEachStatement(java.lang.Boolean validateEachStatement)
Indicates whether each statement should be validated or not. By default it is unset, relying on the Pig defaults.

Parameters:
validateEachStatement - whether to validate each statement or not.

setUser

public void setUser(java.lang.String user)
Sets the user impersonation (optional) for executing Pig jobs. Should be used when running against a Hadoop Kerberos cluster.

Parameters:
user - user/group information

Spring for Apache Hadoop