Uses of Interface
org.apache.crunch.contrib.text.Extractor

Packages that use Extractor
org.apache.crunch.contrib.text   
 

Uses of Extractor in org.apache.crunch.contrib.text
 

Classes in org.apache.crunch.contrib.text that implement Extractor
 class AbstractCompositeExtractor<T>
          Base class for Extractor instances that delegates the parsing of fields to other Extractor instances, primarily used for constructing composite records that implement the Tuple interface.
 class AbstractSimpleExtractor<T>
          Base class for the common case Extractor instances that construct a single object from a block of text stored in a String, with support for error handling and reporting.
 

Methods in org.apache.crunch.contrib.text that return Extractor
static Extractor<Boolean> Extractors.xboolean()
          Returns an Extractor for booleans.
static Extractor<Boolean> Extractors.xboolean(Boolean defaultValue)
           
static
<T> Extractor<Collection<T>>
Extractors.xcollect(TokenizerFactory scannerFactory, Extractor<T> extractor)
           
static
<T extends Tuple>
Extractor<T>
Extractors.xcustom(Class<T> clazz, TokenizerFactory scannerFactory, Extractor... extractors)
          Returns an Extractor for a subclass of Tuple with a constructor that has the given extractor types that uses the given TokenizerFactory for parsing the sub-fields.
static Extractor<Double> Extractors.xdouble()
          Returns an Extractor for doubles.
static Extractor<Double> Extractors.xdouble(Double defaultValue)
           
static Extractor<Float> Extractors.xfloat()
          Returns an Extractor for floats.
static Extractor<Float> Extractors.xfloat(Float defaultValue)
           
static Extractor<Integer> Extractors.xint()
          Returns an Extractor for integers.
static Extractor<Integer> Extractors.xint(Integer defaultValue)
          Returns an Extractor for integers.
static Extractor<Long> Extractors.xlong()
          Returns an Extractor for longs.
static Extractor<Long> Extractors.xlong(Long defaultValue)
          Returns an Extractor for longs.
static
<K,V> Extractor<Pair<K,V>>
Extractors.xpair(TokenizerFactory scannerFactory, Extractor<K> one, Extractor<V> two)
          Returns an Extractor for pairs of the given types that uses the given TokenizerFactory for parsing the sub-fields.
static
<A,B,C,D> Extractor<Tuple4<A,B,C,D>>
Extractors.xquad(TokenizerFactory scannerFactory, Extractor<A> a, Extractor<B> b, Extractor<C> c, Extractor<D> d)
          Returns an Extractor for quads of the given types that uses the given TokenizerFactory for parsing the sub-fields.
static Extractor<String> Extractors.xstring()
          Returns an Extractor for strings.
static Extractor<String> Extractors.xstring(String defaultValue)
           
static
<A,B,C> Extractor<Tuple3<A,B,C>>
Extractors.xtriple(TokenizerFactory scannerFactory, Extractor<A> a, Extractor<B> b, Extractor<C> c)
          Returns an Extractor for triples of the given types that uses the given TokenizerFactory for parsing the sub-fields.
static Extractor<TupleN> Extractors.xtupleN(TokenizerFactory scannerFactory, Extractor... extractors)
          Returns an Extractor for an arbitrary number of types that uses the given TokenizerFactory for parsing the sub-fields.
 

Methods in org.apache.crunch.contrib.text with parameters of type Extractor
static
<T> PCollection<T>
Parse.parse(String groupName, PCollection<String> input, Extractor<T> extractor)
          Parses the lines of the input PCollection<String> and returns a PCollection<T> using the given Extractor<T>.
static
<T> PCollection<T>
Parse.parse(String groupName, PCollection<String> input, PTypeFamily ptf, Extractor<T> extractor)
          Parses the lines of the input PCollection<String> and returns a PCollection<T> using the given Extractor<T> that uses the given PTypeFamily.
static
<K,V> PTable<K,V>
Parse.parseTable(String groupName, PCollection<String> input, Extractor<Pair<K,V>> extractor)
          Parses the lines of the input PCollection<String> and returns a PTable<K, V> using the given Extractor<Pair<K, V>>.
static
<K,V> PTable<K,V>
Parse.parseTable(String groupName, PCollection<String> input, PTypeFamily ptf, Extractor<Pair<K,V>> extractor)
          Parses the lines of the input PCollection<String> and returns a PTable<K, V> using the given Extractor<Pair<K, V>> that uses the given PTypeFamily.
static
<T> Extractor<Collection<T>>
Extractors.xcollect(TokenizerFactory scannerFactory, Extractor<T> extractor)
           
static
<T extends Tuple>
Extractor<T>
Extractors.xcustom(Class<T> clazz, TokenizerFactory scannerFactory, Extractor... extractors)
          Returns an Extractor for a subclass of Tuple with a constructor that has the given extractor types that uses the given TokenizerFactory for parsing the sub-fields.
static
<K,V> Extractor<Pair<K,V>>
Extractors.xpair(TokenizerFactory scannerFactory, Extractor<K> one, Extractor<V> two)
          Returns an Extractor for pairs of the given types that uses the given TokenizerFactory for parsing the sub-fields.
static
<K,V> Extractor<Pair<K,V>>
Extractors.xpair(TokenizerFactory scannerFactory, Extractor<K> one, Extractor<V> two)
          Returns an Extractor for pairs of the given types that uses the given TokenizerFactory for parsing the sub-fields.
static
<A,B,C,D> Extractor<Tuple4<A,B,C,D>>
Extractors.xquad(TokenizerFactory scannerFactory, Extractor<A> a, Extractor<B> b, Extractor<C> c, Extractor<D> d)
          Returns an Extractor for quads of the given types that uses the given TokenizerFactory for parsing the sub-fields.
static
<A,B,C,D> Extractor<Tuple4<A,B,C,D>>
Extractors.xquad(TokenizerFactory scannerFactory, Extractor<A> a, Extractor<B> b, Extractor<C> c, Extractor<D> d)
          Returns an Extractor for quads of the given types that uses the given TokenizerFactory for parsing the sub-fields.
static
<A,B,C,D> Extractor<Tuple4<A,B,C,D>>
Extractors.xquad(TokenizerFactory scannerFactory, Extractor<A> a, Extractor<B> b, Extractor<C> c, Extractor<D> d)
          Returns an Extractor for quads of the given types that uses the given TokenizerFactory for parsing the sub-fields.
static
<A,B,C,D> Extractor<Tuple4<A,B,C,D>>
Extractors.xquad(TokenizerFactory scannerFactory, Extractor<A> a, Extractor<B> b, Extractor<C> c, Extractor<D> d)
          Returns an Extractor for quads of the given types that uses the given TokenizerFactory for parsing the sub-fields.
static
<A,B,C> Extractor<Tuple3<A,B,C>>
Extractors.xtriple(TokenizerFactory scannerFactory, Extractor<A> a, Extractor<B> b, Extractor<C> c)
          Returns an Extractor for triples of the given types that uses the given TokenizerFactory for parsing the sub-fields.
static
<A,B,C> Extractor<Tuple3<A,B,C>>
Extractors.xtriple(TokenizerFactory scannerFactory, Extractor<A> a, Extractor<B> b, Extractor<C> c)
          Returns an Extractor for triples of the given types that uses the given TokenizerFactory for parsing the sub-fields.
static
<A,B,C> Extractor<Tuple3<A,B,C>>
Extractors.xtriple(TokenizerFactory scannerFactory, Extractor<A> a, Extractor<B> b, Extractor<C> c)
          Returns an Extractor for triples of the given types that uses the given TokenizerFactory for parsing the sub-fields.
static Extractor<TupleN> Extractors.xtupleN(TokenizerFactory scannerFactory, Extractor... extractors)
          Returns an Extractor for an arbitrary number of types that uses the given TokenizerFactory for parsing the sub-fields.
 

Constructor parameters in org.apache.crunch.contrib.text with type arguments of type Extractor
AbstractCompositeExtractor(TokenizerFactory scannerFactory, List<Extractor<?>> extractors)
           
 



Copyright © 2014 The Apache Software Foundation. All Rights Reserved.