AccumuloDefaultIndexScanner (Hive 3.1.3 API)

java.lang.Object
- org.apache.hadoop.hive.accumulo.AccumuloDefaultIndexScanner

All Implemented Interfaces:

AccumuloIndexScanner
```
public class AccumuloDefaultIndexScanner
extends Object
implements AccumuloIndexScanner
```
This default index scanner expects indexes to be in the same format as presto's accumulo index tables defined as: [rowid=field value] [cf=cfname_cqname] [cq=rowid] [visibility] [value=""]
This handler looks for the following hive serde properties: 'accumulo.indextable.name' = 'table_idx' (required - name of the corresponding index table) 'accumulo.indexed.columns' = 'name,age,phone' (optional - comma separated list of indexed hive columns if not defined or defined as '*' all columns are assumed to be indexed ) 'accumulo.index.rows.max' = '20000' (optional - maximum number of match indexes to use before converting to a full table scan default=20000' Note: This setting controls the size of the in-memory list of rowids each search predicate. Using large values for this setting or having very large rowid values may require additional memory to prevent out of memory errors 'accumulo.index.scanner' = 'org.apache.hadoop.hive.accumulo.AccumuloDefaultIndexScanner' (optional - name of the index scanner)
To implement your own index table scheme it should be as simple as sub-classing this class and overriding getIndexRowRanges() and optionally init() if you need more config settings

Constructor Summary

Constructors
Constructor and Description

AccumuloDefaultIndexScanner()

Constructors
Constructor and Description
`AccumuloDefaultIndexScanner()`

Method Summary

All Methods Instance Methods Concrete Methods
Modifier and Type	Method and Description
`protected Map<String,String>`	`createColumnMap(org.apache.hadoop.conf.Configuration conf)`
`org.apache.accumulo.core.security.Authorizations`	`getAuths()`
`org.apache.accumulo.core.client.Connector`	`getConnect()`
`protected org.apache.accumulo.core.client.Connector`	`getConnector()`
`AccumuloConnectionParameters`	`getConnectParams()`
`Set<String>`	`getIndexColumns()`
`AccumuloIndexParameters`	`getIndexParams()`
`List<org.apache.accumulo.core.data.Range>`	`getIndexRowRanges(String column, org.apache.accumulo.core.data.Range indexRange)` Get a list of rowid ranges by scanning a column index.
`String`	`getIndexTable()`
`int`	`getMaxRowIds()`
`void`	`init(org.apache.hadoop.conf.Configuration conf)` Initialize object based on configuration.
`boolean`	`isIndexed(String column)` Test if column is defined in the index table.
`void`	`setConnectParams(AccumuloConnectionParameters connectParams)`

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

Constructor Detail
- AccumuloDefaultIndexScanner
```
public AccumuloDefaultIndexScanner()
```

Method Detail

init
```
public void init(org.apache.hadoop.conf.Configuration conf)
```
Initialize object based on configuration.

Specified by:

init in interface AccumuloIndexScanner

Parameters:

conf - - Hive configuration

getIndexRowRanges
```
public List<org.apache.accumulo.core.data.Range> getIndexRowRanges(String column,
                                                                   org.apache.accumulo.core.data.Range indexRange)
```
Get a list of rowid ranges by scanning a column index.

Specified by:

getIndexRowRanges in interface AccumuloIndexScanner

Parameters:

column - - the hive column name

indexRange - - Key range to scan on the index table

Returns:

List of matching rowid ranges or null if too many matches found if index values are not found a newline range is added to list to short-circuit the query

isIndexed
```
public boolean isIndexed(String column)
```
Test if column is defined in the index table.

Specified by:

isIndexed in interface AccumuloIndexScanner

Parameters:

column - - hive column name

Returns:

true if the column is defined as part of the index table

createColumnMap

protected Map<String,String> createColumnMap(org.apache.hadoop.conf.Configuration conf)

getConnector

protected org.apache.accumulo.core.client.Connector getConnector()
                                                          throws org.apache.accumulo.core.client.AccumuloSecurityException,
                                                                 org.apache.accumulo.core.client.AccumuloException

Throws:: org.apache.accumulo.core.client.AccumuloSecurityException; org.apache.accumulo.core.client.AccumuloException

setConnectParams

public void setConnectParams(AccumuloConnectionParameters connectParams)

getConnectParams

public AccumuloConnectionParameters getConnectParams()

getIndexParams

public AccumuloIndexParameters getIndexParams()

getMaxRowIds
```
public int getMaxRowIds()
```

getAuths

public org.apache.accumulo.core.security.Authorizations getAuths()

getIndexTable
```
public String getIndexTable()
```

getIndexColumns
```
public Set<String> getIndexColumns()
```

getConnect

public org.apache.accumulo.core.client.Connector getConnect()

Class AccumuloDefaultIndexScanner

Constructor Summary

Method Summary

Methods inherited from class java.lang.Object

Constructor Detail

AccumuloDefaultIndexScanner

Method Detail

init

getIndexRowRanges

isIndexed

createColumnMap

getConnector

setConnectParams

getConnectParams

getIndexParams

getMaxRowIds

getAuths

getIndexTable

getIndexColumns

getConnect