public interface Reader
Modifier and Type | Interface and Description |
---|---|
static class |
Reader.Options
Options for creating a RecordReader.
|
Modifier and Type | Method and Description |
---|---|
CompressionKind |
getCompressionKind()
Get the compression kind.
|
int |
getCompressionSize()
Get the buffer size for the compression.
|
long |
getContentLength()
Get the length of the file.
|
OrcProto.FileTail |
getFileTail()
Get the file tail (footer + postscript)
|
OrcFile.Version |
getFileVersion()
Get the file format version.
|
List<String> |
getMetadataKeys()
Get the user metadata keys.
|
int |
getMetadataSize() |
ByteBuffer |
getMetadataValue(String key)
Get a user metadata value.
|
long |
getNumberOfRows()
Get the number of rows in the file.
|
List<OrcProto.ColumnStatistics> |
getOrcProtoFileStatistics() |
List<OrcProto.StripeStatistics> |
getOrcProtoStripeStatistics() |
long |
getRawDataSize()
Get the deserialized data size of the file
|
long |
getRawDataSizeFromColIndices(List<Integer> colIds)
Get the deserialized data size of the specified columns ids
|
long |
getRawDataSizeOfColumns(List<String> colNames)
Get the deserialized data size of the specified columns
|
int |
getRowIndexStride()
Get the number of rows per a entry in the row index.
|
TypeDescription |
getSchema()
Get the type of rows in this ORC file.
|
ByteBuffer |
getSerializedFileFooter() |
ColumnStatistics[] |
getStatistics()
Get the statistics about the columns in the file.
|
List<StripeInformation> |
getStripes()
Get the list of stripes.
|
List<StripeStatistics> |
getStripeStatistics() |
List<OrcProto.Type> |
getTypes()
Deprecated.
use getSchema instead
|
List<Integer> |
getVersionList() |
OrcFile.WriterVersion |
getWriterVersion()
Get the version of the writer of this file.
|
boolean |
hasMetadataValue(String key)
Did the user set the given metadata value.
|
RecordReader |
rows()
Create a RecordReader that reads everything with the default options.
|
RecordReader |
rows(Reader.Options options)
Create a RecordReader that uses the options given.
|
long getNumberOfRows()
long getRawDataSize()
long getRawDataSizeOfColumns(List<String> colNames)
colNames
- long getRawDataSizeFromColIndices(List<Integer> colIds)
colIds
- - internal column id (check orcfiledump for column ids)List<String> getMetadataKeys()
ByteBuffer getMetadataValue(String key)
key
- a key given by the userboolean hasMetadataValue(String key)
key
- the key to checkCompressionKind getCompressionKind()
int getCompressionSize()
int getRowIndexStride()
List<StripeInformation> getStripes()
long getContentLength()
ColumnStatistics[] getStatistics()
TypeDescription getSchema()
List<OrcProto.Type> getTypes()
OrcFile.Version getFileVersion()
OrcFile.WriterVersion getWriterVersion()
OrcProto.FileTail getFileTail()
RecordReader rows() throws IOException
IOException
RecordReader rows(Reader.Options options) throws IOException
options
- the options to read withIOException
List<Integer> getVersionList()
int getMetadataSize()
List<OrcProto.StripeStatistics> getOrcProtoStripeStatistics()
List<StripeStatistics> getStripeStatistics() throws IOException
IOException
List<OrcProto.ColumnStatistics> getOrcProtoFileStatistics()
ByteBuffer getSerializedFileFooter()
Copyright © 2016 The Apache Software Foundation. All rights reserved.