public interface FSIndex<T extends FeatureStructure> extends Iterable<T>
Notice that each feature structure index uses its own ordering relation that will usually be different from index to index. In particular, equality with respect to the index ordering will in general not imply object identity for the feature structures.
We currently support three different kinds of indexes: sorted, set and bag indexes. The default index is a sorted index. In a sorted index, FSs that are committed (added to the index) are added, unless they are duplicates of already existing FSs in the index. This behavior is new as of version 2.7.0; the old behavior can be restored by specifying the JVM property "uima.allow_duplicate_add_to_indexes". Even without this flag, multiple different instances of FSs which compare equal will still be added in the index.
The index is sorted in the sense that iterators will output FSs in sorted order according to the comparator for that index. The order of FSs that are equal wrt the comparator is arbitrary but fixed. That is, if you iterate over the same index several times, you will see the same relative order of FSs every time. We also guarantee that reverse iterators will produce exactly the reverse sequence of forward iteration.
A set index will contain no duplicates of the same type, where a duplicate is defined by the indexing comparator. That is, if you commit two feature structures of the same type that are equal wrt the indexing comparator, only the first one will be entered into the index. Note that you can still have duplicates wrt the indexing order if they are of a different type. A set index is not guaranteed to be sorted.
A bag index finally simply stores everything, without any guaranteed order. Note that any operation like find() or FSIterator.moveTo() will not produce useful results on bag indexes, since bag indexes do not honor comparators (except that find is useful for indicating if the FS is in the index). Only use a bag index if you want very fast adding and will have to iterate over the whole index anyway.
Indexes have a top-most type, either the top of the type hierarchy, or some subtype of that. The top-most type in an index is represented by the generic T.
Modifier and Type | Field and Description |
---|---|
static int |
BAG_INDEX
Indexing strategy: bag index.
|
static int |
DEFAULT_BAG_INDEX
Special indexes used by the framework to implement
FSIndexRepository.getAllIndexedFS(Type) . |
static int |
SET_INDEX
Indexing strategy: set index.
|
static int |
SORTED_INDEX
Indexing strategy: sorted index.
|
Modifier and Type | Method and Description |
---|---|
int |
compare(FeatureStructure fs1,
FeatureStructure fs2)
Compare two feature structures according to the ordering relation of the index.
|
boolean |
contains(FeatureStructure fs)
Check if the index contains an element equal to the given feature structure
according to the comparators defined for this index.
|
FeatureStructure |
find(FeatureStructure fs)
Find an entry in the index "equal to" the given feature structure according to the comparators specified
for this index.
|
int |
getIndexingStrategy()
Return the indexing strategy.
|
Type |
getType()
Return the type of feature structures this index contains.
|
FSIterator<T> |
iterator()
Return an iterator over the index.
|
FSIterator<T> |
iterator(FeatureStructure fs)
Return an iterator over the index.
|
int |
size()
Return the number of feature structures in this index.
|
FSIndex<T> |
withSnapshotIterators()
Creates a shared copy of this FSIndex configured to produce snapshot iterators
that don't throw ConcurrentModificationExceptions.
|
forEach, spliterator
static final int SORTED_INDEX
static final int SET_INDEX
static final int BAG_INDEX
static final int DEFAULT_BAG_INDEX
FSIndexRepository.getAllIndexedFS(Type)
. Not user-definable.int size()
Type getType()
boolean contains(FeatureStructure fs)
fs
- A Feature Structure used a template to match for equality with the
FSs in the index.true
if the index contains such an element.FeatureStructure find(FeatureStructure fs)
fs
- A Feature Structure used a template to match with the Feature Structures in the index.null
if no such FS exists.FSIterator.moveTo(FeatureStructure)
int compare(FeatureStructure fs1, FeatureStructure fs2)
fs1
- the first Feature Structure to comparefs2
- the second Feature Structure to compare-1
if fs1 < fs2
; 0
if
fs1 = fs2
; 1
else.FSIterator<T> iterator()
iterator
in interface Iterable<T extends FeatureStructure>
FSIterator<T> iterator(FeatureStructure fs)
get()
method is
greater than or equal to fs
, and any previous FS is less than FS
(the iterator is positioned at the earliest of equal values).
If no such position exists, the iterator will be invalid.fs
- The feature structure at which the iterator should be positioned.fs
, if it exists. An invalid iterator, else.int getIndexingStrategy()
SORTED_INDEX
, BAG_INDEX
or SET_INDEX
.FSIndex<T> withSnapshotIterators()
Copyright © 2006–2017 The Apache Software Foundation. All rights reserved.