org.apache.uima.java
true
org.spin.scrubber.uima.annotator.RegexAnnotator
Regex Annotator
Matches regular expressions in document text.
Filenames
list of external resource dependency keys that need to be initialized for this annotator
String
true
true
CaseSensitiveFile
boolean flag to determine if files should be interpreted as case sensitive or not.
Boolean
true
true
Filenames
PatternFile
HospitalNameFile
PrivateFile
CaseSensitiveFile
true
false
false
uima.tcas.Annotation
org.spin.scrubber.uima.type.OntologyMatch
true
true
false
PatternFile
An required external file containing regular expressions to match.
File format is as follows:
- Lines starting with // or whitepsace are ignored
- Lines starting with # are the regex name
- Lines starting with % indicate an annotation type.
- All other lines are regular expressions.
false
HospitalNameFile
An optional external file containing names to match.
File format is as follows:
- Lines starting with // or whitepsace are ignored
- Lines starting with # are the name
- Lines starting with % indicate an annotation type.
- All other lines consist of strings to match.
true
PrivateFile
An optional external file containing names to match.
File format is as follows:
- Lines starting with // or whitepsace are ignored
- Lines starting with # are the name
- Lines starting with % indicate an annotation type.
- All other lines consist of strings to match.
true
regex
file:conf/regex_patterns.txt
hospital
file:conf/hospital_names.txt
private
file:conf/private_dict.txt
PatternFile
regex
HospitalNameFile
hospital
PrivateFile
private