org.apache.uima.java
edu.mayo.bmi.uima.termspotter.cr.RadiologyRecordsCollectionReader
lines from file collection reader
This collection reader facilitates reading "documents" from a single file. Each line in the document will be considered an entity to be analyzed by the CPE. That is each line will be treated as a "document" and will have its own CAS.
1.0
Mayo Clinic
InputFileName
specifies the location of the file that will be run through this collection reader.
String
false
true
Language
ISO language code for the documents
String
false
false
CommentString
specifies a comment string. Any line that begins with the string will be ignored and not be added as a "document" to the CPE.
String
false
false
IgnoreBlankLines
Optional parameter determines whether a blank line will be processed as a document or will be ignored. The default will be set to 'true' which means that blank lines will be ignored.
Boolean
false
false
IdDelimeter
Name of optional configuration parameter that specifies a character (or string) that delimits
the id of the document from the text of the document. For example, if the parameter is
set to '|' then the following line from a file:
1234|this is some text
would have an id of 1234 and text 'this is some text'.
* If this parameter is not set, then
* the id of a document will be its line number in the file.
String
false
false
ColumnCount
Number of columns contained in the radiology record. Typically, all the columns will be skipped for actual annotation except the final column which contains the details of the examination.
Integer
false
false
FilterExamTypes
Specifies the file which contains the valid types of exams which will be processed by the pipeline.
String
false
false
FilterExamColumnNumber
The column number of the input file that contains the compare string to parse applicable exam types.
Integer
false
false
InputFileName
change me
IgnoreBlankLines
true
ColumnCount
0
FilterExamColumnNumber
0
true
false
false