PAD term spotter

Overview

The ‘PAD term spotter’ pipeline processes radiology note textual extractions specifically pertaining to the diagnosis, treatment, etc. of lower limb Peripheral Artery Disease (PAD) (e.g. stenosis/occlusion paired with popliteal/femoral). The main feature is classifying each document for the presence of PAD. Descriptive text of diagnosis and illness terms are paired with the site designated terms to build a relational tie, indicating a hit. The pipeline assesses presence of phrases indicative of peripheral arterial disease (PAD) in one or more sentences contained in radiology related documents.

Note
Disclaimer

This should be considered a beta release of this annotator. See the Clinical Text Analysis and Knowledge Extraction System User Guide documentation located in the cTAKES <pipeline-root>/docs/userguide/cTAKES_userguide.htm for detailed install and setup information pertaining to all the cTAKES components. For additional, documentation pertaining to this pipeline see <pipeline-root>/PAD term spotter/doc/NLP OF RADIOLOGY REPORTSv1.pdf and <pipeline-root>/PAD term spotter/README (for prerequisite and installation steps see <pipeline-root>/PAD term spotter/README).

Collection readers (annotator)

Analysis engines (annotator)

Note

The following classes and files have Mayo specific site and terminology terms that are being leveraged, especially as it pertains to the subsection handling:

1) ‘/PAD term spotter/src/edu/mayo/bmi/fsm/machine/SubSectionPadIdFSM.java’ -Terms; "smh","rmh","gonda","romayo" are indicative of names of buildings on the Mayo campus which are used to mark subsection begin/end - terms; "indications","bleindications","exam","showing" are special terms which often contain the terms being screened for relating to PAD, but since they are titles of examinations, revision sections, and generalized screenings they are to be ignored in the Mayo cohort.

2) ‘/PAD term spotter/src/main/java/org/apache/ctakes/pad/impl/PADConsumerImpl.java’ - Terms; "indications:" and "showing" are special terms which often contain the terms being screened for relating to PAD, but since they are titles of examinations, revision sections, and generalized screenings they are to be ignored in the Mayo cohort. "maxSubsectionSize" is used to limit the overall scope of where the subsection tokens will be searched. It has been hardcoded to 300 in the shipped class.

3) ‘/PAD term spotter/resources/lookup/radiology/ExamTitleWords.txt’ - Comma delimited terms which represent key values to distinguish the type of radiology examination being utilized: US_EXAM (ultrasound), LOWER_EXT (lower extremity), US_LOWER_EXT (ultrasound lower extremity), US_LOWER_SOLO (ultrasound lower extremity one side only), CT_EXAM (CAT scan), CT_EXAM_SOLO (CAT Scan one side only).

4) ‘/PAD term spotter/resources/lookup/radiology/ExamsForPAD.csv’ - Provides a list of valid examination codes to act as a filter to eliminate the need to parse records not related to PAD.

CAS consumers

Resources