Title: Apache cTAKES - Glossary Notice: Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. See the NOTICE file distributed with this work for additional information regarding copyright ownership. The ASF licenses this file to you under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at . http://www.apache.org/licenses/LICENSE-2.0 . Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License. **CDA** - Clinical Document Architecture – an HL7 XML-based standard for clinical documents. **component** - cTAKES is made up of a number of components, some optional. Some components include a single annotator. Others, such as 'core', include multiple annotators. For example, 'core' includes a sentence detector annotator and a tokenizer annotator. **cTAKES** - Clinical Text Analysis and Knowledge Extraction System **GENIA** - A collection of biomedical literature. It has been compiled and annotated within the scope of the [GENIA project](http://www-tsujii.is.s.u-tokyo.ac.jp/GENIA/). **LVG** - One of the NLM's Lexical Tools, used by cTAKES to find the normalized form of words. Refer to the [LVG information on the NLM website](http://lexsrv2.nlm.nih.gov/LexSysGroup/Projects/lvg/current/docs/userDoc/tools/lvg.html). **Orange Book** - A list from the U.S. Food and Drug Administration (FDA) of FDA-approved prescription drugs, including new and generic drugs. Refer to the [Orange information on the FDA website](http://www.fda.gov/cder/orange/). **Penn Treebank** - Corpus of naturally-occurring text annotated for linguistic structure. Refer to the [Treebank information on the University of Pennsylvania website](http://www.cis.upenn.edu/~treebank/). **pipeline** - Refers to one or more cTAKES components configured to process documents. **RxNORM** - A "standardized nomenclature for clinical drugs," produced by the National Library of Medicine (NLM). Refer to the [RxNORM information on the NLM website](http://www.nlm.nih.gov/research/umls/rxnorm/). **SNOMED** - Systematized Nomenclature of Medicine – Clinical Terms (SNOMED CT®), a comprehensive clinical terminology, originally created by the College of American Pathologists (CAP) and now owned, maintained, and distributed by the International Health Terminology Standards Development Organisation (IHTSDO). [Refer to SNOMED CT](http://www.ihtsdo.org/snomed-ct/). See also [the SNOMED information on the NLM website](http://www.nlm.nih.gov/research/umls/Snomed/snomed_main.html). **UIMA** - Unstructured Information Management Architecture. Refer to the [UMIA information on the Apache website](http://uima.apache.org/index.html). **UMLS** - U.S. National Library of Medicine's Unified Medical Language System®. Refer to the [UMLS information on the NLM website](http://www.nlm.nih.gov/research/umls/).