Apache UIMA-DUCC (Unstructured Information Management Architecture - Distributed UIMA Cluster Computing ) v.2.2.0 Release Notes

Contents

1. What is UIMA-DUCC?
2. Major Changes in this Release
3. Migration from a Prior Release
4. Limitations

1. What is UIMA-DUCC?

DUCC stands for Distributed UIMA Cluster Computing. DUCC is a cluster management system providing tooling, management, and scheduling facilities to automate the scale-out of applications written to the UIMA framework. Core UIMA provides a generalized framework for applications that process unstructured information such as human language, but does not provide a scale-out mechanism. UIMA-AS provides a scale-out mechanism to distribute UIMA pipelines over a cluster of computing resources, but does not provide job or cluster management of the resources. DUCC defines a formal job model that closely maps to a standard UIMA pipeline. Around this job model DUCC provides cluster management services to automate the scale-out of UIMA pipelines over computing clusters.

2. Major Changes in this Release

Apache UIMA DUCC 2.2.0 is a major release containing new features and bug fixes. What's new:



For a complete list of issues fixed and up-to-date information on UIMA-DUCC issues, see our issue tracker: https://issues.apache.org/jira/issues/?jql=project%20%3D%20UIMA%20AND%20fixVersion%20%3D%20%222.2.0-Ducc%22%20

3. Migration from a Prior Release

An existing DUCC installation can be updated in place by using the ducc_update script which can be copied from the UIMA Downloads page or extracted from the binary distribution. Additional steps are required to convert existing history and state files for database access. Details are in the INSTALL document and the DuccBook.

4. Limitations

On some systems cgroups swap accounting is not enabled and duccmon will show N/A for swap. To confirm, please check memory.stat file in /ducc/ folder. If swap accounting is enabled there should be "swap" property defined. If it's missing, you need to add a kernel parameter swapaccount=1. Details of how to do this can be found here.