# Licensed to the Apache Software Foundation (ASF) under one or more # contributor license agreements. See the NOTICE file distributed with # this work for additional information regarding copyright ownership. # The ASF licenses this file to You under the Apache License, Version 2.0 # (the "License"); you may not use this file except in compliance with # the License. You may obtain a copy of the License at # # http://www.apache.org/licenses/LICENSE-2.0 # # Unless required by applicable law or agreed to in writing, software # distributed under the License is distributed on an "AS IS" BASIS, # WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. # See the License for the specific language governing permissions and # limitations under the License. Source: oozie Section: misc Priority: extra Maintainer: Bigtop Build-Depends: debhelper (>= 6) Depends: zip, unzip Standards-Version: 3.8.0 Homepage: http://incubator.apache.org/oozie/ Package: oozie-client Architecture: all Depends: bigtop-utils Description: Client for Oozie Workflow Engine Oozie client is a command line client utility that allows remote administration and monitoring of worflows. Using this client utility you can submit worflows, start/suspend/resume/kill workflows and find out their status at any instance. Apart from such operations, you can also change the status of the entire system, get vesion information. This client utility also allows you to validate any worflows before they are deployed to the Oozie server. Package: oozie Architecture: all Depends: oozie-client (= ${source:Version}), hadoop-client, bigtop-tomcat Description: Oozie is a system that runs workflows of Hadoop jobs. Oozie is a system that runs workflows of Hadoop jobs. Oozie workflows are actions arranged in a control dependency DAG (Direct Acyclic Graph). . Oozie coordinator functionality allows to start workflows at regular frequencies and when data becomes available in HDFS. . An Oozie workflow may contain the following types of actions nodes: map-reduce, map-reduce streaming, map-reduce pipes, pig, file-system, sub-workflows, java, hive, sqoop and ssh (deprecated). . Flow control operations within the workflow can be done using decision, fork and join nodes. Cycles in workflows are not supported. . Actions and decisions can be parameterized with job properties, actions output (i.e. Hadoop counters) and HDFS file information (file exists, file size, etc). Formal parameters are expressed in the workflow definition as ${VARIABLE NAME} variables. . A Workflow application is an HDFS directory that contains the workflow definition (an XML file), all the necessary files to run all the actions: JAR files for Map/Reduce jobs, shells for streaming Map/Reduce jobs, native libraries, Pig scripts, and other resource files. . Running workflow jobs is done via command line tools, a WebServices API or a Java API. . Monitoring the system and workflow jobs can be done via a web console, the command line tools, the WebServices API and the Java API. . Oozie is a transactional system and it has built in automatic and manual retry capabilities. . In case of workflow job failure, the workflow job can be rerun skipping previously completed actions, the workflow application can be patched before being rerun.