~~ Licensed under the Apache License, Version 2.0 (the "License"); ~~ you may not use this file except in compliance with the License. ~~ You may obtain a copy of the License at ~~ ~~ http://www.apache.org/licenses/LICENSE-2.0 ~~ ~~ Unless required by applicable law or agreed to in writing, software ~~ distributed under the License is distributed on an "AS IS" BASIS, ~~ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. ~~ See the License for the specific language governing permissions and ~~ limitations under the License. See accompanying LICENSE file. --- Apache Oozie, Workflow Scheduler for Hadoop ${project.version} --- ${maven.build.timestamp} Apache Oozie, Workflow Scheduler for Hadoop ${project.version} * Overview Oozie is an open-source workflow/coordination service to manage data processing jobs for Apache Hadoop™. It is an extensible, scalable and data-aware service to orchestrate dependencies between jobs running on Hadoop (including HDFS, Pig and MapReduce). Oozie is a lot of things, but being: * A workflow solution for off Hadoop processing * Another query processing API, a la Cascading is not one of them. *Oozie Benefits Complex workflow action dependencies: Oozie workflow comprises of actions and dependencies among them. Users create Directed Acyclical Graphs (DAG) to model their workflow. Oozie at runtime manages the dependencies and execute the actions when the dependencies identified in the DAG are satisfied. Reduces Time-To-Market (TTM): The DAG specification enables users to specify the workflow. Users save the time to build and maintain custom solutions for dependency and workflow management. Frequency execution: Oozie workflow specification supports both data and time triggers. Users can specify execution frequency and can wait for data arrival to trigger an action in the workflow. Native Hadoop stack integration: Oozie supports all types of Hadoop jobs and is integrated with the Hadoop stack. Yahoo! distribution of Oozie is validated against the Hadoop stack. Oozie was designed for Yahoo!\’s complex workflows and data pipelines at global scale. It is integrated with the Yahoo! Distribution of Hadoop with security and is a primary mechanism to manage a variety of complex data analysis workloads across Yahoo!. * License Oozie is distributed under {{{http://www.apache.org/licenses/}Apache License 2.0}}. * Source Repository {{http://svn.apache.org/repos/asf/incubator/oozie/}}