BatchEE projects aims to provide a JBatch implementation (aka JSR352) and a set of useful extensions for this specification.
Entered incubation 2013-10-03 .
FIXME
Group 1 = January, April, July, October (Currently monthly: August)
DataFu provides a collection of Hadoop MapReduce jobs and functions in higher level languages based on it to perform data analysis. It provides functions for common statistics tasks (e.g. quantiles, sampling), PageRank, stream sessionization, and set and bag operations. DataFu also provides Hadoop jobs for incremental data processing in MapReduce.
Entered incubation 2014-01-05 .
Jakob Homan
Group 1 = January, April, July, October (Currently monthly: August)
Edgent is a stream processing programming model and lightweight runtime to execute analytics at devices on the edge or at the gateway. (Formerly known as Quarks)
Entered incubation 2016-02-29 .
Katherine Marsden
Group 2 = February, May, August, November
Guacamole is an enterprise-grade, protocol-agnostic, remote desktop gateway. Combined with cloud hosting, Guacamole provides an excellent alternative to traditional desktops. Guacamole aims to make cloud-hosted desktop access preferable to traditional, local access.
Entered incubation 2016-02-10 .
Jean-Baptiste Onofre
Group 2 = February, May, August, November
Gobblin is a distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.
Entered incubation 2017-02-23 .
Olivier Lamy
Group 1 = January, April, July, October (Currently monthly: April, May, June, July, August, September)
A real-time, distributed, fault-tolerant stream processing engine.
Entered incubation 2017-06-23 .
Julien Le Dem
Group 2 = February, May, August, November (Currently monthly: August, September, October)
Impala is a high-performance C++ and Java SQL query engine for data stored in Apache Hadoop-based clusters.
Entered incubation 2015-12-03 .
Tom White
Group 2 = February, May, August, November
Open source system that enables the orchestration of IoT devices.
Entered incubation 2016-01-20 .
Hadrian Zbarcea
Group 2 = February, May, August, November
Joshua is a statistical machine translation toolkit
Entered incubation 2016-02-13 .
Chris Mattmann
Group 2 = February, May, August, November
Livy is web service that exposes a REST interface for managing long running Apache Spark contexts in your cluster. With Livy, new applications can be built on top of Apache Spark that require fine grained interaction with many Spark contexts.
Entered incubation 2017-06-05 .
Sean Busbey
Group 1 = January, April, July, October (Currently monthly: July, August, September)
MRQL is a query processing and optimization system for large-scale, distributed data analysis, built on top of Apache Hadoop, Hama, Spark, and Flink.
Entered incubation 2013-03-13 .
Edward J. Yoon
Group 3 = March, June, September, December (Currently monthly: July, August)
NetBeans is a development environment, tooling platform and application framework.
Entered incubation 2016-10-01 .
Bertrand Delacretaz
Group 1 = January, April, July, October (Currently monthly: August)
PredictionIO is an open source Machine Learning Server built on top of state-of-the-art open source stack, that enables developers to manage and deploy production-ready predictive services for various kinds of machine learning tasks.
Entered incubation 2016-05-26 .
Andrew Purtell
Group 2 = February, May, August, November
Pulsar is a highly scalable, low latency messaging platform running on commodity hardware. It provides simple pub-sub semantics over topics, guaranteed at-least-once delivery of messages, automatic cursor management for subscribers, and cross-datacenter replication.
Entered incubation 2017-06-01 .
Bryan Call
Group 3 = March, June, September, December (Currently monthly: July, August, September)
Ratis is a java implementation for RAFT consensus protocol
Entered incubation 2017-01-03 .
Jitendra Pandey
Group 2 = February, May, August, November
S2Graph is a distributed and scalable OLTP graph database built on Apache HBase to support fast traversal of extremely large graphs.
Entered incubation 2015-11-29 .
Hyunsik Choi
Group 2 = February, May, August, November
Slider is a collection of tools and technologies to package, deploy, and manage long running applications on Apache Hadoop YARN clusters.
Entered incubation 2014-04-29 .
Vinod K
Group 2 = February, May, August, November
Superset is an enterprise-ready web application for data exploration, data visualization and dashboarding.
Entered incubation 2017-05-21 .
Daniel Dai
Group 3 = March, June, September, December (Currently monthly: June, July, August, September)
Tamaya is a highly flexible configuration solution based on an modular, extensible and injectable key/value based design, which should provide a minimal but extendible modern and functional API leveraging SE, ME and EE environments.
Entered incubation 2014-11-14 .
David Blevins
Group 2 = February, May, August, November
Toree provides applications with a mechanism to interactively and remotely access Apache Spark.
Entered incubation 2015-12-02 .
Sam Ruby
Group 2 = February, May, August, November
Unomi is a reference implementation of the OASIS Context Server specification currently being worked on by the OASIS Context Server Technical Committee. It provides a high-performance user profile and event tracking server.
Entered incubation 2015-10-05 .
Jean-Baptiste Onofre
Group 2 = February, May, August, November