The Apache Software Foundation has been key to the development of the Web since 1999
The Apache Software Foundation has been key to the development of the Web since 1999

The Apache Software Foundation (ASF) offers a wide range of tools, libraries, frameworks, and data stores for building enterprise applications. The purpose of this list is to keep track of what they are and what their current status is, with a strict focus on enterprise apps.

This document is a constant work in progress and will probably never be finished, not because I am lazy, but because all those projects are moving fast. I monitor closely comments, so do not hesitate to report errors, comments, updates. If you think we are missing an important information about a project, feel free to ask for it.

This document was last updated on[lastupdated before=””].

All Projects

Backed by Apple

Backed by CMU

Backed by CloudOps

Backed by Cloudera

Backed by Confluent

Backed by CyberAgent

Backed by Data Fueled

Backed by DataStax

Backed by DataTorrent

Backed by Databricks

Backed by Dell Software

Backed by Ecetera

Backed by Facebook

Backed by Georgia Institute of Technology

Backed by GyroLabs

Backed by Hortonworks

Backed by Huawei

Backed by IBM

Backed by InMobi

Backed by Intel

Backed by Jive

Backed by MapR Technologies

Backed by Microsoft

Backed by Midokura

Backed by NASA

Backed by NASA JPL

Backed by NTT DATA

Backed by Nextiva

Backed by Nicira

Backed by ObjectStyle

Backed by Oracle

Backed by Pivotal

Backed by Pragsis

Backed by Red Hat

Backed by RentTheRunway

Backed by Riot Games

Backed by ScalingData

Backed by StreamSets

Backed by Talend

Backed by Twitter

Backed by UC Berkeley

Backed by UC Irvine

Backed by UC Riverside

Backed by Uber

Backed by University of Michigan

Backed by Vanderbilt University

Backed by WANdisco

Backed by Xing Cloud

Backed by Yahoo!

Backed by Zymergen

Category Analytics

Category Application Server

Category Big Data

Category Data

Category Data Store

Category ETL

Category Framework

Category Infrastructure

Category Java Library

Category LDAP

Category Messaging

Category NLP

Category ORM

Category OSGi

Category Publication

Category RPC

Category Resource Management

Category Security

Category Stream

Category Tools

Category UI

Committer:

Committer: Alibaba

Committer: Alluxio

Committer: Altiscale

Committer: Apple

Committer: Bizo

Committer: BlueTalon

Committer: Brazil Outsource

Committer: Carnegie Mellon University

Committer: ClearStory Data

Committer: CloudOps

Committer: Cloudera

Committer: Clover Health

Committer: Computer Associates

Committer: Confluent

Committer: Couchbase

Committer: CyberAgent

Committer: Data Fueled

Committer: DataStax

Committer: DataTorrent

Committer: Databricks

Committer: Dell Software

Committer: EMC

Committer: Ecetera

Committer: Facebook

Committer: Georgia Institute of Technology

Committer: Getopt

Committer: Google

Committer: Groupon

Committer: HP

Committer: HPI

Committer: Hebrew University

Committer: Hortonworks

Committer: Huawei

Committer: IBM

Committer: INRIA

Committer: InMobi

Committer: Indiana University

Committer: Intel

Committer: JPL

Committer: KACST Saudi Arabia

Committer: Lanka Software Foundation

Committer: LinkedIn

Committer: MIT

Committer: MapR Technologies

Committer: Mesosphere

Committer: Microsoft

Committer: Midokura

Committer: NASA

Committer: NASA JPL

Committer: NTNU Trondheim

Committer: NTT DATA

Committer: NTT Data

Committer: Netflix

Committer: Nextiva

Committer: Nicira

Committer: Northeastern University

Committer: ObjectStyle

Committer: Persistent Systems Limited

Committer: Pivotal

Committer: Purdue University

Committer: Quantifind

Committer: QuestTec

Committer: QuestTec B.V.

Committer: Rackspace

Committer: Red Hat

Committer: RentTheRunway

Committer: Rutgers University

Committer: ScalingData

Committer: Sonic Software

Committer: Sosnoski Software

Committer: StreamSets

Committer: Talend

Committer: Teradata

Committer: The Ohio State University

Committer: Twitter

Committer: UC Berkeley

Committer: UC Irvine

Committer: UC Riverside

Committer: UC Santa Barbara

Committer: Uber

Committer: University of Michigan

Committer: Vanderbilt University

Committer: WANdisco

Committer: WSO2

Committer: Webtrends

Committer: X15 Software

Committer: Yahoo!

Committer: YouTube

Committer: Zymergen

Committer: eBay

Name Apache Accumulo
Website http://accumulo.apache.org/
Twitter @ApacheAccumulo
Categories Big Data, Tools, Data Store
Description Apache Accumulo™ is a sorted, distributed key/value store that provides robust, scalable data storage and retrieval.

Apache Accumulo is based on the design of Google’s BigTable and is powered by Apache Hadoop, Apache Zookeeper, and Apache Thrift.

Last update on JGP.net
VP
Production Version 1.7.2
Last release date
Development Version
Last development drop date
Backers (PMC)
Committers

Name Apache ACE
Website http://ace.apache.org
Twitter
Categories Framework, OSGi, Tools
Description Apache ACE™ is a software distribution framework that allows you to centrally manage and distribute software components, configuration data, and other artifacts to target systems. It is built using OSGi and can be deployed in different topologies. The target systems are usually also OSGi based, but don’t have to be.

Last update on JGP.net
VP
Production Version 2.1.0
Last release date Tuesday, February 9, 2016
Development Version
Last development drop date
Backers (PMC)
Committers

Name Apache Ambari
Website http://ambari.apache.org/
Twitter
Categories Infrastructure
Description Apache Ambari™ aimes at making Hadoop management simpler.

Ambari enables System Administrators to provision, Manage, and monitor a Hadoop Cluster. Ambari provides an intuitive, easy-to-use Hadoop management web UI backed by its RESTful APIs.

Last update on JGP.net
VP Yusaku Sako
Production Version 2.2.2
Last release date Monday, May 9, 2016
Development Version
Last development drop date
Backers (PMC) Hortonworks, Red Hat, IBM, WANdisco
Committers Hortonworks, Pivotal, Red Hat, BlueTalon, IBM, WANdisco, eBay, Rutgers University, Microsoft, HP, Teradata, Rackspace, EMC, The Ohio State University, Carnegie Mellon University

Name Apache Apex
Website http://apex.apache.org/
Twitter @ApacheApex
Categories Big Data, Tools, Stream
Description Apache Spark™ is an enterprise-grade unified stream and batch processing engine. Apex is a Hadoop add-on that unifies stream and batch processing. It processes big data in-motion in a way that is highly scalable, highly performant, fault tolerant, stateful, secure, distributed, and easily operable.
Last update on JGP.net
VP Thomas Weise
Production Version 3.4.0
Last release date Wednesday, May 25, 2016
Development Version
Last development drop date
Backers (PMC) DataTorrent
Committers DataTorrent, Google

Name Apache Arrow
Website http://arrow.apache.org/
Twitter
Categories Big Data, Analytics
Description Apache Arrow™ powers columnar and in-memory analytics.

Arrow enables execution engines to take advantage of the latest SIMD (Single Input Multiple Data) operations included in modern processors, for native vectorized optimization of analytical data processing. Columnar layout of data also allows for a better use of CPU caches by placing all data relevant to a column operation in as compact of a format as possible.

Last update on JGP.net
VP Jacques Nadeau
Production Version
Last release date
Development Version
Last development drop date
Backers (PMC)
Committers

Name Apache AsterixDB
Website http://asterixdb.apache.org/
Twitter @ASTERIXUCI
Categories Big Data, Data Store
Description Apache AsterixDB™ is a scalable Big Data Management System (BDMS). AsterixDB offers support for flexible data model, distributed storage and transaction, fast data ingestion, scalable and data-parallel query execution runtime, declarative query language.

AsterixDB supports various storage and indexing options, including managed datasets, external datasets (e.g., data on HDFS – Hadoop File System), and secondary indexes.

Last update on JGP.net
VP Till Westmann
Production Version
Last release date
Development Version 0.8.8-incubating
Last development drop date
Backers (PMC) UC Irvine, UC Riverside, Oracle
Committers Couchbase, UC Irvine, NTNU Trondheim, UC Riverside, Hebrew University, HPI, Couchbase, Microsoft, KACST Saudi Arabia, X15 Software

Name Apache Aurora
Website http://aurora.apache.org/
Twitter @ApacheAurora
Categories Infrastructure, Framework
Description Apache Aurora™ is a Mesos framework for long-running services and cron jobs.

Aurora runs applications and services across a shared pool of machines, and is responsible for keeping them running, forever. When machines experience failure, Aurora intelligently reschedules those jobs onto healthy machines.

Last update on JGP.net
VP Bill Farner
Production Version
Last release date
Development Version 0.14.0
Last development drop date Tuesday, June 14, 2016
Backers (PMC)
Committers

Name Apache Avro
Website http://avro.apache.org/
Twitter
Categories Data, Data Store, RPC
Description Apache Avro™ is a data serialization system, which provides rich data structures, a compact, fast, binary data format, a container file, to store persistent data, Remote Procedure Call (RPC), and simple integration with dynamic languages.

Code generation is not required to read or write data files nor to use or implement RPC protocols. Code generation can be done for optimization purpose, but only worth implementing for statically typed languages.

Avro relies on schemas. When Avro data is read, the schema used when writing it is always present. This permits each datum to be written with no per-value overheads, making serialization both fast and small. This also facilitates use with dynamic, scripting languages, since data, together with its schema, is fully self-describing.

When Avro data is stored in a file, its schema is stored with it, so that files may be processed later by any program. If the program reading the data expects a different schema this can be easily resolved, since both schemas are present.

When Avro is used in RPC, the client and server exchange schemas in the connection handshake. This can be optimized so that, for most calls, no schemas are actually transmitted. Since both client and server both have the other’s full schema, correspondence between same named fields, missing fields, extra fields, etc. can all be easily resolved.

Last update on JGP.net
VP
Production Version 1.8.1
Last release date Thursday, May 19, 2016
Development Version
Last development drop date
Backers (PMC)
Committers

Name Apache Axis2
Website http://axis.apache.org/axis2/java/core/
Twitter
Categories Infrastructure
Description Apache Axis2™ is a Web Services, SOAP, and WSDL engine, the successor to the widely used Apache Axis SOAP stack. There are two implementations of the Apache Axis2 Web services engine: Apache Axis2/Java and Apache Axis2/C.
Last update on JGP.net
VP Deepal Jayasinghe
Production Version 1.7.3
Last release date Monday, May 30, 2016
Development Version
Last development drop date
Backers (PMC) Georgia Institute of Technology
Committers WSO2, IBM, Sonic Software, Indiana University, Lanka Software Foundation, Georgia Institute of Technology, Brazil Outsource, HP, Computer Associates, Sosnoski Software, Purdue University, UC Santa Barbara

Name Apache Bahir
Website http://bahir.apache.org/
Twitter
Categories Analytics, Big Data
Description Apache Bahir™ provides extensions to distributed analytic platforms such as Apache Spark.
Last update on JGP.net
VP Luciano Resende
Production Version
Last release date
Development Version
Last development drop date
Backers (PMC) NASA, Talend, IBM, Cloudera, Yahoo!, Hortonworks
Committers Databricks, UC Berkeley, NASA, Netflix, Alluxio, QuestTec, Talend, Cloudera, NTT Data, IBM, ClearStory Data, Yahoo!, Clover Health, Hortonworks

Name Apache Bigtop
Website http://bigtop.apache.org/
Twitter
Categories Big Data, Infrastructure
Description Apache Bigtop™ is a project targeted for Infrastructure Engineers and Data Scientists looking for comprehensive packaging, testing, and configuration of the leading open source big data components. Bigtop supports a wide range of components/projects, including, but not limited to, Hadoop, HBase and Spark.
Last update on JGP.net
VP Olaf Flebbe
Production Version 1.1.0
Last release date
Development Version 1.2.0-SNAPSHOT
Last development drop date
Backers (PMC)
Committers

Name Apache Calcite
Website http://calcite.apache.org/
Twitter
Categories Data Store
Description Apache Calcite™ is a dynamic data management framework.

It contains many of the pieces that comprise a typical database management system, but omits some key functions: storage of data, algorithms to process data, and a repository for storing metadata.

Calcite intentionally stays out of the business of storing and processing data. This makes an excellent candidate for mediating between applications and one (or more) data storage locations and data processing engines.

Last update on JGP.net
VP Julian Hyde
Production Version 1.8.0
Last release date Monday, June 13, 2016
Development Version
Last development drop date
Backers (PMC)
Committers

Name Apache Cassandra
Website http://cassandra.apache.org/
Twitter
Categories Data Store
Description Apache Cassandra™ is a NoSQL database focusing on scalability and high availability, without compromising performance. Linear scalability and proven fault-tolerance on commodity hardware or cloud infrastructure make it the perfect platform for mission-critical data. Cassandra’s support for replicating across multiple datacenters is best-in-class, providing lower latency for your users and the peace of mind of knowing that you can survive regional outages.

Cassandra’s data model offers the convenience of column indexes with the performance of log-structured updates, strong support for denormalization and materialized views, and powerful built-in caching.

Last update on JGP.net
VP Jonathan Ellis
Production Version 3.0.7, 2.2.6
Last release date
Development Version
Last development drop date
Backers (PMC) DataStax
Committers DataStax

Name Apache Cayenne
Website http://cayenne.apache.org/
Twitter
Categories Java Library, Tools
Description Apache Cayenne™ provides Object-Relational Mapping (ORM) and remoting services. With a wealth of unique and powerful features, Cayenne can address a wide range of persistence needs. Cayenne seamlessly binds one or more database schemas directly to Java objects, managing atomic commit and rollbacks, SQL generation, joins, sequences, and more. With Cayenne’s Remote Object Persistence, those Java objects can even be persisted out to clients via Web Services.

Cayenne is designed to be easy to use, without sacrificing flexibility or design. To that end, Cayenne supports database reverse engineering and generation, as well as a Velocity-based class generation engine.

All of these functions can be controlled directly through a GUI tool called CayenneModeler. No cryptic XML or annotation based configuration is required! An entire database schema can be mapped directly to Java objects within minutes, all from the comfort of the GUI-based CayenneModeler.

Cayenne supports numerous other features, including caching, a complete object query syntax, relationship pre-fetching, on-demand object and relationship faulting, object inheritance, database auto-detection, and generic persisted objects. Most importantly, Cayenne can scale up or down to virtually any project size. With a mature, 100% open source framework, an energetic user community, and a track record of solid performance in high-volume environments, Cayenne is an exceptional choice for persistence services.

Last update on JGP.net
VP Andrus Adamchik
Production Version 3.1.1
Last release date Monday, May 16, 2016
Development Version 4.0m3
Last development drop date Friday, February 12, 2016
Backers (PMC) ObjectStyle
Committers ObjectStyle

Name Apache Chukwa
Website http://chukwa.apache.org/
Twitter
Categories Infrastructure, Tools
Description Apache Chukwa™ is a data collection system for monitoring large distributed systems. Chukwa is built on top of the Hadoop Distributed File System (HDFS) and Map/Reduce framework and inherits Hadoop’s scalability and robustness. Chukwa also includes a flexible and powerful toolkit for displaying, monitoring and analyzing results to make the best use of the collected data.
Last update on JGP.net
VP Eric Yang
Production Version
Last release date
Development Version 0.7.0
Last development drop date Monday, December 21, 2015
Backers (PMC) UC Berkeley, Twitter, IBM, Riot Games, GyroLabs, Intel, Xing Cloud, Pragsis, CMU
Committers

Name Apache CloudStack
Website http://cloudstack.apache.org/
Twitter
Categories Infrastructure
Description Apache CloudStack™ is designed to deploy and manage large networks of virtual machines, as a highly available, highly scalable Infrastructure as a Service (IaaS) cloud computing platform. CloudStack is used by service providers to offer public cloud services, and by many companies to provide an on-premises (private) cloud offering, or as part of a hybrid cloud solution.

CloudStack is a turnkey solution that includes the entire “stack” of features most organizations want with an IaaS cloud: compute orchestration, Network-as-a-Service, user and account management, a full and open native API, resource accounting, and a first-class User Interface (UI).

CloudStack currently supports the most popular hypervisors: VMware, KVM, Citrix XenServer, Xen Cloud Platform (XCP), Oracle VM server, and Microsoft Hyper-V.

Users can manage their cloud with an easy to use Web interface, command line tools, and/or a full-featured RESTful API. In addition, CloudStack provides an API that’s compatible with AWS EC2 and S3 for organizations that wish to deploy hybrid clouds.

Last update on JGP.net
VP Will Stevens
Production Version 4.8.0.1
Last release date
Development Version
Last development drop date
Backers (PMC) CloudOps
Committers CloudOps

Name Apache Commons
Website http://commons.apache.org/
Twitter
Categories Java Library
Description Apache Commons™ is a project focused on all aspects of reusable Java components.

Last update on JGP.net
VP
Production Version Varies by sub-project
Last release date
Development Version
Last development drop date
Backers (PMC)
Committers

Name Apache CouchDB
Website http://couchdb.apache.org/
Twitter
Categories Data Store
Description Apache CouchDB™ is a database that uses JSON for documents, JavaScript for MapReduce indexes, and regular HTTP for its API.
Last update on JGP.net
VP
Production Version 1.6.1
Last release date Wednesday, September 3, 2014
Development Version 2.0.0
Last development drop date
Backers (PMC)
Committers

Name Apache Crunch
Website http://crunch.apache.org/
Twitter
Categories Big Data, Tools
Description The Apache Crunch™ Java library provides a framework for writing, testing, and running MapReduce pipelines. Its goal is to make pipelines that are composed of many user-defined functions simple to write, easy to test, and efficient to run.

Running on top of Hadoop MapReduce and Apache Spark, the Apache Crunch library is a simple Java API for tasks like joining and data aggregation that are tedious to implement on plain MapReduce. The APIs are especially useful when processing data that does not fit naturally into relational model, such as time series, serialized object formats like protocol buffers or Avro records, and HBase rows and columns.

Last update on JGP.net
VP
Production Version
Last release date
Development Version
Last development drop date
Backers (PMC)
Committers

Name Apache cTAKES
Website http://ctakes.apache.org/
Twitter
Categories Java Library, NLP
Description Apache cTAKES™ is a natural language processing system for extraction of information from electronic medical record clinical free-text. cTAKES can discover codable entities, temporal events, properties and relations. It relies on customized pipelines, which can be used with any Apache UIMA compatible component.
Last update on JGP.net
VP
Production Version
Last release date
Development Version
Last development drop date
Backers (PMC)
Committers

Name Apache Curator
Website http://curator.apache.org/
Twitter @ApacheCurator
Categories Big Data, Tools
Description Apache Curator™ is an Apache ZooKeeper keeper: it provides implementations of some of the common ZooKeeper “recipes”. The implementations are built on top of the Curator Framework, which is a high-level API that greatly simplifies using ZooKeeper. It adds many features that build on ZooKeeper and handles the complexity of managing connections to the ZooKeeper cluster and retrying operations.
Last update on JGP.net
VP
Production Version
Last release date
Development Version
Last development drop date
Backers (PMC)
Committers

Name Apache CXF
Website http://cxf.apache.org/
Twitter
Categories Java Library, Tools
Description Apache CXF™ is a services-oriented framework. CXF helps you build and develop services using frontend programming APIs, like JAX-WS and JAX-RS. These services can speak a variety of protocols such as SOAP, XML/HTTP, RESTful HTTP, or CORBA and work over a variety of transports such as HTTP, JMS or JBI.
Last update on JGP.net
VP
Production Version 3.1.6
Last release date Tuesday, March 29, 2016
Development Version
Last development drop date
Backers (PMC)
Committers

Name Apache DB
Website http://db.apache.org/
Twitter
Categories Data Store, Tools, ORM
Description Apache DB™ is charged with the creation and maintenance of commercial-quality database solutions based on software licensed to the Foundation, for distribution at no charge to the public. It includes several subprojects.

DdlUtils is a small, easy-to-use component for working with Database Definition (DDL) files. These are XML files that contain the definition of a database schema, e.g. tables and columns.

Apache Derby™ is an relational database implemented entirely in Java and available. Some key advantages include: small footprint – about 2.6 megabytes for the base engine and embedded JDBC driver, is based on the Java, JDBC, and SQL standards, provides an embedded JDBC driver that lets you embed Derby in any Java-based solution. Derby is derived from Informix Cloudscape.

Apache JDO™ is a standard way to access persistent data in databases, using plain old Java objects (POJO) to represent persistent data.

Apache Torque is an object-relational mapper (ORM) for Java. It lets you access and manipulate data in a relational database using Java objects, without using reflection to access user-provided classes, but by generating the necessary classes (including the Data Objects) from an XML schema describing the database layout.

Last update on JGP.net
VP
Production Version Varies by sub-project
Last release date
Development Version
Last development drop date
Backers (PMC)
Committers

Name Apache DeltaSpike
Website http://deltaspike.apache.org
Twitter
Categories Security, Tools
Description Apache DeltaSpike™ is a collection of portable CDI (Context and Dependency injection) extensions. These ready-to-use modules enable you to integrate tested API extensions into your Java projects.

DeltaSpike consists of a core module and a number of optional modules for providing additional enterprise functionality to your applications. The modules include features for enhanced security with type-safe control over method invocations, integration with schedulers, injection of CDI objects into validators, a transactional context and scope, and much more.

Last update on JGP.net
VP
Production Version 1.7.1
Last release date Wednesday, July 20, 2016
Development Version
Last development drop date
Backers (PMC)
Committers

Name Apache DeviceMap
Website http://devicemap.apache.org/
Twitter
Categories Data
Description Apache DeviceMap™ is a data repository containing device information, images and other relevant information for all sorts of mobile devices, e.g. smartphones and tablets. While the focus is on the data, an set of APIs exist to use and manage it.
Last update on JGP.net
VP
Production Version 1.0.3
Last release date
Development Version
Last development drop date
Backers (PMC)
Committers

Name Apache Directory
Website http://directory.apache.org/
Twitter
Categories Security, Tools, LDAP
Description The Apache Directory™ strives to increase LDAP awareness, comfort, and adoption to bring forth what we call the Modern LDAP Renaissance. Directory provides directory solutions entirely written in Java, including a directory server, which has been certified as LDAP v3 compliant by the Open Group (ApacheDS), and Eclipse-based directory tools (Apache Directory Studio).
Last update on JGP.net
VP
Production Version Varies by sub-project
Last release date
Development Version
Last development drop date
Backers (PMC)
Committers

Name Apache Drill
Website http://drill.apache.org/
Twitter
Categories Data, Big Data, Analytics, Tools
Description Apache Drill is a schema-free SQL Query Engine for Hadoop, NoSQL, and Cloud Storage

Drill allows to get faster insights without the overhead (data loading, schema creation and maintenance, transformations, etc.)
Flexibility, analyze the multi-structured and nested data in non-relational datastores directly without transforming or restricting the data; and leverage your existing SQL skillsets and BI tools including Tableau, Qlikview, MicroStrategy, Spotfire, Excel, and more.

Last update on JGP.net
VP
Production Version 1.7.0
Last release date Tuesday, June 28, 2016
Development Version
Last development drop date
Backers (PMC)
Committers

Name Apache Empire-db
Website http://empire-db.apache.org/
Twitter
Categories Java Library
Description Apache Empire-db™ is a relational database abstraction layer and data persistence component that allows developers to take a much more SQL-centric approach in application development than traditional Object-Relational Mapping frameworks (ORM), like JPA or Hibernate. By providing a unique type-safe object orientated command API Empire-db allows building highly efficient SQL-statements that take full advantage of all database features while eliminating the need for error-prone string operations and literals. This, together with DBMS independent record and metadata managment leads to an unprecedented level of ease-of-use and compile-time-safety.
Last update on JGP.net Tuesday, August 9, 2016
VP
Production Version 2.4.4
Last release date Tuesday, August 18, 2015
Development Version
Last development drop date
Backers (PMC)
Committers

Name Apache Falcon
Website http://falcon.apache.org/
Twitter
Categories Infrastructure, Big Data
Description Apache Falcon™ is a feed processing and management system aimed at making it easier for end consumers to onboard their feed processing and management on Hadoop clusters.
Last update on JGP.net Tuesday, August 9, 2016
VP
Production Version
Last release date
Development Version 0.10
Last development drop date Monday, August 8, 2016
Backers (PMC) InMobi, Hortonworks, Talend
Committers InMobi, Hortonworks, Talend

Name Apache Felix
Website http://felix.apache.org/
Twitter
Categories Framework, OSGi, Tools
Description Apache Felix™ is an implementation the OSGi framework and service platform, as well as other interesting OSGi-related technologies. The OSGi specifications originally targeted embedded devices and home services gateways, but they are ideally suited for any project interested in the principles of modularity, component-orientation, and/or service-orientation. OSGi technology combines aspects of these aforementioned principles to define a dynamic service deployment framework that is amenable to remote management.

It is composed of AutoConf Resource Processor, Bundle Repository, Configuration Admin, Connect, Coordinator, Dependency Manager 4, Deployment Admin, Event Admin, File Install, Framework Security, Framework, Gogo Command, Gogo Runtime, Gogo Shell, HTTP Service API, HTTP Service Base, HTTP Service Bridge, HTTP Service Bundle, HTTP Service Cometd, HTTP Service Jetty, HTTP Service Proxy, HTTP Service SSL filter, HTTP Service Whiteboard, HTTP Servlet 2.6 + 3.0 API, Inventory Printer, iPOJO (Core), iPOJO Annotations, iPOJO API, iPOJO Architecture Command for Gogo, iPOJO Architecture Command for the Felix Shell, iPOJO BND Plugin, iPOJO Composite, iPOJO Event Admin handler, iPOJO Extender pattern handler, iPOJO JMX handler, iPOJO Manipulator Project (containing all iPOJO manipulator modules), iPOJO Manipulator, iPOJO Runtime Project (containing all iPOJO runtime modules), iPOJO Temporal Dependency handler, iPOJO WebConsole Plugin, iPOJO Whiteboard pattern handler, Jaas Support Boot Classpath Jar, Jaas Support Bundle, junit4osgi – shell command, junit4osgi – swing gui, junit4osgi, Lightweight HTTP Service Complete, Lightweight HTTP Service Core, Log, Main, Metatype, OSGi OBR service API, Preferences, Remote Shell, Resolver, SCR (Declarative Services), SCR Annotations, SCR Bnd Plugin, SCR Compat (Declarative Services), SCR DS Annotations, SCR Ext Anno, SCR Generator, Shell Text UI, Shell, UPnP Base Driver (JDK 1.3), UPnP Base Driver (JDK 1.4+), UPnP Extra, UPnP Tester, User Admin, User Admin File-based store, User Admin MongoDB-based store, Utils, Web Console, Web Console (all-in-one bundle), Web Console DS Plugin, Web Console Event Admin Plugin, Web Console Memory Usage Plugin, Web Console OBR Plugin, Web Console PackageAdmin Plugin, Web Console Script Console Plugin, Web Console ServiceDiagnostics Plugin, Web Console Subsystems Plugin, Web Console UPNP Plugin, and Web Console User Admin Plugin.

Each module has it’s own version and might be more recent than the core module.

Last update on JGP.net Tuesday, August 9, 2016
VP
Production Version 5.4.0 (Core Frmaework)
Last release date Friday, October 16, 2015
Development Version
Last development drop date
Backers (PMC)
Committers

Name Apache Flex
Website http://flex.apache.org/
Twitter @ApacheFlex
Categories UI
Description Apache Flex™ is a highly productive, open source application framework for building and maintaining expressive web applications that deploy consistently on all major browsers, desktops and devices (including smartphones, tablets and tv). It provides a modern, standards-based language and programming model that supports common design patterns suitable for developers from many backgrounds. Flex applications can be deployed to the ubiquitous Adobe® Flash® Player in the browser, Adobe® AIR™ on desktop and mobile or to native Android™, IOS™, QNX®, Windows® or Mac® applications.
Last update on JGP.net Tuesday, August 9, 2016
VP Alex Harui
Production Version Apache Flex SDK : 4.15 (January 2016)
Apache FlexJS SDK : 0.6.0 (April 2016)
Blaze DS : 4.7.0 (Mar 2015)
Flex Mavenizer : 4.8.0 (Jan 2013)
FlexUnit : 4.2.0 (Apr 2014)
SDK Installer : 3.2.0 (June 2015)
Squiggly : 1.1 (Oct 2014)
Tour De Flex : 1.2 (Nov 2014)
Last release date
Development Version
Last development drop date
Backers (PMC)
Committers

Name Apache Flink
Website http://flink.apache.org/
Twitter @ApacheFlink
Categories Infrastructure, Big Data
Description Apache Flink™ is a platform for distributed stream and batch data processing.

Flink’s core is a streaming dataflow engine that provides data distribution, communication, and fault tolerance for distributed computations over data streams.

Last update on JGP.net Tuesday, August 9, 2016
VP
Production Version 1.1.0
Last release date Monday, August 8, 2016
Development Version
Last development drop date
Backers (PMC)
Committers

Name Apache Flume
Website http://flume.apache.org/
Twitter
Categories Big Data, ETL, Infrastructure
Description Apache Flume™ is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms. It uses a simple extensible data model that allows for online analytic application.

Last update on JGP.net Tuesday, August 9, 2016
VP
Production Version 1.6.0
Last release date Wednesday, May 20, 2015
Development Version
Last development drop date
Backers (PMC) Apple, Apple, Cloudera, Cloudera, Cloudera, Cloudera, Cloudera, Cloudera, Cloudera, Cloudera, Cloudera, CyberAgent, Data Fueled, Hortonworks, Nextiva, ScalingData, StreamSets, Vanderbilt University, Zymergen
Committers Apple, Cloudera, Cloudera, Databricks, , Apple, Apple, Cloudera, Cloudera, Cloudera, Cloudera, Cloudera, Cloudera, Cloudera, Cloudera, Cloudera, CyberAgent, Data Fueled, Hortonworks, Nextiva, ScalingData, StreamSets, Vanderbilt University, Zymergen

Name Apache Forrest
Website http://forrest.apache.org/
Twitter
Categories Infrastructure, Publication
Description Apache Forrest™ software is a publishing framework that transforms input from various sources into a unified presentation in one or more output formats. The modular and extensible plug-in architecture of Apache Forrest is based on Apache Cocoon and the relevant industry standards that separate presentation from content. Forrest can generate static documents, or be used as a dynamic server, or be deployed by its automated facility.
Last update on JGP.net Tuesday, August 9, 2016
VP
Production Version
Last release date
Development Version 0.9
Last development drop date Saturday, July 2, 2011
Backers (PMC)
Committers

Name Apache Geronimo
Website http://geronimo.apache.org/
Twitter
Categories Application Server
Description Apache Geronimo™ is a server runtime that integrates the best open source projects to create Java/OSGi server runtimes that meet the needs of enterprise developers and system administrators. Our most popular distribution is a fully certified Java EE 6 application server runtime.
Last update on JGP.net Monday, September 12, 2016
VP
Production Version 3.0.1
Last release date Tuesday, December 5, 2028
Development Version
Last development drop date
Backers (PMC)
Committers

Name Apache Giraph
Website http://giraph.apache.org/
Twitter
Categories Analytics, Big Data
Description Apache Giraph™ is an iterative graph processing system built for high scalability. For example, it is currently used at Facebook to analyze the social graph formed by users and their connections.
Last update on JGP.net Monday, September 12, 2016
VP
Production Version 1.1.0
Last release date Wednesday, November 19, 2014
Development Version 1.2.0-SNAPSHOT
Last development drop date Wednesday, August 31, 2016
Backers (PMC)
Committers

Name Apache Gora
Website http://gora.apache.org/
Twitter @ApacheGora
Categories Data, Data Store, Big Data
Description Apache Gora™ provides an in-memory data model and persistence for big data. Gora supports persisting to column stores, key value stores, document stores and RDBMSs, and analyzing the data with extensive Apache Hadoop™ MapReduce support.
Last update on JGP.net Monday, September 12, 2016
VP
Production Version
Last release date
Development Version 0.6.1
Last development drop date Monday, September 14, 2015
Backers (PMC)
Committers

Name Apache Hadoop
Website http://hadoop.apache.org/
Twitter
Categories Data Store, Big Data
Description Apache Hadoop® offers open-source software for reliable, scalable, distributed computing.

The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures.

Last update on JGP.net
VP Chris (Christopher) Douglas
Production Version 2.6.4
Last release date Thursday, February 11, 2016
Development Version
Last development drop date
Backers (PMC) NTT DATA, Hortonworks, InMobi, Cloudera, Yahoo!, Microsoft, Intel, Facebook, Jive, WANdisco, Twitter, Huawei, Uber
Committers NTT DATA, Getopt, Hortonworks, Cloudera, InMobi, Yahoo!, Altiscale, eBay, LinkedIn, Huawei, WANdisco, Microsoft, Intel, Intel, IBM, Twitter, UC Berkeley, Pivotal, INRIA, Uber

Name Apache Kafka
Website http://kafka.apache.org/
Twitter @apachekafka
Categories Infrastructure, Tools, Messaging
Description Apache Kafka™ is publish/subscribe messaging designed as a distributed commit log.

A single Kafka broker can handle hundreds of megabytes of reads and writes per second from thousands of clients. It is designed to allow a single cluster to serve as the central data backbone for a large organization. It can be elastically expanded without downtime. Data streams are partitioned and spread over a cluster of machines to allow data streams larger than the capability of any single machine and to allow clusters of coordinated consumers. Messages are persisted on disk and replicated within the cluster to prevent data loss. Each broker can handle terabytes of messages without performance impact.

Last update on JGP.net
VP Jun Rao
Production Version
Last release date
Development Version 0.10.0
Last development drop date
Backers (PMC)
Committers

Name Apache Kudu
Website http://kudu.apache.org/
Twitter @ApacheKudu
Categories Big Data, Analytics
Description Apache Kudu™ completes Hadoop’s storage layer to enable fast analytics on fast data.

Streamlined Architecture – Kudu provides a combination of fast inserts/updates and efficient columnar scans to enable multiple real-time analytic workloads across a single storage layer. As a new complement to HDFS and Apache HBase, Kudu gives architects the flexibility to address a wider variety of use cases without exotic workarounds.

Faster Analytics – Kudu is specifically designed for use cases that require fast analytics on fast (rapidly changing) data. Engineered to take advantage of next-generation hardware and in-memory processing, Kudu lowers query latency significantly for Apache Impala (incubating) and Apache Spark (initially, with other execution engines to come).

Last update on JGP.net Wednesday, February 1, 2017
VP Todd Lipcon
Production Version 1.2.0
Last release date Wednesday, January 18, 2017
Development Version
Last development drop date
Backers (PMC) Cloudera
Committers Cloudera

Name Apache Mesos
Website http://mesos.apache.org/
Twitter
Categories Infrastructure, Resource Management
Description Apache Mesos™ allows to program against your datacenter like it’s a single pool of resources. Mesos abstracts CPU, memory, storage, and other compute resources away from machines (physical or virtual), enabling fault-tolerant and elastic distributed systems to easily be built and run effectively.

Mesos is built using the same principles as the Linux kernel, only at a different level of abstraction. The Mesos kernel runs on every machine and provides applications (e.g., Hadoop, Spark, Kafka, Elastic Search) with API’s for resource management and scheduling across entire datacenter and cloud environments.

Last update on JGP.net
VP Benjamin Hindman
Production Version
Last release date
Development Version 0.28.2
Last development drop date
Backers (PMC)
Committers Facebook, Mesosphere, Northeastern University, Twitter, UC Berkeley, YouTube, Databricks, Carnegie Mellon University, NASA JPL, NASA, Groupon, Red Hat,

Name Apache Metron
Website http://metron.apache.org/
Twitter @ApacheMetron
Categories Infrastructure, Tools, Big Data, Security
Description Apache Metron™ provides a scalable advanced security analytics framework built with the Hadoop Community, evolving from the Cisco OpenSOC Project. Metron is a cyber security application framework that provides organizations the ability to detect cyber anomalies and enable organizations to rapidly respond to identified anomalies.
Last update on JGP.net
VP
Production Version
Last release date
Development Version 0.2.0-beta-RC2
Last development drop date
Backers (PMC) Hortonworks
Committers Hortonworks

Name Apache Myriad
Website http://myriad.incubator.apache.org/
Twitter @ApacheMyriad
Categories Big Data, Tools, Resource Management
Description Apache Myriad™ bridges the gap between Apache Mesos and YARN (as of YARN in Apache Hadoop). It is still in incubation but is really a promising piece of technology when it comes to dynamic resource allocation.

It is defined as “Apache Myriad enables the co-existence of Apache Hadoop and Apache Mesos on the same physical infrastructure. By running Hadoop YARN as a Mesos framework, YARN applications and Mesos frameworks can run side-by-side, dynamically sharing cluster resources.”

Last update on JGP.net
VP
Production Version
Last release date
Development Version
Last development drop date
Backers (PMC)
Committers

Name Apache NiFi
Website http://nifi.apache.org/
Twitter
Categories Infrastructure, Big Data
Description Apache NiFi™ supports powerful and scalable directed graphs of data routing, transformation, and system mediation logic.

NiFi was built to automate the flow of data between systems. This kind of problem has been around since enterprises had more than one system, where some of the systems created data and some of the systems consumed data. NiFi aims at simplifying Enterprise Integration Patterns.

NiFi includes a web-based user interface, it is highly configurable, tracks data flows from beginning to end (aka Data Provenance). It is designed for extension.

Security is built in, with support for SSL, SSH, HTTPS, encrypted content, and pluggable role-based authentication/authorization.

Last update on JGP.net
VP Joe Witt
Production Version
Last release date
Development Version 0.6.1
Last development drop date
Backers (PMC) Hortonworks
Committers Hortonworks

Name Apache POI
Website http://poi.apache.org/
Twitter
Categories Java Library
Description The Java API for handling Microsoft Documents. The Apache POI™ Project’s mission is to create and maintain Java APIs for manipulating various file formats based upon the Office Open XML standards (OOXML) and Microsoft’s OLE 2 Compound Document format (OLE2). In short, you can read and write Microsoft Excel files using Java. In addition, you can read and write Microsoft Word and Microsoft PowerPoint files using Java. Apache POI is your Java Excel solution (for Excel 97-2008). We have a complete API for porting other OOXML and OLE2 formats and welcome others to participate.

OLE2 files include most Microsoft Office files such as XLS (Excel), DOC (Word), and PPT (PowerPoint) as well as MFC serialization API based file formats. The project provides APIs for the OLE2 Filesystem (POIFS) and OLE2 Document Properties (HPSF).

Office OpenXML Format is the new standards based XML file format found in Microsoft Office 2007 and later. This includes XLSX, DOCX and PPTX. POI provides a low level API to support the Open Packaging Conventions.

Last update on JGP.net
VP Dominik Stadler
Production Version 3.14
Last release date Saturday, March 5, 2016
Development Version 3.15-beta1
Last development drop date Friday, April 15, 2016
Backers (PMC)
Committers

Name Apache Spark
Website http://spark.apache.org/
Twitter
Categories Data Store, Big Data
Description Apache Spark™ is a fast and general engine for large-scale data processing.
Last update on JGP.net
VP Matei Zaharia
Production Version 2.0.0
Last release date Tuesday, July 26, 2016
Development Version 2.0.1
Last development drop date
Backers (PMC) IBM, Databricks, University of Michigan, Intel, Facebook, Yahoo!, UC Berkeley
Committers Databricks, University of Michigan, Intel, UC Berkeley, Facebook, Yahoo!, Bizo, ClearStory Data, QuestTec B.V., IBM, Quantifind, Alluxio, Hortonworks, Webtrends, Cloudera, Clover Health, NTT Data, Netflix, Alibaba, MIT

Name Apache Sqoop
Website http://sqoop.apache.org/
Twitter
Categories Big Data, ETL
Description Apache Sqoop™ is a tool designed for efficiently transferring bulk data between Apache Hadoop and structured datastores such as relational databases
Last update on JGP.net
VP Jarek Jarcec Cecho
Production Version 1.4.6
Last release date Friday, April 24, 2015
Development Version 1.99.6
Last development drop date
Backers (PMC) Cloudera, Apple, StreamSets, NASA, NASA JPL, Dell Software, Zymergen, Ecetera, Pivotal
Committers Cloudera, Apple, StreamSets, JPL, Dell Software, Zymergen, Ecetera, Pivotal, Persistent Systems Limited, Google, WANdisco, Intel

Name Apache ZooKeeper
Website http://zookeeper.apache.org/
Twitter
Categories Infrastructure
Description Apache ZooKeeper™ is an open-source server, which enables highly reliable distributed coordination.

ZooKeeper is a centralized service for maintaining configuration information, naming, providing distributed synchronization, and providing group services. All of these kinds of services are used in some form or another by distributed applications. Each time they are implemented there is a lot of work that goes into fixing the bugs and race conditions that are inevitable. Because of the difficulty of implementing these kinds of services, applications initially usually skimp on them, which make them brittle in the presence of change and difficult to manage. Even when done correctly, different implementations of these services can lead to management complexity when the applications are deployed.

Last update on JGP.net
VP Flavio Junqueira
Production Version 3.4.8
Last release date Saturday, February 20, 2016
Development Version 3.5.1-alpha
Last development drop date Monday, August 31, 2015
Backers (PMC) MapR Technologies, RentTheRunway, Cloudera, Confluent, Hortonworks, Facebook, Midokura, Nicira
Committers MapR Technologies, RentTheRunway, Cloudera, Confluent, Hortonworks, Facebook, Midokura, Nicira, Intel, Google, Twitter