Apache DataFu™ (incubating)

Apache DataFu

Apache DataFu Pig

Apache DataFu Hourglass

Community

Apache DataFu Pig

Guide

Apache DataFu Pig is a collection of user-defined functions for working with large scale data in Apache Pig. It has a number of useful functions available. This guide provides examples of how to use these functiosn and serves as an overview for working with the library.

There is also Javadoc available for all UDFs in the library. We continue to add UDFs to the library. If you are interested in helping out please follow the Contributing guide.

Pig Compatibility

The current version of DataFu has been tested against Pig 0.11.1 and 0.12.0. DataFu should be compatible with some older versions of Pig, however we do not do any sort of testing with prior versions of Pig and do not guarantee compatibility. Our policy is to test against the most recent version of Pig whenever we release and make sure DataFu works with that version.

Blog Posts

Slides

Videos