If you've been slighted by your family and friends this gift-giving season, brood no more - because the very special "Disco 0.3.2: Holiday Release" was created just for you! Among the highlights of this release are chunking, DDFS tag attributes and authentication, and some new goodies in disco.func and discodb. For a more complete list of changes, see the release notes.
One of the most requested features for Disco/DDFS has been automatic splitting of inputs, and we're glad to announce that its finally supported in the form of chunking. If you prefer, you can still push raw blobs to DDFS, but we now provide another layer on top of 'push' called 'chunk', which uses some input streams and/or a reader (default is lines of text) to break your data into records, convert them to Disco's compressed internal format, and store them in size-limited blobs. Read the tutorial for a primer on chunking, or check out 'ddfs chunk --help' from the command line.
For those of you tired of writing and re-writing combiners and reducers to do basic summing, disco.func now includes some handy functions for those common tasks. We've also included a 'gzip_line_reader', for conveniently tearing through possibly corrupted, gzipped text files (unfortunately, these beasts seem to appear quite often in the wild).
As always, there are a number of bugfixes in this release, so upgrading is recommended (just beware the note about deleting OOB data in the release notes, in case you have been relying on that). Also, there are many other new features not covered here, so make sure to read through the docs if you want to get the most out of this release. We at NRC are especially delighted to see how the Disco community continues to grow. Thanks to everyone who has been asking questions on the IRC channel and on the mailing list, your feedback has been invaluable. Please enjoy!