Bucketing, multiplexing and combining in Hadoop
Great two part blog post about Bucketing, multiplexing and combining data in Hadoop by Alex Holmes (@grep_alex). He goes into great detail of how to use MultipleOutputFormat and MultipleOutputs. This post has excellent code examples and diagrams. This was incredibly useful for a project I worked on this week.
Alex Holmes is the author of Hadoop in Practice.
Part 1
Part 2
--Jason














