The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing. I became interested in this when looking at true 24/7, follow the sun computing. I published a blog article after reviewing Cloudsoft’s Monterey middleware. I thought it was pretty neat although I haven’t used it in anger.
Michael Noll has published two tutorials on building hadoop clusters, and one on building a python hadoop client. His index is on his wiki home page. I discuss the how to in more detail here in my article, Amazon Web Services, I should probably bring the comments across to this page, as it is meant to be focused on Hadoop, not on AWS.