Internet Archive’s Petabox The Petabox is a large scale data repository. It “is a machine designed to safely store and process one petabyte of information (a petabyte is a million gigabytes).” Here is a paper by Bruce Baumgart and Matt Laue that describes the architecture. Share this: Share on X (Opens in new window) X Share on Facebook (Opens in new window) Facebook Like this:Like Loading… Published by Rajesh Jain An Entrepreneur based in Mumbai, India. View all posts by Rajesh Jain