- taskvine
- sade
- floability
- pledge
- xgfabric
- highlights
•
•
•
•
•
-
On Parallel Programming with Processes
About once a week, a well-meaning person stops by my office to ask a question like this: I need to run about 1000 simulations that take about an hour each. …
-
Exponential Backoff in Distributed Systems
In response to my previous article, a commenter asked: Why exponential backoff? To put a finer point on the question, How should I choose the parameters for …
-
Chirp on the Blue Gene/P at Supercomputing
In a recent paper at IEEE/ACM Supercomputing, researchers at Argonne National Lab deployed our Chirp filesystem on hundreds of intermediate nodes to support …
-
BXGrid Featured in ISGTW
Our work on the Biometrics Research Grid (BXGrid), was the feature story in this week's issue of International Science Grid This Week.
-
Fail Fast, Fail Often
A common misconception among programmers is that software should always attempt to hide failures in distributed systems. This idea seems sensible at first, b…