| Publisher | Carnegie Mellon University | ||
|---|---|---|---|
| Format | 568.0KB PDF | Date added | 01 Dec 2005 |
| Topics | High Performance Computing | ||
| Downloads | 2 | ||
Designing highly dependable systems requires a good understanding of failure characteristics. Unfortunately little raw data on failures in large IT installations is publicly available, due to the confidential nature of this data. This paper analyzes soon-to-be-public failure data covering systems at a large high-performance-computing site. The data has been collected over the past 9 years at Los Alamos National Laboratory and includes 23000 failures recorded on more than 20 different systems, mostly large clusters of SMP and NUMA nodes. They study the statistics of the data, including the root cause of failures, the mean time between failures, and the mean time to repair.
Related white papers
HP print solutions and Barclays Wealth
Leading investment management advisor Barclays Wealth wanted to replace its disparate, multi-vendor print environment with a more efficient and environmentally sound solution. One of the business benefits, Annual savings from...
HP print solutions for Logica
IT services and business provider, Logica wanted to replace an ageing fleet of legacy printers and copies from different vendors with a single-vendor solution which would reduce costs and increase...
Creating a Dynamic Infrastructure through Virtualization
In almost every case,the transformation to a dynamic infrastructure will involve virtualization.Many IT professionals think of virtualization specifically in terms of servers.IBM,however,has a broader perspective,in which virtualization is seen as...
Dynamic Infrastructure Helping Build a Smarter PlanetDelivering Superior Business and IT Services with Agility and Speed
In this smarter world, we need our infrastructure to propel us forward, not hold us back. This infrastructure becomes instrumented, interconnected and intelligent to bring together the business and IT...
IBM Virtualization Services
Virtualization is a powerful technology and can have profound effects on the datacenter; however, it should be viewed as a component of an overall IT strategy that will be able...
Go Green with IBM System x Servers and Intel Xeon Processors
By "going green" with energy-efficient IBM® System x™ servers featuring Intel® Xeon® processors, you can win back control of your IT budget—and win the battle with data center power constraints.
Recommended Practices for PC Fleet Management for Mid Market and EnterpriseOrganizations
PC management is both costly and ongoing. Desktop support alone soaks up 30-45 percent1 of IT budgets. But optimizing your PC fleet management strategy will produce efficiencies and lower costs. ...



