The explosive growth in data-storage capabilities and rapid network communication protocols has allowed organizations to collect and store a staggering amount of information on specific topics. These databases may be upwards of petrabyte size (1 x 10 15 bytes, or a billion megabytes) - a truly awe-inspiring amount of data! Such massive information stores are often found in research applications (such as biology, medicine, physics, and astronomy) and government agencies (such as the IRS, Department of Defense, and Department of Labor). They may also occur in business: for example, in insurance calculations for underwriting risk. Government agencies often need to share data, but different data schemas, interfaces, and communication techniques complicate these transfers.
Related white papers
Non-Shared Disk Cluster - A Fault Tolerant, Commodity Approach to Hi-Bandwidth Data Analysis
The STAR experiment, in collaboration with the NERSC Scientific Data Management Group is prototyping / developing an approach to accomplish a high bandwidth data analysis capability using commodity components in...
The Parallel Effective I/O Bandwidth Benchmark: B_eff_io
The parallel effective I/O bandwidth benchmark (b_eff_io) is aimed at producing a characteristic average number of the I/O bandwidth achievable with parallel MPI-I/O applications exhibiting various access patterns and using...
Deterministic Wavelet Thresholding for Maximum-Error Metrics
This paper proposes novel, computationally-efficient schemes for deterministic maximum-error wavelet thresholding in one and multiple dimensions. For one-dimensional wavelets, the paper introduces an optimal, low polynomial-time thresholding algorithm based on...
Enterprise Energy Management Solution
Energy managers and energy management organizations are more than ever focusing on linking together the three key energy management areas: financial management, operations and procurement. By stitching these three areas...
Geoinformatic Hotspot Systems (GHS) for Detection, Prioritization, and Early Warning
The five year NSF DGP project has been instrumental to conceptualize surveillance geoinformatics partnership among several interested cross-disciplinary scientists in academia, agencies, and private sector. A declared need is around...
Healthcare Consultant Uses Data Analysis Tool to Save Clients $1.5 Million in Costs
When Towers Perrin sought to expand its healthcare consultancy from enterprise clients to midmarket companies, it needed a tool to offer high-quality analysis at a lower price point. Towers Perrin...
Predicting the End-Price of Online Auctions
Online auctions have become one of the fastest growing modes of online-commerce transactions. eBay has 94 million active members buying and selling goods at a staggering rate. These auctions are...

