Quality Assurance (QA) plays a critical role in high volume document digitization projects by making sure that the specified quality standard is reached under cost and time constraints. This paper takes a systematic view on this issue by summarizing and abstracting related existing work: quality bottlenecks and technical solutions throughout the whole processing pipeline, including cataloging, capture, image analysis and recognition, and error cascading; various strategies to conduct cost-effective QA, such as combination of auto-QA and manual QA, batch QA, special QA user interface, and open source QA.
Related white papers
Digital Libraries and Autonomous Citation Indexing
The web is revolutionizing the way researchers access scientific literature, however scientific literature on the Web is largely disorganized. Autonomous citation indexing can help organize the literature by automating the...
Distance Learning and Sun Microsystems
Over the years, distance learning has evolved from correspondence courses and television presentations to much more interactive forms of instruction delivery. With the increasing availability of Internet technology, the...
Direct Data Distribution: the Changing Paradigm of Aftermarket Cataloging
This white paper is on Direct Data Distribution in the Automotive Aftermarket Industry. Direct Data Distribution is a new concept in cataloging automotive part information for the Automotive Aftermarket Industry....
Web-Page Classification Through Summarization
Web-page classification is much more difficult than pure-text classification due to a large variety of noisy information embedded in Web pages. This paper proposes a new Webpage classification algorithm based...
Symbol Technologies: Maximizing "Teachable Moments" in Schools
All schools today are data intensive. From the grading and attendance of individual students to monitoring the activity of entire student bodies, school officials are turning to technology to help...
Creating Multilingual Data for WebSphere Commerce Suite Via WebSphere Catalog Manager
This paper first states that why it needs multilingual business and two types of multilingual business, and give a high level view about the capabilities of WebSphere Catalog Manager(WCM) and...
High-Tech MBA Turns IT Professionals into Technology Leaders
If you're looking for a Master of Business Administration program that's designed specifically for IT professionals, the NTU School of Engineering and Applied Science's High-Tech MBA. is the program for...

