Do your organization’s historical books or records need digitizing? Could you benefit from a searchable database of digitized documents?
Scanning then, digitally cleaning and watermarking those scans begins the digital preservation process. Once complete, optical character recognition (OCR) and detailed quality assurance can be performed. These steps create electronic libraries that will protect your documents for perpetuity.
The following post details the steps necessary to create a digital library from your original materials. At Anderson Archival, we tailor this process and every detail to the client and the collection.

Transcription:
Dear Son
Am patiently waiting for the day to come when I am to see you again. Hope you are well and happy.
Mother
Mr. John P. Scott
U.S.S. Rhode Is.
c/o P. M. New York
N.Y.
Create Your Searchable Database
Images
Historical photographs and drawings are valuable to current and future generations. To preserve them digitally, scan these color and black-and-white images to retain the best quality possible.
Have mold or water marred your documents, or has someone written on the pages? After scanning them, digitizing teams can clean most pages to produce a readable document.
Before: An original scan from Peter Rabbit
After: The same page from Peter Rabbit digitally cleaned
Optical Character Recognition
With typeset documents, special software using optical character recognition (OCR) processes the images and generates text output. Once a document is OCR’d, the text can be exported and made searchable via indexing. OCR analysis is not 100% accurate so our expert team can word-by-word proof the OCR against the original to ensure accuracy is maintained.
A searchable PDF is a common export option from the OCR process and combines the scanned page image with an underlying text layer for user interaction. You can search the entire file by keyword or page through it like a digital book.
Watermarking
Documents can be watermarked to make it easier to identify pages within a document or specific collection. Many clients also want their digital library watermarked for attribution and to protect the pages from being repurposed by unauthorized users.
Printed Books