Received: 2005-08-05

Revision Accepted: 2005-09-10

Crosschecked: 0000-00-00

Cited: 0

Clicked: 6904

Journal of Zhejiang University SCIENCE A 2005 Vol.6 No.11 P.1312~1317


A sustainable development OCR system in CADAL application

Author(s):  HUANG Chen, ZHAO Ji-hai, HU Xiao

Affiliation(s):  Zhejiang University Libraries, Zhejiang University, Hangzhou 310027, China; more

Corresponding email(s):   chuang@lib.zju.edu.cn, jhzhao@lib.zju.edu.cn, xiaohu@uiuc.edu

Key Words:  Sustainable Development, Digital Library, optical character recognition (OCR), China-US Million Books Digital Library (CADAL)

This paper briefly introduces the main ideas of a sustainable Development OCR system based on open architecture techniques and then describes the construction of an optical character recognition (OCR) center built on computer clusters, for the purpose of dynamically improving the recognition precision of the digitized texts of a million volumes of books produced by the China-US Million Books digital Library (CADAL) Project. The practice of this center will provide helpful reference for other digital Library projects.

Darkslateblue:Affiliate; Royal Blue:Author; Turquoise:Article


