Full Text:   <1256>

CLC number: TP391

On-line Access: 

Received: 2005-08-05

Revision Accepted: 2005-09-10

Crosschecked: 0000-00-00

Cited: 0

Clicked: 6904

Citations:  Bibtex RefMan EndNote GB/T7714

-   Go to

Article info.
1. Reference List
Open peer comments

Journal of Zhejiang University SCIENCE A 2005 Vol.6 No.11 P.1312~1317


A sustainable development OCR system in CADAL application

Author(s):  HUANG Chen, ZHAO Ji-hai, HU Xiao

Affiliation(s):  Zhejiang University Libraries, Zhejiang University, Hangzhou 310027, China; more

Corresponding email(s):   chuang@lib.zju.edu.cn, jhzhao@lib.zju.edu.cn, xiaohu@uiuc.edu

Key Words:  Sustainable Development, Digital Library, optical character recognition (OCR), China-US Million Books Digital Library (CADAL)

HUANG Chen, ZHAO Ji-hai, HU Xiao. A sustainable development OCR system in CADAL application[J]. Journal of Zhejiang University Science A, 2005, 6(11): 1312~1317.

@article{title="A sustainable development OCR system in CADAL application",
author="HUANG Chen, ZHAO Ji-hai, HU Xiao",
journal="Journal of Zhejiang University Science A",
publisher="Zhejiang University Press & Springer",

%0 Journal Article
%T A sustainable development OCR system in CADAL application
%A ZHAO Ji-hai
%A HU Xiao
%J Journal of Zhejiang University SCIENCE A
%V 6
%N 11
%P 1312~1317
%@ 1673-565X
%D 2005
%I Zhejiang University Press & Springer
%DOI 10.1631/jzus.2005.A1312

T1 - A sustainable development OCR system in CADAL application
A1 - HUANG Chen
A1 - ZHAO Ji-hai
A1 - HU Xiao
J0 - Journal of Zhejiang University Science A
VL - 6
IS - 11
SP - 1312
EP - 1317
%@ 1673-565X
Y1 - 2005
PB - Zhejiang University Press & Springer
ER -
DOI - 10.1631/jzus.2005.A1312

This paper briefly introduces the main ideas of a sustainable Development OCR system based on open architecture techniques and then describes the construction of an optical character recognition (OCR) center built on computer clusters, for the purpose of dynamically improving the recognition precision of the digitized texts of a million volumes of books produced by the China-US Million Books digital Library (CADAL) Project. The practice of this center will provide helpful reference for other digital Library projects.

Darkslateblue:Affiliate; Royal Blue:Author; Turquoise:Article


[1] Brunelli, M., Writer, N., 2004. The Holy Grail of Model-driven Development. http://searchwebservices.techtarget.com/qna/0,289202,sid26_gci999474,00.html.

[2] Bruntland, G. (Ed.), 1987. Our Common Future: The World Commission on Environment and Development. Oxford University Press, Oxford.

[3] Bu, F.Y., Liu, C.S., Ding, X.Q., 2004. Distinguish tables from graphics in layout analysis. Computer Engineering and Application, 12:83-87.

[4] Chen, L., Ding, X.Q., 2004. Font recognition of single Chinese character based on wavelet feature. Acta Electronica Sinica, 32(2):177-180.

[5] Chen, Y., Sun, Y.F., Zhang, Y.Z., 2004. A study on segmentation method for gray document image. Journal of Chinese Information Processing, 18(4):44-49.

[6] DCR (Development and Reform Committee), 2004. The Approval for Report on Results of Feasibility Study on Construction Project of the Chinese Academy Digital Library & Information System (CADLIS)’s Tenth Five-Year Plan Authorized by Development and Reform Committee, China, No. 2004-1649 (in Chinese).

[7] Evi, N., Yang, J.Z.H., 1999. UNIX System Administration Handbook. Tsinghua University Press, Beijing.

[8] Kim, M.S., Ryu, S., Cho, K,T., Rhee, T.H., Choi, H.I., Kim, J.H., 2004. Recognition-based Digitalization of Korean Historical Archives. Asia Information Retrieval Symposium AIRS 2004. Revised Selected Papers (Lecture Notes in Computer Science, 3411:281-288).

[9] Shaw, E.J., 2000. Building a digital library: a technology manager’s point of view. The Journal of Academic Librarianship, 26(6):394-398.

[10] Sparks, G., 2005. MDA Overview. Sparx Systems. http://www.sparxsystems.com/bin/MDA%20Tool.pdf.

Open peer comments: Debate/Discuss/Question/Opinion


Please provide your name, email address and a comment

Journal of Zhejiang University-SCIENCE, 38 Zheda Road, Hangzhou 310027, China
Tel: +86-571-87952783; E-mail: cjzhang@zju.edu.cn
Copyright © 2000 - Journal of Zhejiang University-SCIENCE