CLC number: TP391
On-line Access:
Received: 2005-08-05
Revision Accepted: 2005-09-10
Crosschecked: 0000-00-00
Cited: 0
Clicked: 9447
HUANG Chen, ZHAO Ji-hai, HU Xiao. A sustainable development OCR system in CADAL application[J]. Journal of Zhejiang University Science A, 2005, 6(11): 1312-1317.
@article{title="A sustainable development OCR system in CADAL application",
author="HUANG Chen, ZHAO Ji-hai, HU Xiao",
journal="Journal of Zhejiang University Science A",
volume="6",
number="11",
pages="1312-1317",
year="2005",
publisher="Zhejiang University Press & Springer",
doi="10.1631/jzus.2005.A1312"
}
%0 Journal Article
%T A sustainable development OCR system in CADAL application
%A HUANG Chen
%A ZHAO Ji-hai
%A HU Xiao
%J Journal of Zhejiang University SCIENCE A
%V 6
%N 11
%P 1312-1317
%@ 1673-565X
%D 2005
%I Zhejiang University Press & Springer
%DOI 10.1631/jzus.2005.A1312
TY - JOUR
T1 - A sustainable development OCR system in CADAL application
A1 - HUANG Chen
A1 - ZHAO Ji-hai
A1 - HU Xiao
J0 - Journal of Zhejiang University Science A
VL - 6
IS - 11
SP - 1312
EP - 1317
%@ 1673-565X
Y1 - 2005
PB - Zhejiang University Press & Springer
ER -
DOI - 10.1631/jzus.2005.A1312
Abstract: This paper briefly introduces the main ideas of a sustainable Development OCR system based on open architecture techniques and then describes the construction of an optical character recognition (OCR) center built on computer clusters, for the purpose of dynamically improving the recognition precision of the digitized texts of a million volumes of books produced by the China-US Million Books digital Library (CADAL) Project. The practice of this center will provide helpful reference for other digital Library projects.
[1] Brunelli, M., Writer, N., 2004. The Holy Grail of Model-driven Development. http://searchwebservices.techtarget.com/qna/0,289202,sid26_gci999474,00.html.
[2] Bruntland, G. (Ed.), 1987. Our Common Future: The World Commission on Environment and Development. Oxford University Press, Oxford.
[3] Bu, F.Y., Liu, C.S., Ding, X.Q., 2004. Distinguish tables from graphics in layout analysis. Computer Engineering and Application, 12:83-87.
[4] Chen, L., Ding, X.Q., 2004. Font recognition of single Chinese character based on wavelet feature. Acta Electronica Sinica, 32(2):177-180.
[5] Chen, Y., Sun, Y.F., Zhang, Y.Z., 2004. A study on segmentation method for gray document image. Journal of Chinese Information Processing, 18(4):44-49.
[6] DCR (Development and Reform Committee), 2004. The Approval for Report on Results of Feasibility Study on Construction Project of the Chinese Academy Digital Library & Information System (CADLIS)’s Tenth Five-Year Plan Authorized by Development and Reform Committee, China, No. 2004-1649 (in Chinese).
[7] Evi, N., Yang, J.Z.H., 1999. UNIX System Administration Handbook. Tsinghua University Press, Beijing.
[8] Kim, M.S., Ryu, S., Cho, K,T., Rhee, T.H., Choi, H.I., Kim, J.H., 2004. Recognition-based Digitalization of Korean Historical Archives. Asia Information Retrieval Symposium AIRS 2004. Revised Selected Papers (Lecture Notes in Computer Science, 3411:281-288).
[9] Shaw, E.J., 2000. Building a digital library: a technology manager’s point of view. The Journal of Academic Librarianship, 26(6):394-398.
[10] Sparks, G., 2005. MDA Overview. Sparx Systems. http://www.sparxsystems.com/bin/MDA%20Tool.pdf.
Open peer comments: Debate/Discuss/Question/Opinion
<1>