Full Text:   <1663>

CLC number: TP391

On-line Access: 

Received: 2005-08-05

Revision Accepted: 2005-09-10

Crosschecked: 0000-00-00

Cited: 0

Clicked: 3384

Citations:  Bibtex RefMan EndNote GB/T7714

-   Go to

Article info.
1. Reference List
Open peer comments

Journal of Zhejiang University SCIENCE A 2005 Vol.6 No.11 P.1341~1347


Preserving the literary past, looking to the future: the first Hong Kong Literature Database

Author(s):  MA Leo F.H., WONG Rita, LAU Paul

Affiliation(s):  University Library System, The Chinese University of Hong Kong, Hong Kong, China

Corresponding email(s):   leo-ma@cuhk.edu.hk, rita-wong@cuhk.edu.hk, Paullau@cuhk.edu.hk

Key Words:  Hong Kong Literature, Hong Kong Literature Database, XML, Metadata schema, Database structure, Unicode UTF-8, OCR technology

MA Leo F.H., WONG Rita, LAU Paul. Preserving the literary past, looking to the future: the first Hong Kong Literature Database[J]. Journal of Zhejiang University Science A, 2005, 6(11): 1341~1347.

@article{title="Preserving the literary past, looking to the future: the first Hong Kong Literature Database",
author="MA Leo F.H., WONG Rita, LAU Paul",
journal="Journal of Zhejiang University Science A",
publisher="Zhejiang University Press & Springer",

%0 Journal Article
%T Preserving the literary past, looking to the future: the first Hong Kong Literature Database
%A MA Leo F.H.
%A WONG Rita
%A LAU Paul
%J Journal of Zhejiang University SCIENCE A
%V 6
%N 11
%P 1341~1347
%@ 1673-565X
%D 2005
%I Zhejiang University Press & Springer
%DOI 10.1631/jzus.2005.A1341

T1 - Preserving the literary past, looking to the future: the first Hong Kong Literature Database
A1 - MA Leo F.H.
A1 - WONG Rita
A1 - LAU Paul
J0 - Journal of Zhejiang University Science A
VL - 6
IS - 11
SP - 1341
EP - 1347
%@ 1673-565X
Y1 - 2005
PB - Zhejiang University Press & Springer
ER -
DOI - 10.1631/jzus.2005.A1341

In the last two decades of the 20th century, there has been an increasing interest in and emphasis on the study of the hong Kong Literature in both the academic and general public in Hong Kong. Recognizing the emergent need of the resources on hong Kong Literature, the University Library System of the Chinese University of Hong Kong set up the hong Kong Literature Database (the “Database”), which was the first Chinese literature database in the Internet in 2000. The paper will examine how the database is constructed using XML technology and metadata schema. The database also employs unicode UTF-8 as the internal code. A mapping table for traditional and simplified Chinese characters was created based on Unihan and is used behind the scene so that a user can either input traditional or simplified Chinese characters and retrieval will give both traditional and simplified Chinese characters. Currently 65% of journals use OCR technology so that full-text searching is possible. The Chinese OCR technology will be examined in greater detail. Special features of the Database such as, page-by-page browse mode, position-highlight for full-page newspaper, linking Table-Of-Contents and book jackets from the Library catalogue, etc. are described. The paper will also bring out the problem of massive downloading and compare the state-of-the-art technology and their shortcomings. This paper shows how the hong Kong Literature Database facilitates future collaboration and data exchange by using open standard, shareable structure and the latest technology.

Darkslateblue:Affiliate; Royal Blue:Author; Turquoise:Article


[1] Afonso de Sousa, A., Duarte, P., Pereira, J.L., Carvalho, J.Á., 2004. Topics on XML Data Storage and Management. Proceedinds of the IADISac2004, Lisbon, Portugal.

[2] Chen, B.L., 1991. Hong Kong Literary Critisism. Joint Publishing Company, Hong Kong (in Chinese).

[3] Fan, J., 2002. Off-line Optical Character Recognition for Printed Chinese Character(A Survey. http://www.ee.columbia.edu/~junfan/E6880_Final5.pdf.

[4] Huang, W.L., 1988. Critique of Hong Kong Literature I. Wah Hon Publishing Co., Hong Kong (in Chinese).

[5] Joshi, H., Vyas, M., 2002. Framework for a federated digital library. http://dspace.inflibnet.ac.in/bitstream/1944/353/1/04cali_45.pdf.

[6] Liu, D.H., 1997. Hisotry of Hong Kong Literature. Hong Kong Writers Publishing, Hong Kong (in Chinese).

[7] Schoning, H., 2001. Tamino(A DBMS Designed for XML. Tamino(17th International Conference on Data Engineering (ICDE’01), p.0149. http://www.comp.nus.edu.sg/~jaga/papers/Arch-Tamino-ICDE01.pdf.

[8] Selingma, P., Smith, S., 2004. Dectecting Unauthorized Use in Online Journal Archives: A Case Study. http://www.ists.dartmouth.edu/library/securing-systems-software/duu1004.pdf.

[9] Zhao, X.F., 2003. Fiction Hong Kong. Joint Publishing Company, Beijing (in Chinese).

Open peer comments: Debate/Discuss/Question/Opinion


Please provide your name, email address and a comment

Journal of Zhejiang University-SCIENCE, 38 Zheda Road, Hangzhou 310027, China
Tel: +86-571-87952783; E-mail: cjzhang@zju.edu.cn
Copyright © 2000 - Journal of Zhejiang University-SCIENCE