Recognition-Based Digitalization of Korean Historical Archives

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 543
  • Download : 14
We present a recognition-based digitization method for building digital library of large amount of historical archives. Because the most of archives are manually transcribed in ancient Chinese characters, their digitization present unique academic and pragmatic challenges. By integrating the layout analysis and the recognition into single probabilistic framework, our system achieved 95.1% character recognition rates on test data set, despite the obsolete characters and unique variants used in the archives. Compared with intuitive verification and correction interface, the system freed the operators from repetitive typing tasks and improved the overall throughput significantly.
Publisher
Springer Verlag (Germany)
Issue Date
2004
Citation

Lecture Notes in Computer Science, Vol.3411, pp.281-288

ISSN
0302-9743
URI
http://hdl.handle.net/10203/10829
Appears in Collection
CS-Conference Papers(학술회의논문)

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0