With handwritten documents, it is not just scholars who are frustrated by their inaccessibility. Some sophisticated handwriting

admin2009-04-23  33

问题     With handwritten documents, it is not just scholars who are frustrated by their inaccessibility. Some sophisticated handwriting recognition systems are in use. But Dr. Manmatha said the experience developed from those systems was not particularly useful. The current systems have to cope with only a limited range of material—for example, names and addresses—written in a consistent format. On top of that, postal systems have large numbers of human readers as backups, something that wouldn’t be possible for a manuscript search engine.
    They began by working on a variation of an approach used to search digital photographs and the Congress documents, trying to match specific typewritten letters with digital images of their handwritten counterparts. But Dr. Manmatha said the inherent variations in handwriting quickly made that approach too cumbersome.
    The problem was also made more difficult by the fact that Washington dictated to several secretaries and wrote some of his letters personally. The result is that his papers contain at least five handwriting styles. The breakthrough came from looking at research into how people read, Dr. Manmatha said. Rather than analyzing individual letters, he said, people look at words and even parts of sentences as whole units. To develop software that would take a similar holistic approach, Dr. Manmatha turned to an idea developed to let search engine users enter queries in their own language to find Web pages written in another language. Rather than mapping words between the two languages one for one, those systems rely on software that is trained to spot common ground.
    Outside of accuracy, there were two things that needed improvement in the system. It can generally cope with the difference in the writing styles in Washington’s papers because they contain broad similarities. But somewhat like voice recognition software, the program has to be retrained before it can digest documents in a significantly different hand. Dr. Manmatha said that eliminating or minimizing that retraining step would be difficult, but that he believed it would be possible.

选项 A、The amount of such documents is huge and hard to read.
B、Handwriting differs greatly from person to person.
C、Handwritings have never got definite digital counterparts.
D、Less people are willing to read handwritings today.

答案B

解析 细节题。从文章第三段的第一、二两句我们可以看到,真正的困难在于“his papers contain at least five handwriting styles”。也就是说,不同的人的笔迹是不一样的,因此这一题的答案是B。
转载请注明原文地址:https://kaotiyun.com/show/ynZK777K
0

最新回复(0)