[A] As might be expected, countermeasures to sniff out such deception are being developed. Nuance Communications, a maker of voi

admin2018-06-06  23

问题 [A] As might be expected, countermeasures to sniff out such deception are being developed. Nuance Communications, a maker of voice-activated software, is working on algorithms that detect tiny skips in frequency at the points where slices of speech are stuck together. Adobe, best known as the maker of Photoshop, an image-editing software suite, says that it may encode digital watermarks into speech fabricated by a voice-cloning feature called VoCo it is developing. Such wizardry may help computers flag up suspicious speech. Even so, it is easy to imagine the mayhem that might be created in a world which makes it easy to put authentic-sounding words into the mouths of adversaries—be they colleagues or heads of state.
[B] Until recently, voice cloning—or voice banking, as it was then known—was a bespoke industry which served those at risk of losing the power of speech to cancer or surgery. Creating a synthetic copy of a voice was a lengthy and pricey process. It meant recording many phrases, each spoken many times, with different emotional emphases and in different contexts (statement, question, command and so forth), in order to cover all possible pronunciations.
[C] More troubling, any voice—including that of a stranger—can be cloned if decent recordings are available on YouTube or elsewhere. Researchers at the University of Alabama, Birmingham, led by Nitesh Saxena, were able to use Festvox to clone voices based on only five minutes of speech retrieved online. When tested against voice-biometrics software like that used by many banks to block unauthorised access to accounts, more than 80% of the fake voices tricked the computer. Alan Black, one of Festvox’s developers, reckons systems that rely on voice-ID software are now "deeply, fundamentally insecure".
[D] Next year VivoText plans to release an app that lets users select the emphasis, speed and level of happiness or sadness with which individual words and phrases are produced. Mr Silbert refers to the emotive quality of the human voice as "the ultimate instrument". Yet this power also troubles him. VivoText licenses its software to Hasbro, an American toymaker keen to sell increasingly interactive playthings. Hasbro is aware, Mr Silbert notes, that without safeguards a prankster might, for example, type curses on his mother’s smartphone in order to see a younger sibling burst into tears on hearing them spoken by a toy using mum’s voice.
[E] Not anymore. Software exists that can store slivers of recorded speech a mere five milliseconds long, each annotated with a precise pitch. These can be shuffled together to make new words, and tweaked individually so that they fit harmoniously into their new sonic homes. This is much cheaper than conventional voice banking, and permits novel uses to be developed.
[F] Utter 160 or so French or English phrases into a phone app developed by Candy Voice, a new Parisian company, and the app’s software will reassemble tiny slices of those sounds to enunciate, in a plausible simulacrum of your own dulcet tones, whatever typed words it is subsequently fed. In effect, the app has cloned your voice. The result still sounds a little synthetic but Candy Voice’s boss, Jean-Luc Crébouw, reckons advances in the firm’s algorithms will render it increasingly natural. Similar software for English and four widely spoken Indian languages, developed under the name of Festvox, by Carnegie Mellon University’s Language Technologies Institute, is also available.
[G] And, lest people get smug about the inferiority of machines, humans have proved only a little harder to fool than software is. Dr Saxena and his colleagues asked volunteers if a voice sample belonged to a person whose real speech they had just listened to for about 90 seconds. The volunteers recognised cloned speech as such only half the time (i. e. no better than chance). The upshot, according to George Papcun, an expert witness paid to detect faked recordings produced as evidence in court, is the emergence of a technology with "enormous potential value for disinformation". Dr Papcun, who previously worked as a speech-synthesis scientist at Los Alamos National Laboratory, ponders on things like the ability to clone an enemy leader’s voice in wartime.

【B2】

选项

答案B

解析 此段位于第二段,下文E项段内容已经给出,且开头为Not anymore,其后内容为现有的软件可以存储仅5毫秒长的语音录音片段,并逐一精确标注音调。这比传统语音银行成本低得多,而且还可以用来开发新的用途。由此可知,第二段内容应与E项段相反,内容应说明之前的技术落后,不先进。纵观剩余几段,只有B项段与之相符,指出过去的“语音银行”还只是项定制业务,耗时漫长,花费不菲,与下文E项段形成转折关系,故为正确答案。
转载请注明原文地址:https://kaotiyun.com/show/Jx6Z777K
0

最新回复(0)