|
Moreover, it has to be a live voice, since a tape recording of the voice will not actually replicate that person's voiceprint, making it a secure system.
Ah, yes, but how do they guarantee that? This reminds me of that excellent article on biometric security devices in the german technology magazine (wish I had the link), which did a study about how easy it was to spoof fingerprint-based and face recognition-based biometric devices. A voice-based authentication technology will run into the same problem.
Why? Think about - /how/ will Nuance make sure it's a live person and not a recording? The face-recognition software tried to check for 'moving pixels', to prevent somebody holding up a photograph. This was easily circumvented by holding up a video image up to the sensor. The voice authentication technology will face a similar challenge.
If you think about it, there's not that many ways to make sure there's a live person on the other end. They usually boil down to questions of resolution and "flaws". If they're trying to authenticate by resolution, they're relying on the fact that a live voice will have some kinds of frequencies that a recording will not. Of course, the voice is transmitted through a phone line, anyhow, so some of those frequencies are lost. But you can see how this becomes a technology race - how sensitive is the security device vs. how well can a recording reproduce the voice.
Similarly, if they're relying on "flaws" to authenticate, like the face-recognition software did (i.e. make sure there's tiny imperceptible variations/movements), a good recording (or a good interference/whatever algorithm) can circumvent that.
I can think of only one way to do voice authentication that couldn't be thwarted by recordings - although it didn't sound like they're taking that path (their product description touts authentication by user saying their name). And that's by having the user say a random set of words, instead of a fixed name or password. The software would analyze the user's voice for common elements, and since the words to be spoken is different each time, no recording is possible.
It's an interesting problem, nevertheless. Always make sure to challenge vendor claims such as 'it /has/ to be a live voice'.
|