During a recent study, we asked participants how long they thought they would have to speak in order for their voice to be uniquely recognized (i.e., voice print). While their estimates varied widely – from 30 seconds to 30 minutes – most people said about three to five minutes.
The reality is, voice prints can take as little as five seconds to do accurately if the correct phrases are spoken. So the question is, as researchers, should we recommend to design for what people think and feel is the correct length so they feel secure? Or should we collect the minimum?
While I understand why we might do what is comforting for users, I think our job is to convince people that voice biometrics are secure with the minimum amount of effort required. What do you think? Join the discussion!