Shoebat.com has been consistently warning about artificial intelligence and the grave threat it poses.
Several years ago, I began covering this as it concerned the adult film industry, since individual persons began using artificial intelligence technologies in order to “modify” films to their personal taste. I warned this was a serious sign that the technology was going mainstream and that it would eventually be used not to satisfy the perversities of men’s private thoughts, but would be used to manufacture evidence for political reasons.
This prediction has shown itself to be true, as artificial intelligence applications have directly grown into an entire industry of being able to produce fake news casts with fake anchors entirely generated by A.I., and now the technology is imitating people’s voices in a matter of seconds:
With just 3.7 seconds of audio, a new AI algorithm developed by Chinese tech giant Baidu can clone a pretty believable fake voice. Much like the rapid development of machine learning software that democratized the creation of fake videos, this research shows why it’s getting harder to believe any piece of media on the internet.
Researchers at the tech giant unveiled their latest advancement in Deep Voice, a system developed for cloning voices. A year ago, the technology needed around 30 minutes of audio to create a new, fake audio clip. Now, it can create even better results with just a few seconds of training material.
Of course, the more training samples it gets, the better the output: One-source results still sound a bit garbled, but it doesn’t sound much worse than a low-quality audio file might.
The system can change a female voice to male, and a British accent to an American one—demonstrating that AI can learn to mimic different styles of speaking, personalizing text-to-speech to a new level. “Voice cloning is expected to have significant applications in the direction of personalization in human-machine interfaces,” the researchers write in a Baidu blog article on the study.
This iteration of Deep Voice marks yet another development in AI-generated voice mimicry in recent years. Adobe demonstrated its VoCo software in 2016, which could generate speech from text after 20 minutes of listening to a voice. Montreal-based AI startup Lyrebird claims it can do text-to-speech using just one minute of audio.
These technologies represent the kind of leaps in the advancement of AI that researchers and theorists raised concerns around when deepfakes democratized machine learning-generated videos. If all that’s needed is a few seconds of someone’s voice and a dataset of their face, it becomes relatively simple to fabricate an entire interview, press conference, or news segment. (source)
There is a scene in the first and second Terminator films where the Terminator robots imitate the voices of loved ones, so to deceive Sarah Connor and respectively attempt to do the same to her son, John.
This is not a scene from a movie anymore. It is real life.
The future of warfare and conflict, as well as deceit, will be driven by AI tools. It is a critical trend to watch with massive implications for the future.