Synthesized Singers 383
ctwxman writes "Over the past few decades, advances in computer hardware and software have eliminated many jobs... some technical, some menial, but none artistic. As an on-camera performer in television, I've always was believed that I was 'bulletproof' as far as replacement through technology was concerned. Not so fast. Recently, The Sinclair television stations began using 'central casting' to bring news and weather anchors from a central location (near Baltimore) to the local outlets. Still, real people are needed, just not as many. But now, even real performers may be replaced. The New York Times (inhalation of airplane glue required) reports on a new technology which allows synthesized singers to sing. Imagine having a singer with a world-class voice at your disposal, any hour of any day. She's just standing at the ready, game to perform whatever silly song you might make up for her: a ballad about her love for you, a tribute to your best friend's golf game, a stirring rendition of the evening's dinner menu. Scary."
Scary? (Score:2, Insightful)
Imagine a composer getting up in the middle of the night, going to his newfangled magical "keyboard" and whipping up an entire symphony without the need for a full orchestra..... ooooh... scary.
Man, for a bunch of geeks sometimes the
This isn't really NEW (Score:5, Insightful)
It sounds like they've gone to much greater lengths on this project than any I'm aware of in the past, but the basic thing here has been out for a long time. Most any keyboard you can buy has human voices. A single sample can be spread out over your keyboard and sing any pitch you want, even glides and stuff, pretty easily. But it's generally fairly rudimentary - 'ahh' and 'ohh' or similar, you can actually do some nice sounding background vocals but not sing verses.
From the description in the article, this 'new' thing is really just an inevitable extension of that - they spend about 5 days with a singer, recording her singing many different phonemes and different effects, so that you can then piece together the words to your own song and put it to your own melody in her voice. And, for the moment, they're still aiming at producing background vocals, just more complex ones with the ability to do actual lyrics instead of a oohs and aaahs. Could be kind of cool, but it definately doesn't sound like a 'quantum leap' - just an extension of long-existing technology. I've been expecting to see someone do this for well over 10 years now, ever since I first got to play around with a digital synthesizer.
Who would do this? (Score:2, Insightful)
Re:Scary? (Score:5, Insightful)
Re:This is not new technology... (Score:2, Insightful)
I'm not a fan of her music, but credit where credit is due!
Re:Scary? (Score:3, Insightful)
Aw, c'mon. They said the same thing about player pianos.
I, as a geek, like tech to the extent that it reduces the tedium and frees us to be creative. This is realizing that the very thing we love can be used to work against us. And that is the realization that is truly and deeply scary.
This sort of artistic Luddism has no place in today's world. If you're worthy of the self-applied title "geek," you'll find ways to use this technology to create sounds and effects, maybe even entire musical genres, that were never possible or practical before.
This isn't going to put Loreena McKennitt out of business, but I could see it giving Enya the willies.
Not really buying it. (Score:3, Insightful)
Now, "popular music" notwithstanding, it takes more than just hitting the right notes and holding them to make music. This applies muchly to instruments, and doubly so for voices.
First of all, just any combination of notes are not what makes music... artists have to play with hundreds of variations of tones to find "that perfect sequence," the collection of tones in a specific order, length, and style that produce a pleasing arrangement. Once that has been found, further arrengments of music are patterned and fitted to that sequence. You can have a synthesizer, but someone's still programming it... and not with numbers, either.
Voices are many times more complex than musical instruments, because not only is there tone, volume, and length, but there is, for lack of a better term (in my own knowledge), shape of the sound. The artist Karl Jenkins (of "Adiemus" fame) used singers and a nonsensical language specifically to capitalize on that very set of qualities... using the human voice and speech as another "Instrument," rather than as lyrics.
Now, you could synth using the phonemes and vocal qualities of a singer, but ultimately, without the feeling behind the voice, no amount of coding will put any life to it.
Re:Scary, or progress? (Score:4, Insightful)
When pitch perfection and standardised voices are available from a $300.00 software package, music made by people with interesting voices and offbeat musical philosophies will be that much more valuable.
After all, it seems unlikely that there'll be a software Tom Waits or a digital Johnny Rotten in our immediate future. Punk revival anyone?
There are more artists than performance artists (Score:5, Insightful)
Ever hear of a cel animator?
Re:Not really buying it. (Score:4, Insightful)
Artificial intelligence isn't truly artificial sentience until it has the capability of experiencing it's own existance. Living organisms that posess such self-awareness have thousands of input devices, known as nerve receptors, which alert them to the presence of anything to their immediate position. By this, one must learn to recognize the receptors' data. After a long time of learning the abilities of those receptors, and their cousins, the motor nerves (which activate muscle groups for the purpose of movement), self-awareness becomes available, because everywhere on the human body has such receptors, and what doesn't isn't really the human body.
With this knowledge, the person then begins to learn what is a pleasant experience to those receptors, and what is pain. With pleasure/pain, over time, the person begins to develop affections and apprehensions, which give way to full emotional response. Some additional functions in the body help this along, such as endorphins which improve the pleasure state in the brain, and thus, the body... further enhancing the personal experiences.
Now, a computer would have to have MASSIVE amounts of electric and processing power to activate and stimulate such receptors, should miniturization ever allow such devices to be manufactured cheaply and at such quantity to compare with the human's nervous system. And without that system, a computer cannot develop the deep, intricate levels of affection/apprehensions that would allow for emotional responses.
Add to this the fact that a computer would have to be able to process all of this in realtime, over approximately 12-18 years to truly mature into a true artificial sentience.
Now, what does this have to do with music?
Music is all about experience. People write what they know, and they sing how they feel. Experience is a byproduct of sentience, which most definitely means that computerized music, which can please and FOOL audiences, is yet a long time in coming.
Walk before you run, talk before you sing (Score:4, Insightful)
Furthermore, this tech is likely not going to be what you think. What makes a singer good is their INTERPRETATION of the notes. Even with proper synthesis, at its best, it will be like computer animation. It could be very good and maybe even perfect but it would be TIME CONSUMING. Watch the making of Making Nemo on the DVD to get an idea of how hard it is to understand emoting.
You would really need to spend a large amount of time figuring out how to make the voice sound EMOTIONAL.
Re:Macintosh speech synthesis (Score:3, Insightful)
What I'd like to see is physical modeling of the speech apparatus - lungs, vocal cords, mouth, tongue, teeth, lips, where you can vary parameters such as articulation, etc. We have the computational power to drive such a simulation, witness the brain-dazzling graphics $200 3d cards can pump out. Couple that on-demand speech engine with a decent text to speech translator, and say goodbye to poor phoneme transitions and inappropriate articulations.
Of course, this technology would probably find it's first applications in interactive 3d porn... Grant applications anybody?
Re:This isn't really NEW (Score:5, Insightful)
Re:I hate to shoot your ego, but... (Score:2, Insightful)
But you are mostly correct unfortunately. Most Discovery/TLC programs use library or "needle drop" music. It is susually pretty crappy. The studio I work at writes new music for shows, and taylors it to scenes. There is a cymbal crash when "Dr. Brady" gets bitten by a viper. A bowed gong plays when the rodent gets squeezed to death by an Anaconda. This is a simplification, but it is all possible with computers, instead of a 1,000 dollar gong. But yes, most TV music is utter crap. I will shut up about now...
Re:Walk before you run, talk before you sing (Score:2, Insightful)
Not if you were wanting to create a virtual Kylie, Atomic Kitten, Gareth Gates, Westlife, Robbie Williams, Christina Aguilerra, Madonna, Cher, Britney Spears, J-Lo, Random Pop Muppet you wouldn't.
Re:Scary? (Score:1, Insightful)
I never post here, but I just had to respond to this.
Please. That same, impotent argument can be made against any tool, any technology. And it's always just as pointless and futile. Yes, it's possible that this technology may cost jobs; every new technology does. But they also create new jobs at the same time. But beyond that, there are two other important points to consider that are more specific to this particular technology:
Since the advent of recording and the mass distribution of recorded music it's been as much about who as what. Consumers often care more about who is creating their entertainment than they care about the quality or substance of the entertainment itself. If that wasn't the case, our culture certainly wouldn't have the star-crazy media landscape it does today: Where it must be some kind of odd coincidence that massively popular dreck is always presented by young, attractive people with intriguing and often scandalous personal lives. But this rule still holds true for music that isn't massively popular. Computers have been able to draw pictures for a long time, but they haven't been displacing human artists.
Secondly, this is an *amazing* tool for people who create music. Not because it allows them to kick the lead singer they've always hated out of their band, but because it opens new possabilities. Wow, I can add a quick background vocal in a few minutes. Or hey, I can whip up a quick prototype and see how the tune sounds before I use up my singer's time rehearsing and refining the track. I can use this for all sorts of interesting applications. I can't sing, but I can track songs in Reason or Cakewalk. Now I can add some cool vocals. Wow, I've just added a new dimension to my one-man-band. For the independent home-recording artist in particular, this is an exciting new technology.
So drop the Aldous Huxley BS (no offense to a great author intended) and just accept it. This is here, it has lots of valuable applications, it's never going to replace flesh-and-blood humans, which are inherently social entities.
There's nothing "scary" about it.
Re:Jazz (Score:3, Insightful)
Re:At last, the ultimate weapon against the RIAA (Score:3, Insightful)
Basically, that means that the copyright owner must have released the song for sale in some form...if it's on an album you could have bought at some point, the artist HAS to let you cover it for the stated fees--that's the point of compulsory licensing, the songwriter doesn't get a choice.
The clause you bolded is to prevent me from doing something like singing a previously unreleased Johnny Cash (for example) song without permission by citing the compulsory licensing law.