Synthesized Singers 383
ctwxman writes "Over the past few decades, advances in computer hardware and software have eliminated many jobs... some technical, some menial, but none artistic. As an on-camera performer in television, I've always was believed that I was 'bulletproof' as far as replacement through technology was concerned. Not so fast. Recently, The Sinclair television stations began using 'central casting' to bring news and weather anchors from a central location (near Baltimore) to the local outlets. Still, real people are needed, just not as many. But now, even real performers may be replaced. The New York Times (inhalation of airplane glue required) reports on a new technology which allows synthesized singers to sing. Imagine having a singer with a world-class voice at your disposal, any hour of any day. She's just standing at the ready, game to perform whatever silly song you might make up for her: a ballad about her love for you, a tribute to your best friend's golf game, a stirring rendition of the evening's dinner menu. Scary."
Google Link (Score:5, Informative)
This isn't really NEW (Score:5, Insightful)
It sounds like they've gone to much greater lengths on this project than any I'm aware of in the past, but the basic thing here has been out for a long time. Most any keyboard you can buy has human voices. A single sample can be spread out over your keyboard and sing any pitch you want, even glides and stuff, pretty easily. But it's generally fairly rudimentary - 'ahh' and 'ohh' or similar, you can actually do some nice sounding background vocals but not sing verses.
From the description in the article, this 'new' thing is really just an inevitable extension of that - they spend about 5 days with a singer, recording her singing many different phonemes and different effects, so that you can then piece together the words to your own song and put it to your own melody in her voice. And, for the moment, they're still aiming at producing background vocals, just more complex ones with the ability to do actual lyrics instead of a oohs and aaahs. Could be kind of cool, but it definately doesn't sound like a 'quantum leap' - just an extension of long-existing technology. I've been expecting to see someone do this for well over 10 years now, ever since I first got to play around with a digital synthesizer.
Re: (Score:2, Interesting)
Re:This isn't really NEW (Score:3, Interesting)
Re:This isn't really NEW (Score:2)
Re:This isn't really NEW (Score:5, Insightful)
Re:This isn't really NEW (Score:3, Interesting)
Given that singing and speech aren't mediated by the same parts of the brain. In this book [amazon.com], there's examples of people who can sing sentences, but can't speak them.
So combine.... (Score:5, Funny)
Re:So combine.... (Score:5, Funny)
Re:So combine.... (Score:2, Funny)
Electric Monk (Score:5, Funny)
Re:So combine.... (Score:2)
Actually, I just found my PHD Project! (Score:3, Funny)
-theGreater Ponderer.
Re:Actually, I just found my PHD Project! (Score:2)
-theGreater Embarrassment.
PS: Now I'm just going to type for awhile, until my two minute timer expires. It's unfortunate that I'm rather a speedy typer, though. Luckily for you all, I happen to have rewritten this spiel several times in an attempt to make myself look smarter. It's not working, is it? Ah well, time to push Submi...
Re:So combine.... (Score:3, Funny)
Here is the demo MP3 (Score:5, Informative)
LOLA Demo 1 -Little Bird (MP3) [zero-g.co.uk]
Demo 1: "Little Bird".
(NOTE - the lead vocal line on this demo is NOT by Vocaloid - it is a real singer. Please listen to the backing vocals!). This demo illustrates well how LOLA has been used to create a simple backing vocal arrangement for a personally-produced song. The song was written and performed by one of the Zero-G singing synthesis development team, Andy Power. Andy is singing the lead vocal himself, with his real voice, but he was able to add the backing vocals to his song purely by creating them all using LOLA. Although this is only a very simple example, it immediately illustrates LOLA's usefulness in an everyday situation.
Re:Here is the demo MP3 (Score:5, Informative)
Forget backing vocals, here's a sample [online.fr] of "Amazing Grace" mentioned in the article. Not perfect, but quite impressive.
Re:Here is the demo MP3 (Score:2)
Re:Here is the demo MP3 (Score:4, Funny)
Oh hay watch out, ... out of ... the way
I'm back-ing up
please get
Oh please get lost
or you'll be crushed
Scoot and live an-other dat.
Vibrato (Score:3, Informative)
Having said that though, I'm sure they'll have tweaks for that sort of thing in no time.
Re:Here is the demo MP3 (Score:3, Informative)
This must be a dupe... (Score:5, Funny)
Scary? (Score:2, Insightful)
Imagine a composer getting up in the middle of the night, going to his newfangled magical "keyboard" and whipping up an entire symphony without the need for a full orchestra..... ooooh... scary.
Man, for a bunch of geeks sometimes the
Re:Scary? (Score:2)
Thats not scary, thats dreamlike. I'm an artist by hobby (its a minor, not a full degree), and one of the largest frustrations is the variety of ways you need help for things - in my case, sculpture often requires assistance in casting objects, for example - and in music its only worse: if you're a composer, yo
Re:Scary? (Score:5, Insightful)
Re:Scary? (Score:3, Insightful)
Aw, c'mon. They said the same thing about player pianos.
I, as a geek, like tech to the extent that it reduces the tedium and frees us to be creative. This is realizing that the very thing we love can be used to work against us. And that is the realization that is truly and deeply scary.
This sort of artistic Luddism has no place in today's world. If you're worthy of the self-applied title "geek," you'll find ways to use this tec
Re:Scary? (Score:2)
I'm not willing to label the people who sing advertisements for soap and automobiles as artists or creative. (The guy who wrote the song, maybe). They're little different from the legions of portratists unemployed by Kodak.
In fact, it was only in recent years that singers and musicians have been able to convince the media to start referring to them by the loftier word "artist".
This is not new technology... (Score:3, Funny)
I suspect Milli Vanilli, BROS, Christina, Brittney and N*sync may be suing for prior art.
Re:This is not new technology... (Score:2, Insightful)
I'm not a fan of her music, but credit where credit is due!
Macintosh speech synthesis (Score:5, Funny)
I'm almost considering getting a mac just to listen to her.
Re:Macintosh speech synthesis (Score:2, Funny)
Re:Macintosh speech synthesis (Score:2)
Just as long as you don't get too into Princes. OK?
Re:Macintosh speech synthesis (Score:3, Insightful)
What I'd like to see is physical modeling of the speech apparatus - lungs, vocal cords, mouth, tongue, teeth, lips, where you can vary parameters such as articulation, etc. We have the computational power to drive such a simulation, witness the
Re:Macintosh speech synthesis (Score:4, Informative)
http://www.danamania.com/temp/victoria.mp3 is the old
http://www.danamania.com/temp/vicki.mp3 is the new
Hrmm (Score:5, Funny)
And in the middle of the stage, a beige computer tower with a monitor, keyboard and mouse and a technician on hand to wiggle the mouse every 10 minutes so the flying windows screensaver doesnt come on.
Re:Hrmm (Score:2)
"And in the middle of the stage, a beige computer tower"
(That picture is a recreation from Macross Plus [imdb.com]. Although the idea of computerized [amazon.com] singers [schlockmercenary.com] is not uncommon in scifi)
Re:Hrmm (Score:3, Funny)
Re:Hrmm (Score:2)
I'm personally very interested in computer generated artists. I have a CD single of each of Sharon Apple and Kyoko Date -- both sung by a human voice and generally ficticious as far as being a Virtual Idol is concerned, but none the less interesting as projects go. I also enjoy Idoru by William Gibson, in which the central character is an AI performer. Much as computer generated art has been, historically, underwhelming it is only a matter of time be
the future of music (Score:4, Interesting)
Re:the future of music (Score:3, Interesting)
You might be interested in this album, Farinelli: Original Soundtrack [amazon.co.uk] in which computers where used to simulate the sound of a castrati singer.
Scary, or progress? (Score:2)
It's ironic that the very tools the music industry uses today to guarantee pitch-perfection are tomorrow going to undermine their own success, much as people giving away software are doing in many ways for the software industry. Perhaps the only thing guaranteed is acting, as Final Fantasy: The Spirits Within demonstrated, altho
Re:Scary, or progress? (Score:2)
In another way, it demonstrates the value of real music: much higher than recorded music. Have you been to a concert lately? One where you could see the eyes of the people performing? There's more to music then sound.
Re:Scary, or progress? (Score:4, Insightful)
When pitch perfection and standardised voices are available from a $300.00 software package, music made by people with interesting voices and offbeat musical philosophies will be that much more valuable.
After all, it seems unlikely that there'll be a software Tom Waits or a digital Johnny Rotten in our immediate future. Punk revival anyone?
Ah (Score:2, Funny)
Is her name Sharon Apple?
Copyrighting one's voice? (Score:3, Interesting)
As an example, Harley Davidson (the motorcycle company), tried to get it's unique motorcycle engine sound copyrighted and failed. Will this change the copyright office's stance?
Same rules as sampling (Score:2)
Some friends and I tried to do something like this about 15 years ago, but the state of the art was not there (we were having to develop custom hardware with multiple DSPs, and it would have been at least 2U in size), and a couple of lawsuits scared us off. The nail in the coffin was one lawsuit that really seemed to be on point for us, was a Mercury Sable
Ehhhh... (Score:2, Funny)
They already have been. Who would call Spears, Aguilera, or Milli Vanilli "real"?
a stirring rendition of the evening's dinner menu.
Sorry, but "Pasta Roni" sung is going to be underwhelming, no matter how good the voice is.
new "html" (Score:2, Interesting)
eg
I didn't say he did it.
I didn't say he did it.
I didn't say he did it.
listen to it here (Score:2)
the have some samples here [soundonsound.com]
sounds synthetic to me
Who would do this? (Score:2, Insightful)
Re:Who would do this? (Score:2)
It only takes one "traitor" to spoil it for all the rest. (Ok, you'd really need singers to represent 5-20 categories). It's unlikely that ALL singers are forward-thinking enough to understand that completing this one particular job might destroy their profession.
And even if vocal performers some how come to a shared, rational decision not to submit voices to machines, how long could they hold out? On one side, th
Re:Who would do this? (Score:2)
Yes, of course. That permission is naturally included in the employment contract for the aforementioned one week of work. The hourly pay will be much higher than a typical gig...
(The producers will start off by picking singers who are skilled but unattractive- usual "studio backup" types- who haven't the clout to get redisuals on the set)
What would hurt the industry bad would be if enough disgruntled artists decided to put their voice sets
I hate to shoot your ego, but... (Score:2, Interesting)
Your time is coming to an end, but I will say that synths and sample
Q: What do you call someone... (Score:2, Funny)
A: A drummer.
ba dum dum chsst
Re:I hate to shoot your ego, but... (Score:2)
The submitter is a weatherman, BTW. A career that's ripe to be surrendered to machinery. All newreading "talking heads" might be replaced with CGI within a decade, but on-air "meteorologists" will be the first to go, since the content they read is the least varying.
As far as music, though, the first big use of this specific tech will be advertising jingles. Seriously, hardly any other kind of TV production desires to have any
Mac Speech Voices (Score:2)
The light you see at the end of the tunnel...
is the headlamp of a fast approaching train!
I find it interesting... (Score:2, Funny)
Here are some samples (Score:3, Informative)
Player [music-eclub.com]
The samples are very good and worth the trouble if you're interested in this. While not perfect it is better that I was expecting and I could see how it could be passable for a real person in certain situations.. Here are some direct links to the samples:
Kimi no uwasa / Male lead vocal (Japanese song) [yamaha.co.jp]
Sarasara yukigeshiki / Chorus (Japanese) [yamaha.co.jp]
Amazing Grace / English example [yamaha.co.jp]
Synthesize Politicians? (Score:2)
Little would you know that while your local senator or rep are being televised bellowing out meaningless torrents of weasel-words on CSPAN, they may well actually be off porking an intern or on a lobbyist-paid junket.
When potentially used on one end of a "live" webcast or other broadcast, the possibility of creating "digital alabais" rears its head.
This is one mode of media where it may be necessary and de
Endless Listening (Score:4, Interesting)
Re:Endless Listening (Score:2, Funny)
Soundtrack for my life (Score:3, Interesting)
Just What I Want (Score:2)
So
nytimes registration not bad, but.. (Score:3, Funny)
Now excuse me while I go try find where my brain cells went.
This is a science fiction novel... (Score:3, Informative)
ttyl
Farrell
Re:This is a science fiction novel... (Score:2)
I'm not sure I really agree with your analogy though. Shockwave Rider really doesn't have much to do with modern day hacking, but from what I remember Little Heroes is fairly accurate with what the music industry has become (algorithms to predict music popularity, etc).
Finally! (Score:2, Funny)
Virtual Singer (Score:2)
Fun toy.
Can it sing it as... (Score:2)
Rant coming... (Score:2)
I keep hoping for the next N.W.A. or Nirvana and I start thinking we might never get fresh, exciting music again.
Not really buying it. (Score:3, Insightful)
Now, "popular music" notwithstanding, it takes more than just hitting the right notes and holding them to make music. This applies muchly to instruments, and doubly so for voices.
First of all, just any combination of notes are not what makes music... artists have to play with hundreds of variations of tones to find "that perfect sequence," the collection of tones in a specific order, length, and style that produce a pleasing arrangement. Once that has been found, further arrengments of music are patterned and fitted to that sequence. You can have a synthesizer, but someone's still programming it... and not with numbers, either.
Voices are many times more complex than musical instruments, because not only is there tone, volume, and length, but there is, for lack of a better term (in my own knowledge), shape of the sound. The artist Karl Jenkins (of "Adiemus" fame) used singers and a nonsensical language specifically to capitalize on that very set of qualities... using the human voice and speech as another "Instrument," rather than as lyrics.
Now, you could synth using the phonemes and vocal qualities of a singer, but ultimately, without the feeling behind the voice, no amount of coding will put any life to it.
Re:Not really buying it. (Score:2)
Or to put it another way, we have to wait for AI to catch up. It'll get there. It's probably possible to code a machine to put together aesthetically pleasing tunes already. Sort of an audio version of AARON [viewingspace.com] the painting robot.
I find it interesting when people talk about computers like that. It gets into the AI arena, but they always stop at say 2050 when computers have finally reached the capacity of the human brain. I find it more fascinating to ponder what happens after that, when they have surpassed us
Re:Not really buying it. (Score:4, Insightful)
Artificial intelligence isn't truly artificial sentience until it has the capability of experiencing it's own existance. Living organisms that posess such self-awareness have thousands of input devices, known as nerve receptors, which alert them to the presence of anything to their immediate position. By this, one must learn to recognize the receptors' data. After a long time of learning the abilities of those receptors, and their cousins, the motor nerves (which activate muscle groups for the purpose of movement), self-awareness becomes available, because everywhere on the human body has such receptors, and what doesn't isn't really the human body.
With this knowledge, the person then begins to learn what is a pleasant experience to those receptors, and what is pain. With pleasure/pain, over time, the person begins to develop affections and apprehensions, which give way to full emotional response. Some additional functions in the body help this along, such as endorphins which improve the pleasure state in the brain, and thus, the body... further enhancing the personal experiences.
Now, a computer would have to have MASSIVE amounts of electric and processing power to activate and stimulate such receptors, should miniturization ever allow such devices to be manufactured cheaply and at such quantity to compare with the human's nervous system. And without that system, a computer cannot develop the deep, intricate levels of affection/apprehensions that would allow for emotional responses.
Add to this the fact that a computer would have to be able to process all of this in realtime, over approximately 12-18 years to truly mature into a true artificial sentience.
Now, what does this have to do with music?
Music is all about experience. People write what they know, and they sing how they feel. Experience is a byproduct of sentience, which most definitely means that computerized music, which can please and FOOL audiences, is yet a long time in coming.
replication vs. recognition (Score:2, Interesting)
Re: (Score:2)
Hmm... (Score:4, Informative)
Personally, I think the best examples to download are "The Easy Way" (song 15) and "K'ai - Eyes swim" (song 16).
While no where near perfect, Flinger and the samples really show where things are heading - I have said it before, but this type stuff (perfected, of course), plus tech like machinima (once again, as it becomes better) are truely going to alter what we think of movies, acting, etc - virtual actors, virtual singers, virtual movies...
This is Good News... (Score:2, Informative)
Never mind that the Mac had singing text to speech for a long time...
There are more artists than performance artists (Score:5, Insightful)
Ever hear of a cel animator?
Re:There are more artists than performance artists (Score:4, Interesting)
Slashdot - Home of Depressing news for musicians (Score:2)
The only bright side is that I could eventually have the chance to delete Celine Dion.
Samples (Score:2, Informative)
Although pretty convincing, i think it has a way to go before it would be perfect.
Jazz (Score:5, Interesting)
Classic singers stay as close as possible to the "absolute" quality line - it's perfect for being mathematically modeled and it's a matter of time such models will be apparead, even if their implementations will take some hardware resources.
Pop singers make sound anyway far away from being called as an art. It's perfect for being implemented in embeded solutions. It's a matter of time first cyber-singers will be cloned like cheap "made-in-China" electronic (sorry, my oriental friends, although nothing personal or racial in this comment).
Jazz is still an art, like classical music, but its improvizations are very unpredictable. Jazz singers will be last ones to go. Even more - Jazz improvizators will be eventually involved to prototype new cyber-singers. Hmm, I can even imagine special programming languages for singer-modelling: "bebop", "blues", "swing" :)
Re:Jazz (Score:3, Insightful)
phone sex industry (Score:5, Funny)
Re:phone sex industry (Score:3, Funny)
Nah, you could just do a quick and dirty 'search and replace' on an Eliza script, and go from there.
Sexual Eliza: Hello, my name is Candi. What would you like me to do?
Customer: Strip for me, baby.
Sexual Eliza: I think you want me to strip for you. Why is that?
Customer: Oh, baby, cuz it's so hot.
Sexual Eliza: What is hot about strip for you?
And so on....
Macross Plus (Score:5, Funny)
Not there yet (Score:2, Interesting)
One area that really needs quite a bit more work is the vowels. When singers sing "ee" (as in "saved a wretch like me," for example), they usually soften it so it sounds a bit more like "meh." When I used synthesized voices before
At last, the ultimate weapon against the RIAA (Score:5, Informative)
But this works for anybody. If you can synthesize music from MIDI and vocal models, you can use that deal. The RIAA can't stop you from doing this.
A synthesized music web site could even buy blanket ASCAP and BMI licenses, which aren't too expensive, and allow music downloads. The going rate [ascap.com] seems to be about $5000 per million downloads, or about $0.005 per song.
This is a real threat to the RIAA. If the technology works.
Re:At last, the ultimate weapon against the RIAA (Score:4, Informative)
Re:At last, the ultimate weapon against the RIAA (Score:3, Insightful)
Basically, that means that the copyright owner must have released the song for sale in some form...if it's on an album you could have bought at some point, the artist HAS to let you cover it for the stated fees--that's the point of compulsory licensing, the songwriter doesn't get a choice.
The clause you bolded is to prevent me from doing something like singing a previously unreleased Johnny Cash (for example) song without permission by citing the compulsory licensing
Yeah, but will these computational singers... (Score:5, Funny)
Reminds me... (Score:2)
Walk before you run, talk before you sing (Score:4, Insightful)
Furthermore, this tech is likely not going to be what you think. What makes a singer good is their INTERPRETATION of the notes. Even with proper synthesis, at its best, it will be like computer animation. It could be very good and maybe even perfect but it would be TIME CONSUMING. Watch the making of Making Nemo on the DVD to get an idea of how hard it is to understand emoting.
You would really need to spend a large amount of time figuring out how to make the voice sound EMOTIONAL.
Pah, old tech (Score:4, Funny)
Try Fruity Loops for a simple example (Score:3, Informative)
The music studio Fruity Loops has had a singing plug-in for a little while now; you don't have a lot of (easy) control over the pitch, so it's really more of a toy most of the time. But combined with some simple audio processing, you can easily get results like these:
http://www.antics.org.uk/mp3/green/ntk_copyright.m p3 [antics.org.uk] (1mb MP3)
http://www.antics.org.uk/mp3/green/ntk_eod.mp3 [antics.org.uk] (646kb MP3)
I'm sure it's nowhere near the league of the featured developments, but it's still a very impressive feature in an affordable package...
Synthesized Idol (Score:3, Funny)
Re:More like a sampler than a synth (Score:2)
Re:More like a sampler than a synth (Score:2)
Is it that different to actors' faces and movements being sampled in order to recreate them in today's SFX movies?
Johnny Cab (Score:2)
Best sample (Score:2)