Zap2It Labs Discontinuing Free TV Guide Service 569
QuijiboIsAWord writes "Zap2It Labs, which provides free TV listing data for personal use, has long been the main source of program guide information for users in the US and beyond. They've announced via their webpage that,
due to abuse of the service, data will no longer be available after September 1st. There is no other direct source, and no option to pay for the service even if the users wanted to. Without a data feed of this type, users will be reduced to scraping websites at best. Is this going to be a killing blow for MythTV?"
This is troubling (Score:4, Interesting)
Anybody know what the "abuse" was? (Score:4, Interesting)
Achilles Heel (Score:4, Interesting)
Looks like five years later, it's still the only plan.
Re:This is troubling (Score:3, Interesting)
I'm a little shocked that these guys didn't just go commercial, though, and build a MythTV add-on that allows you to subscribe to their product.
Re:Anybody know what the "abuse" was? (Score:3, Interesting)
It seems fair to start charging, but odd that they're just shutting it off. They say they're willing to license to other companies, so presumably they're hoping someone will come along and offer a package to the MythTV folks by licensing the data and re-selling subscriptions.
Going, going, gone? (Score:5, Interesting)
At first, they made me fill out a big online survey as "payment" for the service. The first time it was about 30 questions.
The third time (this is like every 3-6 months) they only asked one question.
For the last year, the survey has been "click here to renew."
What's with that? I'm willing to give up some personal time and info to pay for this service, and they can't even think of a way to leverage that?
I don't see the problem (Score:4, Interesting)
There's multiple sites out there with TV listings: Yahoo TV, Zap2It, MeeTV (the one I use), etc. Just write perl scripts to capture the listing information from these sites, and modify MythTV to allow the user to choose any service he wishes. Of course, some of these sites may (stupidly) screw with their HTML in order to throw off these scripts, but that's easily worked around with regular updates. So we just need to have a "myth-scripts" distribution site where your Myth box automatically checks for updates to the perl scripts every day and downloads them if necessary, just like we already do with many other things.
No, it's not quite as reliable and efficient as a static interface to this data, but if these companies are stupid enough to remove static interfaces, thinking we're just going to go back to doing everything manually and looking at all the ads, this seems like a reasonable solution. There's no way of preventing automated scripts from downloading webpages.
Re:Myth will survive (Score:1, Interesting)
How? I had my DirecTivo installed in 2005 and haven't touched it since. When it was installed I didn't have to do anything, someone came out and hooked it up and turned it on. Ever since it's been working like it should w/o me having to fuck with it in the least.
Until MythTV boxes come ready to plug and play for less than $100 (mine was free b/c DirecTV's new firmware (at the time) put up a screensaver that my standalone TiVo recorded instead of the show) it won't "beat the pants off of TiVo".
Re:This is quite bad :( (Score:5, Interesting)
it's the new management, they hate that OSS people are getting access to the data and want to stop it.
What if I was to write a web service? (Score:4, Interesting)
Any takers?
Re:Anybody know what the "abuse" was? (Score:1, Interesting)
Maybe they abused it by not hitting it often enough.
Remember, a few months ago, after a long beta, the website version of zap2it finally killed its old tables-and-HTML look to force everyone to use the styles-and-CSS look? And how the biggest difference wasn't the increased standards compliance, but the fact that you suddenly needed to "register" to get a 6-hour view rather than the default 3-hour view, and how all the things that had formerly been done with form submit buttons stopped working unless Javashit was turned on? Gosh, what a coincidence.
It's all about the ad impressions. 6-hour-view website users, hitting SaveAs four times a day (perhaps as infrequently as 28 hits, once a week)... don't make enough impressions to sell banner ads. So no 6-hour-view unless you give 'em an email address they can spam. MythTV users were only hitting the feed once a day, and without banner ads, and producing even less ad revenue.
Why not use Guide+? (Score:3, Interesting)
Could a computer not do the same thing?
Re:What if I was to write a web service? (Score:2, Interesting)
The difference comes down to OSS and Free Services. The same rules don't really apply.
However... we could create a project that would allow for smaller ad-hoc communities and not have to have one site serving the entire internet, just 100 or so users per site...
I'm on board if you want to discuss more (and you got the skillz to pay the billz).
Re:Anybody know what the "abuse" was? (Score:1, Interesting)
Re:I don't see the problem (Score:3, Interesting)
There's multiple sites out there with TV listings: Yahoo TV, Zap2It, MeeTV (the one I use), etc. Just write perl scripts to capture the listing information from these sites, and modify MythTV to allow the user to choose any service he wishes. Of course, some of these sites may (stupidly) screw with their HTML in order to throw off these scripts, but that's easily worked around with regular updates. So we just need to have a "myth-scripts" distribution site where your Myth box automatically checks for updates to the perl scripts every day and downloads them if necessary, just like we already do with many other things.
No, it's not quite as reliable and efficient as a static interface to this data, but if these companies are stupid enough to remove static interfaces, thinking we're just going to go back to doing everything manually and looking at all the ads, this seems like a reasonable solution. There's no way of preventing automated scripts from downloading webpages.
For instance, they could replace their nice little HTML table with some flash code - that's not going to be impossible for you to read with scripts, but it'll be a lot harder.
Another option would be to use images instead of text - possibly even breaking up the images into smaller images, to make it harder for automated tools to pull it and OCR it.
They could load their page with bogus invisible text.
They could provide the listing data in a funky character encoding.
So if these TV listing sites decide that the impact from web scrapers is becoming significant, they have options that can make it very difficult for you.
What happened to Mythtv's paid service, TechnoVera (Score:3, Interesting)
Re:TV stations are hardly interested in helping... (Score:4, Interesting)
Re:Myth will survive (Score:3, Interesting)
Re:Make it a paid service (Score:5, Interesting)
Let's say that MythTV implemented your paid service plan and began charging the princely sum of $2 per month for the data.
I would give it all of 7 days before that paid for data became available for free. Someone, somewhere, would buy the data for $2 per month and load it up for others to have free of charge. It would be a daily torrent that you could pull, or a streamed RSS feed, a static layout site with a downloadable screen scraper, or any one of a dozen other ways I can think.
So now instead of a million dollar revenue stream you'd get a thousand dollar revenue stream coming from the 500 users who would actually be wiling to pay when a free source is available.
If you can answer the question of how to prevent the above scenario from happening I can put you in touch with some content providers who will pay REAL money for your idea. The kind of money that allows people to retire for life...at the age of twenty.
Re:This is troubling (Score:5, Interesting)
Re:So what? (Score:3, Interesting)
It'll be the same way with IPTV for a very long time to come.
Re:Myth will survive (Score:5, Interesting)
Tivo used to allow a really nice FF feature to skip commercials. Now they overlay advertisements on top of the advertisements you are fast forwarding through. Not to mention the advertising in the rest of the UI. If you own one, you should know what I'm talking about. If not, Google found me someone's blog with pictures [blogspot.com]
Re:Myth will survive (Score:3, Interesting)
Every time a feed is lost and a new one is found, every time the guide feed changes, every user has to manually upgrade the software, and it's sure to happen many times in the future..
That is inefficient.
What would be ideal would be if myth boxes could connect to each other to form a peer-to-peer network and share some of their guide data with each other.
Let the authenticity of an entry update (in case of conflicting stories) be determined by the reported source of the update, a vote, and the "age" and reputation of the mythtv installations reporting the listing.
In that manner, when one feed dies, any mythtv user who can would be able to scrape or find listings data and provide it for the benefit of all mythtv users.
Also, no one mythtv node would necessarily need the full listings, they would only need to submit updates randomly based on newly acquired data, and to submit queries based on listings they're interested in.
I.E. channels the user is currently trying to lookup listings for, or channels in the channel lineup for the next few days, and searches for program names the user is interested in or wants recorded every time.
Re: how to prevent data "stealing"? (Score:1, Interesting)
Your argument then relies only on the idea that the hacker's effort wouldn't be worth it. That's quite debatable, but flip it around. Where is the incentive for zap2it to sell this data to you and me? Well they'd have to hire some programmers to interface it with their new database system (which is probably the main reason for discontinuation, they refuse to spend money to keep a free service running), programmers to create the subscription payment and distribution framework, programmers for the encryption and mythtv modules. All for what? To have someone within a few days subvert their hard work? Not going to happen. It's obvious the new management over at zap2it (TMS) wants nothing to do with the community or it's free use of their data. They see no incentive to create a paid service for individuals when they obviously already fear piracy of their data. They wish to continue making money from businesses and businesses alone.
The real reason why TMS is killing Zap2It (Score:3, Interesting)
Basically, what they're saying is that they will be changing their database structure, and can't be bothered to re-code the Labs.Zap2It part. They'd rather spend the time on other things, like increasing ad revenue.
Bear in mind that TMS charges each station (Somewhere in the region of $75/day) to collect their data, and charges their commercial end-users (somewhere in the region of $500/market/month) to provide the data. Quite a lucrative trade, wouldn't you say? Add to that advertising revenue from their site, and subtract bandwidth fees, and they still make a tidy profit.
It has been theorised that the main reason labs.zap2it was set up was to save bandwidth costs after XMLTV scrapers started hitting their public site. As the data was served up in chunks in the midst of a sea of other information (Links, advertisements, commentary, navigation etc) it cost considerably more in bandwidth, so it made more fiscal sense to offer the data for free, so they could track who was using it and where, along with only having to serve the data itself, with compression and selectivity. This is only a theory, mind, but considering how the Myth community (On it's own, without any of the other projects that were/are using Zap2It feeds) has grown, I think removing this option will drastically increase Zap2It's bandwidth bills without adding any ad revenue back into the pot (A scraper doesn't see ads, and doesn't care about them).
In other words, this could be a costly mistake for TMS. Here's hoping they see sense, and work out a way to work with MythTV and others.
Re:Rather than a Million Screen Scrapers... (Score:3, Interesting)
True. However, in the case of tv listings which have been published online, they're clearly not trade secrets. A contract is possible, but AFAIK none of the sites that have the listings bother with them. Specht v. Netscape would be instructive in such a case.