Amazon Plan Would Allow Text Search Of Books 193
emmastory writes "The New York Times is running a story (free registration required) about a new development at Amazon - they plan to assemble "a searchable online archive with the texts of tens of thousands of books of nonfiction." Users would only be able to read a certain portion of the text from any one book, but it sounds promising nonetheless. The Times article suggests that this is part of a larger strategy to compete with Google and Yahoo by making Amazon an authoritative source of information on everything book-related."
Re:Brilliant idea (Score:5, Insightful)
speaking of searching with Amazon (Score:5, Insightful)
This development may bite them back - when I look for something on Amazon now, I often find in their ads that other people have the item cheaper. Amazon may get a nickel or quarter for the referral, but they lose the dollars from the markup.
Re:Brilliant idea (Score:5, Insightful)
It would be very valuable to be able to open a chapter of the book and give a read over it, you know, like in a real fucking bookstore.
The problem being that stores [brick and mortar] like Chapters.ca stock only self-help dime-a-dozen whim-of-the-minute books. In fact when the local chapters first open you could walk in and buy TAOCP [I did
For the most part people have to blindly trust some review from "BigGuy4477" about the value of a 89$ textbook...
Tom
Good Data (Score:2, Insightful)
legal? (Score:4, Insightful)
Re:Patent this (Score:3, Insightful)
Re:Too bad ... (Score:5, Insightful)
This type of text searching has been around for a gazillion years and is not really that complex. It really depends on how flexible they want to make the searching. Case in point, wildcards. Google sacrifices flexibility by not allowing you to search on wildcards in their news searches in order to gain speed. Ditto for things like phrase searching, etc. The actual # of docs is pretty much irrelevant wrt search speed (at least directly). It depends more on the features you allow in your query language and the # of hits returned by each part of your query. Plus you're dealing with static data that can easily be distributed.
The tough part of all this is getting the stuff in digital format. I assume for most current books it won't be a problem. The hassle would be older books that you'd actually have to OCR. Though once they're done, they would have a pretty valuable asset.
Re:legal? (Score:4, Insightful)
Re:Patent this (Score:4, Insightful)
That's funny. Oh... you're not trying to be funny.
Have you missed the dozens of articles about people recently patenting things that've been around for 30+ years, then suing small businesses for cash?
The USPTO seems to grant a surprising amount of patents on things that "can't be patented".
Re:Too bad ... (Score:5, Insightful)
You are aware that Google's a proprietary solution, right?
Just because Slashdot loves Google doesn't mean it's all of a sudden non-proprietary!
==free online books? (Score:3, Insightful)
Research Humanity vs. P2P (Score:5, Insightful)
Besides the obvious copywrite problems, if the gov't was to get involved and Amazon (or whoever) was allowed to permit searching an entire book for concepts / keywords but not be able to view the entire book without paying for it this would both increase sales and usefulness.
If this was the origional model for online music, think of all the problems that would have been avoided. Perhaps a second look at this type of archiving will help the movie industry as bandwidth increases.
Re:O'Reilly on steroid? (Score:4, Insightful)
Safari is access to the whole content of the book on-line, as well as searching for text within that content as well as any other books they have available on-line. IOW, Safari is actually a superset of the Amazon thing, since you can pay to read the whole book, not just search through it for snippets and passages.
I love Safari as well - saves shelf space, trees and frustration (because of the search function). I wouldn't want to read a novel on-line, since a paper book is a better interface for that, but for reference material about programming/networking/Operating Systems etc., Safari works well, since you're in front of a machine anyway. And IIRC, errata in the books is applied directly to the text on-line, and you get the latest edition without having to get another book, just updated content.
The only time having all of your reference material on-line would be a problem is if you need ref. material to get your Cisco router that connects you to the Internet back on-line.
Soko
Re:Brilliant idea (Score:1, Insightful)
Re:OCR Be Damned! (Score:4, Insightful)
This would be very easy for publishers to accomodate, and they would do so more willingly if the book was old (e.g. Origin Of Species, etc).
RealLife? (Score:5, Insightful)
How authors will react is another question.
Isn't this what happens in the RealWorld? You walk into a bookstore, open it up, read a few pages and make a decision on whether or not you want to buy it?
I think publishers and authors would be rather short-sighted to not allow potential customers shop online the same way they shop in brick and mortar stores.
Re:Brilliant idea (Score:5, Insightful)
Besides, in college you usually don't have a choice about which textbook to use for the class. I guess you could always purchase supplemental books, but those are usually out of the price range/interest level/time scope of many college students.
Great idea. (Score:4, Insightful)
Just imagine if Amazon did some deal with the Library of Congress that allowed them to scan in nearly every book published in the United States. Once the information is digitally stored, it could be utilized in other ways as well:
Re:this could be huge... (Score:3, Insightful)
One example from current events: Bush said in his State of the Union address, "The British government has learned that Saddam Hussein recently sought significant quantities of uranium from Africa"
However, several news organizations excluded the first six words of that sentence, and then called the President a liar. The President's intelligence or honesty aside, intentionally excluding these words dramatically distorts the meaning of the phrase, to the detriment of those using the filter.
Re:Brilliant idea (Score:2, Insightful)
If you don't like how an online business does things, don't use the online business.
If you don't realize the difference between a brick and mortar store providing physical access the the product and an online store providing a digital copy of the product, you need to get your head examined.
Basically they would be giving the book away. My guess is that the publisher has a problem with that.
Original point, if you don't like the rules, don't play the game.