Deja For Sale 197
yet another coward writes: "According to a story in Internet Week, Deja.com is for sale. The company plans to sell the Usenet archive and the buying guide separately. This move might mean a comeback for the archive." So what's the going rate for 1,000,000 MMF posts?
VA Linux Needs to Buy Deja (Score:1)
So far VA hasn't screwed up Slashdot any worse, right? I think Deja's database would fit right in with VA's move to be a content provider.
Re:WTF is MMF? (Score:1)
Re:So you would rather... (Score:1)
Re:Spam database (Score:1)
Re:ebay (Score:1)
Re:Selling USENET Archive? (Score:2)
Collecting tapes with the old articles took work which should be worth something. I don't know, but they possibly even had to pay for them.
And it's not like it costs nothing for a news server to receive articles.
As I see it, Dejanews is like any usenet news provider, only with a much longer expire period. (Unless they'd do things like edit articles to make words in them become hyperlinks and the like.) So yes, of course they can sell the database just like any other provider could.
Re:I see good and bad (Score:1)
Re:Hopefully . . . (Score:1)
Yeah, like anyone would pay for something if a free, better thing wasn't available.
Uh, wait...
My .02,
I see good and bad (Score:1)
2 - a comback for dejanews.com which was a decent site without the portal junk.
Personally I'm voting for #2, because I have enough problem getting off the linux-ipsec list and getting off every spammers list with my muliple email address would surely drive me (more) insane...
Re:Open Content Usenet initiative (Score:1)
access to which is free for their
customers. Netcom, AT&T @HOME, come
to mind as examples.
I have nothing against WWW interface to USENET
as yet another access method. What I do not
like is being fed BS about expensive USENET
software. It is cost of storage and bandwith
which makes USENET expensive for ISPs. So,
you putting all your USENET articles into DB engine,
and serving them over HTTP will save you neither
disk space not bandwidth. (In fact it will take
more bandwith, due to HTML decorations and "sponsor messages").
AOL bids for Deja.Com (Score:1)
AOL would like to buy Deja.Com
Me too!
And me!
Add me to the list!
Me too!!!!
Me as well!!!
a mental picture... (Score:1)
One last defense of my gender on /. (Score:3)
There is a bigger problem, though. Go ahead and look at my previous comments. Nearly every one of them has one or five AC replies to the effect of "suck my dick" or "I want to fuck you in the ass". Throughout history, female authors have been denied recognition for their work, because it was commonly assumed that women were incapable of creating what they created. And throughout history, women have been spat upon, threatened, battered, and gangraped by the same men you'll find here on slashdot. For all I know, you yourself are one of those same ACs.
Ask yourself what you gain by contributing to this climate of fear and hate. Ask yourself that question when you scurry off for your nightly porn fix. Ask yourself that question when you insult and harass people on slashdot.
Re:The Cost of Maintaining an Archive (Score:1)
How do you search 10 terabytes of data in a few seconds? I dare you to do it with 50,000$ worth of equipment.
Even if you restrict to title and e-mail, say, 20 bytes/message, it's still 20 GB. The only way you can do this is to have many computers each searching a piece of the archive concurrently.
So, 200 computers each searching 100 MB of data? That still takes a few seconds. And that's a few seconds *per search*.
Yes. I'd love to buy a copy of the archives. Whoever buys deja may want to market DVDs full of usenet posts (they may run into copyright issues since all posts are copyright of the authors).
Building a fast search engine is always the problem.
Re:Not the same incentives (Score:1)
In general, there has been a move away from Usenet and towards other fertile discussion forums within the last four years© I expect this trend to continue well into the next five years© Today, Usenet is nothing like what it was ten years ago© It'll be even less so, tomorrow©
People keep predicting that usenet will die© People have been saying that usenet's been in decline for the last five years for at least as long as I've been on it ¥early 95©
Usenet won't die because it's easy to use ¥news readers are a very mature technology, it's informative and there's a real sense of community there© No matter what happens in terms of technological advances, you'll never get those communities shifting en-masse to somewhere else©
And ultimately that's what usenet is about, communities© I met my first girlfriend, my wife [wwwvelvetnet], a good portion of my close friends from one newsgroup© I learned Everything I Need To Know about programming from some uber-intelligent people on another©
I was off usenet for while recently, 6 months© I missed it terribly© I'd been checking usenet probably 355+ days a year© I missed the sense of community©
Until you kill that, until you remove all my friends internet connections, take away their newsreaders and burn the servers, news will survive© It's the most succesful form of online community, it always has been and it always will be© When IRC is ancient history, when ICQ, AIM etc no longer exist, I'll still be checking news©
- Aidan
Re:Older Archives? (Score:1)
Who else has usenet archives?!?!! Arg!!! I feel my complacency in letting them manage this has bit us all in the ass.
Re:One last defense of my gender on /. (Score:1)
Make it your new screensaver! (Score:1)
Buy specific branches (Score:5)
It strikes me that IBM in particular could use it as a show piece for their technologies: DB2 (scalability, speed etc), their storage farms, search engine frontend etc. Make it part of their developerWorks and keep it really fast to show off their stuff.
New URL (Score:4)
Re:So you would rather... (Score:1)
NOOOOOOOOO (Score:1)
My Offer (Score:4)
3 Cans of Spam
1 used Napkin (paper)
My Slashdot and ICQ accounts
The Head of Rob Malda
Re:One last defense of my gender on /. (Score:1)
DaPhreaker- Downloading usenet one piece or porn at a time
Re:Spam database (Score:1)
B1ood
Deja should respect privacy laws like toysmart did (Score:1)
Re:Spam database (Score:1)
Re:Profit vs Tragedy of the Commons (Score:1)
TANSTAAFL
LL
misanthropic? (Score:1)
Obviously, "misanthropic" would be the opposite of "nisgynistic" if we were prepared to abandon the stupid convention that "men" means "people" rather than "men" (believe me, I'm no happier than you with an abortion like "misandristic"), but since it seems difficult to get any force behind non-phallo-generic language, you need to distinguish between misanthropy and hatred of men.
"ebay up for auction" (Score:1)
Re:One last defense of my gender on /. (Score:1)
Look at the nicks used by pretty much every other poster here. The closest anyone else gets to revealing any information about their identity is to run their initials together with their surname, from which you cannot infer their sex. Your nick leaves no doubt what sex you are (or wish to portray yourself as).
I know someone who enjoys bloking, which is to say he assigns himself a female nick then logs on to IRC. He gets all the abuse, and finds it funny. There are a lot of people out there that do it, and it leads me to be skeptical about anyone who advertises their sex, as your nick does.
"On the Internet, nobody knows you are a dog" I believe the cartoon frame once said (probably Farside?). Your previous post was great, I really enjoyed it, I didn't look at your nick because it isn't and shouldn't be relevant. But to the trolls, its just an open invitation. If your nick was BlackGeek they would call you a nigger. If it was Gay&Proud they would call you a fag. If it was SchoolNerd they would torment you about your age. If it was AMWhateverYourSurnameIs they would only be able to abuse you for the contents of your posting.
What Next? eBay.com For Sale? (Score:2)
What Next? eBay.com For Sale? They could auction themselves off on... um... somewhere.
Copyright? D@mnright! (Score:2)
There are exemptions under copyright for the intended transmission, and reduced damages because I don't mark my posts (C), but they're still copyright. If DejaNews or whomever buys them starts charging, I want a cut!
That said, other posters have commented on how an archive should be funded, and I agree it's a thorny issue. Much as I dislike gov't involvement, this seems a natural for the Library of Congress. Or maybe you would like an ICANN spinoff?
Re:Older Archives? (Score:1)
Recently we moved the Deja.com servers to a new facility in order to provide greater reliability and performance. The move is now complete and we thank you for your patience.
Please note that currently our Usenet Discussion Service only retrieves messages from the past year (back through June 1999). As announced, we are reconfiguring the service that provides messages posted more than 1 year ago in order to provide greater reliability and performance. This will take some time though, possibly a few months. Have no fear: We're committed to bringing these messages back online as soon as possible.
... which doesn't say much really.
Regards, Ralph.
Re:One last defense of my gender on /. (Score:1)
If I'm not mistaking, you're saying that she should hide the fact that she's a woman, or else it's some kind of provocation...
Think about it for a minute, if some people call her name because "she advertises her gender" (as you wrote) and not because of the content of her posting, it's their guilt, not hers.
"On the Internet nobody knows you're a dog?" Not at all, on the Internet, everybody assumes you're white, male and lives in the US. Why should you hide the fact that you're not?
Search Deja with less spam (Score:1)
Re:Older Archives? (Score:1)
Alan Cox posted this link [linux.org.uk] to LKML a few months ago. It contains the early LKML posts, dating back to 1993. This prompted a post from tytso, who gave out this [mit.edu] link to even earlier posts.
Save DejaNews (Score:1)
As someone who has to constantly solve problems involving a wide assortment of hardware and software, I can't begin to estimate the value af being able to go to a single web site and find the answer to almost any problem within minutes.
It is the only web site I would consider paying a monthly fee to use.
On the other hand, their me-too product ratings have no value to me at all. No doubt it will sell for a lot more money and be around for ever.
The Pricelist (Score:5)
Error (Score:1)
To continue browsing the archives, please log back into the NYTimes.com website
(The Average Slashdotter's Nightmare)
-------
Our Fish Keep Dying! Try not to laugh at the results!
http://udel.edu/~jgephart/fishcounter.h
Hey.... (Score:4)
---
Re:What if MS buys it? Anti-windows articles remov (Score:1)
NOW WHERE DO I GO FOR KIDDIE PORN?
What about GPL'ed code sent on newsgroups? (Score:2)
To me, the simple fact that they ask that you pay to access copyrighted information (GPL or not) that they don't own, seems illegal.
Hopefully . . . (Score:3)
Deja is the buggiest major site I've ever come across. If you've tried to use deja.com to read anything other than the most recent day or two worth of traffic, you probably know what I mean. Follow a link to a specific post, and there's a good chance you'll be directed to a totally different post. This state of affairs has held for at least the last year, maybe longer.
Knowing that deja is up for sale, it now makes sense that they haven't put a lot of effort into fixing bugs. But whoever buys the usenet archive is going to have some serious work to do.
Re:Profit (Score:1)
I don't know just how hard a thing deja is to maintain. The code in itself seems like it hasn't undergone many changes in the last little while, including this #$@! bug that comes around every now and then asking it to search only for messages that contain the '*' character...
Re: (Score:1)
You don't do coding, do you? We need this! (Score:5)
Most of the books in the public library are crap, too, IMO, but I wouldn't once suggest that libraries are of limited use.
Almost every single coding problem I've come up against, or configuration problem, or hardware problem, or VCR-clock-setting problem, has been asked already. All I need to do is a Deja Power Search, some thoughtful keywords, and I have my answer, courtesy of someone the previous year.
Market value? Yeah, probably not. Usenet isn't there for market value; it's there to facilitate a huge meeting of the minds. And we need to preserve that information, so that those of us trying to write code and support the rest of you aren't forever asking the same questions.
How about "It blew goats!"? Deja should have stuck to what it did best -- archiving Usenet -- and left that "portal" crap to places that believe in such things.
Re:WTF is MMF? (Score:1)
Library of Congress (Score:3)
False Memory Syndrome? (Score:1)
FatPhil
Re:Article states that Usenet unit IS profitable (Score:2)
Then we were right all along, and deja.com is truly fuckedcompany.com material.
The reason their USENET archive was profitable (fuck, just think how many banner impressions get generated for a typical query - and I'm talking Deja Classic, not the "new" mode) was because it was useful, and people used it.
I'm actually very relieved to see that they'll be selling the USENET archive to someone who gives a damn about USENET. Deja sure as hell didn't.
And that the money-losing "product review" site will go to someone dumb enough to think that when I'm searching for "Frobozznitz 1996 specs", I want some FrobCo marketer's spiel about the latest and greatest, when the reality is merely that I found the circuit board for a Frobozznitz in a surplus store, the dates on the chips indicate it was made in 1996, and I wanna find out what it was!
I just hope that the buyer of the USENET archive gets the full source tree for their code, so they can go back and dump the ass-sucking "frames" look, the nonproportional text fonts, the goofy colors (ugly shit-beige on white!?!) the tracking URLs (www.deja.com/wewatch/whatlinksyouclick/thenweredi rectyouto/http://www.eatatjoes.com/oldfr obs), the spammish URLs (http://www.frobcoscompetitor.com) inserted into USENET posters' posts, and all the marketing shite they added to Deja's code over the past 3 years.
A USENET archive. Profitable. Kick ass.
Re:Profit (Score:1)
Paul Allen! [paulallen.com]
-jon
Re:Selling USENET Archive? (Score:1)
FatPhil
Re:One last defense of my gender on /. (Score:2)
Why do you do it all? I for one didn't take notice of what you *wrote* in your signature, before you defended yourself against that troll. I don't care wheter your male or female. I don't care wheter your short, tall, good-looking, ugly, black, white, male, female or *whatever* as long as you write intelligent things.
The only thing you should do when someone attacks you, is to *ignore the idiots*. Don't answer the obvious far too stupid trolls. They'll go away, eventually, hopefully.
Until then, ignore'em. They're not worth your time. Intelligent people don't care about your gender when they discuss with you.
--
Re:Profit (Score:1)
There is no such thing as a free lunch.
FatPhil
(A European who thinkgs that we Eurpoeans should pay our part to keep the archives up and running)
Copyright, GPL, and selling (out) (Score:2)
What I will discuss is the hypocrisy. Never mind that they're not actually selling your content, they're selling their business. (Technicality, but absolutely true, they're not really making money off the value of your content, which is what copyright is designed to protect, they're making money off its existence. Yeah, I ain't a lawyer, so I might be totally clueless there. It doesn't matter. I've got a real point here.)
How about the hypocrisy (boy, the tangents...)? Here we are arguing that Napster should be legal cause it's not violating copyright, it's "sharing," and then when a company, that has merely archived posts that we knew were going out into the public domain, we start screaming about our copyright. For shame!
Jeff
Being a woman ... (Score:1)
--
Re:So you would rather... (Score:1)
Re:New URL (Score:2)
Re:So you would rather... (Score:1)
Re:Copyright? D@mnright! (Score:1)
Re:Spam database (Score:1)
A curious point, for me, is the number of spam pieces sent to usernames that not only don't exist, but never existed. And not just easy guesses like "sales@...", but plausible-looking usernames. These addresses could not have been trawled. I haven't seen any of them repeat, so they're probably not on lists. But I still wonder what the utility of sending spam to an address guaranteed to bounce might be. Are they spamming the postmaster through the bounce log?
Roll your own archive... (Score:5)
I like my implementation much better the deja. The interface to deja was horrible, IMHO, and was one of the main reasons I decided to roll my own. I'd suggest archiving stuff you'll never think you will need. Back then, I didn't know that I would be running Sybase on Linux, but since I archived most of the comp.* groups, including comp.databases.sybase, I was able to use the information with relative ease.
oh dear (Score:2)
So, are they selling the entire archive? Or just submissions after May 15, 1999?
[0] okay, rhetorical.. I'll hack the fscking perl source just like I did when they started putting those fscking little arrows all over the place, but still... *sigh*
Re:Copyright, GPL, and selling (out) (Score:2)
If you'll recall when Slashdot wanted to try to publish a book involving our comments on Katz' Hellmouth series? Around fifty people, including myself, were extremely vocal in our protests against it. You'll generally not see these same people supporting Napster. I can chose to use the GPL on a throwaway program. The program is *not* under the GPL by default, until I say otherwise. Likewise with my writings and public domain. The post you are reading is copyrighted. However unlikely I am to enforce it, I certainly could - and probably would, were someone else using it in a way that I found objectionable, without permission. (ie, publishing it on a different website, in print, etc)
I'm not sure how long you've been reading this site, but the editorial opinion is not written into stone - perhaps the editors are fairly hypocritical, but the majority of the readership is not.
--
Re:Ha. (Score:2)
Example:
"Find great deals on KILLING YOUR PARENTS on deja.com!" Endless fun.
Re:One last defense of my gender on /. (Score:2)
A troll only wants one thing, to be fed. Replying to them or their ignorant remarks only feeds them and makes them grow. Browse at +1 or +2 and don't worry about it.
Out of curiousity (Score:2)
Granted, I'm not really interested in the older archives (which I'm sure they have on file).
Still, I use it every day (particularly when I'm having driver issues or want fan reviews of my favorite games.
Re:Usenet C-SPAN? (Score:2)
I'm an investigator. I followed a trail there.
Q.Tell me what the trail was.
I Dream of Google Usenet (Score:3)
Those old archives are the best part of deja... (Score:3)
I hope the archives get bought by someone who wants to make a usefully complete, freely-searchable USENET archive (my wet dream: Google buys the archives), but I fear that they'll just be snapped up by a company like Lexis-Nexis, who'll happily take the publically contributed works of thousands and resell access at kilobucks-a-year.
-Isaac
Not the same incentives (Score:4)
there'd be one fewer source of headaches for them. Usenet contributes to the perpetuation of spam, and it's the ISPs' majordomos that have to clean up when their users get led astray.
Many ISPs are trying their best to set up their own proprietary bulletin boards accessible through their own channels. Usenet is an unnecessary source of competition
Usenet is a big gaping sucking legal wound waiting to happen, what with all the copyright infringements, obscenity, and violations of the DMCA being tossed around. Any prudent ISP wary of tort suits should be wary of affiliating itself with such an anarchic beast.
Usenet still, above all, requires enormous resources to maintain. Especially binary groups.
In general, there has been a move away from Usenet and towards other fertile discussion forums within the last four years. I expect this trend to continue well into the next five years. Today, Usenet is nothing like what it was ten years ago. It'll be even less so, tomorrow.
Libraries? (was Re:Usenet C-SPAN?) (Score:2)
Google should buy it up. (Score:3)
Surely they can spare some of them 6,000 computers :).
Re:You don't do coding, do you? We need this! (Score:2)
No, what I'm saying is that I don't use usenet anymore and I wouldn't pay a nickel for Deja's service...neither would nayone else, evidently.
and they just transferred their email service, too (Score:2)
and it 'only' took them 3 weeks to restore the saved mail from the previous provider to the current one...
its to the point where, since its no longer spam-free, that the web-based email service is totally USELESS. sheesh - didn't they know that's what the draw was?
I loved the fact that dejanews.com (I prefer the old name to the shortened deja.com) archived usenet. and the fact that you could opt-out of it (x-no-archive: yes). who knows what level of service the new owner will exhibit. probably less than what is currently offered, I fear.
--
Lottery is actually an insurance policy... (Score:2)
Imagine you selected 6 lottery numbers, but didn't buy the ticket. Disaster strikes: your numbers come out. Too bad you're uninsured.
Re:Roll your own archive... (Score:2)
today, its cheap to build an fast system (athlon 1Ghz), add 10rpm discs and hardware raid (for speed/reliability), sprinkle with lotsa mem., and you have a local deja of your own.
'cache the info out towards the edges' is what I always say. power to the edges! praise be to he who caches reliably and frequently.
--
Re:Roll your own archive... (Score:2)
obviously a 10rpm drive wouldn't help much.
sed /rpm/Krpm/
--
Deja [vu] for sale... (Score:4)
No need to defend yourself (Score:2)
As a guy whose name (Jean-Michel) often causes USENET readers to mistakenly believe they are corresponding with a woman, I can relate in some small fashion. The behavior of some of the cretins on the internet (be it USENET, irc, or slashdot) is enough to make one ill.
Nevertheless, I would implore you to consider the source. These are Anonymous Cowards, the keyword being Coward. Were they confronted with an actual woman showing any interest in their sorry existence whatsoever, they would almost certainly soil their pants with fear before stuttering something inane and descending hopelessly into a seizure of insecurity and general social cluelessness.
You have no need to defend yourself or your gender. The offensive posts to which you refer speak for themselves and identify their posters as the penultimate losers of society, whose only chance at either a sexual or interpersonal relationship is limited to their right hand.
I know it probably doesn't make you feel any better to read this, being the target of such purile harassment, but it is nevertheless true: your value as a contibuter is in no way diminished or tarnished by these idiots.
And throughout history, women have been spat upon, threatened, battered, and gangraped by the same men you'll find here on slashdot.
I know you're angry, but this comment is very unfair to the vast, vast majority of men on slashdot.
Throughout history women have seduced, betrayed, and murdered men for cheap material gain, to grasp power, to avenge a wrong (real or imagined), or even out of simple spite and jealousy.
The sexes have a long history of using and abusing one another, just as they have an equally long history of nurturing and sustaining one another.
It would be as wrong for me to paint the women who post to slashdot as "gold digging cunts" (or some other equally offensive characterization) as it is for you to paint the men here as would-be rapists, batterers, etc.
Put simply, some human beings are scum, irrespective of sex. Most are not. And while I don't blame you for being angry, please try to resist the very natural, human tendency to overgeneralize about an entire population of people based upon the behavior of a few mysogenist losers who will almost certainly remain sexually frustrated for the duration of their small, pitiful lives.
In defense of pr0n (Score:2)
And yes, we're both quite addicted to sex.
For all we know, it may have been your friend's wife's intolerance of porn that ruined their relationship, not the husbands "addiciton." Even if it wasn't, the notion that a few losers can't control themselves or are so narcissistic that they can only get off to pornography does not even remotely imply that sych would be true for the rest of us, who enjoy healthy lives and relationships while enjoying a little hardcore from time to time.
Selling USENET Archive? (Score:2)
--
Chief Frog Inspector
Re:Buy specific branches (Score:2)
After all, that was the whole point of Altavista, back when it was still altavista.digital.com -- it was intended to show off DEC's hardware.
At least Altavista is still useful for its original function. Sure, they've crowded the window with crappy flashing ads, and put all the keyword crap in (search for "flying buttmonkeys", and it'll give you a link to "Comparison shop for flying buttmonkeys"!), but it still works as a web search site.
Re:You don't do coding, do you? We need this! (Score:2)
I wasn't saying usenet had no worth, I was saying that Deja's archives had little monetary worth. I'm glad you have found tidbits in there from time to time, but you aren't going to pony up $100 million for it either, are you??
Older Archives? (Score:4)
--
The Cost of Maintaining an Archive (Score:2)
Get it all into a fault-tolerant RAID system with hot swap and quadruple the cost. You still have only $2000/month debt service.
That's less than a fast food worker makes in Silicon Valley these days.
If your objection to these figures are that the traffic is so high that the costs are dominated by bandwidth, then the problem is you've got too much business -- a terrible problem with which you must learn to cope.
As the Buddha says: Life is suffering.
Open Content Usenet initiative (Score:4)
Creating a open network of Deja.com like servers is my dream and I already have stable running code for it...
At SourceForge.net the project UsenetWeb [sourceforge.net] is located that is the Open Source implementation of Deja.com
Currently the software is stand-alone, but is could be expanded to form a network of OpenContent deja.com like servers. With the sharing of news groups across several servers, it could become a volenteers only job... The software only supports text-only newsgroups for now.
Are there people here that would like to run this software and build a Open Usenet Network? ? ?
See a demonstration of Open Source deja.com at Usenet4free.com [usenet4free.com]
Johan.
Profit (Score:4)
I came to the conclusion that I would spend nothing at all. Why? Because I feel that ideally, these archives should be free to all and any attempt to charge for access would be somehow wrong. These are ideas in their purest form, when they were just first beginning to be transferred into digital format en masse. This stuff belongs in a museum, not a pay site.
On the other hand.. maintaining such a behemoth for no profit would suck, and would take someone far more idealistic than me.
In conclusion, I don't want Deja, and anyone who does want it will either be A) A zealot we admire but secretly resent; or B) A big businessman with a stupid business plan and no soul.
Feh (Score:2)
If I remember correctly they've flirted with doing the latter already- in that they are putting hotlinks in stuff as if I, in writing 'connect w box to x box and then to y and z boxes', had added a link like 'connect w box to x box [null.com] and then to y and z boxes'.
Re:The Cost of Maintaining an Archive (Score:2)
Grasshopper, when I made my little comment about "bandwidth" and the Buddha, I did neglect to mention that bandwidth includes everything that scales with usage: internal CPU to memory bandwidth and disk bandwidth as well as wide area network bandwidth. The point is that the incremental costs of maintaining the pre-1999 archive for online access are miniscule compared to the rest of Deja's other expenses, yet Deja has used the costs of providing the earlier archives as the reason they are offline. The CEO even admitted they get hardly any incremental increase in bandwidth utilization (see definition above) from the older archives. So, since the incremental costs of simply having them online is so low, how can the CEO use this reason? Where are the real incremental costs of keeping the older archives coming from?
Oh, also, I did quadruple the $50,000 to $200,000 for interfaces, etc.
User Information sold (Score:2)
Now that deja will be owned by someone else and that includes its 'customer' databse, what happens to my personal information and yours?
Anyone feel like playing a lawyer on
My perfect Deja... (Score:2)
It'd be a damn sight easier to use than what Deja has now.
--
Re:I Dream of Google Usenet (Score:2)
Re:Spam database (Score:2)
And I suppose being able to do that makes you a real programmer? Wow and all this time here I thought I was just an average college level programmer who slapped together a few numerical computations and simulations for computational physics courses. Now, I see, 3ye 4m 4n 31337 h4x0r... excuse me while i r3w7 j00r b0x0rz!
heh
Re:Spam database (Score:2)
Consult (deja.com :-) in news.admin.net-abuse.email for "dictionary attack".
Both chickenboner ("guy in a trailer park) and mainsleaze (i.e. "legitimate" - at least, companies that *pretend* to be legit) marketroids are spamming any plausible username at any SMTP server they can get their hands on.
The goal is, as your logs show, to cram spam down the throats of users who've never even used email. After all, the spammer just ignores the bounce, and your box has to consume bandwidth dealing with it. It's no skin off the spammer's nose if your box dies from the load.
If you're seeing this, report it as a DOS attack. Because frankly, that's what it is.
Usenet C-SPAN? (Score:5)
Just today I saw an online article that over half the households in the US are online in some capacity. According to the Census Bureau, that means around 50 million households are online. A buck a month per customer routed through ISPs and you're looking at six hundred million dollars a year -- enough to cover an archive without even asking the rest of the world to kick in. We could pay for it ourselves as a token gesture of reconciliation for "Americanizing" the rest of the net through brute force.
You run into the issue of censorship almost before the proposal hits paper. For every newsgroup there will almost certainly be someone or many someones who wants the content sifted or outright not in the archive. Beating these people into submission so that they will be silent forever will be difficult.
Just a notion, make of it what you will. I'm sure there's a vast array of technical issues that would have to be worked out up front, but I'm absolutely convinced that this could work. Further, I think this is the only way a Usenet archive _can_ work (barring some well-funded philanthropic gesture from a dead billionaire).
Comments?
The net does not want its own history (Score:2)
Or something like that.
I highly doubt that anyone will want to pay for an archive of usenet postings. Frankly, they are of limited use - most post threads offer very little useful information.
Deja's archives may be of interest to an educational institution looking at the historical value of the posts, but the useful market value of the posts is zilch.
As for Deja as a product review site - what can you say? It lost the race.
Epinions, Yahoo, and Amazon's product reviews are far out ahead, and Deja never really made a meaningful transition from being a usenet archive.
The bigger question is whether NNTP is kaput altogether at this stage.
Re:The Cost of Maintaining an Archive (Score:2)
You ignored my comment about bandwidth which includes everything that scales with usage -- not storage. Furthermore, you say "no way this will cost 50k" when I quadrupled "the simple cost of the drives" to $200,000 to get the drives into a interface with some redundancy for hot-swap.
As you, yourself, admit, inclusion of organizational costs is the cost of a from-scratch startup, not "the cost of maintaining an archive".
When you want to go high availability (presumably when you want to go high performance), you may as well just duplicate the system in geographically remote locations -- this gets back to the scalability/bandwidth point I already made. The "transaction journal" for such a system is simply the Usenet feed. You don't need Oracle or even want it since it doesn't do the right kind of indexing anyway. The only backup you need is another server added to provide bandwidth. Recovery takes time, of course, even with a T3 retransmitting the archive from the redundant system(s), but it isn't too bad. At only 1Mbyte/sec you can recover a fully redundant nuked site in 4 months. You don't need or want a system like Veritas for high availability. When one system goes down, the other(s) keep(s) serving and feeds the log to the one being brought back online. This isn't up-front capital.
At the limit, this gets to the real solution to the Usenet archiving problem:
Peer-to-peer redundant archiving.
The main problem to be solved with peer-to-peer redundant archiving is query optimization, decomposition and routing within a distributed redundant index. Know of any good work in this area?
Re-De-centralized Usenet (Score:2)
Distributed full-text searching, possibly with some sort of centralized assistance, but truly distributed access would make for a pretty mighty technology demo. I wonder if they're up to the task.