CenturyLink Outage Led To a 3.5% Drop in Global Web Traffic (zdnet.com) 27
US internet service provider CenturyLink has suffered a major technical outage on Sunday after a misconfiguration in one of its data centers created havoc all over the internet. From a report Due to the technical nature of the outage -- involving both firewall and BGP routing -- the error spread outward from CenturyLink's network and also impacted other internet service providers, ending up causing connectivity problems for many more other companies. The list of tech giants who had services go down today because of the CenturyLink outage includes big names like Amazon, Twitter, Microsoft (Xbox Live), EA, Blizzard, Steam, Discord, Reddit, Hulu, Duo Security, Imperva, NameCheap, OpenDNS, and many more. Cloudflare, which was also severely impacted today, said CenturyLink's outward-propagating issue led to a 3.5% drop in global internet traffic, which would make this one of the biggest internet outages ever recorded.
And nothing of value was lost? (Score:3)
I didn't even get my laptop out of it's bag on Sunday, so I didn't even know.. Nothing of value was lost.
Re: (Score:2)
There was a lot down.
Not to mention if CenturyLink was your ISP, then everything was lost.
Re: (Score:2)
Re: (Score:1)
Re: (Score:3)
Meanwhile the call center for the COVID-19 / Nurse Advice Hotline in my state was completely packed on Sunday as most of the employees that normally work from home had to come on-site at the last minute.
Re: And nothing of value was lost? (Score:1)
Re: (Score:2)
There are still 35,000+ new cases every day, of course they're busy
Yes... But.. That's DOWN from it's peak in mid July of about 65K cases pre day, meaning that the number of ACTIVE cases is falling at this point too. Also, your number is wrong.. It's currently looking at the 7 day moving average about 41K and if you look at the daily reported number for Sunday (which will be understandably low) it's still 37K. (as of 8/30 from: https://covid.cdc.gov/covid-da... [cdc.gov] )
I'm not saying the concern is past, only that the situation is getting better as it is apparent that the rate
Re: (Score:2)
Many B2B cloud services were also down due to this. We had a debrief call earlier today and a many of our customers were affected all over the world. Thankfully it wasn't exactly peak time but some businesses do need to keep running on the weekends to.
I gues this is where on-prem would be safer but the customers are definitely capable of screwing that up all by themselves anyway :)
Re: (Score:2)
I didn't even notice anything weird that day as an Internet addict.
Are CDNs overhyped? (Score:1)
Re: (Score:3)
CF routed around it pretty quickly, but only continued to have errors for destinations whose only peer was CenturyLink.
The old days... (Score:3)
Gosh I miss the old days....
This isn't the internet I thought we would get. Most of the backhaul providers are incompetent. Consumer ISPs even worse.
We did a lot better when the main routing point for the east coast was in a closet in a parking garage. A lot better.
Re: (Score:2)
Now, we are looking at 10-60. Why? Because IPv4 routing has been bastardized, as well as scaling.
With IPv6, we return back to the simple days and simple routing.
This is really why we need to get off IPv4 and over to IPv6.
A lot of routing issues will go to the by-side with that. No more need for NAT, and I say thank god.
Re: (Score:2)
But, the bigger issue was that back then, the routing was very simple. Now, with IPv4, basically being classless, it means tonnes of lookup tables, etc. That slows the heck out of everything. IPv6's larger address does slow things, but, it allows for nicely dividing up the IP space, and makes routing fairly trivial.
Of course, we just have to get everybody to switch.
Just Like Slashdot... (Score:2)
A day and a half later, and only now were they down.
CenturyLink Fiber (Score:1)
CenturyLink: sad (Score:5, Interesting)
There are/were a lot of great people at CenturyLink (CTL). And they have/had a great network. (I used to work there)
But they made a series of expensive acquisitions (not including Level3) over the last several years that were not in their best interest. (ie the acquired company's products were far from CTL's core strengths and, I believe, were geared toward making investors think they could be everything to everybody) With the L3 acquisition, a few people walked away with a lot of money. (I believe one top exec cashed a check for something in the neighborhood of $300M) And, though publicly it was CTL that acquired L3, most of the CTL upper management is gone replaced by L3 management. Many CTL products have been EOL'd, replaced with L3 products. (I understand that some product line consolidation must occur, but it has been somewhat lopsided, considering who officially acquired who)
Today, they stand at $36B in debt (https://www.fool.com/investing/2019/03/19/where-will-centurylink-be-in-5-years.aspx) They have had a number of layoffs in the last couple years that they have managed to keep mostly out of the press. Lots of experienced people left, including a lot of engineers. Though I cannot confirm this, I assume that some of their cost-cutting may ultimately affect quality of service or at least response to outages.
I wish them well, but they're in a tough spot with a lot of competition (often cable companies) and a lot of debt.
Re: (Score:2)
Inconceivable! (Score:2)
"Amazon, Twitter, Microsoft (Xbox Live), EA, Blizzard, Steam, Discord, Reddit, Hulu,..."
I can't imagine how people have suffered. The atrocity, the hopelessness...
How to make routing changes (Score:1)
1. Announce changes in routing policy 30 days in advance if possible, 24 hrs if not.
2. Have a testing methodology in place to ensure routes advertised before the change are advertised after.
3. Have a rollback strategy so if the test is unsuccessful roll back to previous configure and try again tomorrow.
4. Communicate to customers change time, change purpose, and follow up with "done, all hands off" or "not done"
5. Don't schedule it in the middle of the day.
6. Have peer review of everything above and at leas
CenturyLink and Comcast are NIGHTMARES. (Score:2)
Not using juniper! (Score:2)
animating the outage (Score:1)
We observed the outage in Trinocular, an Internet-wide outage detection system---the animation is at https://ant.isi.edu/outage/ani/centurylink202008/index.html [isi.edu]. This is one of the largest U.S. outages in some time!