Recurly's Backup Mess Takes Days to Clean Up 21

Posted by samzenpus on Monday September 10, 2012 @02:58PM from the best-practices dept.

A cascading hardware outage struck subscription payment provider Recurly last week, and that started a long example in how not to manage critical infrastructure. From the article: "Last Monday, the payment provider suffered an intermittent hardware failure, which prevented the company from processing either payments or refunds. The company says it serves over 1,000 customers, including Adobe, BrightCove, and Fox News Radio, processing recurring payments for subscriptions. By Friday, the company still hadn’t completely straightened out the mess, providing updates to customers using payment gateways such as Authorize.net and LinkPoint/First Data."

Recurly's Backup Mess Takes Days to Clean Up

This discussion has been archived. No new comments can be posted.

Load All Comments

Search 21 Comments Log In/Create an Account

Comments Filter:

Reminds me of Authorize.net (Score:4, Funny)

by Mr. Kinky ( 2726685 ) writes: on Monday September 10, 2012 @02:58PM (#41291297)

This case reminds me of our payment processor Authorize.net in 2009, when a fire took down the whole network and infrastructure for many days. It was only solved when one of the guys over at Authorize.net literally

- Re:Reminds me of Authorize.net (Score:5, Funny)
  
  by MetalliQaZ ( 539913 ) writes: on Monday September 10, 2012 @03:00PM (#41291331)
  
  He would have finished the story but he had a cascading hardware failure that took out his network...
  
  - Re: (Score:1)
    
    by maxwell demon ( 590494 ) writes:
    
    Yeah, that never could happen with me, because I
- Re: (Score:1)
  
  by carlos92 ( 682924 ) writes:
  
  Literally what? The suspense is killing me!
- Re: (Score:1)
  
  by tstrunk ( 2562139 ) writes:
  
  I know that technician! His name was Candlejack, right?
  When he came to
- - Re: (Score:2)
    
    by arglebargle_xiv ( 2212710 ) writes:
    
    Pretty common practice to half-ass everything, they don't care about supporting the customers just getting their percentage off your transactions..
    A friend of mine runs a networking services company who got called into a medium-sized payment processor a few months back to upgrade a server, about an afternoon's work. After several months of 10-12 hour days he's now got them up to the level where they're about quarter-arsed. With another few months' work they'll be at the level of half-arsed. When he described the original setup he found I thought he was making it up, it was just fail layered upon fail layered upon fail, like something a bunch of dru
    - Re: (Score:2)
      
      by cusco ( 717999 ) writes:
      
      Makes me glad that I pay cash for everything possible.
I would've been leery of... (Score:5, Funny)

by Anonymous Coward writes: on Monday September 10, 2012 @03:23PM (#41291625)

...a service provider named Recurly in the first place.
Same goes for any provider named Relarry, Remoe or Reshemp either for that matter.

- Re: (Score:3)
  
  by Alien Being ( 18488 ) writes:
  
  I'm Honest Moe, that's Honest Shemp, and that's... that's Larry.
No backups (Score:3, Interesting)

by Anonymous Coward writes: on Monday September 10, 2012 @03:51PM (#41292099)

This is a perfect example of redundancy not being the same as backups. They had redundant encryption devices, but the failure of one rolled over into the other. They had no backups (that's right, none at all) that they could restore from. From what they've told us, they intend to resolve this issue by adding more redundancy.
Yes, really.

- Re: (Score:3, Funny)
  
  by Anonymous Coward writes:
  
  They should have used RAID.
- Re:No backups (Score:4, Informative)
  
  by tlhIngan ( 30335 ) writes: <slashdot@NOspaM.worf.net> on Monday September 10, 2012 @05:18PM (#41293193)
  
  This is a perfect example of redundancy not being the same as backups. They had redundant encryption devices, but the failure of one rolled over into the other. They had no backups (that's right, none at all) that they could restore from. From what they've told us, they intend to resolve this issue by adding more redundancy.
  Correction, they have no backups of the keys that the encryption accellerators used. End result is now they have a bunch of encrypted data, with little in the way of being able to recover it because the keys used are lost or corrupted.
  Sounds like they need to be hacked and their information "liberated" so they can recover it :).

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Recurly's Backup Mess Takes Days to Clean Up 21

Recurly's Backup Mess Takes Days to Clean Up More Login

Recurly's Backup Mess Takes Days to Clean Up

Reminds me of Authorize.net (Score:4, Funny)

Re:Reminds me of Authorize.net (Score:5, Funny)

Re: (Score:1)

Re: (Score:1)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

I would've been leery of... (Score:5, Funny)

Re: (Score:3)

No backups (Score:3, Interesting)

Re: (Score:3, Funny)

Re:No backups (Score:4, Informative)

Related Links Top of the: day, week, month.

Slashdot Top Deals

Slashdot