

Coolrunning Outages
#1
Posted 25 July 2009 - 08:34 PM
we did an upgrade to our database system approx 4.30am this morning (sydney time).
some people reported problems this morning but it was fine for me this morning.
this afternoon we had some database issue - It seemed to affect mysql across the whole of coolrunning as our hompage, wiki, calendar, forums were all bung.
then our isp had a power failure approx 4pm - this is very very rare.
this caused cr to go down the tubes completely and obviously everything else on our server (all other domain names), but it came back pretty quick - total outage was maybe 15mins. this is approx the first time in 2yrs I reckon.
when it did come back everything else was sorted except the cr forums. there was one table that had crashed. however I had the kids and dog to the vet. when I got back it was still buggered (I had left emails for people that might have been able to help).
I then googled and tried and repaired the cactus database. I was impressed with this. All back approx 30mins ago.
Support our Australian advertisers:
#2
Posted 25 July 2009 - 08:42 PM
JoggerKev, on Jul 25 2009, 05:34 AM, said:
we did an upgrade to our database system approx 4.30am this morning (sydney time).
some people reported problems this morning but it was fine for me this morning.
this afternoon we had some database issue - It seemed to affect mysql across the whole of coolrunning as our hompage, wiki, calendar, forums were all bung.
then our isp had a power failure approx 4pm - this is very very rare.
this caused cr to go down the tubes completely and obviously everything else on our server (all other domain names), but it came back pretty quick - total outage was maybe 15mins. this is approx the first time in 2yrs I reckon.
when it did come back everything else was sorted except the cr forums. there was one table that had crashed. however I had the kids and dog to the vet. when I got back it was still buggered (I had left emails for people that might have been able to help).
I then googled and tried and repaired the cactus database. I was impressed with this. All back approx 30mins ago.
Hope the vet had as much luck fixing the kids (and dog),
Funrunner aka Craig
#3
Posted 25 July 2009 - 08:49 PM
Seriously though, I know the dramas with being a volunteer administrator of a forum, so I can completely understand the forum being down sometimes.
#4
Posted 25 July 2009 - 10:10 PM

#5
Posted 26 July 2009 - 07:36 AM
IE8 couldn't resolve the address.
Being a geek, did an nslookup and it timed out. Tried it from web based nslookup and got the IP for the domain, hit this and got the CR Control Pane. Presume needed header to get to real site, so added hosts file entry and can now post this.
So seems there is a problem with not all DNS servers being able to resolve. I'm currently using Telstra NextG.
Or maybe it's just Windows7, but I can get to everything else ...
#6
Posted 26 July 2009 - 08:49 AM

Thankfully this is not the case. Good on you Kev!

edit. spelling
Edited by Trick, 26 July 2009 - 08:50 AM.
#7
Posted 26 July 2009 - 10:08 AM
#8
Posted 26 July 2009 - 10:47 AM
Whippet Man, on Jul 25 2009, 10:10 PM, said:

Funny you should say that because that was exactly where I was on the forum at the time CR crashed! I kept trying to get back in to finish reading thread and just couldn't. So happy when all was well again

No problems for me so far this morning

#9
Posted 26 July 2009 - 11:14 AM
Trick, on Jul 25 2009, 05:49 PM, said:

Thankfully this is not the case. Good on you Kev!

edit. spelling
Is FECK the correct spelling???????
All these years I've had it wrong!!
#10
Posted 26 July 2009 - 11:39 AM
#11
Posted 26 July 2009 - 12:49 PM
I accept your apology!
The outages made it that little bit harder to get through my night-shift last night!
#12
Posted 26 July 2009 - 07:08 PM

#13
Posted 28 July 2009 - 06:45 AM
From now on we will list any outages here:
http://docs.google.c...u...24&hl=en_GB
so tech geeks can read the whole sorry saga.
Unfortunately we may have lost much of yesterday's posts.
I haven't had time to check yet.
We may be able to recover some.
Kevin
#14
Posted 28 July 2009 - 07:44 AM
they are looking to see if we can recover but its looks doubtful.
Kevin
#15
Posted 28 July 2009 - 08:00 AM

#16
Posted 28 July 2009 - 09:05 AM
CoolRunning Admin, on Jul 28 2009, 06:45 AM, said:
From now on we will list any outages here:
http://docs.google.c...u...24&hl=en_GB
so tech geeks can read the whole sorry saga.

#17
Posted 28 July 2009 - 09:12 AM

With such a great event on the calendar to rival 6foot the system was taken down to reflect all the feedback that may topple the 6foot as one of the most iconic events on the calendar

- I still havent got it - any chance it may surface in time for the summer

just incase the smileys dont serve the intended purpose, best I add - just kidding, oh except for my singlet, I still want it
#18
Posted 28 July 2009 - 09:47 AM
Quote
Sounds like a crock to me. Saturday outage probably screwed up the file system. Generally UNIX systems don't reboot just because the filesystem requires an fsck. Certain things might stop working if the drive dies but not normally a reboot.
#19
Posted 28 July 2009 - 12:40 PM
FakePlasticTrees, on Jul 27 2009, 05:47 PM, said:
Sounds like a crock to me. Saturday outage probably screwed up the file system. Generally UNIX systems don't reboot just because the filesystem requires an fsck. Certain things might stop working if the drive dies but not normally a reboot.
Funrunner aka Cyber-Illiterate
#22
Posted 28 July 2009 - 02:30 PM
#23
Posted 21 August 2009 - 10:17 AM
We had another disk fail, and have lost 44hrs since 2pm Weds when the last backup was successful. The site was wonky ever since. Not only have we got a new disk but we have 2 setup in a raid-array so sort of backed-up/mirrored.
All the gory details here
Please bookmark that link as we will use it for any updates for future outages!
Kevin
#24
Posted 21 August 2009 - 10:20 AM

last post seems to be 1:58pm 19th August
Edited by Colin, 21 August 2009 - 10:23 AM.
#25
Posted 21 August 2009 - 10:22 AM
Good to have CR back though!
#26
Posted 21 August 2009 - 10:25 AM
Colin, on Aug 21 2009, 10:20 AM, said:

last post seems to be 1:58pm 19th August
Yes. That's what Kevin said:
CoolRunning Admin, on Aug 21 2009, 10:17 AM, said:
#27
Posted 21 August 2009 - 10:26 AM

#28
Posted 21 August 2009 - 10:31 AM
Great to have it back!
Rachael
#29
Posted 21 August 2009 - 11:01 AM
Good to see all that technical babble by those IT geeks has been removed from this thread. Those guys seriously need to get a life.
#30
Posted 21 August 2009 - 11:01 AM
CoolRunning Admin, on Aug 21 2009, 10:17 AM, said:
Hey Kevin!!! Thanks a million mate! I haven't said it before (and I should have) but you guys put such a big effort into the site and at no cost to the user! I work in IT myself and can understand how frustrating it's been for you!!!!
When you provide such an fantastic free and valuable website like this, any outage is NOT an inconvenience simply an 'outage'. You don't appreciated what you've got until it's not there! :-)
Your time, effort and dedication is very very much appreciated!
THANK YOU ! THANK YOU!
Cheers.
benjamin
ps. Where can I donate towards the site???
IronMan001, on Aug 21 2009, 10:19 AM, said:
#31
Posted 21 August 2009 - 11:05 AM
#32
Posted 21 August 2009 - 11:07 AM
#33
Posted 21 August 2009 - 11:08 AM
balri, on Aug 21 2009, 10:25 AM, said:
Thanks...thought he referred to CR being 'off' for 44hrs ...not losing the posts ...which actually were there as late as yesterday.
...or perhaps they just tried to kill some threads off...now there's an idea.


Edited by Colin, 21 August 2009 - 11:11 AM.
#34
Posted 21 August 2009 - 11:11 AM
CR withdrawals now subsiding.

#35
Posted 21 August 2009 - 11:11 AM
#36
Posted 21 August 2009 - 11:21 AM
http://docs.google.c...u...24&hl=en_GB for details
edit: we are all working day (real) jobs at the moment. I have been posting updates on twitter as well - @benscarf
#37
Posted 21 August 2009 - 11:29 AM
I want to especially thank Ellie80 who so graciously answered my email late last night explaining what was happening & who then followed it up later with an update email. I don't know of too many other people who would have done the same thing. Thank you Ellie80 & all who got Cool Running back to us today. Wonderfully generous people...proud to be a fellow member! LL
#38
Posted 21 August 2009 - 11:34 AM
Vurt, on Aug 21 2009, 11:21 AM, said:
http://docs.google.c...u...24&hl=en_GB for details
edit: we are all woring day (real) jobs at the moment. I have been posting updates on twitter as well - @benscarf
Thank you to all in getting the site back. My work however doesn't thank you as my productivity has just dropped off now that CR is back up again.
BobbyS
#39
Posted 21 August 2009 - 11:36 AM
It's been tough, but I think I have my twitch under control now.
#40
Posted 21 August 2009 - 11:38 AM
IronMan001, on Aug 21 2009, 10:19 AM, said:
You gotta be kidding IK ?
How about " Thanks to the team at CR for getting things back "
Ungratefull ....................................
b
#41
Posted 21 August 2009 - 11:50 AM
CL
#42
Posted 21 August 2009 - 11:52 AM

#43
Posted 21 August 2009 - 11:59 AM
IronMan001, on Aug 21 2009, 10:19 AM, said:
Pull your head in IM. I'm sure these guys have been working their ar*** off to get it fixed - just remember, they do it for nothing, so they don't need that sort of attitude.
On a more positive note - well done guys on getting things up & running again. The majority of us ( as seen by many other posts ) really appreciate your efforts.

#44
Posted 21 August 2009 - 12:11 PM
Well done guys and thank you!!

Hopefully you can all relax now and get back to having a real life.

#45
Posted 21 August 2009 - 12:15 PM

LuckyLegs I am glad I could give you an update - it was about all I could do as I am utterly useless in the technical department.
There is definitely something about wanting things you can't have isn't there?
#46
Posted 21 August 2009 - 12:17 PM
Didge, on Aug 21 2009, 11:59 AM, said:
For god's sake he is a 15 year old kid who probably doesn't have the same command of the English language that the rest of us do.
He doesn't need you giving him grief about his "poor choice" of words or sentence building.
Maybe if we helped and supported him it would be far more productive than continually taking a giant dump on him from a great height.
*Rant over*
Back on topic - well done Kev and Vurt getting everything back in working order/
edit for spelling
Edited by blkbox, 21 August 2009 - 12:17 PM.
#47
Posted 21 August 2009 - 12:27 PM
bennie, on Aug 20 2009, 08:01 PM, said:
Outage, surely you mean Outrage!!

Just kidding thanks to Kev and the whole team, for getting everything back up and running(pardon the pun).
I was beginning to miss everyone.
#48
Posted 21 August 2009 - 12:33 PM
HillsAths1, on Aug 21 2009, 12:27 PM, said:

hehehehehe.
I went to bed last night around 11pm and did one last check on the computer to see if CR was back! Nothing like a 'Severity #1' that keeps you on your toes....
HillsAths1, on Aug 21 2009, 12:27 PM, said:
*big hug to everyone!* :-)
now: let's get out there running!!!
#49
Posted 21 August 2009 - 12:34 PM

#50
Posted 21 August 2009 - 12:46 PM
blkbox, on Aug 21 2009, 12:17 PM, said:
He doesn't need you giving him grief about his "poor choice" of words or sentence building.
Maybe if we helped and supported him it would be far more productive than continually taking a giant dump on him from a great height.
So why do you single me out from others who have also said similar?
P.S. I don't need to comment about the amount of help & support IM receives on here.......so that point is irrelevent.
*My rant over*
Back to topic.....
Edited by Didge, 21 August 2009 - 12:49 PM.