I'm getting drive errors on one of the SSD drives in the database server. Going to remote unmount it and fsck it and see if that fixes it.. :(
Not been having much luck with hardware recently.
Printable View
I'm getting drive errors on one of the SSD drives in the database server. Going to remote unmount it and fsck it and see if that fixes it.. :(
Not been having much luck with hardware recently.
Think I need a miracle :( It's not responding so it's going to be down for a while. I do have one spare SSD which I'll plug in over the weekend when I get home and see what can be done.
Does the server work with a Raid so there is a backup copy ?
RAID slows it down too much and it's not that great with SSD's anyway.
I do my own custom replication instead. Backups are on separate drives. This is the web facing database anyway which is read only, so it's 'mostly' duplicated from the main one. Means it gets LOTS of writes though and I guess it's finally taken it's toll.
Have you looked into the pciE drives at all there faster and have more capacity
A write-caching program can cut down on writes by an order of magnitude or more. I have used both FancyCache (Romex Software) and PerfectCache 5.0 (Raxco), and they both give very good results for limiting the writes to my SSD when I run the CEP2 project on World Community Grid. Writes will typically go from 80 GB/day down to less than 1 GB/day, but that depends a lot on the statistics of what you are doing of course, and the size of the DRAM cache you set.
FancyCache is currently in Beta (free 90 day trial), and PefectCache 5.0 is $80 (30 day free trial), but well worth it if you have the right situation.
I only got home a few hours ago after travelling overnight, afraid to look at it right now when I'm still so tired. I *think* statstool is gone unless I can recover the drive though as well as a few other things, possibly historical miestones too. It looks like the backups were failing for the last 10 days or so and I didn't get warned. I only keep last 5 days. :(
looks like it got your Sig to
Well if you need off line storage
I have over 20 Terabytes of open space here
Bok
When you get her going again, if you dont use Diskeeper, go to Diskeeper - Hyperfast.
I have used Diskeeper for years, and its excellent massive defrag and disc i/o savings, however, with the onset of SSDs, they brought out Hyperfast, and the difference thats made to my SSDs is astonishing, not only performance, but arguably more important disc life as it prevents a huge number of disc writes/IO. As SSDs are hugely sensitive in terms of their lifespan on minimising disc writes, Hyperfast hits the button superbly.
I recenly crunched some WUs who are very heavy on disc writes - this particular WU type create literaly thousands of directories as part of its routine. During my time time crunching them, the Hyperfast pulled out an average of 600,000 saved disc io per day, compared to a WU that is not disc dependent where Hyperfast pulled only 25,000 or so disc writes per day. It works brilliantly, and frankly is a must have for Server SSDs - sprint dont walk and get it together with the main diskeeper program for the servers.
Regards
Zy
hmm dang :( I guess you aren't meant to leave the house for more than a day... :(
Let me know if there is anything I can help with...
:idea: I bet gopher broke trying to install my long rang Radar
come on confess :whip:
Hey Lauren not me....
The Long range radar is on my radar just got a few real life things to get done first my dear friend:).
It's all on linux, so products like this wouldn't be of any use. RAID 10 arrays might work, but multiple SSD's with good controllers start getting expensive. I really should have been checking the backups a bit better. I plan to pull the server apart tomorrow and see what can be done. At worst I'll plug in the extra SSD drive I have. (Have to pull apart my windows machine for that, but I'll just use my laptop in the meantime).
Good news, looks like I've recovered all of the data as far as I can tell. A few tables gave me errors but they are replicated versions anyway so no big deal. Running tests which will take some time but I think it will be back up no later than noon tomorrow. I'll have to put the drives in correctly as they are just loose right now.
I'm going to look into getting 2 external drives (500Gb or would be fine) and running parallel backups to them.
:) Good job man, thank goodness for that....
Hey bok, good luck with the re-build and recovery process. Take your time and get it right. Now is a good opportunity for that housekeeping on the server that always got postponed :)
Hurrah! Back in business :) Just in time for me to see a milestone I've been after for *ages* :)
Not had much luck with hardware here either. Lost 2 HDs in the last few weeks, and now another machine is randomly freezing and failing to find an OS. Running out of bits to cannibalise from old machines :(
Anyway, hadn't realised how much I relied on Free-DC. Might have to make another donation soon :)
Al.
I think everyone should realize how much they use Free-DC and how mush they missed Free-DC Stats when they were down.
:idea: It MIGHT Make them WANT to give a little back to Free-DC. :Pokes:
I am vary SURE Bok is not to happy about having to strip down other working PC's Just to get the Stats. UP again
..:allhail:
Ah, just put some small change in the pot. Would be more, but I'm running out of HDs here and need to order some new ones. BOINC can seriously ruin your hardware :(
Anywoo, just pleased that the best stats site is back again. And Free-DC really is the best. By a Loooooooong way. Just sayin' like ;)
Al.
Stats on the Main Page are totally Boked errr Borked lol this morning ... :coffee:
:( Amazingly, when I woke up this morning. A 2nd SSD is exhibiting exactly the same behavior as the last one.
Its not the new one I put in, but surely this can't be a coincidence? And more likely points to a motherboard problem. I do not know what to do a this stage. I have one more SSD I could cannibalize from anoter machine, but my head tells me the machine itself is a problem now. I'll plug in the first failed SSD to another linux machine and see if it exhibits any errors at all.
Yeah Bok, Have to say that is pointing at a H/ware controller error - shame you don't got an SATA PCI card ;) see if it's the controller or the whole mobo...
Can you boot to DOS off a flopy and run scandisk
linux has it's own version - fsck which is running now, but if it's a controller that's causing it, fsck won't know.
I guess it's none of the obvious things like the PSU being underrated, or going on the blink and over/under voltaging the drives?
Don't know if the RAM could be casuing stange problems. Have had a few oddities of the years that were RAM related.
If it helps, I could ship you my old Sinclair ZX81 and/or my Nascom-1 ;)
Al.
Bok These are SATA Drives Right
On a quad here that has a SATA SSD it would every 2 to 3 days just stop
rebooted OK with a scandisk
I found the drive controller chip and glued a old VGA chip cooler to it
and it has not crashed since
Find your drive/sata chip put your finger on in if feels hot try this
on a norm it is right next to the SATA ports
Hmm I should check mine, i get a click every so often from a drive but I got a fan on 'em ...
fsck was totally clean, and the stats have been running just fine for the last 8 hours or so. If they are ok still in the morning, I'll re-open the webside connection.
It felt just fine Lauren, these are all 120Gb SSD's. Still running just fine, so I'll open it up soon.
Looks like stats are down again :eek:
Al.
yup. Drive not responding again. I'm going to tear down my windows machine and use that.
What do you think will make this go away
P.S.
Without the need to tear apart you network
Hmm an PC Ice box, nice water cooling, sealed box to prevent dust ingress, with Gigabit ethernet too, with plenty of RAM and some method of minimizing writing to the drives too...
Would some thing like this work better that a SATA
http://www.microcenter.com/single_pr...uct_id=0371589
It has a big conch
The pciE has a larger pipe if I am not wrong
I have a new Iceberg W/cooling system I never installed
And or a pirzo system for working at -0 F
P.S.
this looks cool to a Hybrid
http://www.microcenter.com/single_pr...uct_id=0374294
I think it ranges from trying out a vga cooler like you mentioned on the SATA controller chip to replacing the mobo (which would mean a new CPU too).
I intend to try the first option as soon as I get a chance, given the drive came back up just fine. Otherwise I'll sacrifice my windows machine which has a very good mobo+cpu
Most of my MB's have some kind of coolers on the chip and like I said adding the heat-sink solved (can't read error)
if you got a Old dead MB take the VGA cooler off and use that But use some heat sink past in the middle and a dab or ball
of silicone at the 4 corners to lock it to the MB Let it cure laying down for 6 hrs and your good to go
The silicone will not harm or short the MB
OK try these
http://www.ocztechnology.com/ocz-rev...press-ssd.html
http://www.guru3d.com/article/ocz-re...120gb-review/7
Now I'm starting to want one haha
That last review is from 2010. I didn't notice until I saw it up against an OCZ Vertex II and was curious why they didn't compare to the Vertex III instead which has similar performance at ~ 550Mb/s though is a lot cheaper.
Well with a run of 343 years between failures seems like a bargan
Is the Vertex 3 the one that is not responding? There was a known problem with "panic lock" with the Vertex 2 drives (Sandforce controller), and it may have carried over to some extent to the Vertex 3.
http://www.youtube.com/watch?v=S0CJ0l1BUGI
I have a Vertex 2 that has never locked up in operation, but if I change video cards on the motherboard, the drive is not recognized on the next reboot and I have to go into the BIOS to find it again. So they are definitely temperamental, but work well on the the right motherboard.
Hey Bok, I posted a message in your profile page regarding a possible solution to the Free-DC site man.
Actually no, it was one of the older ones and I don't think it was a problem with the SSD itself given it comes back up just fine. Convinced it's the SATA controller. I spent some time trying to get a VGA cooler on it today but didn't have much luck. See below for further outcome.
I appreciate the offer, but this is the database server. It's the main reason I gave up on a hosted dedicated server many years ago. It *NEEDS* to be a dedicated server. Whilst raw speed doesn't matter that much it certainly helps so a decent core i7 is a must (current one is a core i7-920). Memory is also a necessity to keep the database running well. Current server has 12Gb (which was the max available at the time I built it). It does not run the webpages at all, those are on a separate server. It does all the downloads of stats files, parsing of the xml, mysql updates, ranking etc etc in a schema running on a dedicated SSD. It then replicates all of this into a separate schema running on a different SSD. And it does this a lot.Quote:
Originally Posted by stinger608
As an update for all. I tore apart my windows machine today and replaced the components of the database server with it. Was kind of hoping the OS (CentOS 6.2) would come back up without too much effort but alas it did not and I had to re-install it. Same went for the transplant back to my windows server but then I was expecting that one...
I'm typing on the windows box right now, so that went fairly smooth.
The database server is now running a Core i7-2600K with 16Gb Ram (and it will handle 32Gb). And it's running the SSD's on SATAIII fully now which is good. I have most of the packages I need installed, just a few more to go. So tomorrow if all goes well, I'll be pointing the site back here once more and fingers crossed it all works well. I have no spares of anything now which is a little disconcerting as it's the first time in years :) But I'll deal with that in the coming weeks. I've had about $100 in donations on the last week or so, that will go towards another spare SSD and then the 8Gb Ram chips I think. I did pick up a 1Tb external drive which I have hooked up for secondary backups, maybe get one more of those.
Talking of donations, how can I donate?
Until I open the site up again, you can go to www.free-dc.org/paypal.php
Appreciate it.
Another update before I hit the sack. Stats are actually running right now and do appear to be quite a bit faster, mostly. One part is definitely slower and I'm pretty sure that's down to a mysql config I need to correct. Going to let them run overnight and check everything out first thing. After a few more tests then I'll open it up again.. Fingers crossed!
OK Phil, fifty bucks on its way.:thumbs:
Alan