Results 1 to 24 of 24

Thread: Cosmology Update

  1. #1
    Administrator Bok's Avatar
    Join Date
    Oct 2003
    Location
    Wake Forest, North Carolina, United States
    Posts
    24,596
    Blog Entries
    13

    Cosmology Update

    This email was forwarded to me from NFlight over at AmdUsers


    From the Cosmology Admin....


    We had a sudden unforeseen hardware failure Friday night - we have been working to get the server back up and running. Since it is the week-end it is harder to get replacement parts, so the system may continue to be down for of a couple of days before things will be back on track.

    It would be great if you could get a message to our users about this - we were trying to think of a way, but I could not even post on my webpage because it is running on the same server that is hosting cosmology@home.

    All the best,

    Ben


  2. #2
    Junior Member gatekeeper53's Avatar
    Join Date
    Nov 2007
    Location
    Pierron, IL
    Posts
    19

    Update from Ben at Cos

    Dear Terry -

    That's very kind of you. I have forwarded your message
    to Scott Kruger,
    who has been tracking down exactly what the issue is.
    It appears to be a
    motherboard failure. Cosmology@home is running on a
    dual Opteron platform.

    It does not appear to be a hard drive problem. Even if
    it had been , we
    store all our programs and data on that server
    completely redundantly,
    plus we backed up daily so all the scientific results
    of cosmology@home
    so far are protected. If you do have a way of
    communicating with at
    least some of our users, this would also be a good
    message to get out there.

    All the best,
    Ben

  3. #3
    Junior Member Gamma^Ray's Avatar
    Join Date
    Jul 2006
    Location
    USA
    Posts
    9
    Thanks for the info !

    I added the info to the Synergy Mainpage news for Cosmo to help spread the word.

  4. #4
    =>Team Joker<= LAURENU2's Avatar
    Join Date
    Dec 2004
    Location
    Chicago IL USA
    Posts
    5,478
    Blog Entries
    1
    Is there a ETA for it to be back online

  5. #5
    Administrator Bok's Avatar
    Join Date
    Oct 2003
    Location
    Wake Forest, North Carolina, United States
    Posts
    24,596
    Blog Entries
    13
    nope. but the homepage at least gives a quick status now...

    Bok

  6. #6
    =>Team Joker<= LAURENU2's Avatar
    Join Date
    Dec 2004
    Location
    Chicago IL USA
    Posts
    5,478
    Blog Entries
    1
    Quote Originally Posted by Bok
    nope. but the homepage at least gives a quick status now...

    Bok
    Nope not yet Still The page cannot be displayed
    Must be all the Calls form nodes trying to send in old WU's ___

  7. #7
    Junior Member gatekeeper53's Avatar
    Join Date
    Nov 2007
    Location
    Pierron, IL
    Posts
    19

    Cos update

    I just talked to Ben Wandelt on the phone and he seems to believe that Cosmo should be up and running sometime today. It appears that they had trouble getting the new server to mesh all the software properly. He tells me that they are working around the clock to solve the problem. Also he assures me that no matter how much you have in your machines that you will recieve credit for them. He told me that this is the first real unscheduded downtime that the project has suffered and, that as with all of us, him and his people have learned much from it. He hopes that people will return once the project is back up, also he is sorry for the downtime. He would appreciate any help in getting the word out as to updates on the progress and especially when the project comes back online. He seems like a very nice guy.

  8. #8
    Administrator Bok's Avatar
    Join Date
    Oct 2003
    Location
    Wake Forest, North Carolina, United States
    Posts
    24,596
    Blog Entries
    13
    Thanks for the update!

    Bok

  9. #9
    Minister of Propaganda Fozzie's Avatar
    Join Date
    Jul 2003
    Location
    Bristol,UK
    Posts
    3,609

    Project admins

    that keep in touch and are reasonable always get the backing their project deserves.

    It's when they are arrogant that they lose their support.

    to Ben
    Alas poor Borg, I knew it Horatio



    http://www.butlersurvey.com/

  10. #10
    Administrator Bok's Avatar
    Join Date
    Oct 2003
    Location
    Wake Forest, North Carolina, United States
    Posts
    24,596
    Blog Entries
    13
    From the main page you now get this..

    UPDATE: We've got a new machine and I'm in the process of getting it up and running (we're having some trouble with the ethernet driver ). I'll be sitting in in front of the computer for the rest of the day trying to get everything running.

    I look forward to seeing everybody again after we're back up!

    -Scott

  11. #11
    Junior Member Gamma^Ray's Avatar
    Join Date
    Jul 2006
    Location
    USA
    Posts
    9
    Scott also told me that there was some sort of failure on cosmos which destroyed the power supply and motherboard.

    Pretty bad hit whatever caused it, Which I would guess started with or before the Power Supply. Maybe a power surge or lull not noticed that did it ? Happened to my system not too long ago that fried my P/S and almost got my M/board also.

    But I do agree that the Admins at Cosmo are and have been very good about communications and all.

  12. #12
    Ancient Programmer Paratima's Avatar
    Join Date
    Dec 2001
    Location
    West Central Florida
    Posts
    3,296
    Now we'll see if they're smart enough to learn from all this. Like isolating the P/S and routers from line problems. Like having spares. Murphy lives! Either you're prepared or you're not.

    I guess every young admin has to learn this the hard way.
    HOME: A physical construct for keeping rain off your computers.

  13. #13
    =>Team Joker<= LAURENU2's Avatar
    Join Date
    Dec 2004
    Location
    Chicago IL USA
    Posts
    5,478
    Blog Entries
    1
    Redundancy Redundancy Redundancy I have lots of Redundancy except for my wife
    Not only in data backup but also in servers to share the load

  14. #14
    Administrator Bok's Avatar
    Join Date
    Oct 2003
    Location
    Wake Forest, North Carolina, United States
    Posts
    24,596
    Blog Entries
    13
    site is back but the various services aren't running yet...

    Bok

  15. #15
    Junior Member gatekeeper53's Avatar
    Join Date
    Nov 2007
    Location
    Pierron, IL
    Posts
    19
    Ben Wandelt just stopped by the AMD Users chatroom and let us know that they are getting close to resuming full operations (they hope). He brought up the idea of doing a Q&A about Cosomology@home. We are going to try and set something up and when we get it firmed up I'll post something here so that if any of you would like to get in on it you can.

    Terry

  16. #16
    =>Team Joker<= LAURENU2's Avatar
    Join Date
    Dec 2004
    Location
    Chicago IL USA
    Posts
    5,478
    Blog Entries
    1
    Quote Originally Posted by gatekeeper53
    Ben Wandelt just stopped by the AMD Users chatroom and let us know that they are getting close to resuming full operations (they hope). He brought up the idea of doing a Q&A about Cosomology@home. We are going to try and set something up and when we get it firmed up I'll post something here so that if any of you would like to get in on it you can.

    Terry
    OK Now where did I put that List Nope nope that one is for X-mas

  17. #17
    Junior Member Gamma^Ray's Avatar
    Join Date
    Jul 2006
    Location
    USA
    Posts
    9
    Thanks gatekeeper53.

  18. #18
    =>Team Joker<= LAURENU2's Avatar
    Join Date
    Dec 2004
    Location
    Chicago IL USA
    Posts
    5,478
    Blog Entries
    1
    Bok What do we need to do to get Our (your) Cosmo Stats back updateing Again

  19. #19
    Administrator Bok's Avatar
    Join Date
    Oct 2003
    Location
    Wake Forest, North Carolina, United States
    Posts
    24,596
    Blog Entries
    13
    The need to switch on their job for creating the xml stats...

    I posted about it a couple of hours ago

    Bok

  20. #20
    Ancient Programmer Paratima's Avatar
    Join Date
    Dec 2001
    Location
    West Central Florida
    Posts
    3,296
    It was getting no official notice, so I pinged it. We'll see what happens. They may have more pressing issues. Or sometimes you just have to squeak a little louder.
    HOME: A physical construct for keeping rain off your computers.

  21. #21
    Junior Member Gamma^Ray's Avatar
    Join Date
    Jul 2006
    Location
    USA
    Posts
    9
    Or maybe they are trying to make up for the loss of sleep for the past few days.

  22. #22
    Well, if they dont't get the proflidator/affliataidator or what it is working.....

    Me have about 300 results there pending

  23. #23
    =>Team Joker<= LAURENU2's Avatar
    Join Date
    Dec 2004
    Location
    Chicago IL USA
    Posts
    5,478
    Blog Entries
    1
    Quote Originally Posted by Paratima
    It was getting no official notice, so I pinged it. We'll see what happens. They may have more pressing issues. Or sometimes you just have to squeak a little louder.
    He said they fixed it but just checked all stats no one but Cosmo is updating
    So I went over there and

    Dam I have a Bigger RAC then my wife over there

  24. #24
    Ancient Programmer Paratima's Avatar
    Join Date
    Dec 2001
    Location
    West Central Florida
    Posts
    3,296
    Same is true over at Milkyway@home. No xml stats updates. I've pitched a over there, as well, with about the same result.
    HOME: A physical construct for keeping rain off your computers.

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •