PDA

View Full Version : Server having problems?



rshepard
04-20-2004, 07:58 AM
For a couple of days I've been getting the "server is restarting" msg every time each of my clients tries to connect (both from home and at work) Now it looks as if the stats are broken; also; the clients have continued to do work but it looks like nothing is being recorded at the website. :Pokes:

PlanetScorp
04-20-2004, 08:33 AM
that would be an answer for my first-time-run problems decribed here (in the middle somewhere).

http://www.free-dc.org/forum/showthread.php?s=&threadid=5658

michaelgarvie
04-20-2004, 12:19 PM
Firewall unblocked. Thanks for your patience!
:smoking:

rshepard
04-20-2004, 12:26 PM
Thanks!
:notworthy

PlanetScorp
04-20-2004, 01:21 PM
thank you, connection works fine now! :blush:

michaelgarvie
04-25-2004, 06:02 AM
Web server down today due to migration from department web server to university web server.

The evolution server is running though and stats are being collected.

The new server is hopefully compatible with PHP PNG graphics and would allow graphs to be drawn from the statistics.

rshepard
04-28-2004, 07:44 AM
Some of my clients are showing "timed-out" msgs when trying to get work from the server,
and it looks like the stats on the website have stopped updating.

*hunts for server-beating stick :Pokes:

VirtualAdept
04-28-2004, 09:03 AM
Originally posted by rshepard
Some of my clients are showing "timed-out" msgs when trying to get work from the server

My client has been getting timeouts only (java.rmi.ConnectIOException, java.net.SocketTimeoutException: Read timed out) from the server since ~9:30 CEST this morning, too. Just when I had this remarkable species... :cry:

michaelgarvie
04-28-2004, 09:40 AM
.

rshepard
05-10-2004, 07:50 AM
Timed-out errors are back today, and the stats are hung up again:cry:

michaelgarvie
05-10-2004, 01:15 PM
Thanks :blush:

rshepard
05-13-2004, 08:52 AM
stats are stuck again :Pokes:

michaelgarvie
05-13-2004, 09:52 AM
Thanks again.. There must be some Linux/Java limit the server is hitting..

rshepard
05-24-2004, 12:03 PM
stuck again :bang:

michaelgarvie
05-24-2004, 02:41 PM
And back with some optimisations which should mean you should see generations go by quicker.

rshepard
05-27-2004, 08:44 AM
looks like it is down again :cry:

michaelgarvie
05-27-2004, 10:40 AM
Power cut yesterday 7pm... :swear:

rshepard
06-17-2004, 10:18 AM
stats page is stuck today :help:

HitchHiker
06-18-2004, 08:29 AM
Can't get any jobs either...:bang:


Trying to get task from server //139.184.166.27/KingKong
java.rmi.ConnectIOException: error during JRMP connection establishment; nested exception is:
java.net.SocketTimeoutException: Read timed out

dreplogle
06-18-2004, 09:41 AM
I don't suppose any of the moderators could suppy an ETA (Estimated Time of Access?). :trash:

prokaryote
06-18-2004, 09:55 AM
:( Sorry, can't say since I don't have any info regarding U of S systems. I'm thinking that since Miguel hasn't responded yet that it's system wide over there.

dreplogle
06-19-2004, 12:04 AM
I've still got two windows open processing something. Generation times are increasing. I wonder though if I'm just spinning my wheels and getting nowhere? I can't open any more windows on the other pc's I use. Anyone wondering the same thing?:taz:

rshepard
06-19-2004, 06:13 AM
Islands that are running when the server goes down should continue to evolve - they just won't interact with other islands until the server comes back. You can't start up new islands without the server to hand out work, however. At least,that is how I understand it.

prokaryote
07-02-2004, 10:18 AM
:( Looks like server issues again? Hope things get sorted before the weekend.

prokaryote
07-06-2004, 12:58 PM
Looks like somemore issues....

Any chance that the project could get a dedicated server?

adream
07-07-2004, 05:03 AM
Originally posted by prokaryote
Looks like somemore issues....

Any chance that the project could get a dedicated server?


i second that, what are the bandwidth requirements of the project server ? (not the web server)

regards

adream

:elephant: :elephant:

michaelgarvie
07-08-2004, 02:33 PM
This time there's an interesting story:
It looks like some naughty stats-climber installed a client on some company's machine without their permission. This company then issued a complaint to the university of abnormal traffic between the DHEP server and their machine thinking there was a trojan involved. No one belonging to the company has owned up to being involved with DHEP!

Naughty naughty, who could this have been? :spank: :haddock: :bonk: :whip:

Server is back online now.

adream
07-08-2004, 06:16 PM
wasn't me !!! :bouncy: :bouncy: :bouncy: :bouncy:

thanks for the info miguel

thats bloody naughhty aint it ;)



adream

HitchHiker
07-15-2004, 05:46 PM
Is the server in trouble again? :help:
One of my machines hasn't got a task for more than a day now, it may be longer than that. Stats in ranking indicate I'm not the only one suffering. :D
My two other machines are still slowly but happily churning away though.

rshepard
08-24-2004, 06:30 PM
stats are frozen again

jfk
08-24-2004, 10:48 PM
I am still getting the Java time out error. Anyone else?

michaelgarvie
08-25-2004, 05:01 AM
Sorry about this. Back up again. :bang:

em99010pepe
08-25-2004, 05:17 AM
I like this project but why the server is always down? It's funny but the server goes down everytime I have to restart the computer.

Carlos

jfk
08-25-2004, 11:32 AM
Probably what is happening is that the servers go down and your island continues to evolve without communicating to other servers or islands, then when u restart and it becomes necessary to contact the server to get new work, you become aware of the down server.

em99010pepe
08-25-2004, 12:04 PM
I ran all night and beginning of morning (about 14 hours) but didn't get credit for circuits evaluated. Strange!

Is Sun java J2SE 5.0 Beta 2 quicker than Sun java J2SE 1.4.2 SDK?

rshepard
08-25-2004, 03:03 PM
hung up again-- seems to always break down when the topology map is 33x33 :confused:

Also, what is up with the Marvin island?? It seems to be eating up all the empty space and not returning any info???????

nickth
08-25-2004, 03:30 PM
Originally posted by rshepard
hung up again-- seems to always break down when the topology map is 33x33 :confused:

Also, what is up with the Marvin island?? It seems to be eating up all the empty space and not returning any info???????

I agree with r shepard. the server is down again :trash:

nickth
08-25-2004, 04:18 PM
Originally posted by nickth
I agree with r shepard. the server is down again :trash:

Now an hour latter its back up again :bang:

Very intresting :confused:

em99010pepe
08-25-2004, 04:22 PM
Originally posted by nickth
Now an hour later its back up again :bang:

Very interesting :confused:

Are you sure?

nickth
08-25-2004, 04:28 PM
yes i am for the moment anyway

michaelgarvie
08-26-2004, 04:55 AM
rshepard was right, the problem was Marvin. This island was just creating hundreds of spurious connections. :haddock: I this kind of thing keeps happening I'll have to write some code to handle it. Sorry about this, server is running fine now.

jfk
08-26-2004, 08:47 AM
The stats show Island Not currently active, eventhough I am still chugging away. Are we down again?

rshepard
09-02-2004, 07:42 AM
time to kick the server again, Miguel :D

Also, would you look at the overall rankings ( Total Circuits Evaluated)-- I seem to have disappeared from the list :cry:

Thanks

michaelgarvie
09-02-2004, 08:28 AM
Server up again. :spank:

rshepard, you're in position 306. Is this not correct?

You're still in top 5 for effort.

rshepard
09-02-2004, 08:36 AM
No- I was #1 for quite a while, just waiting for diGriz to run over me. I think it fell off sometime after I broke through 2,000,000,000 circuits. Some sort of overflow condition maybe???

michaelgarvie
09-02-2004, 01:19 PM
Hmm.. Funny that the overflow didn't hid Bolivar. Do the stats look better now?

rshepard
09-02-2004, 01:20 PM
Much Better!!

Thanks :thumbs:

prokaryote
10-13-2004, 01:25 AM
:bonk: Server needs some coercive violence to get it going again. :D

deranged128[OCAU]
10-13-2004, 06:17 AM
A quick dose of percussion treatment appears to be in order. :trash:

This brings me back to a question in another thread I started 'Significant Downtime" where I asked about possibly saving completed work to disk during extended server outtages and the ability to view the java output to know if significant amounts of work are waiting to be uploaded.

I'd also like to get an idea on how many servers we are communicating with for uploads. Is there are need for perhaps additional upload points with the ability to cache work on them before finally reconciling with the main server? Sort of like the old seti-q idea.

I'm keen to keep supporting this project, which I consider to have very worthwhile aims, but am finding I have to divert CPU resources to other projects due to an inability to keep the client running at full capacity. :rolleyes:

ace_dent
10-14-2004, 05:29 AM
Server down again?...

michaelgarvie
10-14-2004, 05:36 AM
Server going for full restart 10:37 GMT.:bang:

prokaryote
10-14-2004, 11:45 PM
:bang: Blinkin Server is down again... It's not running Windoze Server 3 is it? Or maybe someone should give the little hamster in his wheel some water!:crazy:

P.S. Miguel, American_Maid_2 is trying to connect to the server (been doing so for about the last 4 days) Can get to the rest of the internet no probs. Thanks, prok.

michaelgarvie
10-18-2004, 05:28 AM
Server was shut down yesterday by Sussex IT people without warning. They say it was attacking another computer on campus with a DDOS.

deranged128[OCAU]
10-18-2004, 05:48 AM
I might have to come back to this project when the server situation is resolved. :rolleyes: I'll keep watching this thread for news. :cool:

ace_dent
10-18-2004, 12:22 PM
I was surprised that Island topology was not reset when the server was fixed. (Forgive me, I'm new to this project). With such an empty 'world' (only edge islands), wont migration and evolution be hindered?

Regards,
Andrew

michaelgarvie
10-18-2004, 01:48 PM
Server back on-line. Looks like the attack thing was just a scare :haddock: . Thanks for your patience.

The ring topology is because the server was never actually turned off, they just disconnected the network socket and filtered it out of the univiersity firewall. I'll just be double checking for any worms and everything should be normal.

Migration should be going round the ring like on a motorway.

:cheers: Miguel

ace_dent
10-18-2004, 07:29 PM
... and it's down again... :Pokes:

BTW- Is posting server outages useful?

michaelgarvie
10-19-2004, 02:55 AM
And up again. We'll now be focusing on ways to stabilise the server.

deranged128[OCAU]
10-19-2004, 05:47 AM
Originally posted by miguelgarvie
And up again. We'll now be focusing on ways to stabilise the server. I asked a question earlier in this thread about ways of getting more server points available. Is that possible?

michaelgarvie
10-20-2004, 09:02 AM
It would be a nice extension to http://distrit.sf.net . If we get a good proposal from a coder on how they would do this, we would gladly accept it into the mainstream code. :cheers: Miguel

ace_dent
11-02-2004, 05:03 AM
Down again...

Cheers.

em99010pepe
11-13-2004, 04:31 AM
There's some kind of problem with the stats. Look at the numbers?

Carlos

EDIT

Problem fixed

ace_dent
11-16-2004, 06:05 AM
Cannot connect...

Cheers,
Andrew

michaelgarvie
11-16-2004, 06:59 AM
Up again. Power had been cut in the lab.

ace_dent
11-18-2004, 12:19 PM
Hmmm... just what is going on with that pesky server?

After going down this morning, all of the crunchin' of the previous day was in vain- is there no way to recover to the last evolved state? Also, after the last reboot, the server time was actually correct- now we are back to +1:00 (well in the UK anyway).

Cheers,
Andrew.

michaelgarvie
11-18-2004, 01:42 PM
Hi Ace,

Thanks for your message about the time. It's been corrected now.

The run has been restarted deliberately because it had been converged for two days and nothing was happening. A new measure has been introduced to avoid this and seems to be worked very nicely. Basically an immigrant won't be allowed into an island unless it is 0.08 better at fitness p1. This creates speciation as several islands are close to the best global best become immune to be overrun by immigrants. This avoids premature convergence on local optima. By watching the cluster it seems this is a pretty good strategy because it happens often that the global best comes from one of the top five global best, so its good to keep the diversity there.

:cheers: Miguel

ace_dent
11-18-2004, 08:52 PM
A very interesting answer- thanks for the reply.

Andrew

ace_dent
11-25-2004, 07:04 PM
No very important, but server time is +1:00 again.

prokaryote
11-27-2004, 01:39 PM
:eek: Looks like IT shut down the server again this weekend!

Getting kind of old, these problems almost always happen on a Friday and aren't resolved till the following Monday. Is there anyway to check to see if the server is up over the weekend and give someone a call? I've got a couple of systems that I have to reboot and the project will endup losing that work since they couldn't check into the system. :rolleyes: :bang:

michaelgarvie
11-28-2004, 10:26 AM
Server is up. There was power failure in the department over the weekend. They still haven't sorted out the webserver so the dhep website is down. However the DHEP server itself is up and running and collecting stats as I write.
:cheers: Miguel

ace_dent
12-06-2004, 10:18 AM
Server down again?...

HitchHiker
12-06-2004, 04:08 PM
Again and again, what kind of server is this?

Miguel, would you be so kind to answer ace_dent's question of 6 weeks ago: "BTW- Is posting server outages useful?" and react to the remark of prokaryote that this always seems to happen on a friday (except today of course). :help:

You seem to be a man of little words. Spend a bit more of those please.

ace_dent
12-20-2004, 10:34 AM
No crunching this Christmas? :Pokes:

stanray
12-22-2004, 11:26 AM
Well, seems to be lot's of questions about status and no answers to them given. Let me know when it is a go with this project again. See ya!

Stan:swear: