Boas,
mais uma vês o f@h fez das suas
Desta vez tenho o cliente SMP que não quer enviar as coisas de volta para stanford
Fica aqui o LOG:
Bem, isto tem-se vindo a estender a prai uma semana, pelo que já perdi pelo menos umas 10 WUs do GPU e umas 5 de SMPs...
Se alguém souber o que se passa...
Agradecia.
Cumps.
mais uma vês o f@h fez das suas
Desta vez tenho o cliente SMP que não quer enviar as coisas de volta para stanford
Fica aqui o LOG:
Código:
[16:00:47] Completed 175000 out of 500000 steps (35 percent)
[16:10:48] Timered checkpoint triggered.
[16:20:48] Timered checkpoint triggered.
[16:26:55] Writing local files
[16:26:55] Completed 180000 out of 500000 steps (36 percent)
[16:36:56] Timered checkpoint triggered.
[16:46:56] Timered checkpoint triggered.
[16:48:58] - Autosending finished units... [October 27 16:48:58 UTC]
[16:48:58] Trying to send all finished work units
[16:48:58] Project: 2665 (Run 2, Clone 219, Gen 60)
[16:48:58] + Attempting to send results [October 27 16:48:58 UTC]
[16:48:58] - Reading file work/wuresults_01.dat from core
[16:48:58] (Read 22443409 bytes from disk)
[16:48:58] Connecting to http://171.64.65.64:8080/
[16:53:07] Writing local files
[16:53:07] Completed 185000 out of 500000 steps (37 percent)
[17:03:07] Timered checkpoint triggered.
[17:08:59] Posted data.
[17:13:07] Timered checkpoint triggered.
[17:19:20] Writing local files
[17:19:20] Completed 190000 out of 500000 steps (38 percent)
[17:28:59] Initial: 0000; Timered checkpoint triggered.
[17:39:20] Timered checkpoint triggered.
[17:45:36] Writing local files
[17:45:37] Completed 195000 out of 500000 steps (39 percent)
[17:48:59] + Could not connect to Work Server (results)
[17:48:59] (171.64.65.64:8080)
[17:48:59] + Retrying using alternative port
[17:48:59] Connecting to http://171.64.65.64:80/
[17:55:36] Timered checkpoint triggered.
[18:05:36] Timered checkpoint triggered.
[18:09:01] Posted data.
[18:11:53] Writing local files
[18:11:53] Completed 200000 out of 500000 steps (40 percent)
[18:21:53] Timered checkpoint triggered.
[18:29:01] Initial: 0000; Timered checkpoint triggered.
[18:38:14] Writing local files
[18:38:14] Completed 205000 out of 500000 steps (41 percent)
[18:48:14] Timered checkpoint triggered.
[18:49:01] + Could not connect to Work Server (results)
[18:49:01] (171.64.65.64:80)
[18:49:01] - Error: Could not transmit unit 01 (completed October 26) to work server.
[18:49:01] - 9 failed uploads of this unit.
[18:49:01] + Attempting to send results [October 27 18:49:01 UTC]
[18:49:01] - Reading file work/wuresults_01.dat from core
[18:49:01] (Read 22443409 bytes from disk)
[18:49:01] Connecting to http://171.64.122.86:8080/
[18:49:03] - Couldn't send HTTP request to server
[18:49:03] + Could not connect to Work Server (results)
[18:49:03] (171.64.122.86:8080)
[18:49:03] + Retrying using alternative port
[18:49:03] Connecting to http://171.64.122.86:80/
[18:49:03] - Couldn't send HTTP request to server
[18:49:03] (Got status 503)
[18:49:03] + Could not connect to Work Server (results)
[18:49:03] (171.64.122.86:80)
[18:49:03] Could not transmit unit 01 to Collection server; keeping in queue.
[18:49:03] + Sent 0 of 1 completed units to the server
[18:49:03] - Autosend completed
[18:58:14] Timered checkpoint triggered.
[19:01:35] CoreStatus = 63 (99)
[19:01:35] + Error starting Folding@Home core.
[19:01:40]
[19:01:40] + Processing work unit
[19:01:40] Work type a1 not eligible for variable processors
[19:01:40] Core required: FahCore_a1.exe
[19:01:40] Core found.
[19:01:40] Working on queue slot 02 [October 27 19:01:40 UTC]
[19:01:40] + Working ...
[19:01:40] - Calling 'mpiexec -np 4 -channel shm -env MPICH_USE_SMP_OPTIMIZATIONS 1 -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 02 -checkpoint 10 -verbose -lifeline 2756 -version 622'
[19:01:42]
[19:01:42] *------------------------------*
[19:01:42] Folding@Home Gromacs SMP Core
[19:01:42] Version 1.76 (February 23, 2008)
[19:01:42]
[19:01:42] Preparing to commence simulation
[19:01:42] - Ensuring status. Please wait.
[19:01:59] - Looking at optimizations...
[19:01:59] - Working with standard loops on this execution.
[19:01:59] Examination of work files indicates 8 consecutive improper terminations of core.
[19:02:07] - Expanded 2444160 -> 12895909 (decompressed 527.6 percent)
[19:02:08]
[19:02:08] Project: 2653 (Run 23, Clone 42, Gen 87)
[19:02:08]
[19:02:09] Entering M.D.
[19:02:16] Calling FAH init
[19:02:17] in POPC
[19:02:17] Writing local files
[19:02:17] Completed 208798 out of 500000 steps (41 percent)
[19:02:17] PC
[19:02:17] Writing local files
[19:02:17] Completed 208798 out of 500000 steps (41 percent)
[19:02:19] Extra SSE boost OK.
Bem, isto tem-se vindo a estender a prai uma semana, pelo que já perdi pelo menos umas 10 WUs do GPU e umas 5 de SMPs...
Se alguém souber o que se passa...
Agradecia.
Cumps.