Project: 2665 (Run 1, Clone 888, Gen 117)

Moderators: Site Moderators, FAHC Science Team

Post Reply
poiuyut
Posts: 8
Joined: Wed Apr 30, 2008 10:50 pm
Location: Vancouver, BC

Project: 2665 (Run 1, Clone 888, Gen 117)

Post by poiuyut »

BAD_CORE_FILES error immediately after starting this unit.

Code: Select all

[16:16:00] + Processing work unit
[16:16:00] Work type a1 not eligible for variable processors
[16:16:00] Core required: FahCore_a1.exe
[16:16:00] Core found.
[16:16:00] Working on queue slot 03 [October 11 16:16:00 UTC]
[16:16:00] + Working ...
[16:16:00] - Calling 'mpiexec -np 4 -channel shm -env MPICH_USE_SMP_OPTIMIZATIONS 1 -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 03 -checkpoint 15 -verbose -lifeline 3080 -version 624'

[16:16:00] 
[16:16:00] *------------------------------*
[16:16:00] Folding@Home Gromacs SMP Core
[16:16:00] Version 1.76 (February 23, 2008)
[16:16:00] 
[16:16:00] Preparing to commence simulation
[16:16:00] - Looking at optimizations...
[16:16:00] - Created dyn
[16:16:00] - Files status OK
[16:16:04] - Expanded 4697175 -> 24111057 (decompressed 513.3 percent)
[16:16:04] - Starting from initial work packet
[16:16:04] 
[16:16:04] Project: 2665 (Run 1, Clone 888, Gen 117)
[16:16:04] 
[16:16:05] Assembly optimizations on if available.
[16:16:05] Entering M.D.
[16:16:11] Rejecting checkpoint
[16:16:12] NaN detected: x[7904][0]=4.82564 v[7904][0]=NaN
[16:16:12] utdown: BAD_CORE_FILES
[16:16:12] Finalizing output
[16:18:12] NaN detected: x[12294][0]=-3159739.00000 v[12294][0]=NaN
[16:18:12] 
[16:18:12] Folding@home Core Shutdown: BAD_CORE_FILES
[16:18:12] Finalizing output
[16:20:17] CoreStatus = 1 (1)
[16:20:17] Sending work to server
[16:20:17] Project: 2665 (Run 1, Clone 888, Gen 117)
[16:20:17] - Error: Could not get length of results file work/wuresults_03.dat
[16:20:17] - Error: Could not read unit 03 file. Removing from queue.
[16:20:17] Trying to send all finished work units
[16:20:17] + No unsent completed units remaining.
[16:20:17] - Preparing to get new work unit...
[16:20:17] Cleaning up work directory
[16:20:17] + Attempting to get work packet
[16:20:17] - Will indicate memory of 3063 MB
[16:20:17] - Connecting to assignment server
[16:20:17] Connecting to http://assign.stanford.edu:8080/
[16:20:17] Posted data.
[16:20:17] Initial: 40AB; - Successful: assigned to (171.64.65.64).
[16:20:17] + News From Folding@Home: Welcome to Folding@Home
[16:20:17] Loaded queue successfully.
[16:20:17] Connecting to http://171.64.65.64:8080/
[16:20:22] Posted data.
[16:20:22] Initial: 0000; - Receiving payload (expected size: 4697687)
[16:20:31] - Downloaded at ~509 kB/s
[16:20:31] - Averaged speed for that direction ~344 kB/s
[16:20:31] + Received work.
[16:20:31] Trying to send all finished work units
[16:20:31] + No unsent completed units remaining.
[16:20:31] + Closed connections
[16:20:36] 
[16:20:36] + Processing work unit
[16:20:36] Work type a1 not eligible for variable processors
[16:20:36] Core required: FahCore_a1.exe
[16:20:36] Core found.
[16:20:36] Working on queue slot 04 [October 11 16:20:36 UTC]
[16:20:36] + Working ...
[16:20:36] - Calling 'mpiexec -np 4 -channel shm -env MPICH_USE_SMP_OPTIMIZATIONS 1 -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 04 -checkpoint 15 -verbose -lifeline 3080 -version 624'

[16:20:36] 
[16:20:36] *------------------------------*
[16:20:36] Folding@Home Gromacs SMP Core
[16:20:36] Version 1.76 (February 23, 2008)
[16:20:36] 
[16:20:36] Preparing to commence simulation
[16:20:36] - Looking at optimizations...
[16:20:36] - Created dyn
[16:20:36] - Files status OK
[16:20:40] - Expanded 4697175 -> 24111057 (decompressed 513.3 percent)
[16:20:40] - Starting from initial work packet
[16:20:40] 
[16:20:40] Project: 2665 (Run 1, Clone 888, Gen 117)
[16:20:40] 
[16:20:41] Assembly optimizations on if available.
[16:20:41] Entering M.D.
[16:20:47] Rejecting checkpoint
[16:20:48] NaN detected: x[7904][0]=4.82564 v[7904][0]=NaN
[16:20:48] utdown: BAD_CORE_FILES
[16:20:48] Finalizing output
[16:20:52] Killing all core threads
[16:20:52] Killing 4 cores
[16:20:52] Killing core 0
[16:20:52] Killing core 1
[16:20:52] Killing core 2
[16:20:52] Killing core 3

Folding@Home Client Shutdown at user request.
[16:20:52] ***** Got a SIGTERM signal (2)
[16:20:52] Killing all core threads
[16:20:52] Killing 4 cores
[16:20:52] Killing core 0
[16:20:52] Killing core 1
[16:20:52] Killing core 2
[16:20:52] Killing core 3

Folding@Home Client Shutdown.
Bobby-Uschi
Posts: 70
Joined: Thu Jul 31, 2008 3:26 pm
Hardware configuration: PC1//C2Q-Q9450,GA-X48-DS5-NinjaMini,GTX285,2x160GB Western Sata2,2x1GB Geil800,Tagan 800W;XP Pro SP3-32Bit;
PC2//C2Q-Q2600k.GB-P67UD4-Freezer 7Pro,GTX285Leadtek,260 GB Western Sata2,4x2GB GeilPC3,OCZ600W;Win7-64Bit;Siemens 22"
Location: Deutschland

Re: Project: 2665 (Run 1, Clone 888, Gen 117)

Post by Bobby-Uschi »

for me too is faulty
PC1//C2Q-Q9450,GA-X48-DS5-,2xGTX285,2x160GB Western Sata2,2x1GB Geil800,Tagan 800W;XP Pro SP3-32Bit
PC2//C2Q-Q2600k.GB-P67UD4-Freezer 7Pro,GTX285Leadtek,260 GB WeSata2,4x2GB GeilPC3,OCZ600W;Win7-64Bit;Siemens 22"stern
anko1
Posts: 438
Joined: Mon Dec 03, 2007 1:31 am
Hardware configuration: Old Faithful CPU: Windows Graphical 5.03; Intel Pentium 4 Processor 540
(3.2GHz) HT;Windows XP
Big Red: Windows SMP Console 6.29; Windows GPU console 6.20r1; Intel Q9450 2.66G; ASUS P5Q 775 P45; [BFG 9800GTX+ old graphics card] NVidia GeForce 8800 GTX [as of 5/9/09]; Windows XP Pro SP3
Lenovo Think Pad: Windows 6.29 w/ SMP; Windows GPU Console 6.20r1 systray; Intel QX9300; NVIDIA Quadro FX-3700M; Windows XP Professional
Location: SF Peninsula

Re: Project: 2665 (Run 1, Clone 888, Gen 117)

Post by anko1 »

Just catching up on my logs. I received this one, too, three times in a row:

Code: Select all

[15:25:12] Working on queue slot 06 [September 29 15:25:12 UTC]
[15:25:12] + Working ...
[15:25:12] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 06 -checkpoint 15 -verbose -lifeline 3680 -version 624'

[15:25:13] 
[15:25:13] *------------------------------*
[15:25:13] Folding@Home Gromacs SMP Core
[15:25:13] Version 1.74 (March 10, 2007)
[15:25:13] 
[15:25:13] Preparing to commence simulation
[15:25:13] - Ensuring status. Please wait.
[15:25:17] - Starting from initial work packet
[15:25:17] 
[15:25:17] Project: 2665 (Run 1, Clone 888, Gen 117)
[15:25:17] 
[15:25:18] Assembly optimizations on if available.
[15:25:18] Entering M.D.
[15:25:40] percent)
[15:25:40] - Starting from initial work packet
[15:25:40] 
[15:25:40] Project: 2665 (Run 1, Clone 888, Gen 117)
[15:25:40] 
[15:25:41] Entering M.D.
[15:25:47] Rejecting checkpoint
[15:25:48] NaN detected: x[7904][0]=4.82564 v[7904][0]=NaN
[15:25:48] ng output
[15:27:48] ome Core Shutdown: BAD_CORE_FILES
[15:27:48] 
[15:27:48] Folding@home Core Shutdown: BAD_CORE_FILES
[15:27:49] utdown: BAD_CORE_FILES
[15:27:49] Finalizing output
[15:29:53] CoreStatus = 1 (1)
[15:29:53] Sending work to server
[15:29:53] Project: 2665 (Run 1, Clone 888, Gen 117)
[15:29:53] - Error: Could not get length of results file work/wuresults_06.dat
[15:29:53] - Error: Could not read unit 06 file. Removing from queue.
ThunderRd
Posts: 78
Joined: Sun Dec 02, 2007 5:30 am
Location: Nong Khai, Thailand

Re: Project: 2665 (Run 1, Clone 888, Gen 117)

Post by ThunderRd »

Yep, confirmed here as well earlier tonight. I tried it twice, it immediately NaNs and fails with a log identical to the OP.

I had to dump it several times before I got another one that worked. Can we get it out of circulation?
Bobby-Uschi
Posts: 70
Joined: Thu Jul 31, 2008 3:26 pm
Hardware configuration: PC1//C2Q-Q9450,GA-X48-DS5-NinjaMini,GTX285,2x160GB Western Sata2,2x1GB Geil800,Tagan 800W;XP Pro SP3-32Bit;
PC2//C2Q-Q2600k.GB-P67UD4-Freezer 7Pro,GTX285Leadtek,260 GB Western Sata2,4x2GB GeilPC3,OCZ600W;Win7-64Bit;Siemens 22"
Location: Deutschland

Re: Project: 2665 (Run 1, Clone 888, Gen 117)

Post by Bobby-Uschi »

And again I have the same "Bade WU bekommenKann one does not benefit from the traffic?
roject: 2665 (Run 1, Clone 888, Gen 117)
[23:39:33]
[23:39:34] Assembly optimizations on if available.
[23:39:34] Entering M.D.
[23:39:59] percent)
[23:39:59] - Starting from initial work packet
[23:39:59]
[23:39:59] Project: 2665 (Run 1, Clone 888, Gen 117)
[23:39:59]
[23:40:02] Entering M.D.
[23:40:10] Rejecting checkpoint
[23:40:11] [0]=4.82564 v[7904][0]=NaN
[23:40:11]
[23:40:11] Folding@home Core Shutdown: BAD_CORE_FILES
[23:40:11] Finalizing output
[23:42:12] ES
[23:42:12]
[23:42:12] Folding@home Core Shutdown: BAD_CORE_FILES
[23:42:12] 4][0]=NaN
[23:42:12]
[23:42:12] Folding@home Core Shutdown: BAD_CORE_FILES
[23:42:12] Finalizing output
[23:44:14] CoreStatus = 1 (
Bob
PC1//C2Q-Q9450,GA-X48-DS5-,2xGTX285,2x160GB Western Sata2,2x1GB Geil800,Tagan 800W;XP Pro SP3-32Bit
PC2//C2Q-Q2600k.GB-P67UD4-Freezer 7Pro,GTX285Leadtek,260 GB WeSata2,4x2GB GeilPC3,OCZ600W;Win7-64Bit;Siemens 22"stern
Treefrog
Posts: 17
Joined: Thu Jun 19, 2008 1:41 pm
Hardware configuration: Windows 7 Professional 64 SP1
Intel Core2Quad 2.66Ghz
nVidia GeForce 275GTX (ForceWare 285.62)
4 GB RAM
Sound Blaster Audigy 2
FAH v7

Re: Project: 2665 (Run 1, Clone 888, Gen 117)

Post by Treefrog »

Code: Select all

--- Opening Log file [November 22 03:07:56 UTC] 


# Windows SMP Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.24R3

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Users\DannyG\FAH
Executable: C:\Users\DannyG\FAH\Folding@home-Win32-x86.exe
Arguments: -smp -local -verbosity 9 

[03:07:56] - Ask before connecting: No
[03:07:56] - User name: DannyG (Team 54376)
[03:07:56] - User ID: 7DF514D07F0EC14D
[03:07:56] - Machine ID: 1
[03:07:56] 
[03:07:57] Loaded queue successfully.
[03:07:57] 
[03:07:57] - Autosending finished units... [November 22 03:07:57 UTC]
[03:07:57] + Processing work unit
[03:07:57] Trying to send all finished work units
[03:07:57] Work type a1 not eligible for variable processors
[03:07:57] + No unsent completed units remaining.
[03:07:57] Core required: FahCore_a1.exe
[03:07:57] - Autosend completed
[03:07:57] Core found.
[03:07:57] Using generic mpiexec calls
[03:07:57] Working on queue slot 07 [November 22 03:07:57 UTC]
[03:07:57] + Working ...
[03:07:57] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 07 -checkpoint 10 -verbose -lifeline 5312 -version 624'

[03:07:57] 
[03:07:57] *------------------------------*
[03:07:57] Folding@Home Gromacs SMP Core
[03:07:57] Version 1.74 (March 10, 2007)
[03:07:57] 
[03:07:57] Preparing to commence simulation
[03:07:57] - Ensuring status. Please wait.
[03:08:14] - Looking at optimizations...
[03:08:14] - Working with standard loops on this execution.
[03:08:14] - Previous termination of core was improper.
[03:08:14] - Going to use standard loops.
[03:08:14] - Files status OK
[03:08:38] - Expanded 4697175 -> 24111057 (decompressed 513.3 percent)
[03:08:39] - Starting from initial work packet
[03:08:39] 
[03:08:39] Project: 2665 (Run 1, Clone 888, Gen 117)
[03:08:39] 
[03:08:40] Entering M.D.
[03:08:53] Rejecting checkpoint
[03:08:56] NaN detected: x[7904][0]=4.82564 v[7904][0]=NaN
[03:08:56] utdown: BAD_CORE_FILES
[03:08:56] Finalizing output
[03:10:56] NaN detected: x[12294][0]=-3159739.00000 v[12294][0]=NaN
[03:10:56] 
[03:10:56] Folding@home Core Shutdown: BAD_CORE_FILES
[03:10:56] Finalizing output
[03:12:38] Killing all core threads
[03:12:38] Killing 2 cores
[03:12:38] Killing core 0
[03:12:38] Killing core 1

Folding@Home Client Shutdown at user request.
[03:12:38] ***** Got a SIGTERM signal (2)
[03:12:38] Killing all core threads
[03:12:38] Killing 2 cores
[03:12:38] Killing core 0
[03:12:38] Killing core 1

Folding@Home Client Shutdown.


--- Opening Log file [November 22 13:36:47 UTC] 


# Windows SMP Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.24R3

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Users\DannyG\FAH
Executable: C:\Users\DannyG\FAH\Folding@home-Win32-x86.exe
Arguments: -smp -local -verbosity 9 

[13:36:47] - Ask before connecting: No
[13:36:47] - User name: DannyG (Team 54376)
[13:36:47] - User ID: 7DF514D07F0EC14D
[13:36:47] - Machine ID: 1
[13:36:47] 
[13:36:47] Loaded queue successfully.
[13:36:47] 
[13:36:47] + Processing work unit
[13:36:47] - Autosending finished units... [November 22 13:36:47 UTC]
[13:36:47] Work type a1 not eligible for variable processors
[13:36:47] Trying to send all finished work units
[13:36:47] Core required: FahCore_a1.exe
[13:36:47] + No unsent completed units remaining.
[13:36:47] Core found.
[13:36:47] - Autosend completed
[13:36:47] Using generic mpiexec calls
[13:36:47] Working on queue slot 07 [November 22 13:36:47 UTC]
[13:36:47] + Working ...
[13:36:47] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 07 -checkpoint 10 -verbose -lifeline 2540 -version 624'

[13:36:47] 
[13:36:47] *------------------------------*
[13:36:47] Folding@Home Gromacs SMP Core
[13:36:47] Version 1.74 (March 10, 2007)
[13:36:47] 
[13:36:47] Preparing to commence simulation
[13:36:47] - Looking at optimizations...
[13:36:48] .
[13:36:54] - Starting from initial work packet
[13:36:54] 
[13:36:54] Project: 2665 (Run 1, Clone 888, Gen 117)
[13:36:54] 
[13:36:54] Assembly optimizations on if available.
[13:36:54] Entering M.D.
[13:37:22] percent)
[13:37:23] cket
[13:37:23] 
[13:37:23] Project: 2665 (Run 1, Clone 888, Gen 117)
[13:37:23] 
[13:37:23] 5 (Run 1, Clone 888, Gen 117)
[13:37:23] 
[13:37:24] Entering M.D.
[13:37:33] Rejecting checkpoint
[13:37:35] NaN detected: x[7904][0]=4.82564 v[7904][0]=NaN
[13:37:35] ng output
[13:39:35] ome Core Shutdown: BAD_CORE_FILES
[13:39:35] 
[13:39:35] Folding@home Core Shutdown: BAD_CORE_FILES
[13:39:37] utdown: BAD_CORE_FILES
[13:39:37] Finalizing output
[13:41:29] Killing all core threads
[13:41:29] Killing 2 cores
[13:41:29] Killing core 0
[13:41:29] Killing core 1

Folding@Home Client Shutdown at user request.
[13:41:29] ***** Got a SIGTERM signal (2)
[13:41:29] Killing all core threads
[13:41:29] Killing 2 cores
[13:41:29] Killing core 0
[13:41:29] Killing core 1

Folding@Home Client Shutdown.


--- Opening Log file [November 22 13:41:56 UTC] 


# Windows SMP Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.24R3

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Users\DannyG\FAH
Executable: C:\Users\DannyG\FAH\Folding@home-Win32-x86.exe
Arguments: -smp -local -verbosity 9 

[13:41:56] - Ask before connecting: No
[13:41:56] - User name: DannyG (Team 54376)
[13:41:56] - User ID: 7DF514D07F0EC14D
[13:41:56] - Machine ID: 1
[13:41:56] 
[13:41:56] Loaded queue successfully.
[13:41:56] 
[13:41:56] - Autosending finished units... [November 22 13:41:56 UTC]
[13:41:56] + Processing work unit
[13:41:56] Work type a1 not eligible for variable processors
[13:41:56] Core required: FahCore_a1.exe
[13:41:56] Trying to send all finished work units
[13:41:56] Core found.
[13:41:56] + No unsent completed units remaining.
[13:41:56] Using generic mpiexec calls
[13:41:56] - Autosend completed
[13:41:56] Working on queue slot 07 [November 22 13:41:56 UTC]
[13:41:56] + Working ...
[13:41:56] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 07 -checkpoint 10 -verbose -lifeline 4632 -version 624'

[13:41:56] 
[13:41:56] *------------------------------*
[13:41:56] Folding@Home Gromacs SMP Core
[13:41:56] Version 1.74 (March 10, 2007)
[13:41:56] 
[13:41:56] Preparing to commence simulation
[13:41:56] - Ensuring status. Please wait.
[13:42:13] - Looking at optimizations...
[13:42:13] - Working with standard loops on this execution.
[13:42:13] - Previous termination of core was improper.
[13:42:13] - Going to use standard loops.
[13:42:13] - Files status OK
[13:44:14] 
[13:44:14] Folding@home Core Shutdown: MISSING_WORK_FILES
[13:44:14] Finalizing output
[13:44:18] CoreStatus = 1 (1)
[13:44:18] Sending work to server
[13:44:18] Project: 2665 (Run 1, Clone 888, Gen 117)
[13:44:18] - Error: Could not get length of results file work/wuresults_07.dat
[13:44:18] - Error: Could not read unit 07 file. Removing from queue.
[13:44:18] Trying to send all finished work units
[13:44:18] + No unsent completed units remaining.
[13:44:18] - Preparing to get new work unit...
[13:44:18] Cleaning up work directory
[13:44:18] + Attempting to get work packet
[13:44:18] - Will indicate memory of 2045 MB
[13:44:18] - Detect CPU. Vendor: GenuineIntel, Family: 6, Model: 7, Stepping: 10
[13:44:18] - Connecting to assignment server
[13:44:18] Connecting to http://assign.stanford.edu:8080/
[13:44:19] Posted data.
[13:44:19] Initial: 40AB; - Successful: assigned to (171.64.65.64).
[13:44:19] + News From Folding@Home: Welcome to Folding@Home
[13:44:19] Loaded queue successfully.
[13:44:19] Connecting to http://171.64.65.64:8080/
[13:44:25] Posted data.
[13:44:25] Initial: 0000; - Receiving payload (expected size: 4697687)
[13:45:39] - Downloaded at ~61 kB/s
[13:45:39] - Averaged speed for that direction ~70 kB/s
[13:45:39] + Received work.
[13:45:39] Trying to send all finished work units
[13:45:39] + No unsent completed units remaining.
[13:45:39] + Closed connections
[13:45:44] 
[13:45:44] + Processing work unit
[13:45:44] Work type a1 not eligible for variable processors
[13:45:44] Core required: FahCore_a1.exe
[13:45:44] Core found.
[13:45:44] Using generic mpiexec calls
[13:45:44] Working on queue slot 08 [November 22 13:45:44 UTC]
[13:45:44] + Working ...
[13:45:44] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 08 -checkpoint 10 -verbose -lifeline 4632 -version 624'

[13:45:44] 
[13:45:44] *------------------------------*
[13:45:44] Folding@Home Gromacs SMP Core
[13:45:44] Version 1.74 (March 10, 2007)
[13:45:44] 
[13:45:44] Preparing to commence simulation
[13:45:44] - Ensuring status. Please wait.
[13:46:01] - Looking at optimizations...
[13:46:01] - Working with standard loops on this execution.
[13:46:01] - Previous termination of core was improper.
[13:46:01] - Going to use standard loops.
[13:46:01] - Files status OK
[13:46:26] Starting from initial work packet
[13:46:26] 
[13:46:26] Project: 2665 (Run 1, Clone 888, Gen 117)
[13:46:26] 
[13:46:27] itial work packet
[13:46:27] 
[13:46:27] Project: 2665 (Run 1, Clone 888, Gen 117)
[13:46:27] 
[13:46:28] Entering M.D.
[13:46:35] Killing all core threads
[13:46:35] Killing 2 cores
[13:46:35] Killing core 0
[13:46:35] Killing core 1

Folding@Home Client Shutdown at user request.
[13:46:35] ***** Got a SIGTERM signal (2)
[13:46:35] Killing all core threads
[13:46:35] Killing 2 cores
[13:46:35] Killing core 0
[13:46:35] Killing core 1

Folding@Home Client Shutdown.


--- Opening Log file [November 22 13:46:42 UTC] 


# Windows SMP Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.24R3

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Users\DannyG\FAH
Executable: C:\Users\DannyG\FAH\Folding@home-Win32-x86.exe
Arguments: -smp -local -verbosity 9 

[13:46:42] - Ask before connecting: No
[13:46:42] - User name: DannyG (Team 54376)
[13:46:42] - User ID: 7DF514D07F0EC14D
[13:46:42] - Machine ID: 1
[13:46:42] 
[13:46:42] Loaded queue successfully.
[13:46:42] 
[13:46:42] - Autosending finished units... [November 22 13:46:42 UTC]
[13:46:42] + Processing work unit
[13:46:42] Trying to send all finished work units
[13:46:42] Work type a1 not eligible for variable processors
[13:46:42] + No unsent completed units remaining.
[13:46:42] Core required: FahCore_a1.exe
[13:46:42] - Autosend completed
[13:46:42] Core found.
[13:46:42] Using generic mpiexec calls
[13:46:42] Working on queue slot 08 [November 22 13:46:42 UTC]
[13:46:42] + Working ...
[13:46:42] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 08 -checkpoint 10 -verbose -lifeline 2432 -version 624'

[13:46:43] 
[13:46:43] *------------------------------*
[13:46:43] Folding@Home Gromacs SMP Core
[13:46:43] Version 1.74 (March 10, 2007)
[13:46:43] 
[13:46:43] Preparing to commence simulation
[13:46:43] - Looking at optimizations...
[13:46:43] - Created dyn
[13:46:43] - Files status OK
[13:46:43] 
[13:46:43] Folding@home Core Shutdown: MISSING_WORK_FILES
[13:46:43] Finalizing output
[13:47:00] ation of core was improper.
[13:47:00] - Going to use standard loops.
[13:47:00] - Files status OK
[13:49:00] 
[13:49:00] Folding@home Core Shutdown: MISSING_WORK_FILES
[13:49:00] Finalizing output
[13:49:04] CoreStatus = 1 (1)
[13:49:04] Sending work to server
[13:49:04] Project: 2665 (Run 1, Clone 888, Gen 117)
[13:49:04] - Error: Could not get length of results file work/wuresults_08.dat
[13:49:04] - Error: Could not read unit 08 file. Removing from queue.
[13:49:04] Trying to send all finished work units
[13:49:04] + No unsent completed units remaining.
[13:49:04] - Preparing to get new work unit...
[13:49:04] Cleaning up work directory
[13:49:04] + Attempting to get work packet
[13:49:04] - Will indicate memory of 2045 MB
[13:49:04] - Detect CPU. Vendor: GenuineIntel, Family: 6, Model: 7, Stepping: 10
[13:49:04] - Connecting to assignment server
[13:49:04] Connecting to http://assign.stanford.edu:8080/
[13:49:05] Posted data.
[13:49:05] Initial: 40AB; - Successful: assigned to (171.64.65.64).
[13:49:05] + News From Folding@Home: Welcome to Folding@Home
[13:49:05] Loaded queue successfully.
[13:49:05] Connecting to http://171.64.65.64:8080/
[13:49:10] Posted data.
[13:49:11] Initial: 0000; - Receiving payload (expected size: 4697687)
[13:51:07] - Downloaded at ~39 kB/s
[13:51:07] - Averaged speed for that direction ~63 kB/s
[13:51:07] + Received work.
[13:51:07] Trying to send all finished work units
[13:51:07] + No unsent completed units remaining.
[13:51:07] + Closed connections
[13:51:12] 
[13:51:12] + Processing work unit
[13:51:12] Work type a1 not eligible for variable processors
[13:51:12] Core required: FahCore_a1.exe
[13:51:12] Core found.
[13:51:12] Using generic mpiexec calls
[13:51:12] Working on queue slot 09 [November 22 13:51:12 UTC]
[13:51:12] + Working ...
[13:51:12] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 09 -checkpoint 10 -verbose -lifeline 2432 -version 624'

[13:51:12] 
[13:51:12] *------------------------------*
[13:51:12] Folding@Home Gromacs SMP Core
[13:51:12] Version 1.74 (March 10, 2007)
[13:51:12] 
[13:51:12] Preparing to commence simulation
[13:51:12] - Ensuring status. Please wait.
[13:51:29] - Looking at optimizations...
[13:51:29] - Working with standard loops on this execution.
[13:51:29] - Previous termination of core was improper.
[13:51:29] - Going to use standard loops.
[13:51:29] - Files status OK
[13:51:52] - Expanded 4697175 -> 24111057 (decompressed 513.3 percent)
[13:51:53] - Starting from initial work packet
[13:51:53] 
[13:51:53] Project: 2665 (Run 1, Clone 888, Gen 117)
[13:51:53] 
[13:51:54] Entering M.D.
[13:52:04] Rejecting checkpoint
[13:52:05] Killing all core threads
[13:52:05] Killing 2 cores
[13:52:05] Killing core 0
[13:52:05] Killing core 1

Folding@Home Client Shutdown at user request.
[13:52:05] ***** Got a SIGTERM signal (2)
[13:52:05] Killing all core threads
[13:52:05] Killing 2 cores
[13:52:05] Killing core 0
[13:52:05] Killing core 1

Folding@Home Client Shutdown.
I too got the same issue, took me a few attempts to get a new working WU.
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Project: 2665 (Run 1, Clone 888, Gen 117)

Post by bruce »

I've notified the project owner, in case he hasn't noticed this topic.
eliot1785
Posts: 78
Joined: Sat Dec 15, 2007 11:51 pm
Location: Cambridge, MA

Re: Project: 2665 (Run 1, Clone 888, Gen 117)

Post by eliot1785 »

In case it helps, I got the same issue, here are my logs:

Code: Select all

[05:22:40] + Attempting to get work packet
[05:22:40] - Will indicate memory of 3581 MB
[05:22:40] - Detect CPU. Vendor: GenuineIntel, Family: 6, Model: 7, Stepping: 6
[05:22:40] - Connecting to assignment server
[05:22:40] Connecting to http://assign.stanford.edu:8080/
[05:22:41] Posted data.
[05:22:41] Initial: 40AB; - Successful: assigned to (171.64.65.64).
[05:22:41] + News From Folding@Home: Welcome to Folding@Home
[05:22:41] Loaded queue successfully.
[05:22:41] Connecting to http://171.64.65.64:8080/
[05:22:46] Posted data.
[05:22:46] Initial: 0000; - Receiving payload (expected size: 4697687)
[05:22:58] - Downloaded at ~382 kB/s
[05:22:58] - Averaged speed for that direction ~344 kB/s
[05:22:58] + Received work.
[05:22:59] Trying to send all finished work units
[05:22:59] + No unsent completed units remaining.
[05:22:59] + Closed connections
[05:22:59] 
[05:22:59] + Processing work unit
[05:22:59] Work type a1 not eligible for variable processors
[05:22:59] Core required: FahCore_a1.exe
[05:22:59] Core found.
[05:22:59] Working on queue slot 09 [November 25 05:22:59 UTC]
[05:22:59] + Working ...
[05:22:59] - Calling 'mpiexec -np 4 -channel shm -env MPICH_USE_SMP_OPTIMIZATIONS 1 -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 09 -checkpoint 10 -verbose -lifeline 2620 -version 624'

[05:23:04] 
[05:23:04] *------------------------------*
[05:23:04] Folding@Home Gromacs SMP Core
[05:23:04] Version 1.76 (February 23, 2008)
[05:23:04] 
[05:23:04] Preparing to commence simulation
[05:23:04] - Looking at optimizations...
[05:23:04] - Created dyn
[05:23:04] - Files status OK
[05:23:04]  this execution.
[05:23:04] - Previous termination of core was improper.
[05:23:04] - Going to use standard loops.
[05:23:04] - Files status OK
[05:23:23] (decompressed 513.3 percent)
[05:23:23] 7 (decompressed 513.3 percent)
[05:23:23] -keta
[05:23:24] Project: 2665 (Run 1, Clone 888, Gen 117)
[05:23:24] 
[05:23:24] 5 (Run 1, Clone 888, Gen 117)
[05:23:24] 
[05:23:35] Entering M.D.
[05:23:44] NaN detected: x[7904][0]=4.82564 v[7904][0]=NaN
[05:23:44] D_CORE_FILES
[05:23:44] Finalizing output
[05:25:44] _FILES
[05:25:44] FILES
[05:25:44] Finalizing output
[05:26:11] 7aN detected: x[12294][0]=-315973g@home Core Shutdown: BAD_CORE_FILES
[05:26:11] Finalizing output
[05:28:11] D_CORE_FILES
[05:28:11] Finalizing output
[05:28:14] CoreStatus = 1 (1)
[05:28:14] Sending work to server
[05:28:14] Project: 2665 (Run 1, Clone 888, Gen 117)
[05:28:14] - Error: Could not get length of results file work/wuresults_09.dat
[05:28:14] - Error: Could not read unit 09 file. Removing from queue.
[05:28:14] Trying to send all finished work units
[05:28:14] + No unsent completed units remaining.
[05:28:14] - Preparing to get new work unit...
[05:28:14] Cleaning up work directory
[05:28:14] + Attempting to get work packet
[05:28:14] - Will indicate memory of 3581 MB
[05:28:14] - Connecting to assignment server
[05:28:14] Connecting to http://assign.stanford.edu:8080/
[05:28:14] Posted data.
[05:28:14] Initial: 40AB; - Successful: assigned to (171.64.65.64).
[05:28:14] + News From Folding@Home: Welcome to Folding@Home
[05:28:15] Loaded queue successfully.
[05:28:15] Connecting to http://171.64.65.64:8080/
[05:28:20] Posted data.
[05:28:20] Initial: 0000; - Receiving payload (expected size: 4697687)
[05:28:34] - Downloaded at ~327 kB/s
[05:28:34] - Averaged speed for that direction ~341 kB/s
[05:28:34] + Received work.
[05:28:34] Trying to send all finished work units
[05:28:34] + No unsent completed units remaining.
[05:28:34] + Closed connections
[05:28:39] 
[05:28:39] + Processing work unit
[05:28:39] Work type a1 not eligible for variable processors
[05:28:39] Core required: FahCore_a1.exe
[05:28:39] Core found.
[05:28:39] Working on queue slot 00 [November 25 05:28:39 UTC]
[05:28:39] + Working ...
[05:28:39] - Calling 'mpiexec -np 4 -channel shm -env MPICH_USE_SMP_OPTIMIZATIONS 1 -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 00 -checkpoint 10 -verbose -lifeline 2620 -version 624'

[05:28:43] 
[05:28:43] *------------------------------*
[05:28:43] Folding@Home Gromacs SMP Core
[05:28:43] Version 1.76 (February 23, 2008)
[05:28:43] 
[05:28:43] Preparing to commence simulation
[05:28:43] - Ensuring status. Please wait.
[05:28:50] - Starting from initial work packet
[05:28:50] 
[05:28:50] Project: 2665 (Run 1, Clone 888, Gen 117)
[05:28:50] 
[05:28:50] Assembly optimizations on if available.
[05:28:50] Entering M.D.
[05:29:11] l work packet
[05:29:11] 
[05:29:11] Project: 2665 (Run 1, Clone 888, Gen 117)
[05:29:11] 
[05:29:21] 5 (Run 1, Clone 888, Gen 117)
[05:29:21] 
[05:29:27] Entering M.D.
[05:29:46] 4 v[7904][0]=NaN
[05:29:46] 
[05:29:46] Folding@home Core Shutdown: BAD_CORE_FILES
[05:29:46] Finalizing output
[05:31:48] D_CORE_FILES
[05:31:48] Finalizing output
[05:31:53] NaN detected: x[12294][0]=-3159739.00000 v[12294][0]=NaN
[05:31:53] D_CORE_FILES
[05:31:53] Finalizing output
[05:33:53] D_CORE_FILES
[05:33:53] Finalizing output
[05:33:58] CoreStatus = 1 (1)
[05:33:58] Sending work to server
[05:33:58] Project: 2665 (Run 1, Clone 888, Gen 117)
[05:33:58] - Error: Could not get length of results file work/wuresults_00.dat
[05:33:58] - Error: Could not read unit 00 file. Removing from queue.
[05:33:58] Trying to send all finished work units
[05:33:58] + No unsent completed units remaining.
[05:33:58] - Preparing to get new work unit...
[05:33:58] Cleaning up work directory
[05:33:58] + Attempting to get work packet
[05:33:58] - Will indicate memory of 3581 MB
[05:33:58] - Connecting to assignment server
[05:33:58] Connecting to http://assign.stanford.edu:8080/
[05:33:59] Posted data.
[05:33:59] Initial: 40AB; - Successful: assigned to (171.64.65.64).
[05:33:59] + News From Folding@Home: Welcome to Folding@Home
[05:33:59] Loaded queue successfully.
[05:33:59] Connecting to http://171.64.65.64:8080/
[05:34:04] Posted data.
[05:34:04] Initial: 0000; - Receiving payload (expected size: 4697687)
[05:34:17] - Downloaded at ~352 kB/s
[05:34:17] - Averaged speed for that direction ~343 kB/s
[05:34:17] + Received work.
[05:34:18] Trying to send all finished work units
[05:34:18] + No unsent completed units remaining.
[05:34:18] + Closed connections
[05:34:23] 
[05:34:23] + Processing work unit
[05:34:23] Work type a1 not eligible for variable processors
[05:34:23] Core required: FahCore_a1.exe
[05:34:23] Core found.
[05:34:23] Working on queue slot 01 [November 25 05:34:23 UTC]
[05:34:23] + Working ...
[05:34:23] - Calling 'mpiexec -np 4 -channel shm -env MPICH_USE_SMP_OPTIMIZATIONS 1 -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 01 -checkpoint 10 -verbose -lifeline 2620 -version 624'

[05:34:27] 
[05:34:27] *------------------------------*
[05:34:27] Folding@Home Gromacs SMP Core
[05:34:27] Version 1.76 (February 23, 2008)
[05:34:27] 
[05:34:27] Preparing to commence simulation
[05:34:27] - Looking at optimizations...
[05:34:27] - Working with standard loops on this execution.
[05:34:27] - Previous termination of core was improper.
[05:34:27] - Going to use standard loops.
[05:34:27] - Files status OK
[05:34:45] - Expanded 4697175 -> 24111057 (decompressed 513.3 percent)
[05:34:52] - Starting from initial work packet
[05:34:53]  (Run 1, Clone 888, Gen 117)
[05:34:53] 
[05:34:53] 88, Gen 117)
[05:34:53] 
[05:34:53] e 888, Gen 117)
[05:34:53] 
[05:34:56] Entering M.D.
[05:35:04] ed: x[7904][0]=4.82564 v[7904][0]=NaN
[05:35:04] 4][0]=NaN
[05:35:04] 
[05:35:04] Folding@home Core Shutdown: BAD_CORE_FILES
[05:35:04] Finalizing output
[05:37:04]  BAD_CORE_FILES
[05:37:05] 9.00000 v[12294][0]=NaN
[05:37:05]  v[12294][0]=NaN
[05:37:05] 
[05:37:05] Folding@home Core Shutdown: BAD_CORE_FILES
[05:37:05] Finalizing output
[05:39:09] CoreStatus = 1 (1)
[05:39:09] Sending work to server
[05:39:09] Project: 2665 (Run 1, Clone 888, Gen 117)
[05:39:09] - Error: Could not get length of results file work/wuresults_01.dat
[05:39:09] - Error: Could not read unit 01 file. Removing from queue.
[05:39:09] Trying to send all finished work units
[05:39:09] + No unsent completed units remaining.
[05:39:09] - Preparing to get new work unit...
[05:39:09] Cleaning up work directory
[05:39:09] + Attempting to get work packet
[05:39:09] - Will indicate memory of 3581 MB
[05:39:09] - Connecting to assignment server
[05:39:09] Connecting to http://assign.stanford.edu:8080/
[05:39:12] Posted data.
[05:39:12] Initial: 40AB; - Successful: assigned to (171.64.65.64).
[05:39:12] + News From Folding@Home: Welcome to Folding@Home
[05:39:12] Loaded queue successfully.
[05:39:12] Connecting to http://171.64.65.64:8080/
[05:39:14] Posted data.
[05:39:14] Initial: 0000; - Error: Bad packet type from server, expected work assignment
[05:39:15] - Attempt #1  to get work failed, and no other work to do.
Waiting before retry.
[05:39:31] + Attempting to get work packet
[05:39:31] - Will indicate memory of 3581 MB
[05:39:31] - Connecting to assignment server
[05:39:31] Connecting to http://assign.stanford.edu:8080/
[05:39:32] Posted data.
[05:39:32] Initial: 40AB; - Successful: assigned to (171.64.65.64).
[05:39:32] + News From Folding@Home: Welcome to Folding@Home
[05:39:32] Loaded queue successfully.
[05:39:32] Connecting to http://171.64.65.64:8080/
[05:39:38] Posted data.
[05:39:38] Initial: 0000; - Receiving payload (expected size: 4736301)
[05:39:48] - Downloaded at ~462 kB/s
[05:39:48] - Averaged speed for that direction ~367 kB/s
[05:39:48] + Received work.
[05:39:48] Trying to send all finished work units
[05:39:48] + No unsent completed units remaining.
[05:39:48] + Closed connections
[05:39:53] 
[05:39:53] + Processing work unit
[05:39:53] Work type a1 not eligible for variable processors
[05:39:53] Core required: FahCore_a1.exe
[05:39:53] Core found.
[05:39:53] Working on queue slot 02 [November 25 05:39:53 UTC]
[05:39:53] + Working ...
[05:39:53] - Calling 'mpiexec -np 4 -channel shm -env MPICH_USE_SMP_OPTIMIZATIONS 1 -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 02 -checkpoint 10 -verbose -lifeline 2620 -version 624'

[05:39:57] 
[05:39:57] *------------------------------*
[05:39:57] Folding@Home Gromacs SMP Core
[05:39:57] Version 1.76 (February 23, 2008)
[05:39:57] 
[05:39:57] Preparing to commence simulation
[05:39:57] - Ensuring status. Please wait.
[05:40:03] - Starting from initial work packet
[05:40:03] 
[05:40:03] Project: 2665 (Run 3, Clone 540, Gen 149)
[05:40:03] 
[05:40:03] Assembly optimizations on if available.
[05:40:03] Entering M.D.
[05:40:26] percent)
[05:40:26] - Starting from initial work packet
[05:40:26] 
[05:40:26] Project: 2665 (Run 3, Clone 540, Gen 149)
[05:40:26] 
[05:40:30] Entering M.D.
[05:40:39] GG in water
[05:40:39] Writing local files
[05:40:39] cal files
[05:40:40] Extra SSE boost OK.
[05:40:48] cal files
[05:40:49] Completed 0 out of 250000 steps  (0 percent)
[05:50:48] Timered checkpoint triggered.
[06:00:49] Timered checkpoint triggered.
[06:01:24] Writing local files
[06:01:24] Completed 2500 out of 250000 steps  (1 percent)
kasson
Pande Group Member
Posts: 1459
Joined: Thu Nov 29, 2007 9:37 pm

Re: Project: 2665 (Run 1, Clone 888, Gen 117)

Post by kasson »

This one has been stopped.
Post Reply