#636 FAH fails to start core 15 to process workunit on GPU

Moderators: Site Moderators, PandeGroup

#636 FAH fails to start core 15 to process workunit on GPU

Postby anon1 » Mon May 02, 2011 11:04 am

Hello all, FAH can't start a task on my GTX 550 Ti, driver 270.61 using core15. Any suggestions on how to get it work? Thanks. Here is the log:

Code: Select all
10:54:36:Added folding slot
10:54:36:Saving configuration to config.xml
10:54:36:<config>
10:54:36:  <!-- FahCore Control -->
10:54:36:  <checkpoint v='3'/>
10:54:36:
10:54:36:  <!-- Folding Slot Configuration -->
10:54:36:  <gpu v='true'/>
10:54:36:
10:54:36:  <!-- Logging -->
10:54:36:  <verbosity v='5'/>
10:54:36:
10:54:36:  <!-- Network -->
10:54:36:  <proxy v=':8080'/>
10:54:36:
10:54:36:  <!-- User Information -->
10:54:36:  <passkey v='********************************'/>
10:54:36:  <team v='111065'/>
10:54:36:  <user v='anon1'/>
10:54:36:
10:54:36:  <!-- Folding Slots -->
10:54:36:  <slot id='0' type='GPU'/>
10:54:36:  <slot id='1' type='SMP'/>
10:54:36:</config>
10:54:36:Connecting to assign3.stanford.edu:8080
10:54:37:News: Welcome to Folding@Home
10:54:37:Assigned to work server 128.143.199.96
10:54:37:Requesting new work unit for slot 01: READY smp:2 from 128.143.199.96
10:54:37:Connecting to 128.143.199.96:8080
10:54:39:Slot 01: Downloading 1.69MiB
10:54:45:Slot 01: 86.10%
10:54:45:Slot 01: Download complete
10:54:45:Received Unit: id:01 state:DOWNLOAD project:6956 run:0 clone:81 gen:58 core:0xa3 unit:0x00000040fbcb017c4d80cbde159fe0c2
10:54:45:Downloading core from http://www.stanford.edu/~pande/Win32/x86/Core_a3.fah
10:54:45:Connecting to www.stanford.edu:80
10:54:46:WARNING: FahCore type in core package seems to be in wrong byte order.
10:54:46:FahCore a3: Downloading 2.89MiB
10:54:52:FahCore a3: 29.62%
10:54:58:FahCore a3: 54.38%
10:55:04:FahCore a3: 84.92%
10:55:06:FahCore a3: Download complete
10:55:06:Valid core signature
10:55:07:Unpacked 9.59MiB to cores/www.stanford.edu/~pande/Win32/x86/Core_a3.fah/FahCore_a3.exe
10:55:07:Starting Unit 01
10:55:07:Running core: C:/Users/Lim/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/x86/Core_a3.fah/FahCore_a3.exe -dir 01 -suffix 01 -lifeline 3516 -version 701 -checkpoint 3 -np 2
10:55:10:Started core on PID 4708
10:55:10:FahCore 0xa3 started
10:55:10:Started thread 13 on PID 3516
10:55:10:Unit 01:
10:55:10:Unit 01:*------------------------------*
10:55:10:Unit 01:Folding@Home Gromacs SMP Core
10:55:10:Unit 01:Version 2.27 (Dec. 15, 2010)
10:55:10:Unit 01:
10:55:10:Unit 01:Preparing to commence simulation
10:55:10:Unit 01:- Looking at optimizations...
10:55:10:Unit 01:- Created dyn
10:55:10:Unit 01:- Files status OK
10:55:10:Unit 01:- Expanded 1769123 -> 1957708 (decompressed 110.6 percent)
10:55:10:Unit 01:Called DecompressByteArray: compressed_data_size=1769123 data_size=1957708, decompressed_data_size=1957708 diff=0
10:55:10:Unit 01:- Digital signature verified
10:55:10:Unit 01:
10:55:10:Unit 01:Project: 6956 (Run 0, Clone 81, Gen 58)
10:55:10:Unit 01:
10:55:10:Unit 01:Assembly optimizations on if available.
10:55:10:Unit 01:Entering M.D.
10:55:17:Unit 01:Mapping NT from 2 to 2
10:55:17:Unit 01:Completed 0 out of 500000 steps  (0%)
10:55:55:Server connection id=2 on 0.0.0.0:36330 from 127.0.0.1
10:55:55:Started thread 14 on PID 3516
10:56:09:Downloading core from http://www.stanford.edu/~pande/Win32/x86/NVIDIA/Fermi/Core_15.fah
10:56:09:Connecting to www.stanford.edu:80
10:56:09:FahCore 15: Downloading 1.36MiB
10:56:14:Server connection id=2 ended
10:56:16:FahCore 15: 66.83%
10:56:18:FahCore 15: Download complete
10:56:18:Valid core signature
10:56:18:WARNING: FahCore has not changed since last download, aborting core update
10:56:56:Starting Unit 00
10:56:56:Running core: C:/Users/Lim/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/x86/NVIDIA/Fermi/Core_15.fah/FahCore_15.exe -dir 00 -suffix 01 -lifeline 3516 -version 701 -checkpoint 3 -gpu 0
10:56:56:Started core on PID 1652
10:56:56:FahCore 0x15 started
10:56:56:Started thread 15 on PID 3516
10:56:57:Unit 00:
10:56:57:Unit 00:*------------------------------*
10:56:57:Unit 00:Folding@Home GPU Core
10:56:57:Unit 00:Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
10:56:57:Unit 00:
10:56:57:Unit 00:Build host: SimbiosNvdWin7
10:56:57:Unit 00:Board Type: NVIDIA/CUDA
10:56:57:Unit 00:Core      : x=15
10:56:57:Unit 00: Window's signal control handler registered.
10:56:57:Unit 00:Preparing to commence simulation
10:56:57:Unit 00:- Ensuring status. Please wait.
10:57:06:Unit 00:- Looking at optimizations...
10:57:06:Unit 00:- Working with standard loops on this execution.
10:57:06:Unit 00:- Previous termination of core was improper.
10:57:06:Unit 00:- Going to use standard loops.
10:57:06:Unit 00:- Files status OK
10:57:06:Unit 00:sizeof(CORE_PACKET_HDR) = 512 file=<>
10:57:06:Unit 00:- Expanded 43606 -> 172159 (decompressed 394.8 percent)
10:57:06:Unit 00:Called DecompressByteArray: compressed_data_size=43606 data_size=172159, decompressed_data_size=172159 diff=0
10:57:06:Unit 00:- Digital signature verified
10:57:06:Unit 00:
10:57:06:Unit 00:Project: 6806 (Run 6506, Clone 2, Gen 22)
10:57:06:Unit 00:
10:57:06:Unit 00:Entering M.D.
10:57:08:Unit 00:Tpr hash 00/wudata_01.tpr:  3942867997 3152637917 3192897429 3589460235 883589577
10:57:08:Unit 00:Working on 2 PEPTIDE (1-42)
10:57:08:Unit 00:Client config unavailable.
10:57:08:FahCore, running Unit 00, returned: UNKNOWN_ENUM (-1)
10:57:08:WARNING: Unit 00 Too many errors, failing
10:57:08:Sending unit results: id:00 state:SEND project:6806 run:6506 clone:2 gen:22 core:0x15 unit:0x000000160a3b1e644d94c83ec8fe3c05
10:57:08:Connecting to 171.64.65.64:8080
10:57:09:Connecting to assign-GPU.stanford.edu:80
10:57:09:Server responded WORK_QUIT (404)
10:57:09:WARNING: Server did not like results, dumping
10:57:09:Cleaning up Unit 00
10:57:09:News: Welcome to Folding@Home
10:57:09:Assigned to work server 171.64.65.64
10:57:09:Requesting new work unit for slot 00: READY gpu:0:"GF110 [GeForce GTX 590 Ti]" from 171.64.65.64
10:57:09:Connecting to 171.64.65.64:8080
10:57:10:Slot 00: Downloading 41.07KiB
10:57:11:Slot 00: Download complete
10:57:11:Received Unit: id:02 state:DOWNLOAD project:6805 run:7007 clone:0 gen:17 core:0x15 unit:0x000000110a3b1e644d8d333cfe1c7267
10:57:11:Starting Unit 02
10:57:11:Running core: C:/Users/Lim/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/x86/NVIDIA/Fermi/Core_15.fah/FahCore_15.exe -dir 02 -suffix 01 -lifeline 3516 -version 701 -checkpoint 3 -gpu 0
10:57:11:Started core on PID 4328
10:57:11:FahCore 0x15 started
10:57:11:Started thread 16 on PID 3516
10:57:12:Unit 02:
10:57:12:Unit 02:*------------------------------*
10:57:12:Unit 02:Folding@Home GPU Core
10:57:12:Unit 02:Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
10:57:12:Unit 02:
10:57:12:Unit 02:Build host: SimbiosNvdWin7
10:57:12:Unit 02:Board Type: NVIDIA/CUDA
10:57:12:Unit 02:Core      : x=15
10:57:12:Unit 02: Window's signal control handler registered.
10:57:12:Unit 02:Preparing to commence simulation
10:57:12:Unit 02:- Looking at optimizations...
10:57:12:Unit 02:DeleteFrameFiles: successfully deleted file=02/wudata_01.ckp
10:57:12:Unit 02:- Created dyn
10:57:12:Unit 02:- Files status OK
10:57:12:Unit 02:sizeof(CORE_PACKET_HDR) = 512 file=<>
10:57:12:Unit 02:- Expanded 41541 -> 162639 (decompressed 391.5 percent)
10:57:12:Unit 02:Called DecompressByteArray: compressed_data_size=41541 data_size=162639, decompressed_data_size=162639 diff=0
10:57:12:Unit 02:- Digital signature verified
10:57:12:Unit 02:
10:57:12:Unit 02:Project: 6805 (Run 7007, Clone 0, Gen 17)
10:57:12:Unit 02:
10:57:12:Unit 02:Assembly optimizations on if available.
10:57:12:Unit 02:Entering M.D.
10:57:14:Unit 02:Tpr hash 02/wudata_01.tpr:  2067223090 1468347899 1626797930 2826755208 1313848930
10:57:14:Unit 02:Working on ALZHEIMER'S DISEASE AMYLOID
10:57:14:Unit 02:Client config unavailable.
10:57:14:FahCore, running Unit 02, returned: UNKNOWN_ENUM (-1)
10:57:14:Starting Unit 02
10:57:14:Running core: C:/Users/Lim/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/x86/NVIDIA/Fermi/Core_15.fah/FahCore_15.exe -dir 02 -suffix 01 -lifeline 3516 -version 701 -checkpoint 3 -gpu 0
10:57:14:Started core on PID 2372
10:57:14:FahCore 0x15 started
10:57:14:Started thread 17 on PID 3516
10:57:15:Unit 02:
10:57:15:Unit 02:*------------------------------*
10:57:15:Unit 02:Folding@Home GPU Core
10:57:15:Unit 02:Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
10:57:15:Unit 02:
10:57:15:Unit 02:Build host: SimbiosNvdWin7
10:57:15:Unit 02:Board Type: NVIDIA/CUDA
10:57:15:Unit 02:Core      : x=15
10:57:15:Unit 02: Window's signal control handler registered.
10:57:15:Unit 02:Preparing to commence simulation
10:57:15:Unit 02:- Ensuring status. Please wait.
10:57:24:Unit 02:- Looking at optimizations...
10:57:24:Unit 02:- Working with standard loops on this execution.
10:57:24:Unit 02:- Previous termination of core was improper.
10:57:24:Unit 02:- Files status OK
10:57:24:Unit 02:sizeof(CORE_PACKET_HDR) = 512 file=<>
10:57:24:Unit 02:- Expanded 41541 -> 162639 (decompressed 391.5 percent)
10:57:24:Unit 02:Called DecompressByteArray: compressed_data_size=41541 data_size=162639, decompressed_data_size=162639 diff=0
10:57:24:Unit 02:- Digital signature verified
10:57:24:Unit 02:
10:57:24:Unit 02:Project: 6805 (Run 7007, Clone 0, Gen 17)
10:57:24:Unit 02:
10:57:24:Unit 02:Entering M.D.
10:57:26:Unit 02:Tpr hash 02/wudata_01.tpr:  2067223090 1468347899 1626797930 2826755208 1313848930
10:57:26:Unit 02:Working on ALZHEIMER'S DISEASE AMYLOID
10:57:26:Unit 02:Client config unavailable.
10:57:26:FahCore, running Unit 02, returned: UNKNOWN_ENUM (-1)
10:58:14:Starting Unit 02
10:58:14:Running core: C:/Users/Lim/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/x86/NVIDIA/Fermi/Core_15.fah/FahCore_15.exe -dir 02 -suffix 01 -lifeline 3516 -version 701 -checkpoint 3 -gpu 0
10:58:14:Started core on PID 1884
10:58:14:FahCore 0x15 started
10:58:14:Started thread 18 on PID 3516
10:58:15:Unit 02:
10:58:15:Unit 02:*------------------------------*
10:58:15:Unit 02:Folding@Home GPU Core
10:58:15:Unit 02:Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
10:58:15:Unit 02:
10:58:15:Unit 02:Build host: SimbiosNvdWin7
10:58:15:Unit 02:Board Type: NVIDIA/CUDA
10:58:15:Unit 02:Core      : x=15
10:58:15:Unit 02: Window's signal control handler registered.
10:58:15:Unit 02:Preparing to commence simulation
10:58:15:Unit 02:- Ensuring status. Please wait.
10:58:24:Unit 02:- Looking at optimizations...
10:58:24:Unit 02:- Working with standard loops on this execution.
10:58:24:Unit 02:- Previous termination of core was improper.
10:58:24:Unit 02:- Going to use standard loops.
10:58:24:Unit 02:- Files status OK
10:58:24:Unit 02:sizeof(CORE_PACKET_HDR) = 512 file=<>
10:58:24:Unit 02:- Expanded 41541 -> 162639 (decompressed 391.5 percent)
10:58:24:Unit 02:Called DecompressByteArray: compressed_data_size=41541 data_size=162639, decompressed_data_size=162639 diff=0
10:58:24:Unit 02:- Digital signature verified
10:58:24:Unit 02:
10:58:24:Unit 02:Project: 6805 (Run 7007, Clone 0, Gen 17)
10:58:24:Unit 02:
10:58:24:Unit 02:Entering M.D.
10:58:26:Unit 02:Tpr hash 02/wudata_01.tpr:  2067223090 1468347899 1626797930 2826755208 1313848930
10:58:26:Unit 02:Working on ALZHEIMER'S DISEASE AMYLOID
10:58:26:Unit 02:Client config unavailable.
10:58:26:Unit 02:Starting GUI Server
10:58:26:FahCore, running Unit 02, returned: UNKNOWN_ENUM (-1)
10:59:52:Starting Unit 02
10:59:52:Running core: C:/Users/Lim/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/x86/NVIDIA/Fermi/Core_15.fah/FahCore_15.exe -dir 02 -suffix 01 -lifeline 3516 -version 701 -checkpoint 3 -gpu 0
10:59:52:Started core on PID 3536
10:59:52:FahCore 0x15 started
10:59:52:Started thread 19 on PID 3516
10:59:52:Unit 02:
10:59:52:Unit 02:*------------------------------*
10:59:52:Unit 02:Folding@Home GPU Core
10:59:52:Unit 02:Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
10:59:52:Unit 02:
10:59:52:Unit 02:Build host: SimbiosNvdWin7
10:59:52:Unit 02:Board Type: NVIDIA/CUDA
10:59:52:Unit 02:Core      : x=15
10:59:52:Unit 02: Window's signal control handler registered.
10:59:52:Unit 02:Preparing to commence simulation
10:59:52:Unit 02:- Ensuring status. Please wait.
11:00:02:Unit 02:- Looking at optimizations...
11:00:02:Unit 02:- Working with standard loops on this execution.
11:00:02:Unit 02:- Previous termination of core was improper.
11:00:02:Unit 02:- Going to use standard loops.
11:00:02:Unit 02:- Files status OK
11:00:02:Unit 02:sizeof(CORE_PACKET_HDR) = 512 file=<>
11:00:02:Unit 02:- Expanded 41541 -> 162639 (decompressed 391.5 percent)
11:00:02:Unit 02:Called DecompressByteArray: compressed_data_size=41541 data_size=162639, decompressed_data_size=162639 diff=0
11:00:02:Unit 02:- Digital signature verified
11:00:02:Unit 02:
11:00:02:Unit 02:Project: 6805 (Run 7007, Clone 0, Gen 17)
11:00:02:Unit 02:
11:00:02:Unit 02:Entering M.D.
11:00:04:Unit 02:Tpr hash 02/wudata_01.tpr:  2067223090 1468347899 1626797930 2826755208 1313848930
11:00:04:Unit 02:Working on ALZHEIMER'S DISEASE AMYLOID
11:00:04:Unit 02:Client config unavailable.
11:00:04:FahCore, running Unit 02, returned: UNKNOWN_ENUM (-1)
11:01:42:Downloading core from http://www.stanford.edu/~pande/Win32/x86/NVIDIA/Fermi/Core_15.fah
11:01:42:Connecting to www.stanford.edu:80
11:01:42:FahCore 15: Downloading 1.36MiB
11:01:48:FahCore 15: 60.49%
11:01:51:FahCore 15: Download complete
11:01:51:Valid core signature
11:01:51:WARNING: FahCore has not changed since last download, aborting core update
11:02:29:Starting Unit 02
11:02:29:Running core: C:/Users/Lim/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/x86/NVIDIA/Fermi/Core_15.fah/FahCore_15.exe -dir 02 -suffix 01 -lifeline 3516 -version 701 -checkpoint 3 -gpu 0
11:02:29:Started core on PID 1248
11:02:29:FahCore 0x15 started
11:02:29:Started thread 20 on PID 3516
11:02:29:Unit 02:
11:02:29:Unit 02:*------------------------------*
11:02:29:Unit 02:Folding@Home GPU Core
11:02:29:Unit 02:Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
11:02:29:Unit 02:
11:02:29:Unit 02:Build host: SimbiosNvdWin7
11:02:29:Unit 02:Board Type: NVIDIA/CUDA
11:02:29:Unit 02:Core      : x=15
11:02:29:Unit 02: Window's signal control handler registered.
11:02:29:Unit 02:Preparing to commence simulation
11:02:29:Unit 02:- Ensuring status. Please wait.
11:02:39:Unit 02:- Looking at optimizations...
11:02:39:Unit 02:- Working with standard loops on this execution.
11:02:39:Unit 02:- Previous termination of core was improper.
11:02:39:Unit 02:- Going to use standard loops.
11:02:39:Unit 02:- Files status OK
11:02:39:Unit 02:sizeof(CORE_PACKET_HDR) = 512 file=<>
11:02:39:Unit 02:- Expanded 41541 -> 162639 (decompressed 391.5 percent)
11:02:39:Unit 02:Called DecompressByteArray: compressed_data_size=41541 data_size=162639, decompressed_data_size=162639 diff=0
11:02:39:Unit 02:- Digital signature verified
11:02:39:Unit 02:
11:02:39:Unit 02:Project: 6805 (Run 7007, Clone 0, Gen 17)
11:02:39:Unit 02:
11:02:39:Unit 02:Entering M.D.
11:02:41:Unit 02:Tpr hash 02/wudata_01.tpr:  2067223090 1468347899 1626797930 2826755208 1313848930
11:02:41:Unit 02:Working on ALZHEIMER'S DISEASE AMYLOID
11:02:41:Unit 02:Client config unavailable.
11:02:41:FahCore, running Unit 02, returned: UNKNOWN_ENUM (-1)
11:02:41:WARNING: Unit 02 Too many errors, failing
11:02:41:Sending unit results: id:02 state:SEND project:6805 run:7007 clone:0 gen:17 core:0x15 unit:0x000000110a3b1e644d8d333cfe1c7267
11:02:41:Connecting to 171.64.65.64:8080
11:02:41:Connecting to assign-GPU.stanford.edu:80
11:02:42:Server responded WORK_QUIT (404)
11:02:42:WARNING: Server did not like results, dumping
11:02:42:Cleaning up Unit 02
11:02:42:News: Welcome to Folding@Home
11:02:42:Assigned to work server 171.64.65.64
11:02:42:Requesting new work unit for slot 00: READY gpu:0:"GF110 [GeForce GTX 590 Ti]" from 171.64.65.64
11:02:42:Connecting to 171.64.65.64:8080
11:02:43:Slot 00: Downloading 43.30KiB
11:02:44:Slot 00: Download complete
11:02:44:Received Unit: id:00 state:DOWNLOAD project:6806 run:6586 clone:2 gen:22 core:0x15 unit:0x000000170a3b1e644d94c86c98ab6099
11:02:44:Starting Unit 00
11:02:44:Running core: C:/Users/Lim/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/x86/NVIDIA/Fermi/Core_15.fah/FahCore_15.exe -dir 00 -suffix 01 -lifeline 3516 -version 701 -checkpoint 3 -gpu 0
11:02:44:Started core on PID 6056
11:02:44:FahCore 0x15 started
11:02:44:Started thread 21 on PID 3516
11:02:45:Unit 00:
11:02:45:Unit 00:*------------------------------*
11:02:45:Unit 00:Folding@Home GPU Core
11:02:45:Unit 00:Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
11:02:45:Unit 00:
11:02:45:Unit 00:Build host: SimbiosNvdWin7
11:02:45:Unit 00:Board Type: NVIDIA/CUDA
11:02:45:Unit 00:Core      : x=15
11:02:45:Unit 00: Window's signal control handler registered.
11:02:45:Unit 00:Preparing to commence simulation
11:02:45:Unit 00:- Looking at optimizations...
11:02:45:Unit 00:DeleteFrameFiles: successfully deleted file=00/wudata_01.ckp
11:02:45:Unit 00:- Created dyn
11:02:45:Unit 00:- Files status OK
11:02:45:Unit 00:sizeof(CORE_PACKET_HDR) = 512 file=<>
11:02:45:Unit 00:- Expanded 43824 -> 172159 (decompressed 392.8 percent)
11:02:45:Unit 00:Called DecompressByteArray: compressed_data_size=43824 data_size=172159, decompressed_data_size=172159 diff=0
11:02:45:Unit 00:- Digital signature verified
11:02:45:Unit 00:
11:02:45:Unit 00:Project: 6806 (Run 6586, Clone 2, Gen 22)
11:02:45:Unit 00:
11:02:45:Unit 00:Assembly optimizations on if available.
11:02:45:Unit 00:Entering M.D.
11:02:47:Unit 00:Tpr hash 00/wudata_01.tpr:  67189824 1788190166 2638984836 3171554805 3172820352
11:02:47:Unit 00:Working on 2 PEPTIDE (1-42)
11:02:47:Unit 00:Client config unavailable.
11:02:47:FahCore, running Unit 00, returned: UNKNOWN_ENUM (-1)
11:02:47:Starting Unit 00
11:02:47:Running core: C:/Users/Lim/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/x86/NVIDIA/Fermi/Core_15.fah/FahCore_15.exe -dir 00 -suffix 01 -lifeline 3516 -version 701 -checkpoint 3 -gpu 0
11:02:47:Started core on PID 1120
11:02:47:FahCore 0x15 started
11:02:47:Started thread 22 on PID 3516
11:02:47:Unit 00:
11:02:47:Unit 00:*------------------------------*
11:02:47:Unit 00:Folding@Home GPU Core
11:02:47:Unit 00:Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
11:02:47:Unit 00:
11:02:47:Unit 00:Build host: SimbiosNvdWin7
11:02:47:Unit 00:Board Type: NVIDIA/CUDA
11:02:47:Unit 00:Core      : x=15
11:02:47:Unit 00: Window's signal control handler registered.
11:02:47:Unit 00:Preparing to commence simulation
11:02:47:Unit 00:- Ensuring status. Please wait.
11:02:57:Unit 00:- Looking at optimizations...
11:02:57:Unit 00:- Working with standard loops on this execution.
11:02:57:Unit 00:- Previous termination of core was improper.
11:02:57:Unit 00:- Files status OK
11:02:57:Unit 00:sizeof(CORE_PACKET_HDR) = 512 file=<>
11:02:57:Unit 00:- Expanded 43824 -> 172159 (decompressed 392.8 percent)
11:02:57:Unit 00:Called DecompressByteArray: compressed_data_size=43824 data_size=172159, decompressed_data_size=172159 diff=0
11:02:57:Unit 00:- Digital signature verified
11:02:57:Unit 00:
11:02:57:Unit 00:Project: 6806 (Run 6586, Clone 2, Gen 22)
11:02:57:Unit 00:
11:02:57:Unit 00:Entering M.D.
11:02:59:Unit 00:Tpr hash 00/wudata_01.tpr:  67189824 1788190166 2638984836 3171554805 3172820352
11:02:59:Unit 00:Working on 2 PEPTIDE (1-42)
11:02:59:Unit 00:Client config unavailable.
11:02:59:FahCore, running Unit 00, returned: UNKNOWN_ENUM (-1)
anon1
 
Posts: 8
Joined: Sun May 01, 2011 6:40 am

Re: FAH fails to start core 15 to process workunit on GPU

Postby 7im » Mon May 02, 2011 5:45 pm

Hello anon1, welcome to the forum.

Please post the ***System*** section of the log file. Thanks.
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
User avatar
7im
 
Posts: 15147
Joined: Thu Nov 29, 2007 4:30 pm
Location: Arizona

Re: FAH fails to start core 15 to process workunit on GPU

Postby anon1 » Tue May 03, 2011 7:07 am

Hello, I am posting a screenshot of the system info tab for FAH, I think that is what you have requested, correct me if I am wrong. I would also like to bring up the fact that FAH v7 recognises my graphic card incorrectly, it thinks that my GPU is a GTX 590 Ti but in reality I have a GTX 550 Ti installed in this system. My correct system details can be found at this link, http://setiathome.berkeley.edu/show_hos ... id=5761910

Image

Also I would like to add that I installed it for a single user only and I did not install FAH as a service.

Also, I have uninstalled FAH and the data folder, reinstalled FAH v7 about 9 times already and this problem still occurs.

I would like to know, whether there is any link available, that has directions to completely remove FAH, FAH data files and the registry entries as I want to do a clean install again, I do not know whther I am missing something, cause I didn't remove any FAH registry entries and that could be the cause of the problem.

Any suggestions to get the GPU crunching workunits?

Thanks!
anon1
 
Posts: 8
Joined: Sun May 01, 2011 6:40 am

Re: FAH fails to start core 15 to process workunit on GPU

Postby 7im » Tue May 03, 2011 4:31 pm

The v270.61 drivers don't seem to work that well. http://foldingforum.org/viewtopic.php?f=59&t=18121
User avatar
7im
 
Posts: 15147
Joined: Thu Nov 29, 2007 4:30 pm
Location: Arizona

Re: FAH fails to start core 15 to process workunit on GPU

Postby bruce » Tue May 03, 2011 4:37 pm

Please open the CMD window and run FAHClient --lspci. Post the output.

I agree that the problem is either because of a bug in the drivers or because your GPU is being misidentified. That's not something that can be fixed by uninstalling/reinstalling FAHClient.

I do not know of any registry entries that are used by V7. As far as I know, if you marking the Data box when uninstalling, it will completely uninstall it.
bruce
Site Admin
 
Posts: 20181
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: FAH fails to start core 15 to process workunit on GPU

Postby anon1 » Thu May 05, 2011 1:23 pm

Helllo Bruce, here is the output of FAHClient --lspci

Sorry about the delay of getting back to you about this matter. I have been coming home late recently and yesterday I forgot about this.

Also, the GTX 550 Ti GPU in my computer is incorrectly recognised as a GTX 590 Ti by the Folding@Home v7.1.24 software. I also think that there are compatibility issues between the Folding@home v7.1.24 software and the Nvidia driver 270.61. BOINC projects using CUDA like SETI@home, PrimeGrid and GPUgrid have been running flawlessly on my PC.

I hope the developers of the FAH v7 software can look into this problem and fix the bugs. I can't process any GPU workunits and the bugs are a showstopper.

If you need me to provide any further information or run diagnostics I will gladly assist you. Anymore suggestions if you can think of any to fix the issues so that the GPU can start crunching workunits? I am open to any suggestions and will try them.

Thank you for your attention.

Image
anon1
 
Posts: 8
Joined: Sun May 01, 2011 6:40 am

Re: #636 FAH fails to start core 15 to process workunit on G

Postby 7im » Thu May 05, 2011 5:13 pm

Where does --lspci get this information from? Windows? or what the fahclient has? Other?

Because while the Device ID matches the 550 designation, the name listed does not. Almost seems like the driver added the wrong settings, or the card bios was changed...? Shouldn't say 590.
User avatar
7im
 
Posts: 15147
Joined: Thu Nov 29, 2007 4:30 pm
Location: Arizona

Re: #636 FAH fails to start core 15 to process workunit on G

Postby anon1 » Fri May 06, 2011 2:37 pm

Hello all, just want to update you all that my GPU has started crunching workunits using FAHcore_15, 4 hours ago. One workunit has been returned so far and has validated succesfully. Another workunit is being processed now. I would also like to add that this is the first time that the FAH v7 software has connected to get workunits from that project. It is crunching the workunits from the project, http://fah-web.stanford.edu/cgi-bin/fah ... ed?p=10950
with the collection server at http://fah-web.stanford.edu/logs/171.67.108.32.log.html

There was a previous project that I had workunits from but it failed to initalise the fahcore 15 to process the workunits, I can't remember what the project number is but it had something to do with Alzheimer's disease research. It failed with the message, UNKNOWN ENUM (-1)

Also to help to fix the bug in the GPU identification, I am posting the screenshot of GPU-Z for my system. As seen, the GPU identifier is GF116 not GF110!

Image

Thanks for all the help with this problem 7im and bruce!
anon1
 
Posts: 8
Joined: Sun May 01, 2011 6:40 am

Re: #636 FAH fails to start core 15 to process workunit on G

Postby anon1 » Sat May 07, 2011 5:04 am

Hello all, I have found the project where I can't process wokunits, it is at,

Project description:
http://fah-web.stanford.edu/cgi-bin/fah ... ned?p=6805

Work server:
http://fah-web.stanford.edu/logs/171.64.65.64.log.html

Collection server:
http://fah-web.stanford.edu/logs/171.67.108.26.log.html

The error log in FAH v7.1.24 as attached below and the error just repeats itself iteratively everytime FAH core 15 tries to run until there are too many errors and FAH just goes to sleep.

Code: Select all
04:38:04:Starting Unit 01
04:38:04:Running core: C:/Users/Lim/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/x86/NVIDIA/Fermi/Core_15.fah/FahCore_15.exe -dir 01 -suffix 01 -lifeline 4300 -version 701 -checkpoint 3 -gpu 0
04:38:04:Started core on PID 1840
04:38:04:FahCore 0x15 started
04:38:04:Started thread 12 on PID 4300
04:38:05:Unit 01:
04:38:05:Unit 01:*------------------------------*
04:38:05:Unit 01:Folding@Home GPU Core
04:38:05:Unit 01:Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
04:38:05:Unit 01:
04:38:05:Unit 01:Build host: SimbiosNvdWin7
04:38:05:Unit 01:Board Type: NVIDIA/CUDA
04:38:05:Unit 01:Core      : x=15
04:38:05:Unit 01: Window's signal control handler registered.
04:38:05:Unit 01:Preparing to commence simulation
04:38:05:Unit 01:- Ensuring status. Please wait.
04:38:14:Unit 01:- Looking at optimizations...
04:38:14:Unit 01:- Working with standard loops on this execution.
04:38:14:Unit 01:- Previous termination of core was improper.
04:38:14:Unit 01:- Going to use standard loops.
04:38:14:Unit 01:- Files status OK
04:38:14:Unit 01:sizeof(CORE_PACKET_HDR) = 512 file=<>
04:38:14:Unit 01:- Expanded 43595 -> 172159 (decompressed 394.9 percent)
04:38:14:Unit 01:Called DecompressByteArray: compressed_data_size=43595 data_size=172159, decompressed_data_size=172159 diff=0
04:38:14:Unit 01:- Digital signature verified
04:38:14:Unit 01:
04:38:14:Unit 01:Project: 6806 (Run 9241, Clone 0, Gen 26)
04:38:14:Unit 01:
04:38:14:Unit 01:Entering M.D.
04:38:16:Unit 01:Tpr hash 01/wudata_01.tpr:  1379189590 4034461979 3367153790 3767140349 1076606847
04:38:16:Unit 01:Working on 2 PEPTIDE (1-42)
04:38:16:Unit 01:Client config unavailable.
04:38:16:Unit 01:Starting GUI Server
04:38:16:FahCore, running Unit 01, returned: UNKNOWN_ENUM (-1)
04:45:14:Starting Unit 02
04:45:14:Running core: C:/Users/Lim/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/x86/NVIDIA/Fermi/Core_15.fah/FahCore_15.exe -dir 02 -suffix 01 -lifeline 4300 -version 701 -checkpoint 3 -gpu 0
04:45:14:Started core on PID 1584
04:45:14:FahCore 0x15 started
04:45:14:Started thread 18 on PID 4300
04:45:14:Unit 02:
04:45:14:Unit 02:*------------------------------*
04:45:14:Unit 02:Folding@Home GPU Core
04:45:14:Unit 02:Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
04:45:14:Unit 02:
04:45:14:Unit 02:Build host: SimbiosNvdWin7
04:45:14:Unit 02:Board Type: NVIDIA/CUDA
04:45:14:Unit 02:Core      : x=15
04:45:14:Unit 02: Window's signal control handler registered.
04:45:14:Unit 02:Preparing to commence simulation
04:45:14:Unit 02:- Ensuring status. Please wait.
04:45:23:Unit 02:- Looking at optimizations...
04:45:23:Unit 02:- Working with standard loops on this execution.
04:45:23:Unit 02:- Previous termination of core was improper.
04:45:24:Unit 02:- Going to use standard loops.
04:45:24:Unit 02:- Files status OK
04:45:24:Unit 02:sizeof(CORE_PACKET_HDR) = 512 file=<>
04:45:24:Unit 02:- Expanded 41869 -> 162639 (decompressed 388.4 percent)
04:45:24:Unit 02:Called DecompressByteArray: compressed_data_size=41869 data_size=162639, decompressed_data_size=162639 diff=0
04:45:24:Unit 02:- Digital signature verified
04:45:24:Unit 02:
04:45:24:Unit 02:Project: 6805 (Run 9519, Clone 2, Gen 19)
04:45:24:Unit 02:
04:45:24:Unit 02:Entering M.D.
04:45:26:Unit 02:Tpr hash 02/wudata_01.tpr:  3499989629 263059290 1865608348 1378537185 2351634676
04:45:26:Unit 02:Working on ALZHEIMER'S DISEASE AMYLOID
04:45:26:Unit 02:Client config unavailable.
04:45:26:FahCore, running Unit 02, returned: UNKNOWN_ENUM (-1)


However, the latest beta project: Test simulations of Protein-G peptide with gpu openmm-gromacs (Fermi boards), http://fah-web.stanford.edu/cgi-bin/fah ... ed?p=10950
processes workunits fine on my GTX 550 Ti.

I think this problem with FAH core failing to process workunits for project 6805 is due to the project not being updating to work with newer Fermi GPU's in the 500 series, I am not sure whether Fermi 400 series GPUs can process the workunits. However, in the new project 10950 using a new beta openmm-gromacs core specially designed for Fermi GPUs works perfectly on my Fermi GPU.

Perhaps, someone in charge could look into updating project 6805 to work with Fermi GPU's.

Also, I would like to know whether I can set an option for FAH v7 to not ask for workunits for project 6805. Currently, my GPU is getting work from project 6805 and it just keeps failing to process the workunits. How do I make FAH v7 just get workunits from the project, http://fah-web.stanford.edu/cgi-bin/fah ... ed?p=10950

I would like my GPU to keep processing units continuously from project 10950. It is now requesting workunits from project 6805 and fails to process them. GPU is idling now and processing power going to waste...

Thanks.

I will also attach a screenshot of the BOINC message log here to help with the GPU identification bug.

Image
anon1
 
Posts: 8
Joined: Sun May 01, 2011 6:40 am

Re: #636 FAH fails to start core 15 to process workunit on G

Postby 7im » Sat May 07, 2011 9:09 pm

So how does that Manager determine GPU identity. It's not a tool we are familiar with in this forum. Not that helpful if we don't know how it works.

Does it fail the same way at stock speeds?
User avatar
7im
 
Posts: 15147
Joined: Thu Nov 29, 2007 4:30 pm
Location: Arizona

Re: #636 FAH fails to start core 15 to process workunit on G

Postby anon1 » Sun May 08, 2011 12:46 am

At stock speeds it also fails the same way. With the UNKNOWN_ENUM (-1) error, could it be due to my GPU not being in the whitelist, thats why FAHcore15 keeps failing to start?
anon1
 
Posts: 8
Joined: Sun May 01, 2011 6:40 am

Re: #636 FAH fails to start core 15 to process workunit on G

Postby 7im » Sun May 08, 2011 3:09 pm

The "Fermi:1" in the ***System *** shows it has been whitelisted, at least as far as I can tell. This project isn't known for it's communication skills, no matter how hard I try to remedy that. No one from PG has come forward to let us know what that Enum error means.
User avatar
7im
 
Posts: 15147
Joined: Thu Nov 29, 2007 4:30 pm
Location: Arizona

Re: #636 FAH fails to start core 15 to process workunit on G

Postby mephistopheles » Sun May 08, 2011 4:48 pm

7im wrote:So how does that Manager determine GPU identity.

It's apparently worth around 1500 lines of source code but as far as I can tell from scanning it:
* Nvidia device names are retrived from the CUDA driver
* AMD device names are hard coded, based on the "target ID" (one per architecture) from the CAL driver

(A bit more on how Boinc handles mixes of different GPUs here - they identify the "most capable" GPU present, and ignore any GPUs less powerful than this.)

I don't know how the FAHClient detects GPUs, but based on the --lspci argument I guess it's using libpci which apparently also exists for Windows.

It's not clear to me that the client misidentifies the GTX 550 Ti in any way that matters.
Core 15 would run on GTX 550 as well as GTX 590, wouldn't it? Does GF110 vs GF116 make a difference?

There are a couple of minor bugs here - the GPU info is wrong, and "UNKNOWN_ENUM" could be slightly more informative - but I can't immediately see anything from the v7 client that would break folding.

Given that project 6805 fails with the v6 client too, using the same card and the same drivers, I would think that the problem is with the core/driver combination rather than the v7 client.

@anon1 It looks like your options are limited to either change the driver or ignore the errors until the client gets a WU it can handle. AFAIK there's no way to exclude certain projects from folding.
mephistopheles
 
Posts: 195
Joined: Tue Apr 07, 2009 7:51 am

Re: #636 FAH fails to start core 15 to process workunit on G

Postby bruce » Mon May 09, 2011 6:07 am

Has that slot completed any WUs?
bruce
Site Admin
 
Posts: 20181
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: #636 FAH fails to start core 15 to process workunit on G

Postby anon1 » Mon May 09, 2011 3:15 pm

Hello mephistopheles, thank you for your explanation and links relating to my problem. Seems to me that the problem is due to the Nvidia driver 270.61 and FAHcore. However, my GPU has successfully crunched workunits from the project, http://fah-web.stanford.edu/cgi-bin/fah ... ed?p=10950

@bruce, yes that GPU slot has completed workunits from the project, http://fah-web.stanford.edu/cgi-bin/fah ... ed?p=10950
Three workunits from project 10950 completed successfully.
The GPU is using slot 1.
Slot 0 is used by the CPU.
anon1
 
Posts: 8
Joined: Sun May 01, 2011 6:40 am

Next

Return to V7.1.52 Windows/Linux

Who is online

Users browsing this forum: Yahoo [Bot] and 3 guests

cron