Project: 13001 Run 264, Clone 7, Gen 0 Error

Moderators: Site Moderators, FAHC Science Team

davidcoton
Posts: 1102
Joined: Wed Nov 05, 2008 3:19 pm
Location: Cambridge, UK

Re: Project: 13001 Run 264, Clone 7, Gen 0 Error

Post by davidcoton »

Looks to me like Stanford are still issuing 13000 and 13001 to Maxwell cards. Until they stop doing that, it's going to fall over. Probably the best option is to pause or delete the GPU slot until PG indicate that the assignment is fixed. (Of course, it may get fixed and not announced. :))
Image
Nert
Posts: 162
Joined: Wed Mar 26, 2014 7:46 pm

Re: Project: 13001 Run 264, Clone 7, Gen 0 Error

Post by Nert »

I'll suspend until it's working.
Image
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Project: 13001 Run 264, Clone 7, Gen 0 Error

Post by bruce »

An attempt to fix this was installed last night. Was it fixed?
Nert
Posts: 162
Joined: Wed Mar 26, 2014 7:46 pm

Re: Project: 13001 Run 264, Clone 7, Gen 0 Error

Post by Nert »

I just started the GPU slot again, and it appears to be working. I don't know if I got lucky and didn't get one of those i7 units, or whether the change fixed this. I'll monitor and let you know if there's a problem.

Here's the log for the GPU:

Code: Select all

23:38:22:Adding folding slot 00: READY gpu:0:GM107 [GeForce GTX 750 Ti]
23:38:22:Saving configuration to config.xml
23:38:22:<config>
23:38:22:  <!-- Folding Slot Configuration -->
23:38:22:  <power v='FULL'/>
23:38:22:
23:38:22:  <!-- Network -->
23:38:22:  <proxy v=':8080'/>
23:38:22:
23:38:22:  <!-- User Information -->
23:38:22:  <passkey v='********************************'/>
23:38:22:  <team v='165780'/>
23:38:22:  <user v='nert'/>
23:38:22:
23:38:22:  <!-- Folding Slots -->
23:38:22:  <slot id='1' type='CPU'>
23:38:22:    <cpus v='-1'/>
23:38:22:  </slot>
23:38:22:  <slot id='0' type='GPU'/>
23:38:22:</config>
23:38:23:WU02:FS00:Connecting to assign-GPU.stanford.edu:80
23:38:23:WU02:FS00:News: Welcome to Folding@Home
23:38:23:WU02:FS00:Assigned to work server 171.67.108.142
23:38:23:WU02:FS00:Requesting new work unit for slot 00: READY gpu:0:GM107 [GeForce GTX 750 Ti] from 171.67.108.142
23:38:23:WU02:FS00:Connecting to 171.67.108.142:8080
23:38:24:WU02:FS00:Downloading 143.09KiB
23:38:24:WU02:FS00:Download complete
23:38:24:WU02:FS00:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:8018 run:809 clone:0 gen:327 core:0x15 unit:0x0000019b6953ee2e500f1e1dfc0aec3f
23:38:24:WU02:FS00:Starting
23:38:24:WU02:FS00:Running FahCore: "C:\Program Files\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/x86/NVIDIA/Fermi/Core_15.fah/FahCore_15.exe -dir 02 -suffix 01 -version 703 -lifeline 4376 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
23:38:24:WU02:FS00:Started FahCore on PID 7708
23:38:25:WU02:FS00:Core PID:3860
23:38:25:WU02:FS00:FahCore 0x15 started
23:38:27:WU02:FS00:0x15:
23:38:27:WU02:FS00:0x15:*------------------------------*
23:38:27:WU02:FS00:0x15:Folding@Home GPU Core
23:38:27:WU02:FS00:0x15:Version                2.25 (Wed May 9 17:03:01 EDT 2012)
23:38:27:WU02:FS00:0x15:Build host             AmoebaRemote
23:38:27:WU02:FS00:0x15:Board Type             NVIDIA/CUDA
23:38:27:WU02:FS00:0x15:Core                   15
23:38:27:WU02:FS00:0x15:
23:38:27:WU02:FS00:0x15:Window's signal control handler registered.
23:38:27:WU02:FS00:0x15:Preparing to commence simulation
23:38:27:WU02:FS00:0x15:- Looking at optimizations...
23:38:27:WU02:FS00:0x15:DeleteFrameFiles: successfully deleted file=02/wudata_01.ckp
23:38:27:WU02:FS00:0x15:- Created dyn
23:38:27:WU02:FS00:0x15:- Files status OK
23:38:27:WU02:FS00:0x15:sizeof(CORE_PACKET_HDR) = 512 file=<>
23:38:27:WU02:FS00:0x15:- Expanded 146009 -> 660986 (decompressed 452.7 percent)
23:38:27:WU02:FS00:0x15:Called DecompressByteArray: compressed_data_size=146009 data_size=660986, decompressed_data_size=660986 diff=0
23:38:27:WU02:FS00:0x15:- Digital signature verified
23:38:27:WU02:FS00:0x15:
23:38:27:WU02:FS00:0x15:Project: 8018 (Run 809, Clone 0, Gen 327)
23:38:27:WU02:FS00:0x15:
23:38:27:WU02:FS00:0x15:Assembly optimizations on if available.
23:38:27:WU02:FS00:0x15:Entering M.D.
23:38:29:WU02:FS00:0x15:Tpr hash 02/wudata_01.tpr:  839719591 1274400753 2993979261 3916300958 2354353590
23:38:29:WU02:FS00:0x15:GPU device id=0
23:38:29:WU02:FS00:0x15:Working on GRowing Old MAkes el Chrono Sweat
23:38:29:WU02:FS00:0x15:Client config unavailable.
23:38:29:WU02:FS00:0x15:Starting GUI Server
23:39:37:WU02:FS00:0x15:Setting checkpoint frequency: 250000
23:39:37:WU02:FS00:0x15:Completed         3 out of 25000000 steps (0%).
Image
davidcoton
Posts: 1102
Joined: Wed Nov 05, 2008 3:19 pm
Location: Cambridge, UK

Re: Project: 13001 Run 264, Clone 7, Gen 0 Error

Post by davidcoton »

Just for clarification: the two types of unit in question are Core_15 and Core_17 -- referring to the version of the software that actually does the work. Look for core:0x15 in the log above, or FahCore: 0x15 in FAHControl Status to indicate you're getting the "right" ones. It's actually a hexadecimal identification number -- you might also see a3 and a4 for CPU-based folding. i5 and i7 refer to versions of Intel CPUs.

David
Image
Nert
Posts: 162
Joined: Wed Mar 26, 2014 7:46 pm

Re: Project: 13001 Run 264, Clone 7, Gen 0 Error

Post by Nert »

64 year old eyes can't distinguish between an "i" and a "1" :(
Image
Post Reply