Log reading

If you're new to FAH and need help getting started or you have very basic questions, start here.

Moderators: Site Moderators, FAHC Science Team

Ricorocks
Posts: 475
Joined: Thu Aug 04, 2016 1:49 pm
Location: Georgetown, Texas

Log reading

Post by Ricorocks »

I had a stalled GPU, on a two GPU machine, circle 'yellow'
at 2200hrs entered the command for FAH log:

03:29:22:************************* Folding@home Client *************************
03:29:22: Website: http://folding.stanford.edu/
03:29:22: Copyright: (c) 2009-2016 Stanford University
03:29:22: Author: Joseph Coffland <joseph@cauldrondevelopment.com>
03:29:22: Args:
03:29:22: Config: C:\Users\rick\AppData\Roaming\FAHClient\config.xml

All six happened at the same time three twenty nine in the am?

end of same log

03:33:10:WU02:FS01:0x21:Completed 825000 out of 7500000 steps (11%)
03:33:36:WU01:FS00:0xa4:Completed 172500 out of 250000 steps (69%)

So this log is showing 4 minutes?

#1 03:29:22 is this three-twenty nine in the AM?

At approx 10 pm (this machine 2 gpu) found stalled 'yellow' circle GPU 960

Is this where the 960 stalled?

03:29:46:WU01:FS00:0xa4:- Previous termination of core was improper <and I discovered it at 10 pm> <strange as 0830 I checked this machine all GPU's green>

Code: Select all

*********************** Log Started 2017-01-12T03:29:22Z ***********************
03:29:22:************************* Folding@home Client *************************
03:29:22:        Website: http://folding.stanford.edu/
03:29:22:      Copyright: (c) 2009-2016 Stanford University
03:29:22:         Author: Joseph Coffland <joseph@cauldrondevelopment.com>
03:29:22:           Args: 
03:29:22:         Config: C:\Users\rick\AppData\Roaming\FAHClient\config.xml
03:29:22:******************************** Build ********************************
03:29:22:        Version: 7.4.15
03:29:22:           Date: Aug 17 2016
03:29:22:           Time: 04:33:41
03:29:22:     Repository: Git
03:29:22:       Revision: 4f3e0e25571a9f691719f0c273739294bde517dd
03:29:22:         Branch: master
03:29:22:       Compiler: GNU 5.3.1 20160205
03:29:22:        Options: -std=gnu++98 -I/mingw64/include -O3 -funroll-loops -ffast-math
03:29:22:                 -mfpmath=sse -fno-unsafe-math-optimizations -msse2
03:29:22:       Platform: linux2 4.6.0-1-amd64
03:29:22:           Bits: 64
03:29:22:           Mode: Release
03:29:22:******************************* System ********************************
03:29:22:            CPU: Intel(R) Core(TM) i7-3770 CPU @ 3.40GHz
03:29:22:         CPU ID: GenuineIntel Family 6 Model 58 Stepping 9
03:29:22:           CPUs: 8
03:29:22:         Memory: 7.94GiB
03:29:22:    Free Memory: 6.57GiB
03:29:22:        Threads: WINDOWS_THREADS
03:29:22:     OS Version: 6.2
03:29:22:    Has Battery: false
03:29:22:     On Battery: false
03:29:22:     UTC Offset: -6
03:29:22:            PID: 6964
03:29:22:            CWD: C:\Users\rick\AppData\Roaming\FAHClient
03:29:22:             OS: Windows 10 Pro
03:29:22:        OS Arch: AMD64
03:29:22:           GPUs: 2
03:29:22:          GPU 0: Bus:1 Slot:0 NVIDIA:5 GP106 [GeForce GTX 1060 6GB]
03:29:22:          GPU 1: Bus:5 Slot:0 NVIDIA:5 GM206 [GeForce GTX 960]
03:29:22:  CUDA Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:6.1 Driver:8.0
03:29:22:  CUDA Device 1: Platform:0 Device:1 Bus:5 Slot:0 Compute:5.2 Driver:8.0
03:29:22:OpenCL Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:1.2 Driver:376.48
03:29:22:OpenCL Device 1: Platform:0 Device:1 Bus:5 Slot:0 Compute:1.2 Driver:376.48
03:29:22:  Win32 Service: false
03:29:22:***********************************************************************
03:29:22:<config>
03:29:22:  <!-- Slot Control -->
03:29:22:  <power v='FULL'/>
03:29:22:
03:29:22:  <!-- User Information -->
03:29:22:  <passkey v='********************************'/>
03:29:22:  <user v='Ricoocks'/>
03:29:22:
03:29:22:  <!-- Folding Slots -->
03:29:22:  <slot id='0' type='CPU'>
03:29:22:    <paused v='true'/>
03:29:22:  </slot>
03:29:22:  <slot id='1' type='GPU'>
03:29:22:    <paused v='true'/>
03:29:22:  </slot>
03:29:22:  <slot id='2' type='GPU'>
03:29:22:    <paused v='true'/>
03:29:22:  </slot>
03:29:22:</config>
03:29:22:Trying to access database...
03:29:22:Successfully acquired database lock
03:29:22:Enabled folding slot 00: PAUSED cpu:6 (by user)
03:29:22:Enabled folding slot 01: PAUSED gpu:0:GP106 [GeForce GTX 1060 6GB] (by user)
03:29:22:Enabled folding slot 02: PAUSED gpu:1:GM206 [GeForce GTX 960] (by user)
03:29:35:FS00:Unpaused
03:29:35:FS01:Unpaused
03:29:35:FS02:Unpaused
03:29:35:WU02:FS01:Starting
03:29:35:WU02:FS01:Running FahCore: "C:\Program Files\FAHClient/FAHCoreWrapper.exe" C:\Users\rick\AppData\Roaming\FAHClient\cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21.exe -dir 02 -suffix 01 -version 704 -lifeline 6964 -checkpoint 15 -opencl-platform 0 -gpu-vendor nvidia -gpu 0
03:29:35:WU02:FS01:Started FahCore on PID 4524
03:29:35:WU02:FS01:Core PID:1460
03:29:35:WU02:FS01:FahCore 0x21 started
03:29:35:WU01:FS00:Starting
03:29:35:WU01:FS00:Running FahCore: "C:\Program Files\FAHClient/FAHCoreWrapper.exe" C:\Users\rick\AppData\Roaming\FAHClient\cores/web.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 01 -suffix 01 -version 704 -lifeline 6964 -checkpoint 15 -np 6
03:29:35:WU01:FS00:Started FahCore on PID 1404
03:29:36:WU02:FS01:0x21:*********************** Log Started 2017-01-12T03:29:35Z ***********************
03:29:36:WU02:FS01:0x21:Project: 11707 (Run 131, Clone 3, Gen 0)
03:29:36:WU02:FS01:0x21:Unit: 0x000000008ca304f35876a53926b89e3a
03:29:36:WU02:FS01:0x21:CPU: 0x00000000000000000000000000000000
03:29:36:WU02:FS01:0x21:Machine: 1
03:29:36:WU02:FS01:0x21:Digital signatures verified
03:29:36:WU02:FS01:0x21:Folding@home GPU Core21 Folding@home Core
03:29:36:WU02:FS01:0x21:Version 0.0.17
03:29:36:WU02:FS01:0x21:  Found a checkpoint file
03:29:36:WU01:FS00:Core PID:1372
03:29:36:WU01:FS00:FahCore 0xa4 started
03:29:36:WU01:FS00:0xa4:
03:29:36:WU01:FS00:0xa4:*------------------------------*
03:29:36:WU01:FS00:0xa4:Folding@Home Gromacs GB Core
03:29:36:WU01:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
03:29:36:WU01:FS00:0xa4:
03:29:36:WU01:FS00:0xa4:Preparing to commence simulation
03:29:36:WU01:FS00:0xa4:- Ensuring status. Please wait.
03:29:37:WU00:FS02:Connecting to 171.67.108.45:80
03:29:38:WU00:FS02:Assigned to work server 171.64.65.92
03:29:38:WU00:FS02:Requesting new work unit for slot 02: READY gpu:1:GM206 [GeForce GTX 960] from 171.64.65.92
03:29:38:WU00:FS02:Connecting to 171.64.65.92:8080
03:29:39:WU00:FS02:Downloading 3.16MiB
03:29:43:WU00:FS02:Download complete
03:29:43:WU00:FS02:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:9197 run:0 clone:68 gen:170 core:0x21 unit:0x000000fdab40415c57cb3f9474dec0c2
03:29:43:WU00:FS02:Starting
03:29:43:WU00:FS02:Running FahCore: "C:\Program Files\FAHClient/FAHCoreWrapper.exe" C:\Users\rick\AppData\Roaming\FAHClient\cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21.exe -dir 00 -suffix 01 -version 704 -lifeline 6964 -checkpoint 15 -opencl-platform 0 -gpu-vendor nvidia -gpu 1
03:29:43:WU00:FS02:Started FahCore on PID 3424
03:29:43:WU00:FS02:Core PID:5500
03:29:43:WU00:FS02:FahCore 0x21 started
03:29:44:WU00:FS02:0x21:*********************** Log Started 2017-01-12T03:29:43Z ***********************
03:29:44:WU00:FS02:0x21:Project: 9197 (Run 0, Clone 68, Gen 170)
03:29:44:WU00:FS02:0x21:Unit: 0x000000fdab40415c57cb3f9474dec0c2
03:29:44:WU00:FS02:0x21:CPU: 0x00000000000000000000000000000000
03:29:44:WU00:FS02:0x21:Machine: 2
03:29:44:WU00:FS02:0x21:Reading tar file core.xml
03:29:44:WU00:FS02:0x21:Reading tar file system.xml
03:29:44:WU00:FS02:0x21:Reading tar file integrator.xml
03:29:44:WU00:FS02:0x21:Reading tar file state.xml
03:29:44:WU00:FS02:0x21:Digital signatures verified
03:29:44:WU00:FS02:0x21:Folding@home GPU Core21 Folding@home Core
03:29:44:WU00:FS02:0x21:Version 0.0.17
03:29:45:WU02:FS01:0x21:Completed 750000 out of 7500000 steps (10%)
03:29:45:WU02:FS01:0x21:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
03:29:46:WU01:FS00:0xa4:- Looking at optimizations...
03:29:46:WU01:FS00:0xa4:- Working with standard loops on this execution.
03:29:46:WU01:FS00:0xa4:- Previous termination of core was improper.
03:29:46:WU01:FS00:0xa4:- Files status OK
03:29:46:WU01:FS00:0xa4:- Expanded 825200 -> 1398040 (decompressed 169.4 percent)
03:29:46:WU01:FS00:0xa4:Called DecompressByteArray: compressed_data_size=825200 data_size=1398040, decompressed_data_size=1398040 diff=0
03:29:46:WU01:FS00:0xa4:- Digital signature verified
03:29:46:WU01:FS00:0xa4:
03:29:46:WU01:FS00:0xa4:Project: 9039 (Run 826, Clone 0, Gen 382)
03:29:46:WU01:FS00:0xa4:
03:29:46:WU01:FS00:0xa4:Entering M.D.
03:29:53:WU01:FS00:0xa4:Using Gromacs checkpoints
03:29:54:WU01:FS00:0xa4:Mapping NT from 6 to 6 
03:29:57:WU01:FS00:0xa4:Resuming from checkpoint
03:29:57:WU01:FS00:0xa4:Verified 01/wudata_01.log
03:29:57:WU01:FS00:0xa4:Verified 01/wudata_01.trr
03:29:57:WU01:FS00:0xa4:Verified 01/wudata_01.xtc
03:29:57:WU01:FS00:0xa4:Verified 01/wudata_01.edr
03:29:57:WU01:FS00:0xa4:Completed 166020 out of 250000 steps  (66%)
03:30:03:20:127.0.0.1:New Web connection
03:30:04:WU00:FS02:0x21:Completed 0 out of 2500000 steps (0%)
03:30:04:WU00:FS02:0x21:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
03:30:23:Saving configuration to config.xml
03:30:23:<config>
03:30:23:  <!-- Slot Control -->
03:30:23:  <power v='FULL'/>
03:30:23:
03:30:23:  <!-- User Information -->
03:30:23:  <passkey v='********************************'/>
03:30:23:  <user v='Ricoocks'/>
03:30:23:
03:30:23:  <!-- Folding Slots -->
03:30:23:  <slot id='0' type='CPU'/>
03:30:23:  <slot id='1' type='GPU'/>
03:30:23:  <slot id='2' type='GPU'/>
03:30:23:</config>
03:30:50:WU01:FS00:0xa4:Completed 167500 out of 250000 steps  (67%)
03:32:13:WU01:FS00:0xa4:Completed 170000 out of 250000 steps  (68%)
03:32:19:WU00:FS02:0x21:Completed 25000 out of 2500000 steps (1%)
03:33:10:WU02:FS01:0x21:Completed 825000 out of 7500000 steps (11%)
03:33:36:WU01:FS00:0xa4:Completed 172500 out of 250000 steps  (69%)
Driver 'all time in Log' 376.48

NOTE: At 0830 daily I check: points, Wu's, Rank, & most important if all GPU's have green circles. If all GPU's green, I'll check again 6 hours later, & continue till bed time. Am I missing something?
rwh202
Posts: 425
Joined: Mon Nov 15, 2010 8:51 pm
Hardware configuration: 8x GTX 1080
3x GTX 1080 Ti
3x GTX 1060
Various other bits and pieces
Location: South Coast, UK

Re: Log reading

Post by rwh202 »

The first line of the log : Log Started 2017-01-12T03:29:22Z
The final 'Z' means 'Zulu' time, or more commonly known as GMT or UTC. If you're in Texas (Central time?) then I think you're GMT-6 so all those times were 21:30 in the evening.
I guess this is when you found the stalled client and restarted it.
The line '03:29:46:WU01:FS00:0xa4:- Previous termination of core was improper' is fairly normal when a client is restarted without fully pausing all slots first - is that what happened?

To get to the root cause of the 960 stalling, I think you'll need to look in the log folder and dig out the previous log.
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Log reading

Post by bruce »

One more fact:
I've seen GPU slots fail (rarely), often because the driver detected something it didn't like, but often for an undiagnosed reason. If your GPU does fail, FAHClient usually detects it and takes appropriate actions ... but not always. If that happens, I know that the FAH reports that progress continues to progress even when it's not progressing. (Search the forum for the string 99.99%)

In that rare instance, WebControl may continue to show a green circle --- I'm not sure.
Ricorocks
Posts: 475
Joined: Thu Aug 04, 2016 1:49 pm
Location: Georgetown, Texas

Re: Log reading

Post by Ricorocks »

Since 376.48 situation improvement, notably for single GPU machines.

Two GPU (1060, 960) machine can go perhaps as long as 3 days without fail, normally one or the other GPU is 'yellow' (won't start, till a reboot) every other day.

ALSO:

For some reason NOW! On this machine (2 gpu), I can stop FAH from the tray icon, but cannot re-start it, by dbl clicking the desktop icon, the only way to restart, works is reboot.
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Log reading

Post by bruce »

Does the desktop icon point to the shortcut/script that was included with your most recent install which does start FAH?

FAHControl has an exit button which tells FAHClient to shut down, but (naturally) it can't restart it. Is this the same functionality you're talking about.
davidcoton
Posts: 1102
Joined: Wed Nov 05, 2008 3:19 pm
Location: Cambridge, UK

Re: Log reading

Post by davidcoton »

Ricorocks wrote:For some reason NOW! On this machine (2 gpu), I can stop FAH from the tray icon, but cannot re-start it, by dbl clicking the desktop icon, the only way to restart, works is reboot.
Not sure I understand what you are saying. From the tray icon, clicking Pause works? And Pause then gets a tick. Clicking it again should undo the pause and remove the tick. Works for me on 7.4.16 (and previous versions) on Win Vista 32bit.
Image
Ricorocks
Posts: 475
Joined: Thu Aug 04, 2016 1:49 pm
Location: Georgetown, Texas

Re: Log reading

Post by Ricorocks »

No desktop icon <<<<dbl click>>>> does not restart FAH

Shortcut icon properties. "C:\Program Files\FAHClient\HideConsole.exe" "C:\Program Files\FAHClient\FAHClient.exe" --open-web-control.

Normally pause/un-pause does not start the idle GPU.

Choosing 'turn off FAH' then restarting FAH, has workded to restart the stalled GPU, HOWEVER, the shortcut does not restart FAH, but if I reboot FAH is running.
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Log reading

Post by bruce »

When you pause a WU, FAHClient continues to run, whether or not any folding is going on. The icon you're clicking will start FAHClient unless it's already running. There's a difference between a paused WU and terminating the execution of FAHClient.

When your unable to resume work, what does your log say? (Times in the log are UTC and your local time is 6 hours earlier.)
Ricorocks
Posts: 475
Joined: Thu Aug 04, 2016 1:49 pm
Location: Georgetown, Texas

Re: Log reading

Post by Ricorocks »

I'm not pressing pause! ONLY STOP! Then to restart MUST reboot.
rwh202
Posts: 425
Joined: Mon Nov 15, 2010 8:51 pm
Hardware configuration: 8x GTX 1080
3x GTX 1080 Ti
3x GTX 1060
Various other bits and pieces
Location: South Coast, UK

Re: Log reading

Post by rwh202 »

The only place I can find 'STOP' is in the web control interface. I believe that this button is functionally identical to the 'Pause' button in the traditional interface and in the task bar.
Once pressed, FAHClient will command the cores to stop, but will continue to run and doing what it does (upload/download and listening for commands).

If the problems you're having are to do with the FAHClient (difficult to tell without seeing the logs), then yes, a pause/unpause might not help, but it 'should' be possible to restart FAHClient (never tried it in Windows, but task manager should do it) without resorting to restarting the PC.

We still ought to diagnose the actual problem causing the GPUs to go idle.
Ricorocks
Posts: 475
Joined: Thu Aug 04, 2016 1:49 pm
Location: Georgetown, Texas

Re: Log reading

Post by Ricorocks »

My bad where I said 'Stop' replace that word with 'Quit'.

It's small tray icon (near time & date) left or right click, FAH icon, choose 'Quit' the only way to restart is reboot. As dbl click desktop icon, does not re-start FAH.

The stalling of a GPU's in machine, with more than one GPU is a known flaw & may or may not be addressed via a new release of the client, per Bruce, earlier post regarding this. The situation did not improve with the install of Nividia 376.48, or still occasional idle GPU, which requires Reboot
Ricorocks
Posts: 475
Joined: Thu Aug 04, 2016 1:49 pm
Location: Georgetown, Texas

Re: Log reading

Post by Ricorocks »

This should have info on latest idled GPU, within last 24 hrs, two gpu machine

Code: Select all

*********************** Log Started 2017-01-15T14:29:56Z ***********************
14:29:56:************************* Folding@home Client *************************
14:29:56:        Website: http://folding.stanford.edu/
14:29:56:      Copyright: (c) 2009-2016 Stanford University
14:29:56:         Author: Joseph Coffland <joseph@cauldrondevelopment.com>
14:29:56:           Args: 
14:29:56:         Config: C:\Users\rick\AppData\Roaming\FAHClient\config.xml
14:29:56:******************************** Build ********************************
14:29:56:        Version: 7.4.15
14:29:56:           Date: Aug 17 2016
14:29:56:           Time: 04:33:41
14:29:56:     Repository: Git
14:29:56:       Revision: 4f3e0e25571a9f691719f0c273739294bde517dd
14:29:56:         Branch: master
14:29:56:       Compiler: GNU 5.3.1 20160205
14:29:56:        Options: -std=gnu++98 -I/mingw64/include -O3 -funroll-loops -ffast-math
14:29:56:                 -mfpmath=sse -fno-unsafe-math-optimizations -msse2
14:29:56:       Platform: linux2 4.6.0-1-amd64
14:29:56:           Bits: 64
14:29:56:           Mode: Release
14:29:56:******************************* System ********************************
14:29:56:            CPU: Intel(R) Core(TM) i7-3770 CPU @ 3.40GHz
14:29:56:         CPU ID: GenuineIntel Family 6 Model 58 Stepping 9
14:29:56:           CPUs: 8
14:29:56:         Memory: 7.94GiB
14:29:56:    Free Memory: 6.40GiB
14:29:56:        Threads: WINDOWS_THREADS
14:29:56:     OS Version: 6.2
14:29:56:    Has Battery: false
14:29:56:     On Battery: false
14:29:56:     UTC Offset: -6
14:29:56:            PID: 6520
14:29:56:            CWD: C:\Users\rick\AppData\Roaming\FAHClient
14:29:56:             OS: Windows 10 Pro
14:29:56:        OS Arch: AMD64
14:29:56:           GPUs: 2
14:29:56:          GPU 0: Bus:1 Slot:0 NVIDIA:5 GP106 [GeForce GTX 1060 6GB]
14:29:56:          GPU 1: Bus:5 Slot:0 NVIDIA:5 GM206 [GeForce GTX 960]
14:29:56:  CUDA Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:6.1 Driver:8.0
14:29:56:  CUDA Device 1: Platform:0 Device:1 Bus:5 Slot:0 Compute:5.2 Driver:8.0
14:29:56:OpenCL Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:1.2 Driver:376.48
14:29:56:OpenCL Device 1: Platform:0 Device:1 Bus:5 Slot:0 Compute:1.2 Driver:376.48
14:29:56:  Win32 Service: false
14:29:56:***********************************************************************
14:29:56:<config>
14:29:56:  <!-- Slot Control -->
14:29:56:  <power v='FULL'/>
14:29:56:
14:29:56:  <!-- User Information -->
14:29:56:  <passkey v='********************************'/>
14:29:56:  <user v='Ricoocks'/>
14:29:56:
14:29:56:  <!-- Folding Slots -->
14:29:56:  <slot id='0' type='CPU'/>
14:29:56:  <slot id='1' type='GPU'/>
14:29:56:  <slot id='2' type='GPU'/>
14:29:56:</config>
14:29:56:Trying to access database...
14:29:56:Successfully acquired database lock
14:29:56:Enabled folding slot 00: READY cpu:6
14:29:56:Enabled folding slot 01: READY gpu:0:GP106 [GeForce GTX 1060 6GB]
14:29:56:Enabled folding slot 02: READY gpu:1:GM206 [GeForce GTX 960]
14:29:56:WU02:FS02:Starting
14:29:56:WU02:FS02:Running FahCore: "C:\Program Files\FAHClient/FAHCoreWrapper.exe" C:\Users\rick\AppData\Roaming\FAHClient\cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21.exe -dir 02 -suffix 01 -version 704 -lifeline 6520 -checkpoint 15 -opencl-platform 0 -gpu-vendor nvidia -gpu 1
14:29:57:WU02:FS02:Started FahCore on PID 6696
14:29:57:WU02:FS02:Core PID:6720
14:29:57:WU02:FS02:FahCore 0x21 started
14:29:57:WU01:FS00:Starting
14:29:57:WU01:FS00:Running FahCore: "C:\Program Files\FAHClient/FAHCoreWrapper.exe" C:\Users\rick\AppData\Roaming\FAHClient\cores/web.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 01 -suffix 01 -version 704 -lifeline 6520 -checkpoint 15 -np 6
14:29:57:WU01:FS00:Started FahCore on PID 6740
14:29:57:WU01:FS00:Core PID:6764
14:29:57:WU01:FS00:FahCore 0xa4 started
14:29:57:WU01:FS00:0xa4:
14:29:57:WU01:FS00:0xa4:*------------------------------*
14:29:57:WU01:FS00:0xa4:Folding@Home Gromacs GB Core
14:29:57:WU01:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
14:29:57:WU01:FS00:0xa4:
14:29:57:WU01:FS00:0xa4:Preparing to commence simulation
14:29:57:WU01:FS00:0xa4:- Ensuring status. Please wait.
14:29:58:WU02:FS02:0x21:*********************** Log Started 2017-01-15T14:29:57Z ***********************
14:29:58:WU02:FS02:0x21:Project: 9197 (Run 2, Clone 48, Gen 134)
14:29:58:WU02:FS02:0x21:Unit: 0x000000deab40415c57cb3fe14cdd7184
14:29:58:WU02:FS02:0x21:CPU: 0x00000000000000000000000000000000
14:29:58:WU02:FS02:0x21:Machine: 2
14:29:58:WU02:FS02:0x21:Digital signatures verified
14:29:58:WU02:FS02:0x21:Folding@home GPU Core21 Folding@home Core
14:29:58:WU02:FS02:0x21:Version 0.0.17
14:29:58:WU02:FS02:0x21:  Found a checkpoint file
14:30:01:WU00:FS01:Connecting to 171.67.108.45:80
14:30:02:WU00:FS01:Assigned to work server 171.64.65.84
14:30:02:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:GP106 [GeForce GTX 1060 6GB] from 171.64.65.84
14:30:02:WU00:FS01:Connecting to 171.64.65.84:8080
14:30:03:WU00:FS01:Downloading 3.36MiB
14:30:04:WU02:FS02:0x21:Completed 700000 out of 2500000 steps (28%)
14:30:04:WU02:FS02:0x21:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
14:30:07:WU01:FS00:0xa4:- Looking at optimizations...
14:30:07:WU01:FS00:0xa4:- Working with standard loops on this execution.
14:30:07:WU01:FS00:0xa4:- Previous termination of core was improper.
14:30:07:WU01:FS00:0xa4:- Files status OK
14:30:07:WU01:FS00:0xa4:- Expanded 825768 -> 1401112 (decompressed 169.6 percent)
14:30:07:WU01:FS00:0xa4:Called DecompressByteArray: compressed_data_size=825768 data_size=1401112, decompressed_data_size=1401112 diff=0
14:30:07:WU01:FS00:0xa4:- Digital signature verified
14:30:07:WU01:FS00:0xa4:
14:30:07:WU01:FS00:0xa4:Project: 9034 (Run 657, Clone 6, Gen 112)
14:30:07:WU01:FS00:0xa4:
14:30:07:WU01:FS00:0xa4:Entering M.D.
14:30:08:WU00:FS01:Download complete
14:30:08:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:9192 run:0 clone:32 gen:205 core:0x21 unit:0x00000148ab40415457cb2d116cafcaed
14:30:08:WU00:FS01:Starting
14:30:08:WU00:FS01:Running FahCore: "C:\Program Files\FAHClient/FAHCoreWrapper.exe" C:\Users\rick\AppData\Roaming\FAHClient\cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21.exe -dir 00 -suffix 01 -version 704 -lifeline 6520 -checkpoint 15 -opencl-platform 0 -gpu-vendor nvidia -gpu 0
14:30:08:WU00:FS01:Started FahCore on PID 6972
14:30:08:WU00:FS01:Core PID:6996
14:30:08:WU00:FS01:FahCore 0x21 started
14:30:09:WU00:FS01:0x21:*********************** Log Started 2017-01-15T14:30:08Z ***********************
14:30:09:WU00:FS01:0x21:Project: 9192 (Run 0, Clone 32, Gen 205)
14:30:09:WU00:FS01:0x21:Unit: 0x00000148ab40415457cb2d116cafcaed
14:30:09:WU00:FS01:0x21:CPU: 0x00000000000000000000000000000000
14:30:09:WU00:FS01:0x21:Machine: 1
14:30:09:WU00:FS01:0x21:Reading tar file core.xml
14:30:09:WU00:FS01:0x21:Reading tar file system.xml
14:30:09:WU00:FS01:0x21:Reading tar file integrator.xml
14:30:09:WU00:FS01:0x21:Reading tar file state.xml
14:30:09:WU00:FS01:0x21:Digital signatures verified
14:30:09:WU00:FS01:0x21:Folding@home GPU Core21 Folding@home Core
14:30:09:WU00:FS01:0x21:Version 0.0.17
14:30:13:WU01:FS00:0xa4:Mapping NT from 6 to 6 
14:30:14:WU01:FS00:0xa4:Completed 0 out of 250000 steps  (0%)
14:30:17:WU00:FS01:0x21:Completed 0 out of 2500000 steps (0%)
14:30:17:WU00:FS01:0x21:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
14:31:42:WU00:FS01:0x21:Completed 25000 out of 2500000 steps (1%)
14:31:42:WU01:FS00:0xa4:Completed 2500 out of 250000 steps  (1%)
14:32:08:WU02:FS02:0x21:Completed 725000 out of 2500000 steps (29%)
14:32:22:20:127.0.0.1:New Web connection
14:33:08:WU00:FS01:0x21:Completed 50000 out of 2500000 steps (2%)
14:33:12:WU01:FS00:0xa4:Completed 5000 out of 250000 steps  (2%)
Ricorocks
Posts: 475
Joined: Thu Aug 04, 2016 1:49 pm
Location: Georgetown, Texas

Re: Log reading

Post by Ricorocks »

Note on above log, @0830hrs. 1/13/17 both GPU's 1060 & 960 web control shows "GREEN", @0830 1/14/17 GPU 1060 shows "Yellow" circle. Rebooted the machine both 1060 & 960 both green at web control
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Log reading

Post by bruce »

Let me be sure we are all on the same page. Here's what I think is true. (Please correct me if I don't have it right.)

Assuming you do not intentionally shut down FAHClient, there are three ways that the Windows user can interact with FAHControl
1) FAHControl.exe (aka. Advanced Control)
2) Web-Control (which is generally started when FAHClient is started
3) The Icon running in systray (Lower right corner of the screen -- near the time and date)

The "Pause" function is part of 1 and 2. The colored spinners that you are talking about are in 2. The "stop" or "quit" you're taking about is in 3.

Your original question seems to have been: Option 3 provides a "quit" function but no "un-quit" function, so how do I get things running again? (other that by rebooting).

Do I understand the facts correctly and the question you've posed?
Ricorocks
Posts: 475
Joined: Thu Aug 04, 2016 1:49 pm
Location: Georgetown, Texas

Re: Log reading

Post by Ricorocks »

Hi Guys,
Here's my routine 0830hrs each morning:

Visit "web control" 3 machines, via VNC viewer @0830hrs., by clicking Sys tray icon & choosing "WEB CONTROL" NEXT!!!!!

determine if both GPU's are green - if so close web control & visit another machines web control, to insure green gpu.

If say one GPU shows green & the other shows yellow - reboot, after reboot re-visit web control & normally all GPU's will be green.

____________
____________

With one GPU not green, in the past I was able to restart it by:

1. System tray icon FAH >> choose "quit"

2. DESKTOP ICON """"""
Shortcut icon properties. "C:\Program Files\FAHClient\HideConsole.exe" "C:\Program Files\FAHClient\FAHClient.exe" --open-web-control.""""" double clicking it & FAH would start

Now dbl clicking the shortcut does NOT restart FAH, only reboot, restarts FAH.

Forcing FAH to restart itself usually accomplishes getting both GPU's green again.

On 1/15/17 @ 0830hrs the two gpu machine had a yellow circle for the 1060 reboot & green again
1/16/17 " AGG all gpu's green
1/17/17 AGG
1/18/17 AGG
Post Reply