#682 folding-2.pdc.kth.se: Noticeable V7 slowdown

Moderators: Site Moderators, FAHC Science Team

k1wi
Posts: 910
Joined: Tue Sep 22, 2009 10:48 pm

Re: folding-2.pdc.kth.se

Post by k1wi »

Is it possible to -oneunit the v7client and see whether a v6 client has the regular result? Or have you run 690*'s in the past on the v6 fine?
Image
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: folding-2.pdc.kth.se

Post by bruce »

K1wi:
FYI
In v7, the -oneunit option is called "Finish" and it can be done to all slots or selectively to any one if you're running more than one (such as SMP + GPU), so it's certainly possible.
k1wi
Posts: 910
Joined: Tue Sep 22, 2009 10:48 pm

Re: folding-2.pdc.kth.se

Post by k1wi »

Oops sorry, I meant to direct that at Grandpa, to see if he was willing to -oneunit it and switch over to the v6 client. :)
Image
Grandpa_01
Posts: 1122
Joined: Wed Mar 04, 2009 7:36 am
Hardware configuration: 3 - Supermicro H8QGi-F AMD MC 6174=144 cores 2.5Ghz, 96GB G.Skill DDR3 1333Mhz Ubuntu 10.10
2 - Asus P6X58D-E i7 980X 4.4Ghz 6GB DDR3 2000 A-Data 64GB SSD Ubuntu 10.10
1 - Asus Rampage Gene III 17 970 4.3Ghz DDR3 2000 2-500GB Segate 7200.11 0-Raid Ubuntu 10.10
1 - Asus G73JH Laptop i7 740QM 1.86Ghz ATI 5870M

Re: folding-2.pdc.kth.se

Post by Grandpa_01 »

Windows v6 had 10min. upload times on the 690x WU's. I will switch over to v6 on the next WU and check it out though it has been a long time since I ran v6 on Windows.
Image
2 - SM H8QGi-F AMD 6xxx=112 cores @ 3.2 & 3.9Ghz
5 - SM X9QRI-f+ Intel 4650 = 320 cores @ 3.15Ghz
2 - I7 980X 4.4Ghz 2-GTX680
1 - 2700k 4.4Ghz GTX680
Total = 464 cores folding
k1wi
Posts: 910
Joined: Tue Sep 22, 2009 10:48 pm

Re: folding-2.pdc.kth.se

Post by k1wi »

So far I've only had a regular SMP work unit report back using the v7 client. Upload speed to Stanford from here in New Zealand was comparable to v6. Ended up receiving the first 2689 I've had for a long time next however, so will have to wait further until I get a Swedish work unit.

k1wi
Image
Grandpa_01
Posts: 1122
Joined: Wed Mar 04, 2009 7:36 am
Hardware configuration: 3 - Supermicro H8QGi-F AMD MC 6174=144 cores 2.5Ghz, 96GB G.Skill DDR3 1333Mhz Ubuntu 10.10
2 - Asus P6X58D-E i7 980X 4.4Ghz 6GB DDR3 2000 A-Data 64GB SSD Ubuntu 10.10
1 - Asus Rampage Gene III 17 970 4.3Ghz DDR3 2000 2-500GB Segate 7200.11 0-Raid Ubuntu 10.10
1 - Asus G73JH Laptop i7 740QM 1.86Ghz ATI 5870M

Re: folding-2.pdc.kth.se

Post by Grandpa_01 »

I have a 6900 running on the same rig using Windows v6 it will complete in about 9 hours so we will see if there is any difference between v6 and v7 then.
Image
2 - SM H8QGi-F AMD 6xxx=112 cores @ 3.2 & 3.9Ghz
5 - SM X9QRI-f+ Intel 4650 = 320 cores @ 3.15Ghz
2 - I7 980X 4.4Ghz 2-GTX680
1 - 2700k 4.4Ghz GTX680
Total = 464 cores folding
Grandpa_01
Posts: 1122
Joined: Wed Mar 04, 2009 7:36 am
Hardware configuration: 3 - Supermicro H8QGi-F AMD MC 6174=144 cores 2.5Ghz, 96GB G.Skill DDR3 1333Mhz Ubuntu 10.10
2 - Asus P6X58D-E i7 980X 4.4Ghz 6GB DDR3 2000 A-Data 64GB SSD Ubuntu 10.10
1 - Asus Rampage Gene III 17 970 4.3Ghz DDR3 2000 2-500GB Segate 7200.11 0-Raid Ubuntu 10.10
1 - Asus G73JH Laptop i7 740QM 1.86Ghz ATI 5870M

Re: folding-2.pdc.kth.se

Post by Grandpa_01 »

It does not happen with Windows v6 it took 6 1/2 minutes to send a 6900 to the Sweedish server using the same machine.

Code: Select all

[13:02:15] Completed 245000 out of 250000 steps  (98%)
[13:22:50] Completed 247500 out of 250000 steps  (99%)
[13:43:23] Completed 250000 out of 250000 steps  (100%)
[13:43:35] DynamicWrapper: Finished Work Unit: sleep=10000
[13:43:45] 
[13:43:45] Finished Work Unit:
[13:43:45] - Reading up to 52713120 from "work/wudata_01.trr": Read 52713120
[13:43:45] trr file hash check passed.
[13:43:45] - Reading up to 42794504 from "work/wudata_01.xtc": Read 42794504
[13:43:45] xtc file hash check passed.
[13:43:45] edr file hash check passed.
[13:43:45] logfile size: 199658
[13:43:45] Leaving Run
[13:43:50] - Writing 95875222 bytes of core data to disk...
[13:43:51]   ... Done.
[13:44:06] - Shutting down core
[13:44:06] 
[13:44:06] Folding@home Core Shutdown: FINISHED_UNIT
[13:44:09] CoreStatus = 64 (100)
[13:44:09] Sending work to server
[13:44:09] Project: 6900 (Run 32, Clone 18, Gen 48)


[13:44:09] + Attempting to send results [May 19 13:44:09 UTC]
[13:50:41] + Results successfully sent
[13:50:41] Thank you for your contribution to Folding@Home.
[13:50:41] + Number of Units Completed: 132

[13:50:47] - Preparing to get new work unit...
[13:50:47] Cleaning up work directory
[13:50:47] + Attempting to get work packet
[13:50:47] Passkey found
[13:50:47] - Connecting to assignment server
[13:50:48] - Successful: assigned to (130.237.232.141).
[13:50:48] + News From Folding@Home: Welcome to Folding@Home
[13:50:48] Loaded queue successfully.
[13:51:35] + Closed connections
[13:51:35] 
[13:51:35] + Processing work unit
[13:51:35] Core required: FahCore_a5.exe
[13:51:35] Core found.
[13:51:35] Working on queue slot 02 [May 19 13:51:35 UTC]
[13:51:35] + Working ...
[13:51:35] 
[13:51:35] *------------------------------*
Image
2 - SM H8QGi-F AMD 6xxx=112 cores @ 3.2 & 3.9Ghz
5 - SM X9QRI-f+ Intel 4650 = 320 cores @ 3.15Ghz
2 - I7 980X 4.4Ghz 2-GTX680
1 - 2700k 4.4Ghz GTX680
Total = 464 cores folding
k1wi
Posts: 910
Joined: Tue Sep 22, 2009 10:48 pm

Re: folding-2.pdc.kth.se

Post by k1wi »

Interesting.

I guess what we need now (in lieu of me receiving and returning a v7 bigadv work unit) is some input from Kasson or jcoffland as to whether this is a result of a difference in the way v7 manages uploads, or a server issue just affecting v7.


Apart from me returning a v7 6900/6901 work unit, I think we've about isolated as much as we can regarding the issue.
Image
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: folding-2.pdc.kth.se

Post by bruce »

I think the only way we're going to be able to gather accurate data is to run both V6 and V7 simultaneously. I propose that somebody configure a Quad with both a V6 and a V7 client. Have them both set for -smp 2. Get lucky enough to be assigned two WUs from the same project. With a little creative settings of affinity, it should be possible to slow down whichever one is ahead of the other one so the can both finish at the same time and both upload concurrently.

That should provide a side-by-side comparison of uploading a result from V6 and a result from V7 with precisely the same internet connection. We'll be able to see if there is any systematic difference.

That's not an easy thing to do, and I'm certainly open to other suggestions, but it's the only think I can think of that really addresses the question about whether V6 and V7 are different. Everything else compares uploads under different conditions.
k1wi
Posts: 910
Joined: Tue Sep 22, 2009 10:48 pm

Re: folding-2.pdc.kth.se

Post by k1wi »

Notwithstanding my results (other side of the pacific to California and opposite side of the world to Sweden), I think that is going to take a 16 core machine, as only bigadv are being sent to Sweden... Plus I know the v6 can reduce all non-FAH internet traffic to a standstill while uploading without QOS, so you'd also have a significant potential interaction occurring between clients (plus a fast enough upload to accommodate both clients simultaneously).

I understand that "different conditions" exist, but if there is a consistent pattern, time after time as is appearing to occur, it should probably be investigated by Stanford. All we can do as users is basically continue what we already have done. I suspect that Stanford are probably the only people capable of the resources to undertake a test as you suggest... Hopefully our observations and tests have aided their trouble shooting...
Image
Napoleon
Posts: 887
Joined: Wed May 26, 2010 2:31 pm
Hardware configuration: Atom330 (overclocked):
Windows 7 Ultimate 64bit
Intel Atom330 dualcore (4 HyperThreads)
NVidia GT430, core_15 work
2x2GB Kingston KVR1333D3N9K2/4G 1333MHz memory kit
Asus AT3IONT-I Deluxe motherboard
Location: Finland

Re: folding-2.pdc.kth.se

Post by Napoleon »

Could it be that the developers are experimenting with some QoS features already?
Win7 64bit, FAH v7, OC'd
2C/4T Atom330 3x667MHz - GT430 2x832.5MHz - ION iGPU 3x466.7MHz
NaCl - Core_15 - display
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: folding-2.pdc.kth.se

Post by bruce »

k1wi: I realized we were talking about bigadv after I posted that so, yes, it's going to take a big machine, though perhaps not 16 cores. What it will take is someone who is willing to give up a bonus -- or most of it. When a WU finishes, the client can be blocked from uploading by either aborting the upload or by simply disconnecting from the internet. After a WU has finished on each client, but not uploaded, it's pretty simple to reconnect to the internet and start both clients at the same time. The V6 client can be started with -send all, which will prevent the download of a new WU. The V7 client can be started with pause-on-start=true which should still attempt to send results that are in queue. Setting up WUs that are ready to upload is moderately complicated, but it should be easy to start both clients within a second of each other. and if anything doesn't go as planned, yank the internet cable and start over.
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: folding-2.pdc.kth.se

Post by bruce »

Napoleon wrote:Could it be that the developers are experimenting with some QoS features already?
Not that I'm aware of. That ticket is listed as Milestone: Public Release which is still quite some time in the future. Things listed as Milestone: 7.1.25 are near enough that they are working on them in the lab (or maybe even Milestone Open Beta Phase 2) but certainly not in a client that has been distributed yet.
GreyWhiskers
Posts: 660
Joined: Mon Oct 25, 2010 5:57 am
Hardware configuration: a) Main unit
Sandybridge in HAF922 w/200 mm side fan
--i7 2600K@4.2 GHz
--ASUS P8P67 DeluxeB3
--4GB ADATA 1600 RAM
--750W Corsair PS
--2Seagate Hyb 750&500 GB--WD Caviar Black 1TB
--EVGA 660GTX-Ti FTW - Signature 2 GPU@ 1241 Boost
--MSI GTX560Ti @900MHz
--Win7Home64; FAH V7.3.2; 327.23 drivers

b) 2004 HP a475c desktop, 1 core Pent 4 HT@3.2 GHz; Mem 2GB;HDD 160 GB;Zotac GT430PCI@900 MHz
WinXP SP3-32 FAH v7.3.6 301.42 drivers - GPU slot only

c) 2005 Toshiba M45-S551 laptop w/2 GB mem, 160GB HDD;Pent M 740 CPU @ 1.73 GHz
WinXP SP3-32 FAH v7.3.6 [Receiving Core A4 work units]
d) 2011 lappy-15.6"-1920x1080;i7-2860QM,2.5;IC Diamond Thermal Compound;GTX 560M 1,536MB u/c@700;16GB-1333MHz RAM;HDD:500GBHyb w/ 4GB SSD;Win7HomePrem64;320.18 drivers FAH 7.4.2ß
Location: Saratoga, California USA

Re: folding-2.pdc.kth.se

Post by GreyWhiskers »

Don't we need to make sure we understand which side of the transfer has the bottleneck? If ONE bigadv transfer hits the sender's upload limits, then TWO simultaneous transfers would just force sharing the outgoing bandwidth.

I have a fairly fast Comcast cable internet connection. The upload speed is slower than the download speed, and there is an allowance for a higher speed "initial burst", which then evens out to whatever caps that Comcast has put on my modem.

HFM is showing that the average uplink for the GPU transfers is about 462 KiBytes/sec, and the average transfer time is about 6 seconds for about 2.5 Mbytes.

HFM shows the average uplink for bigadv transfers is about 272 KiBytes/sec, and the average transfer time is about 366 seconds for about 100 Mbytes

The difference in upload speed could be because my internet service provider throttles the uplink for the really large transfers - say after 10 seconds of max effort, or that Comcast would let me transfer at 462 KiBytes/sec, but the receiver is the bottleneck. I don't know.

Bottom line is that before we try to have someone simultaneously upload two 100 Mbyte data sets, they understand whether their local internet service is the throttle or the Stanford server.
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: folding-2.pdc.kth.se

Post by bruce »

GreyWhiskers wrote:Bottom line is that before we try to have someone simultaneously upload two 100 Mbyte data sets, they understand whether their local internet service is the throttle or the Stanford server.
I agree.

My point is that if we believe this is a V7 only problem, then we'll need to demonstrate that to be true rather than something else causing the problems. Demonstrating it can become rather complicated. There are many possibilities for places where throttling might occur and we really don't have a handle on which is causing the reported slow-downs.
Post Reply