Moderators: Site Moderators, PandeGroup
bruce wrote:I don't see any evidence that Stanford is "sending out tons of bad work units" -- only that something has happened to your system that we can only guess about. (Without the information requested in my sig, all we can do is guess.)
I don't know if this information will be useful, but scarlet_tech apparently is running 30 slots. (Counting twice for recent reinstalls.) The last WU returned from 27 of them all seem to have earned reasonable points. The last WU from three of them have received 0 points.
2015-09-05 22:05:50 p10495 r30 c2 g33
2015-09-28 04:07:00 p9835 r68 c6 g9
2015-10-06 18:07:55 p9430 r56 c2 g111
Only the last one appears to be recent. That particular WU was reassigned and successfully completed by someone else about 9.6 hours later so it's not a bad WU
scarlet_tech wrote:Could you please provide a link where you see this information, please?
bruce wrote:scarlet_tech wrote:Could you please provide a link where you see this information, please?
Sorry, no. The Pande Group has restricted that data to forum Moderators only.
scarlet_tech wrote:bruce wrote:I will share some posts from EVGA, since they aren't hidden and may be helpful.
Some are helpful, many are not.Since forum Moderators can look up information, my name here does not match my folding name. My folding name is Scarlet-Tech user 654307 according to extreme over clocking. The results you pulled were for scarlet_tech. I mistyped when entering my forum name.
There's no requirement that you name match, but when I found numerous reports from the name you gave me, I made a (reasonable?) assumption. My Bad.
If you want me to correct your mis-typed name, send me a PM.
[quote=Mekhed]
I'm gonna say that you're not wrong. Both of my machines have been rock solid for months folding. I had what I expected the first 3 days of the challenge and on day 4 I also dropped about 300k ppd. The last 7 days have been a struggle just to get WU's to finish and not be returned as "bad work units". I've changed drivers and lowered video card memory speeds and still having problems. You're not wrong Scarlet, something changed on day 4
*********************** Log Started 2015-11-09T20:53:12Z ***********************
20:54:09:WU00:FS01:0x21:ERROR:Potential energy error of 805.531, threshold of 10
20:54:09:WU00:FS01:0x21:ERROR:Reference Potential Energy: -1.23368e+006 | Given Potential Energy: -1.23287e+006
20:54:10:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
If there's a completion between one team's hardware which is overclocked and another team's hardware is stable, guess which one will win the competition.The information is available to show that isn't just one of two people experiencing the issue, but our team number dropped substantially with the same number of folders pushing out units. This started on November 4th,and has been continuous since then.
I will continue to post edits to this thread and provide more failed units from other members as they post them, so that it can't be ignored since our entire team and other teams are experiencing this issue.
11:25:34:WARNING:WU02:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
11:25:34:WU02:FS01:Sending unit results: id:02 state:SEND error:FAULTY project:9625 run:1 clone:1 gen:44
23:11:11:WU02:FS03:0x21:ERROR:exception: Error downloading array velm: clEnqueueReadBuffer (-5)
23:11:12:WARNING:WU02:FS03:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
23:11:12:WU02:FS03:Sending unit results: id:02 state:SEND error:FAULTY project:9704 run:64 clone:18 gen:72 core:0x21
There are reports of this sort of error with 171.64.65.56. Assignments from the server have been suspended until the problem can be resolved. (I'm assuming this was the server involved ... if that's not true, then I have no explanation.)11:53:58:WU00:FS01:Upload 95.92%
11:54:03:WU00:FS01:Upload complete
11:54:03:WU00:FS01:Server responded WORK_QUIT (404)
11:54:03:WARNING:WU00:FS01:Server did not like results, dumping
11:54:03:WU00:FS01:Cleaning up
12:21:14:WARNING:WU02:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
12:21:14:WU02:FS01:Sending unit results: id:02 state:SEND error:FAULTY project:9630 run:1 clone:23 gen:46 core:0x21
12:21:14:WU02:FS01:Uploading 9.50KiB to 171.67.108.155
12:21:14:WU02:FS01:Connecting to 171.67.108.155:8080
12:22:08:WARNING:WU00:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
12:22:08:WU00:FS00:Sending unit results: id:00 state:SEND error:FAULTY project:9629 run:0 clone:23 gen:37 core:0x21
15:03:29:WARNING:WU00:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
15:03:29:WU00:FS00:Sending unit results: id:00 state:SEND error:FAULTY project:9634 run:1 clone:40 gen:14 core:0x21
These are all just a tiny example of errors that are occurring now, and I am trying to get all EVGA folders on board to post every single bad unit that is received across all platforms.. The above listed platforms are nearly identical to my system.
bruce wrote:This research has taken me almost an hour, but it does seem to indicate that several machines are marginally stable and they can't handle the increased utilization that these projects are seeking. This certainly is not the first time that FAH has created a more stressful benchmark that the benchmark routines commonly used by overclockers.
scarlet_tech wrote:15:13:02:WU02:FS01:0x21:Version 0.0.12
15:13:41:WU02:FS01:0x21:ERROR:exception: bad allocation
15:13:41:WU02:FS01:0x21:Folding@home Core Shutdown: BAD_WORK_UNIT
15:13:42:WU02:FS01:Sending unit results: id:02 state:SEND error:FAULTY project:9205 run:16 clone:52 gen:7 core:0x21
15:14:39:WU03:FS01:0x21:ERROR:exception: bad allocation
15:14:39:WU03:FS01:0x21:Folding@home Core Shutdown: BAD_WORK_UNIT
15:14:40:WU03:FS01:Sending unit results: id:03 state:SEND error:FAULTY project:9205 run:3 clone:33 gen:9 core:0x21
20:37:20:WU02:FS02:0x21:ERROR:exception: bad allocation
20:37:20:WU02:FS02:0x21:Folding@home Core Shutdown: BAD_WORK_UNIT
20:37:20:WU02:FS02:Sending unit results: id:02 state:SEND error:FAULTY project:9206 run:0 clone:1351 gen:11 core:0x21
06:27:10:WU00:FS01:0x21:Folding@home Core Shutdown: BAD_WORK_UNIT
06:27:10:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
06:27:10:WU00:FS01:Sending unit results: id:00 state:SEND error:FAULTY project:9207 run:0 clone:22 gen:32 core:0x21
03:56:00:WU02:FS01:0x21:ERROR:exception: bad allocation
03:56:00:WU02:FS01:0x21:Folding@home Core Shutdown: BAD_WORK_UNIT
03:56:00:WU02:FS01:Sending unit results: id:02 state:SEND error:FAULTY project:9209 run:0 clone:50 gen:15 core:0x21
13:52:40:WU02:FS01:Upload complete
13:52:40:WU02:FS01:Server responded WORK_QUIT (404)
13:52:40:WARNING:WU02:FS01:Server did not like results, dumping
13:52:20:WU02:FS01:0x18:Folding@home Core Shutdown: FINISHED_UNIT
13:52:21:WU02:FS01:Sending unit results: id:02 state:SEND error:NO_ERROR project:9430 run:212 clone:9 gen:20 core:0x18
13:52:22:WU00:FS01:0x18:Project: 10486 (Run 0, Clone 22, Gen 56)
13:52:24:WU00:FS01:0x18:Version 0.0.4
Scarlet-Tech wrote:So, it is OK to post stuff like that, but not share the link to it. Makes sense. Wouldn't want the truth out there I guess.
Return to New Donors start here
Users browsing this forum: No registered users and 1 guest