Page 1 of 1

New psummary.html column: Kfactor

Posted: Wed Nov 18, 2009 8:44 pm
by smoking2000
Since two hours ago, the psummary pages contain a new column "Kfactor" (although the table header is not all pages).

What is the meaning of the values in this column? And will this column remain in the psummary pages, or was this just a test or hickup?

I would like to know this, as would all other 3rd party devs (I presume) who have to update their psummary parsers :)

Re: New psummary.html column: Kfactor

Posted: Wed Nov 18, 2009 8:49 pm
by ChasR
Could it be to calculate the -bigadv WUs? They have a k factor.

Re: New psummary.html column: Kfactor

Posted: Wed Nov 18, 2009 8:54 pm
by smoking2000
Heh, just after this post I read the following on the FAH blog:
We will be adding one new field to the psummary web pages (eg, http://fah-web.stanford.edu/psummary.html) to include a new part of FAH stats. We will give more info soon, but for now, please note the additional column on the far right.

For third party tools which parse these web pages, we have added the column at the far right in the hopes that this would not break current tools' parsing routines. Note that we are also looking in developing more systematic ways to interact with third party utilities, although that is still in the works. For now, we will give heads up to changes here in this blog.

Re: New psummary.html column: Kfactor

Posted: Wed Nov 18, 2009 9:31 pm
by smoking2000
Since the psummary is being updated, would this be the right time for some improvement requests?

- Move the <META> tags to the <head>, and not before the <html> as it is now, to make the code W3C valid.
- Don't use two different content formats for the Kfactor column (one uses: <font size=-1>1</font>, the other uses only: 3.30)

Re: New psummary.html column: Kfactor

Posted: Thu Nov 19, 2009 1:53 am
by VijayPande
We'll add more info on Kfactor pretty soon -- we wanted to get out ahead of things with the infrastructure though. Good point regarding the cleanup -- I'll pass that request along.

Re: New psummary.html column: Kfactor

Posted: Thu Nov 19, 2009 5:48 am
by smoking2000
Thanks

Re: New psummary.html column: Kfactor

Posted: Thu Nov 19, 2009 10:48 am
by toTOW
Since not all projects use this factor (yet ?), I guess it would be better to choose an escape character (like "-" or "NA") on projects that doesn't use the "BigAdv" formula instead of 1 (imho, no bonus scheme is different from bonus scheme with k=1) ... I guess it would help 3rd party developers generalize their code to all projects

Re: New psummary.html column: Kfactor

Posted: Thu Nov 19, 2009 10:56 am
by smoking2000
toTOW wrote:Since not all projects use this factor (yet ?), I guess it would be better to choose an escape character (like "-" or "NA") on projects that doesn't use the "BigAdv" formula instead of 1 (imho, no bonus scheme is different from bonus scheme with k=1) ... I guess it would help 3rd party developers generalize their code to all projects
Please don't. A factor of 1 is a valid factor too. Also the value in column is now either an integer (\d+) or floating point (\d+\.\d+), this is easy to parse with regular expressions. Not so if you mix in characters for string like "NA" or "-".

The current values can be used in code like: $some_number * $kfactor. Or to compare kfactors ($project_x_kfractor > $project_y_kfactor), this isn't possible when strings are also used in the kfactor column.

Re: New psummary.html column: Kfactor

Posted: Thu Nov 19, 2009 11:02 am
by toTOW
I guess you didn't get my point : if some projects are not using the bonus scheme (and I guess only BigAdv projects actually use it), it's not the same as projects using the bonus scheme with k = 1 ... so you can't set the Kfactor column to 1 by default as it is currently done ...

Re: New psummary.html column: Kfactor

Posted: Thu Nov 19, 2009 12:06 pm
by P5-133XL
I agree that a parseable number is a much better alternative to fill the column than some invalid characters like "NA" which would require some form of exception treatment. While a k=factor of 1 still produces a bonus using the bigadv equations, the k-factor of 0 won't so that is probably a safe default for non bonus WU's.

Really, the programmers are very likely to already understand these types of concepts and I trust them to do what is necessary and best for the project. I would contend that this level of programming detail does not need to be discussed here, until or unless they mess up.

Re: New psummary.html column: Kfactor

Posted: Thu Nov 19, 2009 3:28 pm
by VijayPande
I'll pass the default value issue along as well. This is easy to change in the psummary code, so this is more of an issue of what makes the most sense.

Re: New psummary.html column: Kfactor

Posted: Fri Nov 20, 2009 6:53 am
by harlam357
Thank you for adding it to the right. That definitely saved my code, as it's pretty "dumb" in regards to the psummary parsing. Other than being rather inflexible, it's worked great so I haven't messed with it much. This change gives me a reason to overhaul and make it smarter. :)

Based on these formulas:

Code: Select all

Total points = base points * bonus factor

The bonus factor is computed based on the time from when our server issues the work unit to when it receives the work unit (WU_time), the time from work issuance to when the deadline would expire (deadline_time), the time from issuance to when the work unit times out and is marked for reissuance (timeout_time), and a constant factor k.
If WU_time > timeout time, bonus factor = 1.
If WU_time <= timeout time, bonus factor = sqrt(deadline_time * k / WU_time)
.

I feel that if a Project does not qualify for a bonus, then a k value of 0 should be assigned... since, as someone else mentioned, a k value of 1 would be valid for Bonus Projects.

The addition of this k value also give us (I think) everything we need to correctly calculate bonus PPD! :)

A few other questions (since I want to try and be as accurate as possible with the calculation):

- Is there anywhere we can find the timeout_time? Is this the final deadline? I'm under the impression that the current bigadv trial projects are 3 days (neither the preferred or final deadline), but I imagine this value will vary with new projects later on. It will be essential to know this value to determine when to set the bonus factor back to 1 and effectively remove all bonus PPD from the calculation.

- To follow on that, is the deadline_time preferred or final deadline... I assume the former.

If there's anywhere I can find an example calculation using available information, that would be most helpful.

Re: New psummary.html column: Kfactor

Posted: Fri Nov 20, 2009 11:07 am
by ChasR
timeout_time = preferred deadline
deadtime_time = final deadline

YHM