Project Hamster

A word on statistics

Posted in News by Toms on June 20, 2009

Any visualization that abstracts data is a dialogue of trust between observer and creator. Statistics are used to expose and display information from angle that reveals new knowledge, that could not be seen just by looking on the whole haystack. Since we are now in the era of information overflow, they have become more essential than ever.

Now, that’s quite a mouthful, hahaha. However, you should always question the methodology used to produce results.

In open source we trust in masses of individuals – that somebody certainly has looked on the code of the program you are using to transfer sensible data, that somebody has made sure that your computer won’t fry up when carried in bag, because program suddenly decided to wake it up, or, at least, that somebody else has already been burned, and the bug has been fixed and your operating system contains it, and, that somebody has made sure that the visualizations don’t lie (at least not horribly).

Ok, i got carried away, but it’s all because i did some adjustments to show statistics more appropriately.

Before the recent commit by Patryk patrys Zawadski, we were splitting activities that overlap midnight in two. In the 2.27 cycle i made the split only happen virtually, but Patryk moved it further and now we have a concept called hamster_midnight, which corresponds to 5:30am. Activities before 5:30 fall into previous day, activities overlapping 5:30 tip to the end where the largest part of the activity is.

So, in the first iteration of stats, to just get things done, i did the same old midnight split. That certainly influenced average starts and ends. Now i just pushed to git master slightly better approach – we now can have 24h+ timespan in the starts and ends charts, and facts respect the hamster midnight.

Here is the resulting difference:

stats

How could you possibly tell which is the new (on the left) and which is the previous (on the right) version (hint – compare week end days, also the hacking now has scooted more to the end of the day).

Truth is – unless the data is totally opposite from your gut feeling – you can’t.

9 Responses

Subscribe to comments with RSS.

  1. jeblad said, on July 8, 2009 at 10:16 am

    Would it be possible to filter the statistics? I’m running version 2.26 packaged with Ubuntu 9.04. What if there were some way to select a subset of the overall logged time, either on a category, an action or a week or weekday, then let the other reports adapt to the new constraints. This would function as some sort of data mining tool.

    Assume that you have an activity “programming” and two categories “Google” and “Bing”, could also be the other way around, then you could choose to filter on Google to get the number of working hours on programming for Google and vica verca.

    Seems like a fairly easy concept to grasp, and it would be sufficient it seems.

  2. Dave said, on July 15, 2009 at 5:48 pm

    I’ve upgraded from version 2.26 to 2.27.5 and the new features look great! I have one question though and I’m not sure if it’s a feature or a bug.

    In the overview list, entries beginning early in the morning, like 3am – 5am, appear at the end of day before. Is this expected because it is before a ‘normal’ workday? I don’t see a setting to define the hours of the workday.

    On a side note, I have found this tool extremely useful since I first found it in the recent Fedora 11 release. It does exactly what I need from a time tracking program.

    Thanks for your work!

    • Toms said, on July 15, 2009 at 7:55 pm

      that’s one of things that has changed in 2.27 – activities before 5:30 fall into previous day. activities that overlap 5:30 will tip over to the side where largest part of activity is.

      • Dave said, on July 15, 2009 at 8:07 pm

        Can that time be changed in a config somewhere?

      • Toms said, on July 16, 2009 at 9:47 am

        there is no configuration parameter yet.
        we hoped that the set time works good enough.
        but sure we will introduce an option.

        still, i’d be interested in finding out your case.

      • Dave said, on July 16, 2009 at 1:07 pm

        I found the two places in your code where the 5:30 is set and changed it to suit my needs. It’s still the perfect solution for my needs. Thanks again!

  3. Northrop said, on July 16, 2009 at 11:32 pm

    I’m very happy about the 5:30 time. A option would be nicer, but midnight just didn’t work last time I had to rebuild the world and worked until 3am. Excellent idea.

  4. JDS said, on July 23, 2009 at 1:16 pm

    This a “me too” on the config option for this. I think Hamster is *great* but there might be a little too much hard-coding going on.

    Another config option I’d like to see is “Reports Directory” — where to save reports. Not crazy about the default.

    Thanks for your great tool!

  5. Webfarmer said, on November 3, 2009 at 5:36 am

    +1 on the config option

    I usually begin work at 2am, so it messes up my report :)

    Otherwise much kudos for this application, love it!!


Leave a Reply