Message boards :
Code and Servers :
Split Workunits
Message board moderation
Author | Message |
---|---|
Tomáš Brada Project administrator Volunteer developer Send message Joined: 3 Feb 19 Deprecated: Creation of dynamic property BoincUser::$nposts is deprecated in /var/boincadm/prj/html/inc/forum.inc on line 613 Posts: 667 Credit: 432,784 RAC: 0 |
Because some of the WUs are quite long, checkpoint are desired. But implementing checkpoints is not easy. Next I found out it is quite easy to split the WUs into smaller ones, see the padls2 test app. When the workunit is split, about 5 seconds are wasted per part. The next question is how many parts should the WU be split? Ideally so each would take roughly the same time to finish. But I do not know how big a WU is until it is started. Do you want such split WUs? And if so, How many parts? |
valap Send message Joined: 8 Feb 19 Deprecated: Creation of dynamic property BoincUser::$nposts is deprecated in /var/boincadm/prj/html/inc/forum.inc on line 613 Posts: 5 Credit: 1,634 RAC: 0 |
Once every 3-5 minutes Computing preferences Request tasks to checkpoint at most every 600 seconds |
Tomáš Brada Project administrator Volunteer developer Send message Joined: 3 Feb 19 Posts: 667 Credit: 432,784 RAC: 0 |
Please mind that implementing checkpoints is not easy: * notice when to checkpoint * implement saving of state ( search index, output set ) * implement loading of state * check on startup whether checkpoint exists and check that the checkpoint is valid Instead, I can make smaller workunits, so they finish faster and therefore do not need to checkpoint. Do you want such smaller workunits? (and therefore more of them) and How big should they be. 1/10 of the current size? |
Natalia Makarova Project scientist Send message Joined: 8 Feb 19 Deprecated: Creation of dynamic property BoincUser::$nposts is deprecated in /var/boincadm/prj/html/inc/forum.inc on line 613 Posts: 423 Credit: 0 RAC: 0 |
I think that WU is not required to be divided into parts. This is where the tasks by user XAVER are executed 1195 623 12 Feb 2019, 1:04:57 UTC 19 Feb 2019, 1:04:57 UTC В процессе --- --- --- Pseudo Associative DLS v1.04 I see the maximum execution time of 16,103.43 seconds. XAVER what do you suggest? |
Natalia Makarova Project scientist Send message Joined: 8 Feb 19 Posts: 423 Credit: 0 RAC: 0 |
We see here https://boinc.tbrada.eu/server_status.php Pseudo Associative DLS 1682 57 3.69 (1.91 - 6.68) 3 I think this is a normal time for WU. Compare ODLK https://boinc.progger.info/odlk/server_status.php odlk3@home 138986 19307 0.48 (0.01 - 9.97) 135 ODLK1 https://boinc.multi-pool.info/latinsquares/server_status.php odlk3@home 52667 292587 0.38 (0.01 - 16.25) 221 |
XAVER Send message Joined: 9 Feb 19 Deprecated: Creation of dynamic property BoincUser::$nposts is deprecated in /var/boincadm/prj/html/inc/forum.inc on line 613 Posts: 5 Credit: 65,265 RAC: 0 |
Splitting is not necssary. Execution times seem the same as running the batch file on my computer. I would like to know if you can use the first results without waiting for a second quorum which at the moment is difficult to obtain because of the lack of different computers necessary. Second quorum is only necessary for credits (or verification) in my opinion. |
Natalia Makarova Project scientist Send message Joined: 8 Feb 19 Posts: 423 Credit: 0 RAC: 0 |
I would like to know if you can use the first results without waiting for a second quorum which at the moment is difficult to obtain because of the lack of different computers necessary. As I understand it, Tomas Brada did automatic processing of the results in the project. What is required of me? |
Tomáš Brada Project administrator Volunteer developer Send message Joined: 3 Feb 19 Posts: 667 Credit: 432,784 RAC: 0 |
Run time is comparable to batch file, because the program is using the same algorithm. XAVER, If you notice significantly worse performance, please report, it would mean error in build of the program. If you restart PC or boinc-client, then the task progress will reset, because this app does not have checkpoints. I feel that significant computation power is being wasted this way. None of you want split workunits, so I guess it is not needed. But note that I can arrange that easily. About the quorum. I loaded the first batch with quorum of 2. Considering that there were no validation failures so far, the quorum can be dropped to 1. Yes, I can export the result even before quorum is reached. I will cancel those second replicas of wus shortly, so we do not waste power on the double checks. |
Tomáš Brada Project administrator Volunteer developer Send message Joined: 3 Feb 19 Posts: 667 Credit: 432,784 RAC: 0 |
Because the "tot5" padls total application has configurable runtime and uses checkpoints, splitting workunits so small is no longer necessary. |
©2024 Tomáš Brada