Advanced search

Forums : Technical Support : Cosmology@home tasks fail with Postponed: VM job unmanageable, restarting later if vboxsvc priority is set to idle
Message board moderation

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
computezrmle

Send message
Joined: 4 Dec 08
Posts: 3
Credit: 12,475,043
RAC: 0
Message 22921 - Posted: 5 Dec 2021, 10:05:07 UTC

Volunteers frequently affected by the postponed issue may try a different vboxwrapper.

BOINC's wiki pages mention communication problems between vboxwrapper and VirtualBox 6.x, especially on Windows.
They offer premade executables that may solve the problems:
https://boinc.berkeley.edu/trac/wiki/VboxApps#Premadevboxwrapperexecutables

It would be the job of the project developers to test those vboxwrappers and distribute them to the clients.
As long as this is not done volunteers could use the following steps as a workaround:

1. Download an alternative vboxwrapper from the page mentioned above (or use one you got from another project, e.g. LHC@home)
2. Start the BOINC client but suspend computing
3. Change to the project directory, e.g. projects/www.cosmologyathome.org, and replace the vboxwrapper there with the test version; the filename must be the name of the old vboxwrapper
4. Resume computing -> check the logfiles of tasks started after the patch


Each restart of the BOINC client will replace the patch with the original vboxwrapper from the project server.
This can be avoided setting <dont_check_file_sizes>1</dont_check_file_sizes> in cc_config.xml, but then all other automatic updates will also not work.
ID: 22921 · Report as offensive     Reply Quote
Jim1348

Send message
Joined: 17 Nov 14
Posts: 136
Credit: 5,413,463
RAC: 4
Message 22922 - Posted: 5 Dec 2021, 17:45:15 UTC - in response to Message 22921.  

That is a masterpiece of concision. I have taken the liberty of posting it on Rosetta.
They are in desperate (desperate) need of it for their python work units.

Thanks, it saves me various other work-arounds, mainly involving going back to VBox 5.2.44.
That is easy in Windows, but not so easy in Ubuntu.
ID: 22922 · Report as offensive     Reply Quote
maeax

Send message
Joined: 21 Dec 17
Posts: 31
Credit: 8,394,413
RAC: 6
Message 22925 - Posted: 8 Dec 2021, 10:18:44 UTC - in response to Message 22921.  

1. Download an alternative vboxwrapper from the page mentioned above (or use one you got from another project, e.g. LHC@home)

There are three vboxwrapper (Theory, Atlas and CMS)
ID: 22925 · Report as offensive     Reply Quote
Jonathan

Send message
Joined: 27 Sep 17
Posts: 190
Credit: 8,321,383
RAC: 6
Message 22926 - Posted: 8 Dec 2021, 13:41:34 UTC - in response to Message 22925.  

The wrapper version reported in the logs doesn't match the wrapper file name in a lot of cases I have check.

Log shows 2021-12-07 09:27:23 (10616): Detected: vboxwrapper 26202 but I was running 26203 downloaded from Boinc. I have seen this on other projects too. On Windows, do a properties of the wrapper and look at the details tab. LHC Atlas has wrapper as vboxwrapper_26198ab7_windows_x86_64 and it reports version as 26197 in the logs.

I was running 4 concurrent 2 core tasks on my Intel i7 920. 4 Cores, 8 with HT. I used 26203 and the machine was stable with no other projects running. I didn't use that machine for anything else as it was just crunching.
ID: 22926 · Report as offensive     Reply Quote
Peter Hucker of the Scottish Boinc Team

Send message
Joined: 5 Jul 11
Posts: 27
Credit: 1,092,134
RAC: 9,457
Message 22936 - Posted: 29 Jan 2022, 1:39:57 UTC

Same nonsense happening here - VM job unmanageable, restarting later.

This is on 3 machines, all with plenty RAM, running latest VB. When I see loads stuck, I restart Boinc. Otherwise apparently they retry once a day.

Not prepared to do the workaround as it seems to have side effects.
ID: 22936 · Report as offensive     Reply Quote
Peter Hucker of the Scottish Boinc Team

Send message
Joined: 5 Jul 11
Posts: 27
Credit: 1,092,134
RAC: 9,457
Message 22937 - Posted: 29 Jan 2022, 12:58:09 UTC - in response to Message 22936.  

And after giving them a shove, some produce computation errors.
ID: 22937 · Report as offensive     Reply Quote
.clair.

Send message
Joined: 4 Nov 07
Posts: 633
Credit: 12,080,613
RAC: 1,132
Message 22938 - Posted: 30 Jan 2022, 0:58:59 UTC - in response to Message 22937.  

And after giving them a shove, some produce computation errors.

Is this the same three that will or will not run rosetta python.
or is virtual blox giving you a new set of problems?
ID: 22938 · Report as offensive     Reply Quote
Peter Hucker of the Scottish Boinc Team

Send message
Joined: 5 Jul 11
Posts: 27
Credit: 1,092,134
RAC: 9,457
Message 22939 - Posted: 30 Jan 2022, 3:11:55 UTC - in response to Message 22938.  
Last modified: 30 Jan 2022, 3:13:06 UTC

And after giving them a shove, some produce computation errors.

Is this the same three that will or will not run rosetta python.
or is virtual blox giving you a new set of problems?
I have 7 PCs. 1 runs Rosetta Python, 6 fail. Cosmology fails on all of them about 50% of the time. Since other people have similar problems, I'm not blaming my end. LHC works perfectly, but they're the only project that knows how to use the buggy Oracle crap properly. Not sure why Oracle still exists, they've never written anything decent.
ID: 22939 · Report as offensive     Reply Quote
Peter Hucker of the Scottish Boinc Team

Send message
Joined: 5 Jul 11
Posts: 27
Credit: 1,092,134
RAC: 9,457
Message 22940 - Posted: 1 Feb 2022, 15:43:25 UTC - in response to Message 22939.  

Maybe things have improved - I just got cosmology VB tasks for my oldest and newest computers. The oldest one failed them all, but the newest one is succeeding on 12/14 tasks.
ID: 22940 · Report as offensive     Reply Quote
.clair.

Send message
Joined: 4 Nov 07
Posts: 633
Credit: 12,080,613
RAC: 1,132
Message 22941 - Posted: 1 Feb 2022, 20:43:51 UTC
Last modified: 1 Feb 2022, 20:45:37 UTC

Just had a look at your computers list to look at the result files and nothing shows.
is that because of your team/group setup ?
ID: 22941 · Report as offensive     Reply Quote
Peter Hucker of the Scottish Boinc Team

Send message
Joined: 5 Jul 11
Posts: 27
Credit: 1,092,134
RAC: 9,457
Message 22942 - Posted: 1 Feb 2022, 20:56:19 UTC - in response to Message 22941.  

ID: 22942 · Report as offensive     Reply Quote
Previous · 1 · 2

Forums : Technical Support : Cosmology@home tasks fail with Postponed: VM job unmanageable, restarting later if vboxsvc priority is set to idle