STATUS_ACCESS_VIOLATION, padls rows 7 an 8

Message boards : Closed Issues : STATUS_ACCESS_VIOLATION, padls rows 7 an 8
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
mmonnin

Send message
Joined: 16 Feb 19
Posts: 20
Credit: 1,456,814
RAC: 0
Message 3032 - Posted: 17 Feb 2019, 5:50:19 UTC

Over a 1/3 of the tasks get this error in Win7.

https://boinc.tbrada.eu/result.php?resultid=38595

<core_client_version>7.8.3</core_client_version>
<![CDATA[
<message>
(unknown error) - exit code -1073741819 (0xc0000005)</message>
<stderr_txt>
PADLS Experiment. 8 1 4 6 9 7 2 5
ASS_DLK10A: 12496 in 0.625 s
PSEVDOASS_DLK_NEW: 0 in 3.405 s
KF DLK: 10304
Split 1/9 Offset 0 size 1144 t 0


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x000000000040edff read attempt to address 0xFFFFFFFF

Engaging BOINC Windows Runtime Debugger...
ID: 3032 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Natalia Makarova
Project scientist
Avatar

Send message
Joined: 8 Feb 19
Posts: 316
Credit: 0
RAC: 0
Message 3033 - Posted: 17 Feb 2019, 6:20:14 UTC - in response to Message 3032.  
Last modified: 17 Feb 2019, 7:52:34 UTC

Over a 1/3 of the tasks get this error in Win7.

https://boinc.tbrada.eu/result.php?resultid=38595

[code]7.8.3

(unknown error) - exit code -1073741819 (0xc0000005)


PADLS Experiment. 8 1 4 6 9 7 2 5
ASS_DLK10A: 12496 in 0.625 s
PSEVDOASS_DLK_NEW: 0 in 3.405 s
KF DLK: 10304
Split 1/9 Offset 0 size 1144 t 0

That's my mistake!
Thanks.

Tomas Brada
you must remove from the file Rows_part2_8321.txt all rows starting with the number 8.
also
you must remove from the file Rows_part2_8321.txt all rows starting with the number 7.

See
https://boinc.progger.info/odlk/forum_thread.php?id=104&postid=3126
ID: 3033 · Rating: 0 · rate: Rate + / Rate - Report as offensive
=Lupus=

Send message
Joined: 16 Feb 19
Posts: 4
Credit: 173,253
RAC: 156
Message 3038 - Posted: 17 Feb 2019, 16:54:41 UTC - in response to Message 3033.  
Last modified: 17 Feb 2019, 17:00:09 UTC

Hello,

as I read about "cancelling WUs with 8 or 7 as first Digit in start row - is it OK for me to cancel WUs which are named "padls2_7..." as they have a leading 7 in start row?
They seem to be killing themselves in seconds or running for Hours (6+ Hours Right now)

Or are the non-self-destruction WUs ok and they run that long?

Greetings,

=Lupus=

*EDIT* nevermind, just saw all WUs went into Status "not needed anymore" - but my Client does not update because BOINC Scheduler sends me http error 413.
ID: 3038 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Tomáš Brada
Project administrator
Volunteer developer
Avatar

Send message
Joined: 3 Feb 19
Posts: 592
Credit: 417,175
RAC: 0
Message 3039 - Posted: 17 Feb 2019, 17:26:54 UTC - in response to Message 3038.  

Just hit update and the scheduler will abort any tasks that are not needed. It should not be necessary, because it is set to update every twice a hour. If the task is not aborted after you update, that means the tasks are still needed. Check your account for the task status.
Regarding the Scheduler http error 413, it should be transient. Try again and if the problem persist, open new thread. But you contacted scheduler 20 minutes ago, so I assume it is solved already.
ID: 3039 · Rating: 0 · rate: Rate + / Rate - Report as offensive
=Lupus=

Send message
Joined: 16 Feb 19
Posts: 4
Credit: 173,253
RAC: 156
Message 3040 - Posted: 17 Feb 2019, 18:57:44 UTC - in response to Message 3039.  
Last modified: 17 Feb 2019, 18:58:20 UTC

HTTP error 413 still there.
So I dont really get the updated info from server side.

Manually aborted the tasks as all of them were of the 7.... batch which got cancelled.

Seems error 413 means "answer too big" - either my or your side cant handle the size of the request.
Something the BOINC makers should know about.
Will open new thread.
ID: 3040 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Tomáš Brada
Project administrator
Volunteer developer
Avatar

Send message
Joined: 3 Feb 19
Posts: 592
Credit: 417,175
RAC: 0
Message 3054 - Posted: 18 Feb 2019, 9:36:42 UTC

Some workunits with first number 7 were resurrected. And they are already gone. Amazing power.
I also increased replication on still tasks from first batch (10). Gone too.
ID: 3054 · Rating: 0 · rate: Rate + / Rate - Report as offensive
STE\/E

Send message
Joined: 16 Feb 19
Posts: 7
Credit: 502,767
RAC: 0
Message 3056 - Posted: 18 Feb 2019, 10:27:20 UTC

Exit status -1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION 100% error still this morning on my Win 7 Box's
ID: 3056 · Rating: 0 · rate: Rate + / Rate - Report as offensive
mmonnin

Send message
Joined: 16 Feb 19
Posts: 20
Credit: 1,456,814
RAC: 0
Message 3065 - Posted: 22 Feb 2019, 1:48:12 UTC

Still being issued.
https://boinc.tbrada.eu/workunit.php?wuid=28994
ID: 3065 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Tomáš Brada
Project administrator
Volunteer developer
Avatar

Send message
Joined: 3 Feb 19
Posts: 592
Credit: 417,175
RAC: 0
Message 3069 - Posted: 22 Feb 2019, 10:45:20 UTC - in response to Message 3065.  

It took 30 secs all together to crash the tasks. Not worth the hassle to cancel them. We will catch the failures at post processing.
If they crash immediately, let the tasks crash.
If they take more than 2 minutes to crash, Please report.
ID: 3069 · Rating: 0 · rate: Rate + / Rate - Report as offensive
mmonnin

Send message
Joined: 16 Feb 19
Posts: 20
Credit: 1,456,814
RAC: 0
Message 3070 - Posted: 22 Feb 2019, 11:04:10 UTC - in response to Message 3069.  

It took 30 secs all together to crash the tasks. Not worth the hassle to cancel them. We will catch the failures at post processing.
If they crash immediately, let the tasks crash.
If they take more than 2 minutes to crash, Please report.


Maybe its not worth it for us to run your project then.
ID: 3070 · Rating: 0 · rate: Rate + / Rate - Report as offensive
STE\/E

Send message
Joined: 16 Feb 19
Posts: 7
Credit: 502,767
RAC: 0
Message 3071 - Posted: 22 Feb 2019, 11:32:24 UTC

lol
ID: 3071 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Natalia Makarova
Project scientist
Avatar

Send message
Joined: 8 Feb 19
Posts: 316
Credit: 0
RAC: 0
Message 3072 - Posted: 22 Feb 2019, 12:40:32 UTC - in response to Message 3070.  

Maybe its not worth it for us to run your project then.

Let me note that the project is in test mode.

Errors are always when starting a new project or a new experiment in an existing project.

For my part, I apologize for my mistake (wrong WUs starting at 7 and 8).
ID: 3072 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Tomáš Brada
Project administrator
Volunteer developer
Avatar

Send message
Joined: 3 Feb 19
Posts: 592
Credit: 417,175
RAC: 0
Message 3073 - Posted: 22 Feb 2019, 12:55:20 UTC - in response to Message 3070.  

It took 30 secs all together to crash the tasks. Not worth the hassle to cancel them. We will catch the failures at post processing.
If they crash immediately, let the tasks crash.
If they take more than 2 minutes to crash, Please report.


Maybe its not worth it for us to run your project then.


Sorry. The erroring tasks take only few seconds to finish, right? So very little cpu is wasted. It is not easy to gather wu ids to cancel. Even the failure is a result. Maybe we will grant some credit for the failed tasks if that is what you want.

We identified the reason why they fail so now it is OK to let the errored tasks crash since they use very little cpu.

I cancelled tasks with 7 and 8 at start as soon as the problem was found. Then I received a list of tasks to resurrect, but it seems I made a mistake, or the list was wrong and resurrected tasks that still crash. We will let this batch (12) finish and then figure out which rows are missing and re-issue good tasks.
The batch 10 is finished and most likely we will have more tasks to send after it is analysed by miss Makarova.
ID: 3073 · Rating: 0 · rate: Rate + / Rate - Report as offensive
STE\/E

Send message
Joined: 16 Feb 19
Posts: 7
Credit: 502,767
RAC: 0
Message 3080 - Posted: 22 Feb 2019, 15:54:04 UTC - in response to Message 3073.  

We will let this batch (12) finish and then figure out which rows are missing and re-issue good tasks.
The batch 10 is finished and most likely we will have more tasks to send after it is analysed by miss Makarova.


+1 ... waiting for more
ID: 3080 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Natalia Makarova
Project scientist
Avatar

Send message
Joined: 8 Feb 19
Posts: 316
Credit: 0
RAC: 0
Message 3087 - Posted: 22 Feb 2019, 16:44:17 UTC - in response to Message 3080.  
Last modified: 22 Feb 2019, 16:44:36 UTC

+1 ... waiting for more

Задания обрабатываются 4540

https://boinc.tbrada.eu/server_status.php

When there will be 0 :)
ID: 3087 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Tomáš Brada
Project administrator
Volunteer developer
Avatar

Send message
Joined: 3 Feb 19
Posts: 592
Credit: 417,175
RAC: 0
Message 3089 - Posted: 22 Feb 2019, 18:52:40 UTC

There are no running 8-begin results, because all have been cancelled.
7-begin results were cancelled, but then some of them were resurrected, because miss Natalia sent me new version of batch 12.
There are, however, no successful 7-begin results from batch 12.
ID: 3089 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Tomáš Brada
Project administrator
Volunteer developer
Avatar

Send message
Joined: 3 Feb 19
Posts: 592
Credit: 417,175
RAC: 0
Message 3090 - Posted: 22 Feb 2019, 18:57:47 UTC - in response to Message 3087.  

When there will be 0 :)

I thought you are going to analyse and issue new batch after, possibly the generator is fixed.
ID: 3090 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Natalia Makarova
Project scientist
Avatar

Send message
Joined: 8 Feb 19
Posts: 316
Credit: 0
RAC: 0
Message 3092 - Posted: 22 Feb 2019, 19:31:03 UTC - in response to Message 3090.  
Last modified: 22 Feb 2019, 19:52:58 UTC

I do not want to launch a new branch of the experiment with the current branch.

I can send you sources of generators 1 and 2 to prepare them for a new branch.
This is a minor modification due to a change in the DLS diagonal.

Then I will send you an array of rows.

PS. All WUs in the second installment, starting with 7 and 8, must be canceled.
We misunderstood each other.
At first, I only saw rows starting with 8.
I deleted these rows and sent you a new list.
But! Then I saw incorrect rows starting with 7.
I asked you to remove these rows from the list.

Tomas Brada
you must remove from the file Rows_part2_8321.txt all rows starting with the number 8.
also
you must remove from the file Rows_part2_8321.txt all rows starting with the number 7.

But you did not understand me. I did not send you the third list yet, I thought that you yourself delete the rows beginning with 7.

We hurried. No need to hurry!
ID: 3092 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Natalia Makarova
Project scientist
Avatar

Send message
Joined: 8 Feb 19
Posts: 316
Credit: 0
RAC: 0
Message 3093 - Posted: 22 Feb 2019, 19:42:16 UTC
Last modified: 22 Feb 2019, 19:42:39 UTC

About the new branch of the experiment PADLS here
https://boinc.progger.info/odlk/forum_thread.php?id=107

Unfortunately, in Russian.
ID: 3093 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Tomáš Brada
Project administrator
Volunteer developer
Avatar

Send message
Joined: 3 Feb 19
Posts: 592
Credit: 417,175
RAC: 0
Message 3095 - Posted: 22 Feb 2019, 20:07:17 UTC - in response to Message 3092.  

I can send you sources of generators 1 and 2 to prepare them for a new branch. This is a minor modification due to a change in the DLS diagonal. Then I will send you an array of rows.

Good. You can send anytime.

PS. All WUs in the second installment, starting with 7 and 8, must be cancelled.

Done! Sorry for misunderstanding.
ID: 3095 · Rating: 0 · rate: Rate + / Rate - Report as offensive
1 · 2 · Next

Message boards : Closed Issues : STATUS_ACCESS_VIOLATION, padls rows 7 an 8

©2020 Tomáš Brada