New Pointer Analysis #616

xeren · 2024-02-15T17:07:59Z

This PR started as a rework for FieldSensitiveAlias. It now features a third implementation InclusionBasedPointerAnalysis.

It manages pointer modifiers of the type (int static offset‘,List<Integer> dynamic offsets). This should make its precision robust against optimizations of the following form. (FieldSensitiveAndersen uses (int,int) and thus looses precision there)

      w = x + y
      address = w + z
      ↓
      address = x + y + z

It stores inclusions, loads, stores and pointer sets in a compact format, such that its memory usage scales better with large arrays.
Instead of a node for each register or allocated byte, this version manages a node for each complex expression, register writer and phi node. This means this version not only field sensitive, but also flow sensitive (with the exception of assumptions). It now requires a prior Dependency Analysis.
It implements a unification rule: If a variable has only one include edge, it gets merged into that other variable. This enhances the mustAlias queries.
It implements an acceleration rule: If a variable directly or indirectly includes itself with a static offset of c != 0, then c is promoted to a dynamic offset. Such cycles propagate at most once in total, instead of at most once per allocated byte. This should also allow the algorithm to terminate even if the program has arrays of unknown/nondeterministic size (if such a feature is ever added in the future).

dartagnan/src/main/java/com/dat3m/dartagnan/expression/integers/IntLiteral.java

hernanponcedeleon · 2024-02-15T19:27:19Z

Does the first bullet point mean the new analysis is always more precise than the old ones?
Do we have any concrete example where we get noticeable better performance?

ThomasHaas · 2024-02-15T22:44:02Z

@xeren showed me improved results on EBR with substantial speed up.

Btw. I think this PR does not set the new analysis as default, so the CI still runs with the old one.
I think you should put the new analysis as default to catch any problems.

hernanponcedeleon · 2024-02-17T18:20:56Z

@ThomasHaas would it make sense to add EBR to our CI? IIRC it is supposed to be buggy wrt IMM but correct under hardware models

Inject Dependency into FieldSensitiveAndersen

.../src/main/java/com/dat3m/dartagnan/program/analysis/alias/InclusionBasedPointerAnalysis.java

ThomasHaas · 2024-02-17T23:02:18Z

@ThomasHaas would it make sense to add EBR to our CI? IIRC it is supposed to be buggy wrt IMM but correct under hardware models

I don't know how much time it takes with Yices2. With Z3, running EBR under multiple models is expensive.

ThomasHaas · 2024-02-18T10:55:44Z

I have just tested this analysis on RCU tree and with B=1 it terminates in 2 seconds and with B=2 in 20 seconds. The old analysis took multiple hours IIRC.
However, it computes quite a few empty pointer sets. I also cannot tell how precise it is cause the fast termination may be due to imprecision caused by RCU having the whole memory stored in arrays.

.../src/main/java/com/dat3m/dartagnan/program/analysis/alias/InclusionBasedPointerAnalysis.java

Signed-off-by: Hernan Ponce de Leon <[email protected]>

hernanponcedeleon · 2024-02-20T11:13:53Z

It seems that when I rebase the branch, I overwrote the default alias type. It should be fixed by now.

BTW, as Thomas said, the new alias analysis shows to be better than the old one in the benchmarks added in #621, here are the statistics

[20.02.2024] 11:56:47 [INFO] AliasAnalysis.fromConfig - Selected alias analysis: FULL
======== RelationAnalysis summary ======== 
        #Relations: 104
        #Axioms: 4
        #may-edges removed (extended): 86907
        #must-edges added (extended): 156
        total #must|may|exclusive edges: 1689106|2239829|0
        #must|may rf edges: 4|18859
        #must|may co edges: 474|486
===========================================

[20.02.2024] 11:58:00 [INFO] AliasAnalysis.fromConfig - Selected alias analysis: FIELD_SENSITIVE
======== RelationAnalysis summary ======== 
        #Relations: 104
        #Axioms: 4
        #may-edges removed (extended): 94306
        #must-edges added (extended): 156
        total #must|may|exclusive edges: 1686959|2287321|0
        #must|may rf edges: 4|21522
        #must|may co edges: 474|486
===========================================

ThomasHaas · 2024-02-20T11:17:09Z

It seems that when I rebase the branch, I overwrote the default alias type. It should be fixed by now.

I think the default you think about was not changed by @xeren. We have two defaults and the one you changed was for the UI only I think.

hernanponcedeleon · 2024-02-20T15:36:04Z

It seems you are right (even if I'm pretty sure I saw field_sensitive in the log when I called dartagnan from the console with no alias options).

We should be able to use Alias.getDefault() similarly to what we do with method, target, ... right?

ThomasHaas · 2024-02-21T10:59:41Z

We should be able to use Alias.getDefault() similarly to what we do with method, target, ... right?

I think so. Just change the initializer in AliasAnalysis.Config to call Alias.getDefault().

Add missing initial communication tests when a store relationship is added and a load uses a constant memory object in an indirect manner, like ``` s = *(havoc ? x : y) *x = 1 ```

hernanponcedeleon · 2024-03-03T23:16:28Z

I am testing this branch in some new benchmarks and I get a bunch of warnings like this

[04.03.2024] 00:14:56 [WARN] InclusionBasedPointerAnalysis.postProcess - empty pointer set for bv64 117:r24 = load(*bv64 117:r23)

How should I interpret this @xeren ? It sounds like this load cannot read from anywhere

ThomasHaas · 2024-03-03T23:53:45Z

Yes, this means the are no possible targets to read from. It could be a bug in the analysis but also a correct result.
For example, this warning also appeared in tree.c but it turned out that the code does indeed access a NULL pointer.

hernanponcedeleon · 2024-03-04T00:38:40Z

Yes, this means the are no possible targets to read from. It could be a bug in the analysis but also a correct result. For example, this warning also appeared in tree.c but it turned out that the code does indeed access a NULL pointer.

But then this is a bug in the code being analyzed ...

ThomasHaas · 2024-03-04T08:58:32Z

Yes, this means the are no possible targets to read from. It could be a bug in the analysis but also a correct result. For example, this warning also appeared in tree.c but it turned out that the code does indeed access a NULL pointer.

But then this is a bug in the code being analyzed ...

Yes, it might be if that memory access is reachable. @xeren Can you add source location information into the warning so we can more easily check what is wrong?

hernanponcedeleon · 2024-03-14T13:34:35Z

@xeren any progress a out this PR? I would like to get this merged soon

Use IntMath.gcd(int,int) Add more logging Fix SyntacticContextAnalysis where an assertion was violated when an event was start and end of a loop iteration at the same time.

.../src/main/java/com/dat3m/dartagnan/program/analysis/alias/InclusionBasedPointerAnalysis.java

Remove unification of LoadEdge and StoreEdge

ThomasHaas · 2024-04-04T10:32:48Z

I feel like your newest change has just shifted the variable vs. edge problem, no?
Previously, you used edges that sometimes represented variables. Now you use variables that sometimes represent edges.

xeren · 2024-04-04T15:48:02Z

I fixed a bug in the analysis, where the communication rule was not correctly implemented.

The rule states that from $(X+o_1\xleftarrow{\text{stores}}W+o_2)$, $(W\supseteq A+o_3)$, $(A+o_4\subseteq R)$, $(R+o_5\xrightarrow{\text{loads}}Y)$ and $((o_2+o_3)\cap(o_4+o_5)\ne\emptyset)$, you can derive $X+o_1\subseteq Y$.

Either $o_2$ or $o_5$ was skipped in the intersection test (depending on where the recently-learned inclusion edge is), resulting in both false positives and false negatives.

I feel like your newest change has just shifted the variable vs. edge problem, no?
Previously, you used edges that sometimes represented variables. Now you use variables that sometimes represent edges.

Maybe you were referring to inclusion edges being typed as DerivedVariable. I have now separated both use cases into separate classes.

ThomasHaas · 2024-04-04T15:51:23Z

I fixed a bug in the analysis, where the communication rule was not correctly implemented.

The rule states that from (X+o1←storesW+o2), (W⊇A+o3), (A+o4⊆R), (R+o5→loadsY) and ((o2+o3)∩(o4+o5)≠∅), you can derive X+o1⊆Y.

Either o2 or o5 was skipped in the intersection test (depending on where the recently-learned inclusion edge is), resulting in both false positives and false negatives.

Do you know of an instance of wrong behaviour? For example, does this affect qspinlock or ck_epoch or RCU?

I feel like your newest change has just shifted the variable vs. edge problem, no?
Previously, you used edges that sometimes represented variables. Now you use variables that sometimes represent edges.

Maybe you were referring to inclusion edges being typed as DerivedVariable. I have now separated both use cases into separate classes.

Yes, that is what I was referring to.

ThomasHaas

Overall, I like this newer version a lot more. I will do a proper review the next week.

.../src/main/java/com/dat3m/dartagnan/program/analysis/alias/InclusionBasedPointerAnalysis.java

ThomasHaas · 2024-04-08T09:07:30Z

EDIT: I was wrong. The code uses join for sequential composition and addInto for parallel composition.
The latter does not merge edges which contradicts the class description/comment, so the description should get updated.

I think you are conflating parallel joining and sequential joining (composition) in your algorithm. These are two different things with different results and it is just accidental that your algo remains correct because you always join parallel edges where one of them is composed.

For example, parallel joining 2x + 2 and 4x + 2 should result in 2x + 2, i.e., the gcd is used for the dynamic offset and the offsets are not added.
Sequential joining on the other hand would give 2x + 4y + 4 (or 2x + 4 if you combine multiple offsets) because the modifiers just get added. In the case where you would know that both dynamic offsets are the same variable, you would even get 6x + 4

.../src/main/java/com/dat3m/dartagnan/program/analysis/alias/InclusionBasedPointerAnalysis.java

…lias

…lias.rebase

ThomasHaas · 2024-04-27T09:32:53Z

Did you intentionally revert your recent changes? If so, you also reverted the changes to the handling of unary expressions, so they are still wrong.

.../src/main/java/com/dat3m/dartagnan/program/analysis/alias/InclusionBasedPointerAnalysis.java

hernanponcedeleon · 2024-04-29T19:19:59Z

I tested this on the libvsync benchmarks and all results look good. Unless there are further comments, I will merge.

ThomasHaas

LGTM

Signed-off-by: Hernan Ponce de Leon <[email protected]> Co-authored-by: Hernan Ponce de Leon <[email protected]> Co-authored-by: Thomas Haas <[email protected]> Co-authored-by: Thomas Haas <[email protected]>

ThomasHaas reviewed Feb 15, 2024

View reviewed changes

dartagnan/src/main/java/com/dat3m/dartagnan/expression/integers/IntLiteral.java Outdated Show resolved Hide resolved

xeren added 3 commits February 17, 2024 19:21

Add InclusionBasedPointerAnalysis

e86e743

Inject Dependency into FieldSensitiveAndersen

Fix IntLiteral

296ec42

Default to InclusionBasedPointerAnalysis

6f547bb

hernanponcedeleon force-pushed the alias branch from c234fc9 to 6f547bb Compare February 17, 2024 18:21

hernanponcedeleon reviewed Feb 17, 2024

View reviewed changes

ThomasHaas reviewed Feb 18, 2024

View reviewed changes

.../src/main/java/com/dat3m/dartagnan/program/analysis/alias/InclusionBasedPointerAnalysis.java Outdated Show resolved Hide resolved

Default to InclusionBasedPointerAnalysis

3eb1126

Signed-off-by: Hernan Ponce de Leon <[email protected]>

xeren and others added 3 commits February 26, 2024 17:14

Merge remote-tracking branch 'origin/development' into alias

8a51a3a

Updated AnalysisTest to avoid running processing twice.

bfc7ab3

Fix InclusionBasedPointerAnalysis

cbc03aa

Add missing initial communication tests when a store relationship is added and a load uses a constant memory object in an indirect manner, like ``` s = *(havoc ? x : y) *x = 1 ```

ThomasHaas mentioned this pull request Mar 8, 2024

Reads without init events #624

Merged

xeren added 2 commits March 14, 2024 15:42

Merge remote-tracking branch 'origin/development' into alias

9645f54

More fixes

8fe039f

Use IntMath.gcd(int,int) Add more logging Fix SyntacticContextAnalysis where an assertion was violated when an event was start and end of a loop iteration at the same time.

ThomasHaas reviewed Mar 14, 2024

View reviewed changes

.../src/main/java/com/dat3m/dartagnan/program/analysis/alias/InclusionBasedPointerAnalysis.java Outdated Show resolved Hide resolved

.../src/main/java/com/dat3m/dartagnan/program/analysis/alias/InclusionBasedPointerAnalysis.java Outdated Show resolved Hide resolved

ThomasHaas mentioned this pull request Mar 16, 2024

Improved memory handling #639

Merged

xeren added 3 commits April 4, 2024 11:13

Split Offset into Modifier, DerivedVariable, LoadEdge and StoreEdge

d56b30c

Remove unification of LoadEdge and StoreEdge

Fix algorithm communication rule

a2d7eb5

Merge branch 'development' into alias

6594353

Refactor: Split IncludeEdge from DerivedVariable

f42744b

ThomasHaas reviewed Apr 7, 2024

View reviewed changes

Refactor

9e10122

xeren force-pushed the alias branch from cd8ae96 to 9e10122 Compare April 10, 2024 12:39

ThomasHaas reviewed Apr 10, 2024

View reviewed changes

ThomasHaas reviewed Apr 11, 2024

View reviewed changes

ThomasHaas mentioned this pull request Apr 18, 2024

Fix SyntacticContextAnalysis #653

Merged

xeren added 5 commits April 23, 2024 10:59

Merge remote-tracking branch 'refs/remotes/origin/development' into a…

523d890

…lias

Refactor

e39312a

Fix compose

f36ae48

Refactor addInclude(Variable,IncludeEdge)

9945fd7

Merge remote-tracking branch 'refs/remotes/origin/development' into a…

ba0f2e8

…lias.rebase

xeren force-pushed the alias branch from 8a427bd to ba0f2e8 Compare April 26, 2024 14:39

xeren added 2 commits April 29, 2024 12:26

Fix multiplication

b1dfb9f

Fix non-negation unary expressions

4ad66d6

ThomasHaas reviewed Apr 29, 2024

View reviewed changes

.../src/main/java/com/dat3m/dartagnan/program/analysis/alias/InclusionBasedPointerAnalysis.java Show resolved Hide resolved

Fix non-negation unary expressions and multiplication

d3148fd

ThomasHaas approved these changes Apr 29, 2024

View reviewed changes

hernanponcedeleon merged commit 506ed2f into development Apr 29, 2024
1 check passed

hernanponcedeleon deleted the alias branch April 29, 2024 19:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New Pointer Analysis #616

New Pointer Analysis #616

xeren commented Feb 15, 2024

hernanponcedeleon commented Feb 15, 2024

ThomasHaas commented Feb 15, 2024

hernanponcedeleon commented Feb 17, 2024

ThomasHaas commented Feb 17, 2024

ThomasHaas commented Feb 18, 2024

hernanponcedeleon commented Feb 20, 2024

ThomasHaas commented Feb 20, 2024

hernanponcedeleon commented Feb 20, 2024

ThomasHaas commented Feb 21, 2024

hernanponcedeleon commented Mar 3, 2024

ThomasHaas commented Mar 3, 2024

hernanponcedeleon commented Mar 4, 2024

ThomasHaas commented Mar 4, 2024

hernanponcedeleon commented Mar 14, 2024

ThomasHaas commented Apr 4, 2024

xeren commented Apr 4, 2024

ThomasHaas commented Apr 4, 2024

ThomasHaas left a comment

ThomasHaas commented Apr 8, 2024 •

edited

Loading

ThomasHaas commented Apr 27, 2024

hernanponcedeleon commented Apr 29, 2024

ThomasHaas left a comment

New Pointer Analysis #616

New Pointer Analysis #616

Conversation

xeren commented Feb 15, 2024

hernanponcedeleon commented Feb 15, 2024

ThomasHaas commented Feb 15, 2024

hernanponcedeleon commented Feb 17, 2024

ThomasHaas commented Feb 17, 2024

ThomasHaas commented Feb 18, 2024

hernanponcedeleon commented Feb 20, 2024

ThomasHaas commented Feb 20, 2024

hernanponcedeleon commented Feb 20, 2024

ThomasHaas commented Feb 21, 2024

hernanponcedeleon commented Mar 3, 2024

ThomasHaas commented Mar 3, 2024

hernanponcedeleon commented Mar 4, 2024

ThomasHaas commented Mar 4, 2024

hernanponcedeleon commented Mar 14, 2024

ThomasHaas commented Apr 4, 2024

xeren commented Apr 4, 2024

ThomasHaas commented Apr 4, 2024

ThomasHaas left a comment

Choose a reason for hiding this comment

ThomasHaas commented Apr 8, 2024 • edited Loading

ThomasHaas commented Apr 27, 2024

hernanponcedeleon commented Apr 29, 2024

ThomasHaas left a comment

Choose a reason for hiding this comment

ThomasHaas commented Apr 8, 2024 •

edited

Loading