[prev in list] [next in list] [prev in thread] [next in thread] 

List:       cfe-dev
Subject:    Re: [cfe-dev] [analyzer] Tracking values for generating bug reports
From:       Artem Dergachev via cfe-dev <cfe-dev () lists ! llvm ! org>
Date:       2019-06-25 1:59:09
Message-ID: 847bd70e-82ac-adc7-2365-fba72c52144f () gmail ! com
[Download RAW message or body]

[Attachment #2 (multipart/alternative)]


Yup, looks much better! Let's actually dig through other projects a bit, 
but results on LLVM look fairly perfect to me.


 > ASTDiagnostic.cpp

Why wasn't the extra tracking removed in the moderate mode? There 
doesn't seem to be an obvious overwrite for `kind`. Is it reacting to an 
invalidation?


 > DominanceFrontierImpl.h

Why does it track so much more expressions in the moderate mode (even if 
not producing any actual nodes for them)? Is it because you changed the 
de-duplication method from by-expr to by-node in between?


 > CGObjCGNU.cpp

I'd like to think more about this one. It doesn't look to me that the 
new notes are helpful here, for the reason that the problem is pretty 
much entirely between the definition of OID and the use, while all of 
the notes are outside that range. The report wouldn't become less valid 
(for a certain definition of valid) if we simply omit everything except 
the last three notes:
   - 'OID' is initialized to a null pointer value
   - Loop body skipped when range is empty
   - Called ++ object pointer is null
I don't see any immediate solutions here; i guess the right thing to do 
would be to simply learn how to chop off the irrelevant beginning of the 
path.


On 6/24/19 4:48 PM, Kristóf Umann wrote:
> I went ahead and (although admittedly with a very crude 
> implementation) I enhanced condition tracking with the following:
> 
> * All notes about their initialization are discarded
> * No longer tracking the right hand side of the last store
> * Last stores are discarded unless it happened in another stack frame
> 
> I evaluated the prototype solution on LLVM+Clang+Clang-tools-extra, 
> Xerces and Bitcoin. I might add rtags and YouCompleteMe later, but I 
> think this is sufficient for now:
> 
> http://cc.elte.hu:15001/Default/#
> 
> Due to the lack of better categorization, I used the following (so 
> this has in fact no relation to whether the report is correct or not):
> * Intentional reports (green x) are the ones where the bugreport got 
> unquestionably worse.
> * Confirmed reports (red check mark) got definitely better
> * False positives (grey line) are in between.
> 
> I didn't have time to look at them all, but here are some reports I 
> found interesting on LLVM:
> 
> http://cc.elte.hu:15001/Default/#is-unique=on&run=LLVM%2FClang%2FClang-tools-extra%2 \
> 0AFTER%20tracking%20conditions&review-status=Confirmed&review-status=False%20positiv \
> e&review-status=Intentional&detection-status=New&detection-status=Reopened&detection \
> -status=Unresolved&tab=LLVM%2FClang%2FClang-tools-extra%20AFTER%20tracking%20conditions
>  
> By clicking on a bug report, and changing the run with the dropdown 
> menu found in the upper right corner (next to "Also found in:"), you 
> can see that in some cases these enhancements kept the amount of extra 
> notes at bay quite well.
> 
> .* BEFORE condition tracking: Analysis made on LLVM monorepo commit 
> 4cc6d72bb4d
> .* AFTER condition tracking: With https://reviews.llvm.org/D62883 
> applied and condition tracking enabled.
> .* AFTER condition tracking + DEBUG notes: With 
> https://reviews.llvm.org/D63642 applied and condition tracking with 
> debug notes enabled.
> .* AFTER condition tracking WITH moderate tracking: With the above 
> mentioned additions applied on top of https://reviews.llvm.org/D62883 
> and condition tracking enabled.
> .* AFTER condition tracking WITH moderate tracking + DEBUG notes: With 
> the above mentioned additions applied on top of 
> https://reviews.llvm.org/D63642 and condition tracking with debug 
> notes enabled.
> 
> On Wed, 19 Jun 2019 at 02:08, Artem Dergachev <adergachev@apple.com 
> <mailto:adergachev@apple.com>> wrote:
> 
> 
> 
> > On Jun 18, 2019, at 5:02 PM, Kristóf Umann <dkszelethus@gmail.com
> > <mailto:dkszelethus@gmail.com>> wrote:
> > 
> > 
> > 
> > On Wed, 19 Jun 2019 at 01:14, Artem Dergachev
> > <noqnoqneo@gmail.com <mailto:noqnoqneo@gmail.com>> wrote:
> > 
> > 
> > 
> > On 6/18/19 3:44 PM, Gábor Horváth wrote:
> > > Hi Kristóf,
> > > 
> > > Thanks for the report. I have been thinking about what kind
> > > of user experience should we pursue. While my ideas might be
> > > a bit opinionated, see my comments inline.
> > > 
> > > On Tue, 18 Jun 2019 at 20:47, Kristóf Umann
> > > <dkszelethus@gmail.com <mailto:dkszelethus@gmail.com>> wrote:
> > > 
> > > Hi!
> > > 
> > > This is an update on my GSoC project, "Enhancing bug
> > > reports in the Clang Static Analyzer". We had some
> > > discussions on phabricator, and some in private
> > > meetings, and I think it's due to share how things are
> > > looking right now.
> > > 
> > > In my previous mail [1], I detailed the two distinct
> > > categories we defined, the "must-have-happened" case
> > > where every information is already in the bug path, and
> > > the "should-not-have-happened" case when the information
> > > isn't found in the bugpath. I divided the latter into
> > > two subcategories, the "Inlined category" (initially
> > > referred to as "category 1"), and the "Not inlined
> > > category" (initally referred to as "category 2").
> > > 
> > > These categorizations are used to teach the analyzer
> > > incrementally about which nodes in the bugpath deserve
> > > special attention. For now, I plan to use this
> > > information exclusively for finding control dependencies
> > > to these nodes, and tracking their condition. Now, what
> > > we mean under "tracking an expression value", is that to
> > > add notes to the bug report relevant to that expression
> > > value.
> > > 
> > > Ultimately, this means that my project depends greatly
> > > on condition tracking yielding meaningful addition of
> > > notes to the bug report, without adding unimportant
> > > ones. Since I more-or-less finished my work on the
> > > must-have-happened case (meaning that the analyzer can
> > > now figure out control dependencies to nodes contained
> > > in the bugpath), I'd like to detail how I plan to work
> > > on this.
> > > 
> > > While evaluating an early prototype solution to the
> > > "must-have-happened" case where the same expression
> > > value tracking was used for both the bug-causing
> > > variable and for the conditions, I found that in many
> > > cases, the growth of bug length was intolerable. This
> > > is, in part, caused by conditions being tracked to a
> > > condition recursively, the conditions of asserts being
> > > tracked, and that notes about a condition are not as
> > > interesting as notes about the bug causing variable
> > > (calls to operator bool for instance).
> > > 
> > > Fixing any of these requires me to teach the analyzer
> > > the difference in between "THE value" and "just a
> > > condition". The details are a little more complicated,
> > > so I'll show some examples that point out certain cases:
> > > *
> > > *
> > > *Example 1.:*
> > > 
> > > 01 int flag;
> > > 02 bool coin();
> > > 03
> > > 04 void foo() {
> > > 05   flag = coin();
> > > 06 }
> > > 07
> > > 08 int main() {
> > > 09   int *x = 0;
> > > 10   foo();
> > > 11   if (flag) // assumed true
> > > 12     *x = 5; // warn
> > > 13 }
> > > *
> > > *
> > > In this example, it'd be great to see notes placed on
> > > line 10 and 5, because if flag wasn't invalidated, the
> > > bug would not have occurred (since flag is global, and
> > > is initialized to 0). The analyzer normally wouldn't
> > > place notes there, so we definitely should track flag up
> > > to line 5.
> > > *
> > > *
> > > *Example 2.:*
> > > *
> > > *
> > > 01 int flag;
> > > 02 bool coin();
> > > 03
> > > 04 void foo() {
> > > 05   coin();
> > > 06 }
> > > 07
> > > 08 int main() {
> > > 09   int *x = 0;
> > > 10   foo();
> > > 11   if (flag)// assumed true
> > > 12     *x = 5;// warn
> > > 13 }
> > > 
> > > This case is very similar, with the only difference
> > > being that the analyzer conservatively assumed that coin
> > > may have written flag (as it's a global variable). We
> > > should track until line 5.
> > > 
> > > 
> > > I am not sure about this example. While the invalidation of
> > > the global flag is definitely playing a role in this bug
> > > report the point where the flag is actually invalidated
> > > could seem quite arbitrary for the user. We might get the
> > > invalidation when we first see a function that is not
> > > defined in this TU, when we reach the maximum call stack
> > > depth and so on. My point is, I am not sure if we actually
> > > help the user by pinpointing the first function call where
> > > the invalidation happens. A more user friendly way to think
> > > about this problem is how could the user solve this false
> > > positive? As far as I understand the cannonical solution
> > > would be to add an assertion, so it would be better to put a
> > > node close to the place where the user would change the
> > > code, so I would put it in the stack frame of the bug. For
> > > example we could generate a note for the call foo(), that
> > > somewhere in that call stack the analyzer could no longer
> > > track the value of flag, thus invalidated its contents.
> > > @Artem Dergachev <mailto:adergachev@apple.com>what do you think?
> > 
> > This is definitely one of the most annoying kinds of false
> > positives we have due to infeasible paths and over-eager
> > assumption chains. Like, you can add an assertion here, but
> > it's going to look super ugly:
> > 
> > bool tmp_flag = flag;
> > coin(); // why would anybody believe it touches flag in the
> > first place???
> > assert(flag == tmp_flag && "sanity check");
> > 
> > And even if the user would actually like to add something
> > like that to his code, it would require a powerful constraint
> > solver to even handle this assertion correctly.
> > 
> > These are not very common, but they're so annoying that i
> > probably wouldn't be against marking reports as invalid
> > immediately when we see that an important control dependency
> > relies on such invalidation (we'll have to gather
> > experimental data on that, of course).
> > 
> > 
> > Hmm, yea, good point. But if I coin were to take flag as a
> > non-const reference (other than the fact that flag is global) I
> > guess we would like to keep the report and place notes all the
> > way there? I agree with Artem, we really need to see the real
> > world impact of this.
> 
> Yup.
> 
> 
> > > 
> > > *Example 3.:*
> > > *
> > > *
> > > 01 void f(int flag) {
> > > 02   int *x = 0;
> > > 03   if (flag)// assumed true
> > > 04     *x = 5;// warn
> > > 05 }
> > > 
> > > Here, the user could simply follow the arrows that shows
> > > the path of execution: it isn't really important what
> > > flag was initialized on the first line.
> > > 
> > > *Example 4.:*
> > > *
> > > *
> > > 01 int flag;
> > > 02
> > > 03 int getInt();
> > > 04
> > > 05 int main() {
> > > 06   int *x = 0;
> > > 07   int y = getInt();
> > > 08   flag = y;
> > > 09   if (flag)// assumed true
> > > 10     *x = 5;// warn
> > > 11 }
> > > 
> > > Again, the user could see there was a write made to flag
> > > on line 8 -- the question is, whether we should explain
> > > y better. Right now, we're thinking that we shouldn't,
> > > leading to the conclusion that flag here shouldn't be
> > > tracked at all.
> > > 
> > > 
> > > Finding the place where flag was modified in a really big
> > > function can still be challenging. While generating a note
> > > would be an overkill I wonder if some middle ground like
> > > highlighting the statement without additional text would
> > > actually make sense for this case.
> > 
> > I think its a matter of a simple comparison of the stack frames
> > that would decide whether we only want to track the condition if
> > it was in between the collapse point and the initialization in a
> > nested stack frame, or regardless of the stack frame. Let's see
> > just see which yields the better result!
> > 
> > 
> > When it's in the same function, you can simply search for the
> > variable name to see all writes into it. It's not that hard.
> > Even if the variable is written into by pointer, you can see
> > that its address is taken and search for the pointer variable
> > as well (though annoying if there are too many levels of
> > indirection).
> > 
> > But, hmm, if the address is taken in a different stack frame,
> > this becomes much harder. I guess, writes that are made
> > through pointers should be highlighted, and we should also do
> > our tracking to explain why do we think that this pointer
> > points to that variable.
> > 
> > 
> > Ah yes, our arch enemy, pointer analysis showed up again. I guess
> > I agree with you, but I plan on approaching aliasing related
> > issues when I got most of the other stuff working.
> 
> In this case it's much more trivial because you're simply relying
> on the information about pointers that's already contained in the
> graph.
> 
> > 
> > > 
> > > *Example 5.:*
> > > *
> > > *
> > > 01 int flag;
> > > 02
> > > 03 int getInt();
> > > 04
> > > 05 void foo() {
> > > 06   int y = getInt();
> > > 07   flag = y;
> > > 08 }
> > > 09
> > > 10 int main() {
> > > 11   int *x = 0;
> > > 12 foo();
> > > 13   if (flag)// assumed true
> > > 14     *x = 5;// warn
> > > 15 }
> > > 
> > > Like Example 1-2, we should explain that flag was
> > > written on line 7, but like in Example 4., we shouldn't
> > > track y.
> > > 
> > > 
> > > Again, from the user's point of view, I think the ultimate
> > > solution would be to have an interface, where we does not
> > > present the part where y was tracked by default, but the
> > > user could click on a button to actually expand that part of
> > > the path in case she consider it interesting. This, of
> > > course, is not part of you GSoC, I am just wondering what is
> > > the ideal that we would like to pursue in the long run.
> > 
> > A more powerful UI could have indeed solved a lot of these
> > problems by telling users to do our work. But given that it
> > comes with a fairly large cost of developing such UIs for
> > every tool that integrates the Static Analyzer, i'd
> > definitely try to push our own effort of stuffing exactly as
> > much information as necessary into the report as far as possible.
> > 
> > 
> > I actually had a number of discussions with Dániel Krupp about
> > this, and folks in Ericsson seem to be very interested in this
> > direction. This could be a great compliment to my GSoC as a
> > followup project.
> > 
> > > *
> > > Example 6.:*
> > > 01 void f(int flag) {
> > > 02   int *x = 0;
> > > 03 assert(flag);
> > > 04   if (flag)// assumed true
> > > 05     *x = 5;// warn
> > > 06 }
> > > 
> > > Currently, we mention nothing about line 3, yet we say
> > > "Taking the true branch" on line 4, rather then
> > > "Assuming the condition is true". This is because the
> > > collapse point (point at which we constrain flag's value
> > > to be true or false) isn't on line 4: the analysis only
> > > continued past the assert since the analyzer assumed
> > > flag to be non-zero. In this case, we would like to
> > > track flag to the assert to explain why we are so
> > > confident on line 5 that flag is non-zero.
> > > 
> > > 
> > > What do you mean by tracking tha flag back to the assert?
> > > The reason why a note is useful because in this case either
> > > the assert is wrong or the bug is a true positive. But
> > > again, when the assert is in the same function we might not
> > > want to generate additional note for this as this info might
> > > easily be inferred by following the arrows in the report
> > > (but we might highlight the assertion line, see my previous
> > > comments). In case the assert is in a separate (inlined)
> > > function it would make much more sense to generate a note.
> > 
> > Wouldn't the assert already contain a (prunable) note saying
> > "Assuming 'flag' is true"? I guess the only problem here is
> > that the note is prunable, so it won't be shown if the assert
> > is wrapped into a nested function call.
> > 
> > Yup, with-analyzer-config prune-paths=false, you get the
> > following report, but it's still not explained well why flag is
> > non-zero (the thing that I'm missing is the connection in between
> > b and flag):
> > 
> > $ cat example_6.cpp
> > 
> > [[noreturn]] void halt();
> > 
> > void assert(int b) {
> > if (!b)
> > halt();
> > }
> > 
> > void f(int flag) {
> > int *x = 0;
> > assert(flag);
> > if (flag) // assumed true
> > *x = 5; // warn
> > }
> > 
> > $ clang -cc1 -analyze -analyzer-output=text
> > -analyzer-checker=core example_6.cpp -analyzer-config
> > prune-paths=false
> > 
> > example_6.cpp:10:3: note: 'x' initialized to a null pointer value
> > int *x = 0;
> > ^~~~~~
> > example_6.cpp:11:3: note: Calling 'assert'
> > assert(flag);
> > ^~~~~~~~~~~~
> > example_6.cpp:5:7: note: Assuming 'b' is not equal to 0
> > if (!b)
> > ^~
> > example_6.cpp:5:3: note: Taking false branch
> > if (!b)
> > ^
> > example_6.cpp:11:3: note: Returning from 'assert'
> > assert(flag);
> > ^~~~~~~~~~~~
> > example_6.cpp:12:7: note: 'flag' is not equal to 0
> > if (flag) // assumed true
> > ^~~~
> > example_6.cpp:12:3: note: Taking true branch
> > if (flag) // assumed true
> > ^
> > example_6.cpp:13:8: note: Dereference of null pointer (loaded
> > from variable 'x')
> > *x = 5; // warn
> > ~ ^
> > 
> > As you said, this example is interesting because assert isn't a
> > macro here, and flag's collapse point really happens in a nested
> > stack frame. Am I understanding correctly that maybe we shouldn't
> > pursue tracking a variable if the collapse point happened in the
> > same stack frame?
> > 
> > > *
> > > Example 7.:*
> > > 
> > > 01 int getInt();
> > > 02 03 void f(int flag) {
> > > 04   int *x = 0;
> > > 05 flag = getInt();
> > > 06 assert(flag);
> > > 07   if (flag)// assumed true
> > > 08     *x = 5;// warn
> > > 09 }
> > > 
> > > Like Example 6, we should explain why know that flag is
> > > non-zero on line 7 by tracking it back to line 6. Like
> > > in the case of Example 4., the user could see where flag
> > > was written, so we wouldn't like to see a note on line 5.
> > > 
> > > So what's the takeaway?
> > > 
> > > After teaching the analyzer the difference in between a
> > > condition and a "regularly tracked expression", I plan
> > > to implement the following two rules:
> > > 
> > > Track a condition only if
> > > a.) The collapse point doesn't coincide with the
> > > condition point
> > > b.) It was written in a nested stack frame.
> > > 
> > > 
> > > Do you mane a) or b), or a) and b)? Also, what to do with
> > > "multiple collapse point" events like:
> > 
> > If either happens.
> > 
> > > 
> > > if ( a < 10 ) ;
> > > if ( a < 5) // Here we do add new constraints to a, but we
> > > also had other constraints before. Do you consider this to
> > > coincide or not?
> > > ...
> > 
> > I'd rather track the condition *until* we cover those points
> > (which implies not tracking it at all when there are no
> > b.)-points and the a.)-point coincide with the current
> > terminator), rather than track it forever if we have those
> > points.
> > 
> > By collapse point i meant the point where the (bool)condition
> > collapses to a constant-false or a constant-true (in the true
> > case the condition itself doesn't necessarily collapse to a
> > constant 1). It doesn't happen every time we introduce a
> > constraint.
> > 
> > Yup, that's what I was planning to go for.
> > 
> > 
> > > 
> > > We hope that by implementing this, tracking conditions
> > > to conditions would be kept at bay without a counter to
> > > limit the depth of recursion, and the intolerable growth
> > > of bug length with drastically shorten. I do expect
> > > skeletons to fall out of the closet, but I am confident
> > > that this is a good initial approach.
> > > 
> > > As explained earlier, this is mission number one, so
> > > I'll prioritize getting it right before pursuing the
> > > "should-not-have-happened" case.
> > > 
> > > One thing I did not touch on just yet, is the case where
> > > an assert was (correctly, by the way) regarded as a
> > > control dependency, and it's condition was tracked. This
> > > is undoubtedly undesirable, but figuring out whether the
> > > condition is found in an assert is rather difficult.
> > > Asserts are often implemented as a macro, and could have
> > > a very, for a lack of a better word, esoteric
> > > implementations on certain platforms. We discussed
> > > trying to tinker with the control dependency calculator,
> > > namely skipping over nodes that have two successors and
> > > one of them leads to noreturn node, but I still need
> > > time to figure out something reliable.
> > > 
> > > 
> > > Some asserts consists of more basic blocks. Having two
> > > successors where one of them is anoreturnblock is not a
> > > sufficient condition. Consider for exampleassert(a &&
> > > b);Here the basic block for evaluatingawill either go to the
> > > next line after assertion or to the evaluation ofbwhich is
> > > not anoreturnblock. While it is true that from node a we
> > > will either go to the next non-assert block or end up in
> > > anoreturnblock, thenoreturnblock might not be an immediate
> > > successor of nodea.
> > 
> > Forgot to mention, i have a history with the CFG for asserts
> > that's mostly contained
> > inhttps://reviews.llvm.org/D28023andhttps://reviews.llvm.org/D35673.
> > This is the reason why i believe that some implementation of
> > assert() have a lot of CFG blocks on their own, regardless of
> > how many CFG blocks does it take to evaluate the assert
> > condition.
> > 
> > Thanks! :D Right now, incorrect tracking of assert conditions
> > isn't on the top of my priority list, but I'll think about this
> > when I'm sitting on the bus or watering the garden.
> > 
> > > 
> > > Regards,
> > > Gabor
> > > 
> > > 
> > > Thanks to everyone who helped me along: Artem Dergachev,
> > > Gábor Horváth, Ádám Balogh, Jakub Kuderski, and Zoltán
> > > Porkoláb!
> > > 
> > > Cheers,
> > > Kristóf
> > > 
> > > [1]
> > > http://lists.llvm.org/pipermail/cfe-dev/2019-June/062535.html
> > > 
> 


[Attachment #5 (text/html)]

<html>
  <head>
    <meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
  </head>
  <body text="#000000" bgcolor="#FFFFFF">
    Yup, looks much better! Let's actually dig through other projects a
    bit, but results on LLVM look fairly perfect to me.<br>
    <br>
    <br>
    &gt; ASTDiagnostic.cpp<br>
    <br>
    Why wasn't the extra tracking removed in the moderate mode? There
    doesn't seem to be an obvious overwrite for `kind`. Is it reacting
    to an invalidation?<br>
    <br>
    <br>
    &gt; DominanceFrontierImpl.h<br>
    <br>
    Why does it track so much more expressions in the moderate mode
    (even if not producing any actual nodes for them)? Is it because you
    changed the de-duplication method from by-expr to by-node in
    between?<br>
    <br>
    <br>
    &gt; CGObjCGNU.cpp<br>
    <br>
    I'd like to think more about this one. It doesn't look to me that
    the new notes are helpful here, for the reason that the problem is
    pretty much entirely between the definition of OID and the use,
    while all of the notes are outside that range. The report wouldn't
    become less valid (for a certain definition of valid) if we simply
    omit everything except the last three notes:<br>
      - 'OID' is initialized to a null pointer value<br>
      - Loop body skipped when range is empty<br>
      - Called ++ object pointer is null<br>
    I don't see any immediate solutions here; i guess the right thing to
    do would be to simply learn how to chop off the irrelevant beginning
    of the path.<br>
    <br>
    <br>
    On 6/24/19 4:48 PM, Kristóf Umann wrote:<br>
    <blockquote type="cite"
cite="mid:CAGcXOD7Pzs0BhbxkPMALzAnTEFDwRiKHhVQQkz6_PWHhWrVxfg@mail.gmail.com">
      <meta http-equiv="content-type" content="text/html; charset=UTF-8">
      <div dir="ltr">I went ahead and (although admittedly with a very
        crude implementation) I enhanced condition tracking with the
        following:
        <div><br>
        </div>
        <div>* All notes about their initialization are discarded</div>
        <div>* No longer tracking the right hand side of the last store</div>
        <div>* Last stores are discarded unless it happened in another
          stack frame</div>
        <div><br>
        </div>
        <div>I evaluated the prototype solution on
          LLVM+Clang+Clang-tools-extra, Xerces and Bitcoin. I might add
          rtags and YouCompleteMe later, but I think this is sufficient
          for now:</div>
        <div><br>
        </div>
        <div><a href="http://cc.elte.hu:15001/Default/#"
            moz-do-not-send="true">http://cc.elte.hu:15001/Default/#</a><br>
        </div>
        <div><br>
        </div>
        Due to the lack of better categorization, I used the following
        (so this has in fact no relation to whether the report is
        correct or not):<br>
        * Intentional reports (green x) are the ones where the bugreport
        got unquestionably worse.<br>
        * Confirmed reports (red check mark) got definitely better<br>
        * False positives (grey line) are in between.
        <div><br>
        </div>
        <div>
          <div>I didn't have time to look at them all, but here are some
            reports I found interesting on LLVM:<br>
            <br>
            <a
href="http://cc.elte.hu:15001/Default/#is-unique=on&amp;run=LLVM%2FClang%2FClang-tools \
-extra%20AFTER%20tracking%20conditions&amp;review-status=Confirmed&amp;review-status=F \
alse%20positive&amp;review-status=Intentional&amp;detection-status=New&amp;detection-s \
tatus=Reopened&amp;detection-status=Unresolved&amp;tab=LLVM%2FClang%2FClang-tools-extra%20AFTER%20tracking%20conditions"
  moz-do-not-send="true">http://cc.elte.hu:15001/Default/#is-unique=on&amp;run=LLVM%2F \
Clang%2FClang-tools-extra%20AFTER%20tracking%20conditions&amp;review-status=Confirmed& \
amp;review-status=False%20positive&amp;review-status=Intentional&amp;detection-status= \
New&amp;detection-status=Reopened&amp;detection-status=Unresolved&amp;tab=LLVM%2FClang%2FClang-tools-extra%20AFTER%20tracking%20conditions</a><br>
  </div>
          <div><br>
          </div>
          <div>By clicking on a bug report, and changing the run with
            the dropdown menu found in the upper right corner (next to
            "Also found in:"), you can see that in some cases these
            enhancements kept the amount of extra notes at bay quite
            well.</div>
        </div>
        <div><br>
        </div>
        <div><font face="courier new, monospace">.* BEFORE condition
            tracking</font>: Analysis made on LLVM monorepo commit
          4cc6d72bb4d</div>
        <font face="courier new, monospace">.* AFTER condition tracking</font>:
        With <a href="https://reviews.llvm.org/D62883"
          moz-do-not-send="true">https://reviews.llvm.org/D62883</a>
        applied and condition tracking enabled.<br
          class="gmail-Apple-interchange-newline">
        <font face="courier new, monospace">.* AFTER condition tracking
          + DEBUG notes</font>: With <a
          href="https://reviews.llvm.org/D63642" \
moz-do-not-send="true">https://reviews.llvm.org/D63642</a>  applied and condition \
tracking with debug notes enabled.<br  class="gmail-Apple-interchange-newline">
        <font face="courier new, monospace">.* AFTER condition tracking
          WITH moderate tracking</font>: With the above mentioned
        additions applied on top of <a
          href="https://reviews.llvm.org/D62883" \
moz-do-not-send="true">https://reviews.llvm.org/D62883</a>  and condition tracking \
enabled.<br  class="gmail-Apple-interchange-newline">
        <font face="courier new, monospace">.* AFTER condition
          tracking WITH moderate tracking + DEBUG notes</font>: With the
        above mentioned additions applied on top of <a
          href="https://reviews.llvm.org/D63642" \
moz-do-not-send="true">https://reviews.llvm.org/D63642</a> and  condition tracking \
with debug notes enabled.</div>  <br>
      <div class="gmail_quote">
        <div dir="ltr" class="gmail_attr">On Wed, 19 Jun 2019 at 02:08,
          Artem Dergachev &lt;<a href="mailto:adergachev@apple.com"
            target="_blank" moz-do-not-send="true">adergachev@apple.com</a>&gt;
          wrote:<br>
        </div>
        <blockquote class="gmail_quote" style="margin:0px 0px 0px
          0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
          <div><br>
            <div><br>
              <blockquote type="cite">
                <div>On Jun 18, 2019, at 5:02 PM, Kristóf Umann &lt;<a
                    href="mailto:dkszelethus@gmail.com" target="_blank"
                    moz-do-not-send="true">dkszelethus@gmail.com</a>&gt;
                  wrote:</div>
                <br
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786Apple-interchange-newline">
  <div><br
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786Apple-interchange-newline">
  <br
style="font-family:Helvetica;font-size:12px;font-style:normal;font-variant-caps:normal \
;font-weight:normal;letter-spacing:normal;text-align:start;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;text-decoration:none">
  <div class="gmail_quote"
style="font-family:Helvetica;font-size:12px;font-style:normal;font-variant-caps:normal \
;font-weight:normal;letter-spacing:normal;text-align:start;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;text-decoration:none">
  <div dir="ltr" class="gmail_attr">On Wed, 19 Jun
                      2019 at 01:14, Artem Dergachev &lt;<a
                        href="mailto:noqnoqneo@gmail.com"
                        target="_blank" \
moz-do-not-send="true">noqnoqneo@gmail.com</a>&gt;  wrote:<br>
                    </div>
                    <blockquote class="gmail_quote" style="margin:0px
                      0px 0px 0.8ex;border-left:1px solid
                      rgb(204,204,204);padding-left:1ex">
                      <div bgcolor="#FFFFFF"><br>
                        <br>
                        <div
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786gmail-m_-4639077540367978006moz-cite-prefix">On
  6/18/19 3:44 PM, Gábor Horváth wrote:<br>
                        </div>
                        <blockquote type="cite">
                          <div dir="ltr">
                            <div dir="ltr">
                              <div>Hi Kristóf,</div>
                              <div><br>
                              </div>
                              <div>Thanks for the report. I have been
                                thinking about what kind of user
                                experience should we pursue. While my
                                ideas might be a bit opinionated, see my
                                comments inline.<span
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786Apple-converted-space"> \
</span><br>  </div>
                            </div>
                            <br>
                            <div class="gmail_quote">
                              <div dir="ltr" class="gmail_attr">On Tue,
                                18 Jun 2019 at 20:47, Kristóf Umann &lt;<a
                                  href="mailto:dkszelethus@gmail.com"
                                  target="_blank" \
moz-do-not-send="true">dkszelethus@gmail.com</a>&gt;  wrote:<br>
                              </div>
                              <blockquote class="gmail_quote"
                                style="margin:0px 0px 0px
                                0.8ex;border-left:1px solid
                                rgb(204,204,204);padding-left:1ex">
                                <div dir="ltr">Hi!<br>
                                  <br>
                                  This is an update on my GSoC project,
                                  "Enhancing bug reports in the Clang
                                  Static Analyzer". We had some
                                  discussions on phabricator, and some
                                  in private meetings, and I think it's
                                  due to share how things are looking
                                  right now.
                                  <div><br>
                                  </div>
                                  <div>In my previous mail [1], I
                                    detailed the two distinct categories
                                    we defined, the "must-have-happened"
                                    case where every information is
                                    already in the bug path, and the
                                    "should-not-have-happened" case when
                                    the information isn't found in the
                                    bugpath. I divided the latter into
                                    two subcategories, the "Inlined
                                    category" (initially referred to as
                                    "category 1"), and the "Not inlined
                                    category" (initally referred to as
                                    "category 2").</div>
                                  <div><br>
                                  </div>
                                  <div>These categorizations are used to
                                    teach the analyzer incrementally
                                    about which nodes in the bugpath
                                    deserve special attention. For now,
                                    I plan to use this information
                                    exclusively for finding control
                                    dependencies to these nodes, and
                                    tracking their condition. Now, what
                                    we mean under "tracking an
                                    expression value", is that to add
                                    notes to the bug report relevant to
                                    that expression value.</div>
                                  <div><br>
                                  </div>
                                  <div>Ultimately, this means that my
                                    project depends greatly on condition
                                    tracking yielding meaningful
                                    addition of notes to the bug report,
                                    without adding unimportant ones.
                                    Since I more-or-less finished my
                                    work on the must-have-happened case
                                    (meaning that the analyzer can now
                                    figure out control dependencies to
                                    nodes contained in the bugpath), I'd
                                    like to detail how I plan to work on
                                    this.</div>
                                  <div><br>
                                  </div>
                                  <div>While evaluating an early
                                    prototype solution to the
                                    "must-have-happened" case where the
                                    same expression value tracking was
                                    used for both the bug-causing
                                    variable and for the conditions, I
                                    found that in many cases, the growth
                                    of bug length was intolerable. This
                                    is, in part, caused by conditions
                                    being tracked to a condition
                                    recursively, the conditions of
                                    asserts being tracked, and that
                                    notes about a condition are not as
                                    interesting as notes about the bug
                                    causing variable (calls to operator
                                    bool for instance).</div>
                                  <div><br>
                                  </div>
                                  <div>Fixing any of these requires me
                                    to teach the analyzer the difference
                                    in between "THE value" and "just a
                                    condition". The details are a little
                                    more complicated, so I'll show some
                                    examples that point out certain
                                    cases:</div>
                                  <div><b><br>
                                    </b></div>
                                  <div><b>Example 1.:</b></div>
                                  <div><br>
                                  </div>
                                  <div><span
id="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-88547581698791967 \
86gmail-m_-4639077540367978006gmail-m_8684261205520343446gmail-m_-3381093670081869542gmail-docs-internal-guid-4371bc51-7fff-dc4f-3cfb-1e3c0d0fbc66">
  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">01 \
int flag;</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">02 \
bool coin();</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">03</span></div>
  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">04 \
void foo() {</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">05 \
flag = coin();</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">06 \
}</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">07</span></div>
  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">08 \
int main() {</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">09 \
int *x = 0;</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">10 \
foo();</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">11 \
if (flag) // assumed true</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">12 \
*x = 5; // warn</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">13 \
}</span></div>  </span><b><br
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-88547581698791 \
96786gmail-m_-4639077540367978006gmail-m_8684261205520343446gmail-m_-3381093670081869542gmail-Apple-interchange-newline">
  </b></div>
                                  <div>In this example, it'd be great to
                                    see notes placed on line 10 and 5,
                                    because if flag wasn't invalidated,
                                    the bug would not have occurred
                                    (since flag is global, and is
                                    initialized to 0). The analyzer
                                    normally wouldn't place notes there,
                                    so we definitely should track flag
                                    up to line 5.</div>
                                  <div><b><br>
                                    </b></div>
                                  <div><b>Example 2.:</b></div>
                                  <div><b><br>
                                    </b></div>
                                  <div><span \
style="background-color:transparent;font-family:&quot;Courier \
New&quot;;white-space:pre-wrap">01 int flag;</span></div>  <span
id="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-88547581698791967 \
86gmail-m_-4639077540367978006gmail-m_8684261205520343446gmail-m_-3381093670081869542gmail-docs-internal-guid-af141fcb-7fff-5805-158d-0af705abc2d5">
  <div
                                      \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">02 \
bool coin();</span></div>  <div
                                      \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">03</span></div>
  <div
                                      \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">04 \
void foo() {</span></div>  <div
                                      \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">05 \
coin();</span></div>  <div
                                      \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">06 \
}</span></div>  <div
                                      \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">07</span></div>
  <div
                                      \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">08 \
int main() {</span></div>  <div
                                      \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">09 \
int *x = 0;</span></div>  <div
                                      \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">10 \
foo();</span></div>  <div
                                      \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">11 \
if (flag)</span><span style="font-family:&quot;Courier \
New&quot;;white-space:pre-wrap"> // assumed true</span></div>  <div
                                      \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">12 \
*x = 5;</span><span style="font-family:&quot;Courier New&quot;;white-space:pre-wrap"> \
// warn</span></div>  <div
                                      \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">13 \
}</span></div>  </span>
                                  <div><br>
                                  </div>
                                  <div>This case is very similar, with
                                    the only difference being that the
                                    analyzer conservatively assumed that
                                    coin may have written flag (as it's
                                    a global variable). We should track
                                    until line 5.</div>
                                </div>
                              </blockquote>
                              <div><br>
                              </div>
                              <div>I am not sure about this example.
                                While the invalidation of the global
                                flag is definitely playing a role in
                                this bug report the point where the flag
                                is actually invalidated could seem quite
                                arbitrary for the user. We might get the
                                invalidation when we first see a
                                function that is not defined in this TU,
                                when we reach the maximum call stack
                                depth and so on. My point is, I am not
                                sure if we actually help the user by
                                pinpointing the first function call
                                where the invalidation happens. A more
                                user friendly way to think about this
                                problem is how could the user solve this
                                false positive? As far as I understand
                                the cannonical solution would be to add
                                an assertion, so it would be better to
                                put a node close to the place where the
                                user would change the code, so I would
                                put it in the stack frame of the bug.
                                For example we could generate a note for
                                the call foo(), that somewhere in that
                                call stack the analyzer could no longer
                                track the value of flag, thus
                                invalidated its contents. <a
                                  class="gmail_plusreply"
id="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786gmail-m_-4639077540367978006plusReplyChip-0"
  href="mailto:adergachev@apple.com"
                                  target="_blank" moz-do-not-send="true">@Artem
                                  Dergachev</a><span
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786Apple-converted-space"> \
</span>what  do you think?<br>
                              </div>
                            </div>
                          </div>
                        </blockquote>
                        <br>
                        This is definitely one of the most annoying
                        kinds of false positives we have due to
                        infeasible paths and over-eager assumption
                        chains. Like, you can add an assertion here, but
                        it's going to look super ugly:<br>
                        <br>
                         <span
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786Apple-converted-space"> \
</span>bool  tmp_flag = flag;<br>
                         <span
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786Apple-converted-space"> \
</span>coin();  // why would anybody believe it touches flag in
                        the first place???<br>
                         <span
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786Apple-converted-space"> \
</span>assert(flag  == tmp_flag &amp;&amp; "sanity check");<br>
                        <br>
                        And even if the user would actually like to add
                        something like that to his code, it would
                        require a powerful constraint solver to even
                        handle this assertion correctly.<br>
                        <br>
                        These are not very common, but they're so
                        annoying that i probably wouldn't be against
                        marking reports as invalid immediately when we
                        see that an important control dependency relies
                        on such invalidation (we'll have to gather
                        experimental data on that, of course).<br>
                        <br>
                      </div>
                    </blockquote>
                    <div><br>
                    </div>
                    <div>Hmm, yea, good point. But if I coin were to
                      take flag as a non-const reference (other than the
                      fact that flag is global) I guess we would like to
                      keep the report and place notes all the way there?
                      I agree with Artem, we really need to see the real
                      world impact of this.</div>
                  </div>
                </div>
              </blockquote>
              <div><br>
              </div>
              <div>Yup.</div>
              <div><br>
              </div>
              <br>
              <blockquote type="cite">
                <div>
                  <div class="gmail_quote"
style="font-family:Helvetica;font-size:12px;font-style:normal;font-variant-caps:normal \
;font-weight:normal;letter-spacing:normal;text-align:start;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;text-decoration:none">
  <blockquote class="gmail_quote" style="margin:0px
                      0px 0px 0.8ex;border-left:1px solid
                      rgb(204,204,204);padding-left:1ex">
                      <div bgcolor="#FFFFFF">
                        <blockquote type="cite">
                          <div dir="ltr">
                            <div class="gmail_quote">
                              <div> </div>
                              <blockquote class="gmail_quote"
                                style="margin:0px 0px 0px
                                0.8ex;border-left:1px solid
                                rgb(204,204,204);padding-left:1ex">
                                <div dir="ltr"><br
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-88547581698791 \
96786gmail-m_-4639077540367978006gmail-m_8684261205520343446gmail-m_-3381093670081869542gmail-Apple-interchange-newline">
  <div><b>Example 3.:</b></div>
                                  <div><b><br>
                                    </b></div>
                                  <div><span
id="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-88547581698791967 \
86gmail-m_-4639077540367978006gmail-m_8684261205520343446gmail-m_-3381093670081869542gmail-docs-internal-guid-7904e5bc-7fff-83e0-6794-7060f31972aa">
  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="background-color:transparent;font-family:&quot;Courier \
New&quot;;white-space:pre-wrap">01 void f(int flag) {</span><br>  </div>
                                      <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">02 \
int *x = 0;</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">03 \
if (flag)</span><span style="font-family:&quot;Courier \
New&quot;;white-space:pre-wrap"> // assumed true</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">04 \
*x = 5;</span><span style="font-family:&quot;Courier New&quot;;white-space:pre-wrap"> \
// warn</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">05 \
}</span></div>  </span><br
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-88547581698791 \
96786gmail-m_-4639077540367978006gmail-m_8684261205520343446gmail-m_-3381093670081869542gmail-Apple-interchange-newline">
  </div>
                                  <div>Here, the user could simply
                                    follow the arrows that shows the
                                    path of execution: it isn't really
                                    important what flag was initialized
                                    on the first line.</div>
                                  <div><br>
                                  </div>
                                  <div><b>Example 4.:</b></div>
                                  <div><b><br>
                                    </b></div>
                                  <div><span
id="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-88547581698791967 \
86gmail-m_-4639077540367978006gmail-m_8684261205520343446gmail-m_-3381093670081869542gmail-docs-internal-guid-6d80ef0f-7fff-a248-86b1-e6e82a5757ba">
  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">01 \
int flag;</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">02</span></div>
  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">03 \
int getInt();</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">04</span></div>
  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">05 \
int main() {</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">06 \
int *x = 0;</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">07 \
int y = getInt();</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">08 \
flag = y;</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">09 \
if (flag)</span><span style="font-family:&quot;Courier \
New&quot;;white-space:pre-wrap"> // assumed true</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">10 \
*x = 5;</span><span style="font-family:&quot;Courier New&quot;;white-space:pre-wrap"> \
// warn</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">11 \
}</span></div>  </span><br
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-88547581698791 \
96786gmail-m_-4639077540367978006gmail-m_8684261205520343446gmail-m_-3381093670081869542gmail-Apple-interchange-newline">
  </div>
                                  <div>Again, the user could see there
                                    was a write made to flag on line 8
                                    -- the question is, whether we
                                    should explain y better. Right now,
                                    we're thinking that we shouldn't,
                                    leading to the conclusion that flag
                                    here shouldn't be tracked at all.</div>
                                </div>
                              </blockquote>
                              <div><br>
                              </div>
                              <div>Finding the place where flag was
                                modified in a really big function can
                                still be challenging. While generating a
                                note would be an overkill I wonder if
                                some middle ground like highlighting the
                                statement without additional text would
                                actually make sense for this case.</div>
                            </div>
                          </div>
                        </blockquote>
                      </div>
                    </blockquote>
                    <div>I think its a matter of a simple comparison of
                      the stack frames that would decide whether we only
                      want to track the condition if it was in between
                      the collapse point and the initialization in a
                      nested stack frame, or regardless of the stack
                      frame. Let's see just see which yields the better
                      result!</div>
                    <blockquote class="gmail_quote" style="margin:0px
                      0px 0px 0.8ex;border-left:1px solid
                      rgb(204,204,204);padding-left:1ex">
                      <div bgcolor="#FFFFFF"><br>
                        When it's in the same function, you can simply
                        search for the variable name to see all writes
                        into it. It's not that hard. Even if the
                        variable is written into by pointer, you can see
                        that its address is taken and search for the
                        pointer variable as well (though annoying if
                        there are too many levels of indirection).<br>
                        <br>
                        But, hmm, if the address is taken in a different
                        stack frame, this becomes much harder. I guess,
                        writes that are made through pointers should be
                        highlighted, and we should also do our tracking
                        to explain why do we think that this pointer
                        points to that variable.<br>
                        <br>
                      </div>
                    </blockquote>
                    <div><br>
                    </div>
                    <div>Ah yes, our arch enemy, pointer analysis showed
                      up again. I guess I agree with you, but I plan on
                      approaching aliasing related issues when I got
                      most of the other stuff working.</div>
                  </div>
                </div>
              </blockquote>
              <div><br>
              </div>
              <div>In this case it's much more trivial because you're
                simply relying on the information about pointers that's
                already contained in the graph.</div>
              <br>
              <blockquote type="cite">
                <div>
                  <div class="gmail_quote"
style="font-family:Helvetica;font-size:12px;font-style:normal;font-variant-caps:normal \
;font-weight:normal;letter-spacing:normal;text-align:start;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;text-decoration:none">
  <blockquote class="gmail_quote" style="margin:0px
                      0px 0px 0.8ex;border-left:1px solid
                      rgb(204,204,204);padding-left:1ex">
                      <div bgcolor="#FFFFFF"><br>
                        <blockquote type="cite">
                          <div dir="ltr">
                            <div class="gmail_quote">
                              <div> </div>
                              <blockquote class="gmail_quote"
                                style="margin:0px 0px 0px
                                0.8ex;border-left:1px solid
                                rgb(204,204,204);padding-left:1ex">
                                <div dir="ltr">
                                  <div><br>
                                  </div>
                                  <div><b>Example 5.:</b></div>
                                  <div><b><br>
                                    </b></div>
                                  <div><span
id="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-88547581698791967 \
86gmail-m_-4639077540367978006gmail-m_8684261205520343446gmail-m_-3381093670081869542gmail-docs-internal-guid-6d80ef0f-7fff-a248-86b1-e6e82a5757ba">
  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">01 \
int flag;</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">02</span></div>
  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">03 \
int getInt();</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">04</span></div>
  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">05 \
void foo() {</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">06 \
int y = getInt();</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">07 \
flag = y;</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">08 \
}</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">09</span></div>
  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">10 \
int main() {</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">11 \
int *x = 0;</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">12 \
foo();</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">13 \
if (flag)</span><span style="font-family:&quot;Courier \
New&quot;;white-space:pre-wrap"> // assumed true</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">14 \
*x = 5;</span><span style="font-family:&quot;Courier New&quot;;white-space:pre-wrap"> \
// warn</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">15 \
}</span></div>  <p dir="ltr"
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">
 </span></p>
                                      <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap"><font \
face="arial, sans-serif">Like Example 1-2, we should explain that flag was written on \
line 7, but like in Example 4., we shouldn't track y.</font></span></div>  \
</span></div>  </div>
                              </blockquote>
                              <div><br>
                              </div>
                              <div>Again, from the user's point of view,
                                I think the ultimate solution would be
                                to have an interface, where we does not
                                present the part where y was tracked by
                                default, but the user could click on a
                                button to actually expand that part of
                                the path in case she consider it
                                interesting. This, of course, is not
                                part of you GSoC, I am just wondering
                                what is the ideal that we would like to
                                pursue in the long run.<span
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786Apple-converted-space"> \
</span><br>  </div>
                            </div>
                          </div>
                        </blockquote>
                        <br>
                        A more powerful UI could have indeed solved a
                        lot of these problems by telling users to do our
                        work. But given that it comes with a fairly
                        large cost of developing such UIs for every tool
                        that integrates the Static Analyzer, i'd
                        definitely try to push our own effort of
                        stuffing exactly as much information as
                        necessary into the report as far as possible.<br>
                        <br>
                      </div>
                    </blockquote>
                    <div><br>
                    </div>
                    <div>I actually had a number of discussions with
                      Dániel Krupp about this, and folks in Ericsson
                      seem to be very interested in this direction. This
                      could be a great compliment to my GSoC as a
                      followup project. <br>
                    </div>
                    <blockquote class="gmail_quote" style="margin:0px
                      0px 0px 0.8ex;border-left:1px solid
                      rgb(204,204,204);padding-left:1ex">
                      <div bgcolor="#FFFFFF">
                        <blockquote type="cite">
                          <div dir="ltr">
                            <div class="gmail_quote">
                              <div> </div>
                              <blockquote class="gmail_quote"
                                style="margin:0px 0px 0px
                                0.8ex;border-left:1px solid
                                rgb(204,204,204);padding-left:1ex">
                                <div dir="ltr">
                                  <div>
                                    <div><b><br
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-88547581698791 \
96786gmail-m_-4639077540367978006gmail-m_8684261205520343446gmail-m_-3381093670081869542gmail-Apple-interchange-newline">
  Example 6.:</b></div>
                                    <div>
                                      <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="background-color:transparent;font-family:&quot;Courier \
New&quot;;white-space:pre-wrap"> 01 void f(int flag) {</span><br>
                                      </div>
                                      <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">02 \
int *x = 0;</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">03 \
assert(flag);</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">04 \
if (flag)</span><span style="font-family:&quot;Courier \
New&quot;;white-space:pre-wrap"> // assumed true</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">05 \
*x = 5;</span><span style="font-family:&quot;Courier New&quot;;white-space:pre-wrap"> \
// warn</span></div>  <div
                                        \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">06 \
}</span></div>  </div>
                                  </div>
                                  <div><br>
                                  </div>
                                  <div>Currently, we mention nothing
                                    about line 3, yet we say "Taking the
                                    true branch" on line 4, rather then
                                    "Assuming the condition is true".
                                    This is because the collapse point
                                    (point at which we constrain flag's
                                    value to be true or false) isn't on
                                    line 4: the analysis only continued
                                    past the assert since the analyzer
                                    assumed flag to be non-zero. In this
                                    case, we would like to track flag to
                                    the assert to explain why we are so
                                    confident on line 5 that flag is
                                    non-zero.</div>
                                </div>
                              </blockquote>
                              <div><br>
                              </div>
                              <div>What do you mean by tracking tha flag
                                back to the assert? The reason why a
                                note is useful because in this case
                                either the assert is wrong or the bug is
                                a true positive. But again, when the
                                assert is in the same function we might
                                not want to generate additional note for
                                this as this info might easily be
                                inferred by following the arrows in the
                                report (but we might highlight the
                                assertion line, see my previous
                                comments). In case the assert is in a
                                separate (inlined) function it would
                                make much more sense to generate a note.<br>
                              </div>
                            </div>
                          </div>
                        </blockquote>
                        <br>
                        Wouldn't the assert already contain a (prunable)
                        note saying "Assuming 'flag' is true"? I guess
                        the only problem here is that the note is
                        prunable, so it won't be shown if the assert is
                        wrapped into a nested function call.<br>
                        <br>
                      </div>
                    </blockquote>
                    <div>Yup, with<span
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786Apple-converted-space"> \
</span><font  face="courier new, monospace">-analyzer-config
                        prune-paths=false</font><font face="arial,
                        sans-serif">, you get the following report, but
                        it's still not explained well why flag is
                        non-zero (the thing that I'm missing is the
                        connection in between b and flag):</font></div>
                    <div><font face="courier new, monospace"><br>
                      </font></div>
                    <div><font face="courier new, monospace">$ cat
                        example_6.cpp</font></div>
                    <div><font face="arial, sans-serif"><br>
                      </font></div>
                    <div><font face="courier new, monospace">[[noreturn]]
                        void halt();<br>
                        <br>
                        void assert(int b) {<br>
                          if (!b)<br>
                            halt();<br>
                        }<br>
                        <br>
                        void f(int flag) {<br>
                          int *x = 0;<br>
                          assert(flag);<br>
                          if (flag) // assumed true<br>
                            *x = 5; // warn<br>
                        }<br>
                      </font></div>
                    <div><font face="courier new, monospace"><br>
                      </font></div>
                    <div><font face="courier new, monospace">$ clang
                        -cc1 -analyze -analyzer-output=text
                        -analyzer-checker=core example_6.cpp
                        -analyzer-config prune-paths=false</font></div>
                    <div><font face="courier new, monospace"><br>
                      </font></div>
                    <div><font face="courier new, monospace">example_6.cpp:10:3:
                        note: 'x' initialized to a null pointer value<br>
                          int *x = 0;<br>
                          ^~~~~~<br>
                        example_6.cpp:11:3: note: Calling 'assert'<br>
                          assert(flag);<br>
                          ^~~~~~~~~~~~<br>
                        example_6.cpp:5:7: note: Assuming 'b' is not
                        equal to 0<br>
                          if (!b)<br>
                              ^~<br>
                        example_6.cpp:5:3: note: Taking false branch<br>
                          if (!b)<br>
                          ^<br>
                        example_6.cpp:11:3: note: Returning from
                        'assert'<br>
                          assert(flag);<br>
                          ^~~~~~~~~~~~<br>
                        example_6.cpp:12:7: note: 'flag' is not equal to
                        0<br>
                          if (flag) // assumed true<br>
                              ^~~~<br>
                        example_6.cpp:12:3: note: Taking true branch<br>
                          if (flag) // assumed true<br>
                          ^<br>
                        example_6.cpp:13:8: note: Dereference of null
                        pointer (loaded from variable 'x')<br>
                            *x = 5; // warn<br>
                             ~ ^</font><br>
                    </div>
                    <div><font face="courier new, monospace"><br>
                      </font></div>
                    <div><font face="arial, sans-serif">As you said,
                        this example is interesting because assert isn't
                        a macro here, and flag's collapse point really
                        happens in a nested stack frame. Am I
                        understanding correctly that maybe we shouldn't
                        pursue tracking a variable if the collapse point
                        happened in the same stack frame?</font></div>
                    <blockquote class="gmail_quote" style="margin:0px
                      0px 0px 0.8ex;border-left:1px solid
                      rgb(204,204,204);padding-left:1ex">
                      <div bgcolor="#FFFFFF">
                        <blockquote type="cite">
                          <div dir="ltr">
                            <div class="gmail_quote">
                              <div> </div>
                              <blockquote class="gmail_quote"
                                style="margin:0px 0px 0px
                                0.8ex;border-left:1px solid
                                rgb(204,204,204);padding-left:1ex">
                                <div dir="ltr">
                                  <div><b><br
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-88547581698791 \
96786gmail-m_-4639077540367978006gmail-m_8684261205520343446gmail-m_-3381093670081869542gmail-Apple-interchange-newline">
  Example 7.:</b></div>
                                  <div>
                                    <p dir="ltr"
                                      \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="background-color:transparent;font-family:&quot;Courier \
New&quot;;white-space:pre-wrap"> </span></p>
                                    <div
                                      \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="background-color:transparent;font-family:&quot;Courier \
New&quot;;white-space:pre-wrap">01 int getInt();</span></div>  <div
                                      \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="background-color:transparent;font-family:&quot;Courier \
New&quot;;white-space:pre-wrap">02 03 void f(int flag) {</span><br>
                                    </div>
                                    <div
                                      \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">04 \
int *x = 0;</span></div>  <div
                                      \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">05 \
flag = getInt();</span></div>  <div
                                      \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">06 \
assert(flag);</span></div>  <div
                                      \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">07 \
if (flag)</span><span style="font-family:&quot;Courier \
New&quot;;white-space:pre-wrap"> // assumed true</span></div>  <div
                                      \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">08 \
*x = 5;</span><span style="font-family:&quot;Courier New&quot;;white-space:pre-wrap"> \
// warn</span></div>  <div
                                      \
style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span \
style="font-family:&quot;Courier \
New&quot;;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">09 \
}</span></div>  </div>
                                  <div><br>
                                  </div>
                                  <div>Like Example 6, we should explain
                                    why know that flag is non-zero on
                                    line 7 by tracking it back to line
                                    6. Like in the case of Example 4.,
                                    the user could see where flag was
                                    written, so we wouldn't like to see
                                    a note on line 5.</div>
                                  <div><br>
                                  </div>
                                  <div>So what's the takeaway? </div>
                                  <div><br>
                                  </div>
                                  <div>After teaching the analyzer the
                                    difference in between a condition
                                    and a "regularly tracked
                                    expression", I plan to implement the
                                    following two rules:</div>
                                  <div><br>
                                  </div>
                                  <div>Track a condition only if</div>
                                  <div>a.) The collapse point doesn't
                                    coincide with the condition point</div>
                                  <div>b.) It was written in a nested
                                    stack frame.</div>
                                </div>
                              </blockquote>
                              <div><br>
                              </div>
                              <div>Do you mane a) or b), or a) and b)?
                                Also, what to do with "multiple collapse
                                point" events like:</div>
                            </div>
                          </div>
                        </blockquote>
                      </div>
                    </blockquote>
                    <div>If either happens. </div>
                    <blockquote class="gmail_quote" style="margin:0px
                      0px 0px 0.8ex;border-left:1px solid
                      rgb(204,204,204);padding-left:1ex">
                      <div bgcolor="#FFFFFF">
                        <blockquote type="cite">
                          <div dir="ltr">
                            <div class="gmail_quote">
                              <div><br>
                              </div>
                              <div><span
                                  style="font-family:&quot;courier
                                  new&quot;,monospace">if ( a &lt; 10 )
                                  ;</span></div>
                              <div><span
                                  style="font-family:&quot;courier
                                  new&quot;,monospace">if ( a &lt; 5) //
                                  Here we do add new constraints to a,
                                  but we also had other constraints
                                  before. Do you consider this to
                                  coincide or not?<br>
                                </span></div>
                              <div><span
                                  style="font-family:&quot;courier
                                  new&quot;,monospace"> ...</span><br>
                              </div>
                            </div>
                          </div>
                        </blockquote>
                        <br>
                        I'd rather track the condition *until* we cover
                        those points (which implies not tracking it at
                        all when there are no b.)-points and the
                        a.)-point coincide with the current terminator),
                        rather than track it forever if we have those
                        points.<br>
                        <br>
                        By collapse point i meant the point where the
                        (bool)condition collapses to a constant-false or
                        a constant-true (in the true case the condition
                        itself doesn't necessarily collapse to a
                        constant 1). It doesn't happen every time we
                        introduce a constraint.<br>
                        <br>
                      </div>
                    </blockquote>
                    <div>Yup, that's what I was planning to go for. </div>
                    <blockquote class="gmail_quote" style="margin:0px
                      0px 0px 0.8ex;border-left:1px solid
                      rgb(204,204,204);padding-left:1ex">
                      <div bgcolor="#FFFFFF"><br>
                        <blockquote type="cite">
                          <div dir="ltr">
                            <div class="gmail_quote">
                              <div> </div>
                              <blockquote class="gmail_quote"
                                style="margin:0px 0px 0px
                                0.8ex;border-left:1px solid
                                rgb(204,204,204);padding-left:1ex">
                                <div dir="ltr">
                                  <div><br>
                                  </div>
                                  <div>We hope that by implementing
                                    this, tracking conditions to
                                    conditions would be kept at bay
                                    without a counter to limit the depth
                                    of recursion, and the intolerable
                                    growth of bug length with
                                    drastically shorten. I do expect
                                    skeletons to fall out of the closet,
                                    but I am confident that this is a
                                    good initial approach.</div>
                                  <div><br>
                                  </div>
                                  <div>As explained earlier, this is
                                    mission number one, so I'll
                                    prioritize getting it right before
                                    pursuing the
                                    "should-not-have-happened" case.</div>
                                  <div><br>
                                  </div>
                                  <div>One thing I did not touch on just
                                    yet, is the case where an assert was
                                    (correctly, by the way) regarded as
                                    a control dependency, and it's
                                    condition was tracked. This is
                                    undoubtedly undesirable, but
                                    figuring out whether the condition
                                    is found in an assert is rather
                                    difficult. Asserts are often
                                    implemented as a macro, and could
                                    have a very, for a lack of a better
                                    word, esoteric implementations on
                                    certain platforms. We discussed
                                    trying to tinker with the control
                                    dependency calculator, namely
                                    skipping over nodes that have two
                                    successors and one of them leads to
                                    noreturn node, but I still need time
                                    to figure out something reliable.</div>
                                </div>
                              </blockquote>
                              <div><br>
                              </div>
                              <div>Some asserts consists of more basic
                                blocks. Having two successors where one
                                of them is a<span
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786Apple-converted-space"> \
</span><span  style="font-family:&quot;courier
                                  new&quot;,monospace">noreturn</span><span
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786Apple-converted-space"> \
</span>block  is not a sufficient condition. Consider
                                for example<span
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786Apple-converted-space"> \
</span><span  style="font-family:&quot;courier
                                  new&quot;,monospace">assert(a
                                  &amp;&amp; b);<span
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786Apple-converted-space"> \
</span></span>Here  the basic block for evaluating<span
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786Apple-converted-space"> \
</span><span  style="font-family:&quot;courier
                                  new&quot;,monospace">a</span><span
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786Apple-converted-space"> \
</span>will  either go to the next line after
                                assertion or to the evaluation of<span
                                  style="font-family:&quot;courier
                                  new&quot;,monospace"><span
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786Apple-converted-space"> \
</span>b</span><span \
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786Apple-converted-space"> \
</span>which  is not a<span
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786Apple-converted-space"> \
</span><span  style="font-family:&quot;courier
                                  new&quot;,monospace">noreturn</span><span
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786Apple-converted-space"> \
</span>block.  While it is true that from node a we
                                will either go to the next non-assert
                                block or end up in a<span
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786Apple-converted-space"> \
</span><span  style="font-family:&quot;courier
                                  new&quot;,monospace">noreturn</span><span
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786Apple-converted-space"> \
</span>block,  the<span
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786Apple-converted-space"> \
</span><span  style="font-family:&quot;courier
                                  new&quot;,monospace">noreturn</span><span
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786Apple-converted-space"> \
</span>block  might not be an immediate successor of
                                node<span
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786Apple-converted-space"> \
</span><span  style="font-family:&quot;courier
                                  new&quot;,monospace">a</span>.<span
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786Apple-converted-space"> \
</span><br>  </div>
                            </div>
                          </div>
                        </blockquote>
                        <br>
                        Forgot to mention, i have a history with the CFG
                        for asserts that's mostly contained in<span
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786Apple-converted-space"> \
</span><a class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786gmail-m_-4639077540367978006moz-txt-link-freetext"
  href="https://reviews.llvm.org/D28023"
                          target="_blank" \
moz-do-not-send="true">https://reviews.llvm.org/D28023</a><span \
class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786Apple-converted-space"> \
</span>and<span class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786Apple-converted-space"> \
</span><a class="gmail-m_-8988326703564405349gmail-m_-1125755170523841522gmail-m_-8854758169879196786gmail-m_-4639077540367978006moz-txt-link-freetext"
  href="https://reviews.llvm.org/D35673"
                          target="_blank" \
moz-do-not-send="true">https://reviews.llvm.org/D35673</a>.  This is the reason why i \
believe that some  implementation of assert() have a lot of CFG
                        blocks on their own, regardless of how many CFG
                        blocks does it take to evaluate the assert
                        condition.<br>
                        <br>
                      </div>
                    </blockquote>
                    <div>Thanks! :D Right now, incorrect tracking of
                      assert conditions isn't on the top of my priority
                      list, but I'll think about this when I'm sitting
                      on the bus or watering the garden. </div>
                    <blockquote class="gmail_quote" style="margin:0px
                      0px 0px 0.8ex;border-left:1px solid
                      rgb(204,204,204);padding-left:1ex">
                      <div bgcolor="#FFFFFF">
                        <blockquote type="cite">
                          <div dir="ltr">
                            <div class="gmail_quote">
                              <div><br>
                              </div>
                              <div>Regards,</div>
                              <div>Gabor<br>
                              </div>
                              <div> </div>
                              <blockquote class="gmail_quote"
                                style="margin:0px 0px 0px
                                0.8ex;border-left:1px solid
                                rgb(204,204,204);padding-left:1ex">
                                <div dir="ltr">
                                  <div><br>
                                  </div>
                                  <div>Thanks to everyone who helped me
                                    along: Artem Dergachev, Gábor
                                    Horváth, Ádám Balogh, Jakub
                                    Kuderski, and Zoltán Porkoláb!</div>
                                  <div><br>
                                  </div>
                                  <div>Cheers,</div>
                                  <div>Kristóf</div>
                                  <div><br>
                                  </div>
                                  <div>[1]  <a
                                      \
href="http://lists.llvm.org/pipermail/cfe-dev/2019-June/062535.html"  target="_blank"
                                      \
moz-do-not-send="true">http://lists.llvm.org/pipermail/cfe-dev/2019-June/062535.html</a></div>
  </div>
                              </blockquote>
                            </div>
                          </div>
                        </blockquote>
                      </div>
                    </blockquote>
                  </div>
                </div>
              </blockquote>
            </div>
            <br>
          </div>
        </blockquote>
      </div>
    </blockquote>
    <br>
  </body>
</html>


[Attachment #6 (text/plain)]

_______________________________________________
cfe-dev mailing list
cfe-dev@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic