infer_clone

Commit Graph

Author	SHA1	Message	Date
Radu Grigore	a6edb94450	Biabduction prover now logs inconsistency reason. Summary: I found this information useful in some debugging. Reviewed By: jvillard Differential Revision: D15875456 fbshipit-source-id: 4ca51068c	6 years ago
Jules Villard	7f12ced394	[pulse] move to SIL proper Summary: [apologies for the unreviewable diff...] Get rid of HIL expressions in pulse. This finishes the HIL -> SIL migration. The first step made pulse start from SIL instructions but would translate most accesses to HIL to re-use most of the existing pulse code. This diff gets rid of the intermediate translation of SIL expressions to HIL expressions. Big changes: 1. `PulseOperations` mostly rewritten, driven by using `Exp.t` instead of `HilExp.AccessExpression.t` for everything. 2. Stop trying to reverse-engineer what addresses mean in terms of access paths from program variables. Rely on the trace pointing at the right places in the code to be enough. This is because it wasn't that useful (and could even be misleading when wrong) but could be prohibitively expensive in degenerate cases (eg nodes with tens of thousands of successive array accesses...) 3. `PulseAbductiveDomain.apply_post` now returns the computed return value instead of recording it itself. 4. Change of vocabulary: `materialize` -> `eval`, `crumb` -> `event` 5. Function calls arguments are now evaluated prior to doing anything else, which saves everything else from having to (remember to) do that. In particular, this changes how models look quite a bit. Reviewed By: mbouaziz Differential Revision: D15986373 fbshipit-source-id: 1d79935de	6 years ago
Timotej Kapus	4ac252120b	[sledge] special case buck-target-patterns Summary: For buck targets that contain at least one of the substrings in `buck-target-pattern` option in config, change the buck target to add `_sledge` suffix. Reviewed By: jberdine Differential Revision: D15920018 fbshipit-source-id: 44c242e99	6 years ago
Nikos Gorogiannis	97c41120ae	[buck/java2] eliminate project root config flag Summary: Passing an absolute project path as buck config flag makes buck caching almost impossible for infer artefacts, since on every host/run that directory can be different. Eliminate that and rely on shell commands to find the project root, executed within the genrule. Reviewed By: jvillard Differential Revision: D15963807 fbshipit-source-id: b6e590029	6 years ago
Josh Berdine	0f5ae186b3	[sledge] Add test for use-after-destroy of a temp Summary: And fix test Makefile to call the C++ compiler on .cpp files. Reviewed By: kren1 Differential Revision: D15972426 fbshipit-source-id: 719de755f	6 years ago
Josh Berdine	a58bc25aa5	[sledge] Strengthen simplification of convert Exps Summary: Simplify all conversions between castable types to the identity. The backend treats castable types as equal, so distinguishing conversions between them is incomplete. Reviewed By: kren1 Differential Revision: D15972427 fbshipit-source-id: fa09859ac	6 years ago
Josh Berdine	cc1f88a747	[sledge] Fix macos build of models Reviewed By: ngorogiannis Differential Revision: D15965940 fbshipit-source-id: a50882a70	6 years ago
Timotej Kapus	6949a5ee68	[sledge] Add a todo for calls with inttoptr Reviewed By: ngorogiannis Differential Revision: D15965374 fbshipit-source-id: bbee029d7	6 years ago
Artem Pianykh	33424c12ac	[infra] Fix deadcode check target Summary: Some functions exposed in ScubaLogging interface were not used outside of ScubaLogging and caused deadcode to fail. Reviewed By: ngorogiannis Differential Revision: D15964204 fbshipit-source-id: d823dbf8b	6 years ago
Josh Berdine	2b5bbcb784	[build] Do not remove Makefile.config in conf-clean Summary: Makefile.config is checked in, don't clean it. Reviewed By: jvillard Differential Revision: D15964240 fbshipit-source-id: bac38317e	6 years ago
Dino Distefano	571ae7774a	Extended check on n-th parameter to cpp method calls Reviewed By: jvillard Differential Revision: D15940525 fbshipit-source-id: 1b4b5d9ed	6 years ago
Josh Berdine	b14580d88b	[sledge] Move locals from blocks to functions Summary: The entry block contains all locals of the entire function, as required by the backend. This makes the manipulation of the locals of each block redundant. This diff moves the locals from the entry block to the function itself, removes the Locals frames of the Control.Stack, and adds a locals field to Return frames. This is part cleanup and part preparation for removing the Control.Stack. Reviewed By: ngorogiannis Differential Revision: D15963503 fbshipit-source-id: 523ebc260	6 years ago
Artem Pianykh	9d9df458b6	[infra] Add Config.execution_id and log it to scuba as a normal Reviewed By: mityal Differential Revision: D15920237 fbshipit-source-id: f0117b5f8	6 years ago
Artem Pianykh	046132b4c5	[infra] Collect low-prio logs during execution and flush them to Scuba at the end in one go Reviewed By: mityal Differential Revision: D15898726 fbshipit-source-id: e8609f10d	6 years ago
Radu Grigore	3de7acada4	[topl] tiny fixes to tracing output Reviewed By: jvillard Differential Revision: D15875300 fbshipit-source-id: 4427946b9	6 years ago
Radu Grigore	10d87eec4e	[topl] Simple error reporting. Reviewed By: jvillard Differential Revision: D15875271 fbshipit-source-id: 148206be9	6 years ago
Mehdi Bouaziz	0efd8960e1	[Tenv] Maximum sharing Summary: Reduces the size of the `tenv` by sharing values as most as possible, in an untyped - but supposedly safe - way, by using black magic on objects. Can be reused for other things later. Reviewed By: ngorogiannis Differential Revision: D15855870 fbshipit-source-id: 169a4b86b	6 years ago
Radu Grigore	384b3c5798	Assert that there is at most one flowgraph per procedure name. Reviewed By: jvillard Differential Revision: D15695839 fbshipit-source-id: 979531edb	6 years ago
Timotej Kapus	86e12cb1a3	[sledge] Add missing llvm passes to frontend.ml Summary: Adds `-mergefunc` and `-dce` passes to `Frontend.translate` to match the `buck link` flow with `opt` Reviewed By: ngorogiannis Differential Revision: D15938641 fbshipit-source-id: 128cb89cd	6 years ago
Mehdi Bouaziz	5f8514a8c2	[sqlite] Normalize blobs used for comparison Summary: Using `Marshal.to_string` to create SQLite values used in comparisons is brittle as there is no guarantee that it will return the same value for structurally equal values. When adding sharing, this will definitely break. From the SQLite queries I found, only `SourceFile` and `Procname` are used in comparisons. I haven't tested performance. It shouldn't change anything for `SourceFile` as there is no possible sharing. It shouldn't change much for `Procname` as they are pretty small anyway. Reviewed By: ngorogiannis Differential Revision: D15923122 fbshipit-source-id: ce4af1fe3	6 years ago
Josh Berdine	330b266d28	[sledge] Rework function return value passing Summary: The current handling of the formal return variable scope is not correct. Since it is passed as an actual argument to the return continuation, it is manipulated as if it was a local variable of the caller. However, its scope is not ended with the caller's locals, leading to clashes. This diff reworks the passing of return values to avoid this problem, mainly by introducing a notion of temporary variables during parameter passing. This essentially has the effect of taking a function spec { P } f(x) { λv. Q } and generating a "temporary" variable v, applying the post λv. Q to it to obtain the pre-state for the call to the return continuation k(v). Being a temporary variable just means that it goes out of scope just after parameter passing. This amounts to a long-winded way of applying the post-state to the formal parameter of the return continuation without violating scopes or SSA. This diff also separates the manipulation of the symbolic states as they proceed from: 1. the pre-state before the return instruction; 2. the exit-state after the return instruction (including the binding of the returned value to the return formal variable); 3. the post-state, where the locals are existentially quantified; and 4. the return-state, which is expressed in terms of actual args instead of formal parameters. Also in support of summarization, formal return and throw parameters are no longer tracked on the analyzer's stack. Note that these changes involve changing the locals of blocks and functions to no longer include the formal parameters. Reviewed By: kren1 Differential Revision: D15912148 fbshipit-source-id: e41dd6e42	6 years ago
Ezgi Çiçek	2db1a3b8e3	[cost,inferBo] Add models for Collections.unmodifiable* getters Reviewed By: ngorogiannis Differential Revision: D15901108 fbshipit-source-id: fa399412a	6 years ago
Jules Villard	f43544598b	[oops] unbreak unit tests Summary: Something unfortunate happened. Reviewed By: mbouaziz Differential Revision: D15898642 fbshipit-source-id: 12bdde37a	6 years ago
Timotej Kapus	01e6c5c558	[sledge] [solver] add handling of trivial equality Summary: The solver couldn't deal with `∃ a,b . a = b` , so this diff adds a special case to deal with it. Reviewed By: ngorogiannis Differential Revision: D15897953 fbshipit-source-id: d841d3557	6 years ago
Jules Villard	04233ee49b	[clang] destroy C++ temporaries Summary: Inject destructor calls to destroy a temporary when its lifetime ends. Reviewed By: mbouaziz Differential Revision: D15674209 fbshipit-source-id: 0f783a906	6 years ago
Jules Villard	0592bac25e	[pulse] explain SIL logical variables in terms of program access paths Summary: Now that HIL doesn't help us anymore we need to reconstruct its mapping "SIL logical var -> program access path". We already have everything we need in pulse: it suffices to walk the current memory graph starting from program variables until we find the value of the temporary we are interested in. This diff also builds some type machinery to make sure all accesses are explained. Reviewed By: mbouaziz Differential Revision: D15824959 fbshipit-source-id: 722c81b39	6 years ago
Jules Villard	c9f4768be7	[pulse] move to SIL Summary: It turns out HIL gets in the way of a precise heap analysis. For instance, instead of: ``` n$0 = &x.f _ = delete(&x) &y = n$0 ``` HIL tries hard to forget about intermediate variables and shows instead ``` _ = delete(&x) &y = &x.f ``` Oops, that's a use-after-delete, whereas the original code was safe. While it's easy to write SIL programs that are completely unsound for HIL, they are not generated very often from the frontends. In fact, the problem became apparent only when making the clang frontend translate C++ temporaries destructors, which produces the situation above routinely. This diff makes the minimal amount of change to make Pulse build and produce equivalent results (minus HIL bugs) starting from SIL instead of HIL. The reporting sucks for now because we need to translate SIL temporaries back into program access paths. This is done in the next diff. Reviewed By: mbouaziz Differential Revision: D15824961 fbshipit-source-id: 8e4e2a3ed	6 years ago
Jules Villard	695b493b56	[pulse] move [PulseTrace] inside [PulseDomain] Summary: Just moving code around. This is needed later to make some types in `PulseTrace` depend on a new that I'll have to define in `PulseDomain`. Also, this gives better names all around I think Reviewed By: mbouaziz Differential Revision: D15881281 fbshipit-source-id: e86c1472e	6 years ago
Mehdi Bouaziz	b03aeb49c2	[eradicate] remove the constant flag only_keep_intersection Reviewed By: jeremydubreil Differential Revision: D9563663 fbshipit-source-id: ad6045fe7	6 years ago
Timotej Kapus	a75a50215b	[sledge] Add LLVM passes that reduce bitcode size Summary: : This patch adds several passes that reduce the amount of bitcode making sledge's job easier, more info: https://llvm.org/docs/Passes.html `-mergefunc` This pass merges functions that do the same thing, this can be because of templating or casts (ie. same functionality but on 32bit and 64bit ints, which is the same in machine code). More details at http://llvm.org/docs/MergeFunctions.html Note that this pass is currently not available through C/OCaml API. `-constmerge` This merges constants that have the same value, this is possible to do when the constants are internalized. `-argpromotion` ``` This pass promotes “by reference” arguments to be “by value” arguments. In practice, this means looking for internal functions that have pointer arguments. If it can prove, through the use of alias analysis, that an argument is only loaded, then it can pass the value into the function instead of the address of the value. This can cause recursive simplification of code and lead to the elimination of allocas (especially in C++ template code like the STL). ``` `-ipsccp` ``` Sparse conditional constant propagation and merging, which can be summarized as: Assumes values are constant unless proven otherwise Assumes BasicBlocks are dead unless proven otherwise Proves values to be constant, and replaces them with constants Proves conditional branches to be unconditional ``` `-deadargelim` Removes dead arguments of internal functions, good to run after other inter-procedural passes. Seems to crash llvm if run directly after `ipsccp`. Note that while this might look like doing full link-time optimisation, we are actually picking relatively cheap optimisations that mostly look at globals and walk their use chains. The main reason link-time optimisations are expensive is due to inlining and then running the full optimisation again from there. Reviewed By: jberdine Differential Revision: D15851408 fbshipit-source-id: be7191683	6 years ago
Jules Villard	512b42ece7	[pulse] move PulseInvalidation inside PulseDomain Summary: Just moving code around. This is needed later to make some types in `PulseInvalidation` depend on a new type that I'll have to define in `PulseDomain`. Reviewed By: mbouaziz Differential Revision: D15824962 fbshipit-source-id: 86cba2bfb	6 years ago
Jules Villard	457b017343	[pulse] more general graph visitor API Summary: Make it possible to re-use the graph visitor to compute all sorts of things with a flexible API where you can pass a function that folds over all addresses reachable from certain stack variables (specified with a filter) and gets passed the access path that leads to each address. This is used in later commits. Reviewed By: mbouaziz Differential Revision: D15824960 fbshipit-source-id: c424a71cb	6 years ago
Ezgi Çiçek	fedb8e5136	[infer] Cleanup preanalysis Summary: Preanalysis is performed at the frontend now. Hence, we don't need to repeatedly check/set when/if it is performed. Reviewed By: mbouaziz Differential Revision: D15863175 fbshipit-source-id: f9c6b7ae1	6 years ago
Nikos Gorogiannis	013d153538	[buck/java2] hashcons the global tenv during merging Summary: One "interesting" feature of the approach of merging the captured targets in Java, is that we union their type environments, as opposed to store partial tenvs together with each source file, which is the case for Clang. This means - the final global type environment is potentially huge because it contains all the types in all targets. - all analysis workers start by loading that tenv in memory, meaning we consume `\|size of tenv\| x #cpus` memory, which can tip the balance towards OOMs This diff attempts to economise on global tenv size. This is done by increasing sharing which is then preserved by marshalling. It's done in a brute force way, with hashtables for each struct component, and is not fully effective due to the recursion amongst types and types names, as well types appearing inside other constructs such as procnames. This is done when calling `Tenv.store` so that - the computation can be parallelised somewhat (capture is parallel, merging is not) - buck caching will benefit from smaller tenvs. This saves about 24% of total memory devoted to the type environment. Reviewed By: mbouaziz Differential Revision: D15840054 fbshipit-source-id: 6f03be1a4	6 years ago
Nikos Gorogiannis	8776a31f7d	[infer][buck capture] kill dead code Reviewed By: jvillard Differential Revision: D15851551 fbshipit-source-id: 0a23d062a	6 years ago
Ezgi Çiçek	898dd104c8	[cost] Invoke Cost issues only once Reviewed By: mbouaziz Differential Revision: D15853454 fbshipit-source-id: 41ec36392	6 years ago
Ezgi Çiçek	0f43930f40	[cost] Refactor cost issue types and enable detecting allocation complexity increase on cold start Summary: - Add allocation costs to `costs-report.json` and enable diffing over allocation costs. - Also, let's be more consistent and modular in naming our cost issues. - introduce a generic issue type `X_TIME_COMPLEXITY_INCREASE` where `X` can be one of the cost kinds. If the function is on the cold start, issue can have the `COLD_START` suffix. Similarly for infinite/zero/expensive calls. - Change `PERFORMANCE_VARIATION` -> `EXECUTION_TIME_COMPLEXITY_INCREASE` - Add new issue type for `ALLOCATION_COMPLEXITY_INCREASE_COLD_START` which will be enabled by default - Refactor cost issues to be more modular and succinct. This also makes addition of a new cost kind very easy by adding the kind into the `enabled_cost_kinds` list in `CostKind.ml` Reviewed By: mbouaziz Differential Revision: D15822681 fbshipit-source-id: cf89ece59	6 years ago
Jules Villard	6f5cb512db	[pulse] add example of FN in const-ref-bound temporary Summary: This one isn't caught because we don't destruct temporaries that are bound to a const reference. According to the C++ standard these should get destroyed when the const reference gets destroyed but instead we just don't destroy them for now. Reviewed By: mbouaziz Differential Revision: D15760209 fbshipit-source-id: 32c935ec0	6 years ago
Jules Villard	e14809baa8	[pulse] fix temporaries test code Summary: A test was claiming to be ok but wasn't. Reviewed By: mbouaziz Differential Revision: D15695944 fbshipit-source-id: 58772a793	6 years ago
Jules Villard	21f66dd197	[pulse] do not model `operator=` as assignment Summary: In a next diff temporaries will get destructed at the end of their lifetimes and that naive model would be causing false positives. The flipside is that we lose all reports on closures for now, will need to model them separately later. Reviewed By: mbouaziz Differential Revision: D15695943 fbshipit-source-id: c2c482c02	6 years ago
Jules Villard	ab427fd3f3	[clang] cache of names of C++ temporaries Summary: Needed for next diff: we'll need to do 2 passes on the AST to collect the temporaries to destroy at the end of an `ExprWithCleanups`, but the SIL names of these temporaries are generated freshly on the fly so they would get different names if we do it naively. This adds a hashmap to the translation context so the temporary corresponding to a given `MaterializeTemporyExpr` is only generated once and then reused. Reviewed By: mbouaziz Differential Revision: D15674212 fbshipit-source-id: 0e16062d9	6 years ago
Jules Villard	a9a7239831	[clang] split `inject_destructors` into two functions Summary: Simple refactor, needed for next diff. Reviewed By: mbouaziz Differential Revision: D15674210 fbshipit-source-id: 4d30104fd	6 years ago
Jules Villard	db800f138b	[clang] rewrite scope computations Summary: This started as an attempt to understand how to modify the frontend to inject destructors for C++ temporaries (see next diffs). This diff rewrites the existing logic for computing the list of variables that should be destroyed at the end of each statement, either because it's the end of their syntactic scope or because control flow branches outside of their syntactic scope. The frontend translates a function from the last instructions to the first, but scope computation needs to be done in the other direction, so it's done in a separate pass before the main translation happens. That first pass creates a map from statements in the AST to the list of variables that should be destroyed at the end of these statements. This is still the case now. Before, that map would be computed in a bit of a weird way: scopes are naturally a stack but instead of that the structure maintained was a flat list + a counter to know where the current scope ended in that list. In this diff, redo the computation maintaining a stack of scopes instead, which is a bit cleaner. Also treat more instructions as introducing a new scope, eg if, for, ... Reviewed By: mbouaziz Differential Revision: D15674208 fbshipit-source-id: c92429e82	6 years ago
Jules Villard	eaa5c32432	[clang] some more debug info Summary: Somewhat trivial: add a string to "Destruction" nodes to indicate why they were created. Rename the main `instruction_aux` function into `instruction_translate` (see next diff for why). Reviewed By: mbouaziz Differential Revision: D15674211 fbshipit-source-id: 8a7eda72c	6 years ago
Jules Villard	c3d55817b1	[pulse] another test for temporaries Summary: I rewrote the test so it doesn't need any C++ headers so that: - it's easier to see what's going on - it's easier to debug: the whole AST is now somewhat readable vs before the headers made it impossibly long Reviewed By: ezgicicek Differential Revision: D15674213 fbshipit-source-id: d98941983	6 years ago
Timotej Kapus	1614f78f6d	[sledge] Add a harness for lionhead fuzzers Summary: This diff introduces a `-lib-fuzz` flag to `buck link`, which links in a simple main that calls the LLVMFuzzerTestOneInput function, which is the entry point of libFuzzer fuzzer. Reviewed By: jberdine, jvillard Differential Revision: D15821512 fbshipit-source-id: cff731ed3	6 years ago
Jules Villard	696731523d	[pname dispatcher] more permissive templated function match Summary: This allows to match `foo<int_&>` and many other horrible names. Reviewed By: mbouaziz Differential Revision: D15825403 fbshipit-source-id: c892033aa	6 years ago
Dino Distefano	472f155a7a	Improved rule on block capturing CXX Reference Reviewed By: jvillard Differential Revision: D15737387 fbshipit-source-id: efe677b3d	6 years ago
Ezgi Çiçek	be85296759	[frontend] Move Preanalysis to frontend so that it is run always Summary: I realized that there was a discrepancy in the # of instructions between whether we run a single analysis or multiple analyses at the same time. It turns out that in biabduction, bufferoverrun and other HIL analyses we did Preanalysis step (which adds scope instructions and invokes liveness etc.) but not in others. This discrepancy results in inconsistent analysis results (e.g. in the new inefficient-keyset-iterator) that rely on instructions. We should be consistent. Hence, we now invoke Preanalysis in the frontend and remove all other uses in the rest of the checkers. Consequently, I had to update the inefficient-keyset-checker to take the CFG resulting from Preanalysis with extra scoping instructions. Reviewed By: mbouaziz, ngorogiannis, jvillard Differential Revision: D15803492 fbshipit-source-id: 4e21eb610	6 years ago
Timotej Kapus	46f5667823	[sledge] Relax call instruction arguments Summary: Previous change to allow bitcasts in call instructions was too strict and did not allow for indirect calls. Reviewed By: jberdine Differential Revision: D15803262 fbshipit-source-id: 40d828b59	6 years ago

... 11 12 13 14 15 ...

6888 Commits (127ba72982da279420fe41b3e70420e533aecfb7) All Branches Search

6888 Commits (127ba72982da279420fe41b3e70420e533aecfb7)

All Branches