infer_clone

Commit Graph

Author	SHA1	Message	Date
Sungkeun Cho	3685cc6fdd	[pulse] Revise trace of uninitialized value check Summary: This diff revises the trace generation of the uninitialized value checker, by introducing a new diagnostics for it. Reviewed By: jvillard Differential Revision: D25433775 fbshipit-source-id: 1279c0de4	4 years ago
Sungkeun Cho	fa29098376	[pulse] Inter-procedural uninit analysis Summary: This diff supports inter-procedural uninit analysis in pulse. * Added `MustBeInitialized` attribute to pre state when an address is read * Remove `Uninitialized` attribute when callee has `WrittenTo` for the same address Reviewed By: jvillard Differential Revision: D25368492 fbshipit-source-id: cbc74d4dc	4 years ago
Sungkeun Cho	0cbe2f9b08	[pulse] Uninitialized value check in pulse Summary: This diff adds uninitialized value check in pulse. For now, it supports only simple cases, - declared variables with a type of integer, float, void, and pointer - malloced pointer variables that points to integer, float, void, and pointer TODOs: I will add more cases in the following diffs. - declared/malloced array - declared/malloced struct - inter-procedural checking Reviewed By: jvillard Differential Revision: D25269073 fbshipit-source-id: 317df9a85	4 years ago
Sungkeun Cho	2d8f404a5f	Revert "[Refactoring] Use pvar instead of mangled in CapturedVar" Reviewed By: jvillard Differential Revision: D25337049 fbshipit-source-id: ba9d85193	4 years ago
Jules Villard	5423bb1699	[pulse] make sure we checked satisfiability on summaries Summary: Made `AbductiveDomain.summary_of_post` return a Sat/Unsat to make sure callers filter unsat summaries. Also made `ExitProgram` take a summary instead of a non-normalized abstract state, which was wrong (mostly could litter the disjuncts with infeasible paths). Reviewed By: skcho Differential Revision: D25277565 fbshipit-source-id: 72dacb944	4 years ago
Jules Villard	98b562c844	[pulse][refactor] extract and reuse a `SatUnsat` module Summary: Use the new module to represent both Sat/Unsat from Pulse formulas, and FeasiblePath/InfeasiblePath from PulseReport. Reviewed By: jberdine Differential Revision: D25277566 fbshipit-source-id: 9f8412ca9	4 years ago
Sungkeun Cho	49bfd206a0	[Refactoring] Use pvar instead of mangled in CapturedVar Summary: This diff uses Pvar.t in CapturedVar.t, so that * it can include additional info in Pvar.t * it can avoid some `Pvar.mk` calls when using the captured variables Reviewed By: jvillard Differential Revision: D25331763 fbshipit-source-id: 4e0c2ab4a	4 years ago
Sungkeun Cho	6972d9cb91	[Refactoring] Use inline record for captured variable type Reviewed By: jvillard Differential Revision: D25331728 fbshipit-source-id: 323e5d08f	4 years ago
Daiva Naudziuniene	4e658903ae	[pulse] Check the validity of the addresses captured by lambda only for captures by reference Summary: To look for captured variable address escape we should only check the validity of the addresses captured by reference. Checking the validity of the address captured by value can cause nullptr dereference false positives. Reviewed By: jvillard Differential Revision: D25219347 fbshipit-source-id: faf6f2b00	4 years ago
Radu Grigore	009f3b651c	[topl] Small steps in Pulse Summary: A Topl "small step" is a call to a method that is of interest to the automaton. When such a call of interest is made, the topl component of PulseAbductiveDomain.t is updated. This means that intra-procedural Topl should now work entirely inside Pulse, without instrumenting Sil. Main TODOs: - add error extraction - implement inter-procedural (PulseTopl.large_step) Reviewed By: jvillard Differential Revision: D25028286 fbshipit-source-id: e31a96d13	4 years ago
Jules Villard	f411c7d131	[pulse] do not stop at the first error in function calls Summary: We deliberately stopped as soon as an error was detected when applying a function call. This is not good as other pre/posts of the function may apply cleanly, which would allow us to cover more behaviours of the code. Went on a bit of a refactoring tangeant while fixing this, to clarify the `Ok None`/`Ok Some _`/`Error _` datatype returned by PulseInterproc. Now we report errors as soon as we find them during function calls but continue accumulating specs afterwards. Reviewed By: da319 Differential Revision: D24888768 fbshipit-source-id: d5f2c29d7	4 years ago
Jules Villard	47e9f8ffec	[pulse][easy] code factorisation Summary: Two bits of code were re-implementing [eval_access]. Reviewed By: skcho Differential Revision: D24832080 fbshipit-source-id: f6b0723c6	4 years ago
Nikos Gorogiannis	8c40ed53d7	[typ] move type constants to own module Summary: Cleanup `Typ` by moving all constant types to `StdTyp`. Also remove `Typ.typ` as it's just `Typ.t` now. Reviewed By: jberdine Differential Revision: D24620397 fbshipit-source-id: 4764f87ef	5 years ago
Jules Villard	2bdc6b892b	[pulse] apply Abort specs too Summary: Before this diff we would just propagate the callee abstract state, which doesn't make sense in the caller. We could just remove the state from AbortProgram altogether as Pulse itself doesn't use it, but for now let's at least make sure it's accurate. Also needed for upcoming hackathon that will start from Pulse error specs to try to produce tests :) Reviewed By: ezgicicek Differential Revision: D24448073 fbshipit-source-id: 9100b3f79	5 years ago
Jules Villard	0b7e2fb7c7	[pulse] more type-safety around summary and latent issues creation Summary: This makes sure we call `AbductiveDomain.summary_of_post` exactly once per post-condition. Notice in particular in the diff: - in Pulse.ml we remove a now-certified-useless "is_unsat_expensive" call - in PulseOperations.ml we add a previously-missing call to `summary_of_post` (it's needed to remove local variables from the symbolic state + normalize) The price to pay is ugly type annotations and down-casting peppered in a few places, in reasonable number. Reviewed By: da319 Differential Revision: D24078564 fbshipit-source-id: 3102cacf0	5 years ago
Jules Villard	7fdb33b710	[pulse] report errors only when the PRUNE nodes along the path are true Summary: Take another page from the Incorrectness Logic book and refrain from reporting issues on paths unless we know for sure that this path will be taken. Previously, we would report on paths that are merely not impossible. This goes very far in the other direction, so it's possible we'll want to go back to some sort of middle ground. Or maybe not. See the changes in the tests to get a sense of what we're missing. Reviewed By: ezgicicek Differential Revision: D24014719 fbshipit-source-id: d451faf02	5 years ago
Daiva Naudziuniene	d0cb245303	[frontend] Fix capture init for cpp lambdas Summary: We were missing assignment to captured variables with initializers. Consider the following example: ``` S* update_inside_lambda_capture_and_init(S* s) { S* object = nullptr; auto f = [& o = object](S* s) { o = s; }; f(s); return object; } ``` which was translated to ``` VARIABLE_DECLARED(o:S&); &o:S&=&object &f =(_fun...lambda..._operator(),([by ref]&o &o:S&)) ``` However, we want to capture `o` (which is an address of `object`), rather `&o` in closure. After the diff ``` VARIABLE_DECLARED(o:S&); &o:S&=&object n$7=&o:S& &f =(_fun...lambda..._operator(),([by ref]n$7 &o:S&)) ``` Reviewed By: jvillard Differential Revision: D23567346 fbshipit-source-id: 20f77acc2	5 years ago
Daiva Naudziuniene	29fd9e13d1	[pulse] Understand captured variables in cpp lambdas Summary: When we evaluate lambdas in pulse, we create a closure object with `fake` fields to store captured variables. However, during the function call we were not linking the captured values from the closure object. We address this missing part here. Reviewed By: jvillard Differential Revision: D23316750 fbshipit-source-id: 14751aa58	5 years ago
Jules Villard	c7305245c5	[istd][minor] no need to name ~fold in fold_of_pervasives_map_fold Summary: It's typically used inside another ~fold argument and it gets too verbose. Reviewed By: da319 Differential Revision: D22846501 fbshipit-source-id: 2fdd4271f	5 years ago
Daiva Naudziuniene	50d659b750	Update type of procdesc and closure expression to contain information about capture variable mode Summary: We update the type of captured variables to include information about capture mode (`ByReference` or `ByValue`) both for procdesc attributes and the closure expression. For lambda: closure expression now contains correct capture mode for capture variables. Procdesc still does not contain information about captured variables which we will address in the next diff. For objc blocks: at the moment all captured variables have mode `ByReference`. Added TODOs to fix this. Reviewed By: jvillard Differential Revision: D22572054 fbshipit-source-id: 4c88678ee	5 years ago
Jules Villard	a89d3db364	[pulse] change recency maps to be backed by lists Summary: This one is observed to be more memory efficient. Intuitively, maps need to be re-allocated more often than lists for balancing. In pulse, we'll often only ever add new values, in increasing order (when they are fresh variables created as we symbolically execute the program), which pushes maps into their worst-case allocation pattern. At least I suspect that's what happens. With lists, this case is handled much better as lists are not re-allocated when adding elements. This is somewhat confirmed by benchmarking and observing GC stats. Reviewed By: skcho Differential Revision: D22140908 fbshipit-source-id: 29815112f	5 years ago
Dulma Churchill	aa6fe7963c	[pulse] Add dealloc calls for ObjC objects that are about to become unreachable Summary: This diff implements part of the memory management for Objective-C classes in ARC, namely that `dealloc` is called when the objects become unreachable. In reality the semantics of ARC says that this happens when their reference count becomes 0, but we are not modelling this yet in Pulse. However, we could in the future. This fixes false positives memory leaks when the memory is freed in dealloc. `dealloc` is often implicit in Objective-C, it also calls the dealloc of instance variables and superclass. None of this is implemented yet, and will be done in a future diff. This will be added in the frontend probably, similarly to how it's done for C++ destructors. This is an important part of modelling Objective-C semantics in Infer, I looked at whether this should be a preanalysis to be used by all analyses but this needs Pulse. So the idea is that any analysis that needs to understand Objective-C memory model well, should have Pulse as a preanalysis. Reviewed By: jvillard Differential Revision: D21762292 fbshipit-source-id: ced014324	5 years ago
Dulma Churchill	f638e741ae	[pulse] Add DynamicType attribute and use it in the model of ObjC alloc Summary: Adding a new attribute for dynamic type. It is set in the models of constructors, currently only in `alloc` in Objective-C. We use it in the following diff to figure out which `dealloc` method to call. However it could be useful for other things, such as dynamic dispatch. #skipdeadcode Reviewed By: jvillard Differential Revision: D21739928 fbshipit-source-id: 9276c0a4d	5 years ago
Sungkeun Cho	719b72cb4f	[pulse] Avoid partitioning abstract values Summary: `partition` always constructs two new maps, which is expensive when there are a lot of entries. Let's avoid it if possible. Reviewed By: jvillard Differential Revision: D21684298 fbshipit-source-id: a8674d358	5 years ago
Josh Berdine	65f369cf35	[ocamlformat] Reformat repo with new version Reviewed By: jvillard Differential Revision: D21583046 fbshipit-source-id: ee4793880	5 years ago
Daiva Naudziuniene	eaf95951f5	[pulse] Modeling std::vector::end() Summary: It is undefined behavior to dereference end iterator. To catch end iterator dereferencing issues we change iterator model: instead of having `internal pointer` storing the current index, we model it as a pointer to a current index. This allows us to model `end()` iterator as having an invalid pointer and there is no need to create an invalidated element in the vector itself. Reviewed By: ezgicicek Differential Revision: D21178441 fbshipit-source-id: fd6a94b0b	5 years ago
Ezgi Çiçek	5ff6fc93a0	[pulse] Brush up Java iterator models Summary: Java's iterator models were wrong. This causes `VECTOR_INVALIDATION` errors in fbandroid projects. This diff aims to fix it by modeling Java iterators with a current pointer and an underlying collection array. Reviewed By: skcho Differential Revision: D21448322 fbshipit-source-id: 7d44354b5	5 years ago
Sungkeun Cho	d373a81b73	[pulse] Keep only one disjunct from blacklisted function Summary: This diff gets only one disjunct from blacklisted callee, in order to avoid OOMing in specific cases. Reviewed By: jvillard Differential Revision: D21406023 fbshipit-source-id: f9214c9c6	5 years ago
Jules Villard	f41575411c	make pulse take an `InterproceduralAnalysis.t` Summary: Needed to make pulse into a dune library. Reviewed By: skcho Differential Revision: D21401820 fbshipit-source-id: d8c758913	5 years ago
Jules Villard	5c453393ff	[pulse] recency model for memory accesses Summary: Add a new data structure and use it for the map of memory accesses to limit the number of destinations reachable from a given address. This avoids remembering details of each index in large arrays, or even each field in large structs. Reviewed By: skcho Differential Revision: D18246091 fbshipit-source-id: 5d3974d9c	5 years ago
Jules Villard	c2ec55fe37	[pulse] remove traces from interval domain Summary: The idea was to keep track of why we know certain facts but actually these traces are never read. Other arithmetic facts (BoItv and the path condition) don't have histories so remove them from concrete intervals too. Reviewed By: dulmarod Differential Revision: D21303353 fbshipit-source-id: eecf07b05	5 years ago
Jules Villard	2d8debc562	[pulse] invalidate vector backing array correctly Summary: We were invalidating "*(vec.__infer_backing_array)" instead of the address of the field itself. Reviewed By: ezgicicek Differential Revision: D21280357 fbshipit-source-id: 48b984800	5 years ago
Dulma Churchill	c76d59853b	[pulse] Model CFBridgingRelease by removing the Allocated attribute Summary: `CFBridgingRelease` and `__bridge_transfer` which I'll model later, transfer the memory model from manual memory ref count to ARC (automatic ref count), so to avoid false positives this needs to be modelled. We can simply remove the Allocated attribute from the state, which means we won't try to track that memory anymore. Reviewed By: skcho Differential Revision: D21088218 fbshipit-source-id: 3520a0d59	5 years ago
Jules Villard	3220804ddb	[pulse] add a cache of constants to equate them Summary: When encountering a constant, pulse creates an abstract value (a variable) to represent it, and remembers that it's equal to it. The problem is that pulse doesn't yet know how to deal with the fact that some variables are going to be equal to each other. This hacks around this issue in the case of constants, within the same procedure, by remembering which constants have been assigned to which place-holder variables, and serving those variables again when the same constant is translated again. Limitation: this doesn't work across procedure calls as the "constant maps" are not saved in summaries. Something to look out for: we don't want to make `if (p == NULL)` create a path where `p` is invalid (we only make null invalid when we see an assignment from 0, i.e. `p = NULL;`). Reviewed By: ezgicicek Differential Revision: D21089961 fbshipit-source-id: 5ebb85d0a	5 years ago
Daiva Naudziuniene	dae7f36339	[pulse] Vector iterator model Summary: Modeling vector iterator with two internal fields: an internal array and an internal pointer. The internal array field points to the internal array field of a vector; the internal pointer field represents the current element of the array. For now `operator++` creates a fresh element inside the array. Reviewed By: ezgicicek Differential Revision: D21043304 fbshipit-source-id: db3be49ce	5 years ago
Jules Villard	7a888170e7	[pudge] it's alive! Summary: Add a path condition to each symbolic state, represented in sledge's arithmetic domain. This gives a precise account of arithmetic constraints. In particular, it is relation and thus is more robust in the face of inter-procedural analysis. This is gated behind a flag for now as there are performance issues with the new arithmetic. Reviewed By: jberdine Differential Revision: D20393947 fbshipit-source-id: b780de22a	5 years ago
Jules Villard	6247437296	[pulse] unified API for arithmetic Summary: Instead of having to remember to update both the inferbo and the concrete intervals domains of pulse, hide these details under a unified API. This should help the transition to adding a third(!) numerical domain later on (pudge!). Reviewed By: ezgicicek Differential Revision: D21022920 fbshipit-source-id: 783157464	5 years ago
Jules Villard	bb9726bbd7	[pulse] enforce short forms for PulseDomainInterface Summary: See previous diff. Reviewed By: ezgicicek Differential Revision: D21022923 fbshipit-source-id: b1cab2fdc	5 years ago
Jules Villard	94e3b06900	[pulse] enforce short forms for PulseBasicInterface Summary: The "interface" modules define short forms for the internals of pulse and also serve as a guide of which modules you are supposed to use at which "level" in the pulse domains (base domain vs abductive domain vs higher-level PulseOperations.ml). Make sure they are used. Reviewed By: skcho Differential Revision: D21022927 fbshipit-source-id: f890df245	5 years ago
Jules Villard	c00de7ad27	[pulse] move interproc call to its own file Summary: PulseAbductiveDomain.ml can be split into two distinct parts: 1. The definition of the "abductive domain" itself. This remains in that file. 2. How to apply a given pre/post pair to the current state (during a function call). This is about the same size as 1. in terms of lines of code(!) and is now in PulseInterproc.ml. Reviewed By: ezgicicek Differential Revision: D21022921 fbshipit-source-id: 431fe061e	5 years ago
Ezgi Çiçek	e1093159b0	[pulse] Distinguish error state at top level Summary: As soon as pulse detects an error, it completely stops the analysis and loses the state where the error occurred. This makes it difficult to debug and understand the state the program failed. Moreover, other analyses that might build on pulse (e.g. impurity), cannot access the error state. This diff aims to restore and display the state at the time of the error in `PulseExecutionState` along with the diagnostic by extending it as follows: ``` type exec_state = \| represents the state at the program point that caused an error ) ``` As a result, since we don't immediately stop the analysis as soon as we find an error, we detect both errors in conditional branches simultaneously (see test result changes for examples). NOTE: We need to extend `PulseOperations.access_result` to keep track of the failed state as follows: ``` type 'a access_result = ('a, Diagnostic.t t [denoting the exit state] ) result ``` Reviewed By: jvillard Differential Revision: D20918920 fbshipit-source-id: 432ac68d6	5 years ago
Ezgi Çiçek	5a2b285fff	[pulse] Distinguish exit state at top level Summary: This diff lifts the `PulseAbductiveDomain.t` in `PulseExecutionState` by tracking whether the program continues the analysis normally or exits unusually (e.g. by calling `exit` or `throw`): ``` type exec_state = \| ContinueProgram of PulseAbductiveDomain.t (** represents the state at the program point ) \| ExitProgram of PulseAbductiveDomain.t (* represents the state originating at exit/divergence. *) ``` Now, Pulse's actual domain is tracked by `PulseExecutionState` and as soon as we try to analyze an instruction at `ExitProgram`, we simply return its state. The aim is to recover the state at the time of the exit, rather than simply ignoring them (i.e. returning empty disjuncts). This allows us to get rid of some FNs that we were not able to detect before. Moreover, it also allows the impurity analysis to be more precise since we will know how the state changed up to exit. TODO: - Impurity analysis needs to be improved to consider functions that simply exit as impure. - The next goal is to handle error state similarly so that when pulse finds an error, we recover the state at the error location (and potentially continue to analyze?). Disclaimer: currently, we handle throw statements like exit (as was the case before). However, this is not correct. Ideally, control flow from throw nodes follows catch nodes rather than exiting the program entirely. Reviewed By: jvillard Differential Revision: D20791747 fbshipit-source-id: df9e5445a	5 years ago
Dulma Churchill	6f2b52fcc7	[pulse] Model Core Graphics create and copy functions Summary: This models all the Create and Copy functions from CoreGraphics, examples in the tests. These functions all allocate memory that needs to be manually released. The modelling of the release functions will happen in a following diff. Until then, we have some false positives in the tests. This check is currently in biabduction, and we aim to move it to Pulse. Reviewed By: jvillard Differential Revision: D20626395 fbshipit-source-id: b39eae2d9	5 years ago
Dulma Churchill	e99295e0e9	[pulse] Memory leak check Summary: First version of a new memory leak check based on Pulse. The idea is to examine unreachable cells in the heap and check that the "Allocated" attribute is available but the "Invalid CFree" isn't. This is done when we remove variables from the state. Currently it only works for malloc, we can extend it to other allocation functions later. Reviewed By: jvillard Differential Revision: D20444097 fbshipit-source-id: 33b6b25a2	5 years ago
Jules Villard	7861752bf3	[pulse] rename "PulseArithmetic" to "PulseCItv" Summary: In preparation for PulseArithmetic to be something else. Reviewed By: ezgicicek Differential Revision: D20393928 fbshipit-source-id: d93131e12	5 years ago
Dulma Churchill	2f90b05c2a	[pulse] Add model for malloc Summary: Adding a model for malloc: we add an attribute "Allocated". This can be used for implementing memory leaks: whenever the variables get out of scope, we can check that if the variable has an attribute Allocated, it also has an attribute Invalid CFree. Possibly we will need more details in the Allocated attribute, to know if it's malloc, or other allocation function, but we can add that later when we know how it should look like. Reviewed By: jvillard Differential Revision: D20364541 fbshipit-source-id: 5e667a8c3	5 years ago
Jules Villard	826fd8a999	[pulse] monad, monads everywhere Summary: Add let*/+ syntax to `result` types to simplify all the applications of `>>=`, `>>\|` that are followed by a binding (eg `>>= fun x -> ...`) in pulse. Reviewed By: skcho Differential Revision: D19940728 fbshipit-source-id: 4df159029	5 years ago
Ezgi Çiçek	040442c93b	[pulse] Don't write through pointer arguments in Java Summary: Pulse has an extra invalidation mechanism (introduced in D18726203) to prevent something invalid (e.g. `null`) to be passed by reference to an initialisation function. Therefore, it havocs formals passed by reference to skipped functions. However, I don't think this makes sense in Java. So, let's turn it off. A nice consequence of this is that in impurity analysis, we do not consider functions that call skipped library calls with object arguments as writing to their formals. Reviewed By: skcho Differential Revision: D19697110 fbshipit-source-id: 6e3a71f2a	5 years ago
Ezgi Çiçek	4677584018	[pulse] Remove map suffix from SkippedCalls Reviewed By: jvillard Differential Revision: D19555827 fbshipit-source-id: 8ebc2f41d	5 years ago
Ezgi Çiçek	a0fd5a0e6a	[pulse] Refactor attributes into domain Summary: Let's move attributes into Pulse's domain. Reviewed By: jvillard Differential Revision: D19533915 fbshipit-source-id: 995fd12da	5 years ago

1 2 3

114 Commits (3685cc6fddbb30c028c6616ec961e34084ce40e3)