infer_clone

Commit Graph

Author	SHA1	Message	Date
Josh Berdine	7e77bad4d2	[sledge] Change: Implement Fol using a solver-independent intermediate type Summary: In order to allow implementations of the single Fol interface using multiple backend first-order logic solvers, add explicit definitions of terms and formulas in the Fol module, and implement Context in terms of them. The Fol interface supports freely mixing Terms and Formulas, in particular there is `Term.ite : cnd:Formula.t -> thn:Term.t -> els:Term.t -> Term.t` which allows Formulas to appear in Terms. The Fol implementation performs enough normalization to enable using an internal representation of terms that is strictly partitioned into "theory terms" and "formulas", which are stratified below "conditional terms" and then below "general terms". This partitioning and stratification enables using backend solvers that do not support mixing formulas in terms. Reviewed By: jvillard Differential Revision: D22170506 fbshipit-source-id: a014ee7d7	5 years ago
Josh Berdine	eca73cf39b	[sledge] Build: Move sledge equality solver to separate lib Reviewed By: ngorogiannis Differential Revision: D22170508 fbshipit-source-id: 1e9cf4a79	5 years ago
Daiva Naudziuniene	35011757dc	[pulse] Add a flag to pass functions that we want to model as returning non-null Summary: To avoid NULLPTR_DEREFERENCE false positives we want to model some functions as returning non-null. A new flag --pulse-model-return-nonnull allows us to provide a list of such functions. Reviewed By: ezgicicek Differential Revision: D22431564 fbshipit-source-id: 9944c7382	5 years ago
Jules Villard	a89d3db364	[pulse] change recency maps to be backed by lists Summary: This one is observed to be more memory efficient. Intuitively, maps need to be re-allocated more often than lists for balancing. In pulse, we'll often only ever add new values, in increasing order (when they are fresh variables created as we symbolically execute the program), which pushes maps into their worst-case allocation pattern. At least I suspect that's what happens. With lists, this case is handled much better as lists are not re-allocated when adding elements. This is somewhat confirmed by benchmarking and observing GC stats. Reviewed By: skcho Differential Revision: D22140908 fbshipit-source-id: 29815112f	5 years ago
Daiva Naudziuniene	0ab3689f1f	[infer] NULLPTR_DEREFERENCE false positive caused by thread_local variable Summary: Keyword `thread_local` in cpp allows us to create a variable with thread storage duration, meaning that the object's lifetime begins when the thread begins and ends when the thread ends. We get `NULLPTR_DEREFERENCE` false positive for `thread_local` variable since we reallocate it in the `VariableLifetimeBegins` metadata instruction and we do not see further updates to the variable. To solve the issue we special case `VariableLifetimeBegins` instruction for global variables. Reviewed By: jvillard Differential Revision: D22284135 fbshipit-source-id: 13c14ef90	5 years ago
Dulma Churchill	85ee958bf9	[pulse] Add model for NSObject.init Summary: This model is very important in the analysis of ObjC classes because the pattern ``` - (instancetype)init { if (self = [super init]) { ... } return self; } ``` is very common, so we need to know that if the super class is `NSObject`, the implementation of `init` is returning `self`, otherwise it's a skip function and we don't get the correct spec for the function. We fix some memory leak FP with this model, see test. Reviewed By: ezgicicek Differential Revision: D22259281 fbshipit-source-id: 3ee48c827	5 years ago
Daiva Naudziuniene	2c48e61031	[pulse] A new issue type OPTIONAL_EMPTY_ACCESS for trying to access folly::Optional when it is folly::none Summary: We need to check if `folly::Optional` is not `folly::none` if we want to retrieve the value, otherwise a runtime exception is thrown: ``` folly::Optional<int> foo{folly::none}; return foo.value(); // bad ``` ``` folly::Optional<int> foo{folly::none}; if (foo) { return foo.value(); // ok } ``` This diff adds a new issue type that reports if we try to access `folly::Optional` value when it is known to be `folly::none`. Reviewed By: ezgicicek Differential Revision: D22053352 fbshipit-source-id: 32cb00a99	5 years ago
Dulma Churchill	2d4b3c9acd	[builtins] Change the name of __free_cf to the more appropriate _objc_bridge_transfer and delete the biabduction implementation Summary: This continues on the previous diff by removing the model for `__bridge_transfer` in biabduction. This also had the name __free_cf which we kept for compatibility with biabduction until now but that we can now change. Reviewed By: ezgicicek Differential Revision: D22207396 fbshipit-source-id: 7a175eca6	5 years ago
Daiva Naudziuniene	412d2777eb	[pulse] Add a flag to pass functions that we want to model as abort Summary: To avoid NULLPTR_DEREFERENCE false positives we want to treat some functions as `abort`. A new flag `--pulse-model-abort` allows us to provide a list of such functions. Reviewed By: ezgicicek Differential Revision: D21962555 fbshipit-source-id: d46b93c99	5 years ago
Ezgi Çiçek	c23e0044fc	[infer] Remove ppx_compare workaround for nonrec types (2) Summary: The past issue with ppx_compare on nonrec types has (at some point) been fixed. Greped for `let compare = compare` and removed the workaround for `nonrec`. Reviewed By: jberdine Differential Revision: D21973087 fbshipit-source-id: 5e2043e20	5 years ago
Josh Berdine	9c8f2e4a5c	[sledge] Build: Move Timer to Nonstdlib Summary: It has no dependencies on the rest of the sledge codebase and might be more generally useful. Reviewed By: jvillard Differential Revision: D21720980 fbshipit-source-id: b4f061e73	5 years ago
Jules Villard	8a1c10f8a1	remove dynamic severity: Reporting.log_{error,warning} -> log_issue Summary: See previous diff: issues are always reported with the same severity so recognise that and just use their default severity in "modern" checkers. Reviewed By: ngorogiannis Differential Revision: D21904591 fbshipit-source-id: fb5387e35	5 years ago
Dulma Churchill	aa6fe7963c	[pulse] Add dealloc calls for ObjC objects that are about to become unreachable Summary: This diff implements part of the memory management for Objective-C classes in ARC, namely that `dealloc` is called when the objects become unreachable. In reality the semantics of ARC says that this happens when their reference count becomes 0, but we are not modelling this yet in Pulse. However, we could in the future. This fixes false positives memory leaks when the memory is freed in dealloc. `dealloc` is often implicit in Objective-C, it also calls the dealloc of instance variables and superclass. None of this is implemented yet, and will be done in a future diff. This will be added in the frontend probably, similarly to how it's done for C++ destructors. This is an important part of modelling Objective-C semantics in Infer, I looked at whether this should be a preanalysis to be used by all analyses but this needs Pulse. So the idea is that any analysis that needs to understand Objective-C memory model well, should have Pulse as a preanalysis. Reviewed By: jvillard Differential Revision: D21762292 fbshipit-source-id: ced014324	5 years ago
Dulma Churchill	f638e741ae	[pulse] Add DynamicType attribute and use it in the model of ObjC alloc Summary: Adding a new attribute for dynamic type. It is set in the models of constructors, currently only in `alloc` in Objective-C. We use it in the following diff to figure out which `dealloc` method to call. However it could be useful for other things, such as dynamic dispatch. #skipdeadcode Reviewed By: jvillard Differential Revision: D21739928 fbshipit-source-id: 9276c0a4d	5 years ago
Ezgi Çiçek	964388f56c	[pulse] Brush up Collection/List add and remove models Summary: The models were too naive before since they invalidated the underlying array completely (copying C++'s push_back model), causing spurious vector invalidation issues in Java. This diff adds more reasonable models. Reviewed By: skcho Differential Revision: D21787543 fbshipit-source-id: a5a59ff69	5 years ago
Daiva Naudziuniene	98092481d4	[pulse] Special case for std::function:operator=( nullptr ) Summary: Assigning `nullptr` to `std::function` was causing `NULLPTR_DEREFERENCE` as our model was expecting to get an object in the right hand side of the assignment (`std::function::operator=`) and was dereferencing that object. Assigning `nullptr` to `std::function` removes callable object from it. We model this special case by creating a fresh value. Reviewed By: skcho Differential Revision: D21685318 fbshipit-source-id: 2d4af1933	5 years ago
Jules Villard	eab7e9aeb7	minor readability improvement in IssueType.ml Summary: - avoid creating issues just to look up their `unique_id` in the set - avoid `let _ =` since it can hide partial applications - delete outdated comment Reviewed By: skcho Differential Revision: D21663959 fbshipit-source-id: e50d02447	5 years ago
Sungkeun Cho	719b72cb4f	[pulse] Avoid partitioning abstract values Summary: `partition` always constructs two new maps, which is expensive when there are a lot of entries. Let's avoid it if possible. Reviewed By: jvillard Differential Revision: D21684298 fbshipit-source-id: a8674d358	5 years ago
Jules Villard	4e28980c8e	[errlog] reporting asserts checker matches issue-type Summary: Add an extra argument everywhere we report about the identity of the checker doing the reporting. This isn't type safe in any way, i.e. a checker can masquerade as another. But, hopefully it's enough to ensure checker writers (and diff reviewers) have a chance to reflect on what issue type they are reporting. Reviewed By: ngorogiannis Differential Revision: D21638823 fbshipit-source-id: b4a4b0c0a	5 years ago
Josh Berdine	61566caddf	[ocamlformat] Set break-sequences = true Summary: Add `break-sequences = true` to .ocamlformat and reformat. Reviewed By: jvillard Differential Revision: D21583901 fbshipit-source-id: eb4ec836c	5 years ago
Josh Berdine	65f369cf35	[ocamlformat] Reformat repo with new version Reviewed By: jvillard Differential Revision: D21583046 fbshipit-source-id: ee4793880	5 years ago
Dulma Churchill	ef7bc324e3	[pulse] Add a flag to model methods for memory ownership transfer Summary: Just like `CFBridgingRelease` we want to be able to model functions that are specific to a given codebase that make a transfer of memory ownership so that developers don't need to worry about releasing that memory anymore, and hence, we don't want to report leaks on that memory. Things get a little more complicated, because some of the functions we want to model are in a specific namespace, so with this flag we take both cases into account, when we are dealing with namespaces or not. Reviewed By: jvillard Differential Revision: D21404409 fbshipit-source-id: c36bd7afc	5 years ago
Daiva Naudziuniene	ca2ec281c7	[pulse] Model for iterator operator-- Summary: Currently we get false positive if we apply `operator--` to the `end()` iterator. To solve this, we model iterator `operator--` not to raise an error for the `EndIterator` invalidation, but to create a fresh element in the underlying array. Reviewed By: ezgicicek Differential Revision: D21476353 fbshipit-source-id: 5c722372e	5 years ago
Daiva Naudziuniene	eaf95951f5	[pulse] Modeling std::vector::end() Summary: It is undefined behavior to dereference end iterator. To catch end iterator dereferencing issues we change iterator model: instead of having `internal pointer` storing the current index, we model it as a pointer to a current index. This allows us to model `end()` iterator as having an invalid pointer and there is no need to create an invalidated element in the vector itself. Reviewed By: ezgicicek Differential Revision: D21178441 fbshipit-source-id: fd6a94b0b	5 years ago
Ezgi Çiçek	faceece120	[pulse] Brush up List.set() model Summary: We mistakenly invalidated the set element which causes spurious vector invalidation errors. Instead, we should modify it without any invalidation. Reviewed By: jvillard Differential Revision: D21521943 fbshipit-source-id: 67963967e	5 years ago
Ezgi Çiçek	5ff6fc93a0	[pulse] Brush up Java iterator models Summary: Java's iterator models were wrong. This causes `VECTOR_INVALIDATION` errors in fbandroid projects. This diff aims to fix it by modeling Java iterators with a current pointer and an underlying collection array. Reviewed By: skcho Differential Revision: D21448322 fbshipit-source-id: 7d44354b5	5 years ago
Jules Villard	041ecc5b43	rename most libraries to be more consistent Summary: - Capitalise names - Remove Infer prefixes ``` git ls-files \| grep /dune \| xargs sed -i -e 's/absint/Absint/g' -e 's/InferIR/IR/g' -e 's/InferStdlib/IStdlib/g' -e 's/InferGenerated/ATDGenerated/g' -e 's/InferBase/IBase/g' -e 's/biabduction/Biabduction/g' -e 's/nullsafe/Nullsafe/g' -e 's/\bbo\b/BO/g' -e 's/\bBo\b/BO/g' -e 's/checkers/Checkers/g' -e 's/costlib/Costlib/g' -e 's/quandary/Quandary/g' -e 's/concurrency/Concurrency/g' -e 's/pulse/Pulse/g' -e 's/labs/Labs/g' -e 's/\bjava\b/JavaFrontend/g' -e 's/\bJava\b/JavaFrontend/' -e 's/JavaStubs/JavaFrontendStubs/' -e 's/integration/Integration/g' -e 's/InferCStubs/CStubs/g' ``` Reviewed By: ngorogiannis Differential Revision: D21440820 fbshipit-source-id: 1c5d10dd4	5 years ago
Dulma Churchill	40143ab01c	[pulse] Model CFRelease as removing the Allocated attribute rather than as free Summary: Because in the real semantics CFRelease can be used more than once, and also the variables can be used after CFRelease in general, modelling this as `free` causes many `USE_AFTER_FREE` errors. Now we change the model to not add the `Invalid CFree` attribute, but to just remove the `Allocated` attribute. So we can model memory leaks in the simple case of `Create` and not `CFRelease` before going out of scope, but we avoid the `USE_AFTER_FREE`. Since the model for CFRelease now diverges from free, changed the command line option for modelling to `pulse-model-release-pattern`. Reviewed By: jvillard Differential Revision: D21324895 fbshipit-source-id: ab323d981	5 years ago
Jules Villard	e06487868b	make Reporting take a Procdesc instead of attributes Summary: This is simpler for almost all call sites. Reviewed By: ezgicicek Differential Revision: D21425591 fbshipit-source-id: 60b8d0e16	5 years ago
Sungkeun Cho	d373a81b73	[pulse] Keep only one disjunct from blacklisted function Summary: This diff gets only one disjunct from blacklisted callee, in order to avoid OOMing in specific cases. Reviewed By: jvillard Differential Revision: D21406023 fbshipit-source-id: f9214c9c6	5 years ago
Jules Villard	7e5dba718a	pulse/dune Summary: An easy one. One subtlety: I needed to name the library "pulselib" instead of "pulse" because dune got confused by the Pulse.ml module. Reviewed By: skcho Differential Revision: D21401815 fbshipit-source-id: 05e75b1fa	5 years ago
Jules Villard	a34e1a8759	bufferoverrun/dune Summary: Main change: needed to cut the dependency of inferbo on pulse, since pulse will need to depend on inferbo. Achieved by changing the ad-hoc "PulseValue" into a little less ad-hoc "ForeignVariable" variant. Reviewed By: skcho Differential Revision: D21401816 fbshipit-source-id: bb341b9ff	5 years ago
Jules Villard	f41575411c	make pulse take an `InterproceduralAnalysis.t` Summary: Needed to make pulse into a dune library. Reviewed By: skcho Differential Revision: D21401820 fbshipit-source-id: d8c758913	5 years ago
Jules Villard	d14ff99f45	[pudge] try harder to prove false Summary: This gives more precision in tests. Reviewed By: jberdine Differential Revision: D21332072 fbshipit-source-id: df20daff3	5 years ago
Jules Villard	2da04b835d	[pulse] require ptr>0 in free() Summary: Resolves a false positive. Reviewed By: skcho Differential Revision: D21332074 fbshipit-source-id: a0c962b91	5 years ago
Jules Villard	385b6fa914	[pulse] revamp arithmetic, put everything in the path condition Summary: List of things happening in this unreviewable diff: - moved PulsePathCondition to PulseSledge - renamed --pulse-path-conditions to --pudge - PulsePathCondition now contains all the arithmetic of pulse (inferbo+concrete intervals+pudge). In particular, moved arithmetic attributes into PulsePathCondition.t. PulsePathCondition plays the role of PulseArithmetic (combining all domains). - added tests for a false positive involving free() - PulseArithmetic is now just a thin wrapper around PulsePathCondition to operate on states directly (instead of on path conditions). - The rest is mostly moving code into PulsePathCondition (eg, from PulseInterproc) and adjusting it. Reviewed By: jberdine Differential Revision: D21332073 fbshipit-source-id: 184c8e0a9	5 years ago
Jules Villard	5c453393ff	[pulse] recency model for memory accesses Summary: Add a new data structure and use it for the map of memory accesses to limit the number of destinations reachable from a given address. This avoids remembering details of each index in large arrays, or even each field in large structs. Reviewed By: skcho Differential Revision: D18246091 fbshipit-source-id: 5d3974d9c	5 years ago
Jules Villard	c2ec55fe37	[pulse] remove traces from interval domain Summary: The idea was to keep track of why we know certain facts but actually these traces are never read. Other arithmetic facts (BoItv and the path condition) don't have histories so remove them from concrete intervals too. Reviewed By: dulmarod Differential Revision: D21303353 fbshipit-source-id: eecf07b05	5 years ago
Dulma Churchill	6c044ba2d4	[pulse] Model Core Foundation create and copy functions Reviewed By: jvillard Differential Revision: D21301068 fbshipit-source-id: 76a997eb2	5 years ago
Jules Villard	2d8debc562	[pulse] invalidate vector backing array correctly Summary: We were invalidating "*(vec.__infer_backing_array)" instead of the address of the field itself. Reviewed By: ezgicicek Differential Revision: D21280357 fbshipit-source-id: 48b984800	5 years ago
Jules Villard	0859f61695	make AbstractInterpreter agnostic in ProcData Summary: `ProcData.t` contains a `Summary.t`. Eventually we want to fix this too so that checkers don't depend on backend/, i.e. on all the other checkers via Summary.ml. But in order to migrate progressively we can first migrate absint/ and one step on the way is for it to not know what kind of analysis data it is passing around. This extra flexibility only costs us passing an extra `Procdesc.t` in a couple more functions so it's actually not a bad change in itself. Reviewed By: ngorogiannis Differential Revision: D21257466 fbshipit-source-id: a91f7b191	5 years ago
Jules Villard	a144c8e4df	split reporting.ml for dependencies Summary: This is a step in disentangling the various analyses: that file used to make every checker on biabduction because of a few of its functions that use biabduction datatypes. Split reporting.ml into: - Reporting.ml: the functions all checkers need to report errors. This is put in absint/ with the other files that are needed by all checkers. - SummaryReporting.ml: functions that need to depend on Summary.ml (useful for later). This is put in backend/ where Summary.ml lives. - BiabductionReporting.ml: for the biabduction analysis The rest of the changes are renames to use the appropriate module amongst the above. Reviewed By: ngorogiannis Differential Revision: D21257468 fbshipit-source-id: fa28cefbc	5 years ago
Dulma Churchill	f28d75c910	[pulse] Add model for malloc_no_fail Summary: We model `malloc` in Objective-C as `malloc_not_fail` I think because the null case is not normally handled in iOS apps because the OS will just killed the app after giving some memory warnings. So adding `malloc_not_fail` model to Pulse. Reviewed By: jvillard Differential Revision: D21278527 fbshipit-source-id: 17a5008fe	5 years ago
Dulma Churchill	fa13577695	[pulse] Model __bridge_transfer Summary: This translates the construct `ObjCBridgedCastExpr` when the cast_kind is `OBC_BridgeTransfer`, or in syntax, the cast (`__bridge_transfer`). This cast means that the object is passed from manual memory management to ARC, so one doesn't need to call `release` manually. It is important to model this to avoid false positives. It translates it as a builtin that we then model in Pulse, the same way we modelled `CFBridgingRelease` which does the same thing. The name of the builtin is `__free_cf` which is not ideal but I left it like that for compatibility with biabduction. We can change it once we remove this check from biabduction. update-submodule: facebook-clang-plugins Reviewed By: jvillard Differential Revision: D21176337 fbshipit-source-id: 736ceeb9b	5 years ago
Daiva Naudziuniene	247ecb813d	[pulse] Fix traces for iterator invalidation errors Summary: Iterator invalidation traces were based on vector rather than iterator itself. Reviewed By: ezgicicek Differential Revision: D21202047 fbshipit-source-id: 62ce8a488	5 years ago
Ezgi Çiçek	269cdb80d9	[pulse] Model `StdVector` allocator Summary: We ignored allocator models for vectors, and were not able to initialize vectors properly. This diff fixes this issue. It also adds a test which was a FN before. Reviewed By: skcho, jvillard Differential Revision: D21089492 fbshipit-source-id: 6906cd1d1	5 years ago
Dulma Churchill	c76d59853b	[pulse] Model CFBridgingRelease by removing the Allocated attribute Summary: `CFBridgingRelease` and `__bridge_transfer` which I'll model later, transfer the memory model from manual memory ref count to ARC (automatic ref count), so to avoid false positives this needs to be modelled. We can simply remove the Allocated attribute from the state, which means we won't try to track that memory anymore. Reviewed By: skcho Differential Revision: D21088218 fbshipit-source-id: 3520a0d59	5 years ago
Jules Villard	3332dc1a42	[AI] improve disjunctive domain Summary: Replace horrible hack with ok hack. The main difficulty in implementing the disjunctive domain is to avoid the quadratic time complexity of executing the same disjuncts over and over again when going around loops: First time around a loop, assuming for example a single disjunct `d`: ``` [d] loop body [d1' \/ d2'] ``` Second time around the same loop: the new pre will be the join of the posts of predecessor nodes, so `old_pre \/ post(loop,old_pre)`, i.e. `d \/ d1' \/ d2'`. Now we need to execute `loop body` again without running the symbolic execution of `d` again (and the time after that we'll want to not execute `d`, `d1'`, or `d2'`). Horrible hack (before): Disjuncts have a boolean "visited" attached that does its best to keep track of whether a given disjunct is old or new. When executing a single instruction look at the flag and skip the state if it's old. Of course we have no way to know for sure so it turns out it was often wrongly re-executing old disjuncts. This was also producing the wrong results over even simple loops: only the last iteration would make it outside the loop for some reason. Overall, the semantics were pretty untractable and shady at best. New hack (this diff): only run instructions of a given node on disjuncts that are not physically equal to the "pre" ones already in the invariant map for the current node. This gives the correct result over simple loops and a nice performance improvement in general (probably the old heuristic was hitting the quadratic bad case more often). Reviewed By: skcho Differential Revision: D21154063 fbshipit-source-id: 5ee38c68c	5 years ago
Jules Villard	edba795825	[AI] move disjunctive scheduling to AbstractInterpreter Summary: This is a preparatory diff to make the actual change more readable. This just moves the code around, trying to change it as little as possible. Reviewed By: skcho Differential Revision: D21154065 fbshipit-source-id: e086318c1	5 years ago
Jules Villard	50feb5481c	[pudge] only ask unsat when reporting Summary: Computing sledge's equality relation and normalising terms is costly. We can avoid doing that most of the time by keeping the sledge path condition lazily evaluated and only forcing it down to a value at two critical points in the analysis: 1. Summary creation, to avoid storing unsatisfiable pre/posts that will have to be needlessly executed by callers. This also saves us from having to serialise the closures involved in the uncomputed form of lazy values inside the pulse summaries. 2. Before reporting errors we check in the state is in fact satisfiable. If not we just prune it away at that point. This yields ~4x speedup on some targets. Reviewed By: ezgicicek Differential Revision: D21129759 fbshipit-source-id: a75fdd3bc	5 years ago
Jules Villard	822a78c576	[pudge] lazily compute sledge stuff Summary: This is mostly just a type change for now, more changes to come. This doesn't make thing much faster yet because we force computations pretty often to check for unsatisfiability (each function call and PRUNE node). Next diff will build on that. Reviewed By: skcho Differential Revision: D21129758 fbshipit-source-id: 72200e2b1	5 years ago
Jules Villard	3220804ddb	[pulse] add a cache of constants to equate them Summary: When encountering a constant, pulse creates an abstract value (a variable) to represent it, and remembers that it's equal to it. The problem is that pulse doesn't yet know how to deal with the fact that some variables are going to be equal to each other. This hacks around this issue in the case of constants, within the same procedure, by remembering which constants have been assigned to which place-holder variables, and serving those variables again when the same constant is translated again. Limitation: this doesn't work across procedure calls as the "constant maps" are not saved in summaries. Something to look out for: we don't want to make `if (p == NULL)` create a path where `p` is invalid (we only make null invalid when we see an assignment from 0, i.e. `p = NULL;`). Reviewed By: ezgicicek Differential Revision: D21089961 fbshipit-source-id: 5ebb85d0a	5 years ago
Daiva Naudziuniene	dae7f36339	[pulse] Vector iterator model Summary: Modeling vector iterator with two internal fields: an internal array and an internal pointer. The internal array field points to the internal array field of a vector; the internal pointer field represents the current element of the array. For now `operator++` creates a fresh element inside the array. Reviewed By: ezgicicek Differential Revision: D21043304 fbshipit-source-id: db3be49ce	5 years ago
Jules Villard	36f44f030d	[pudge] spit out sledge replay tests Summary: Also add an infer option to enable sledge timers. Reviewed By: jberdine Differential Revision: D20871159 fbshipit-source-id: d4ea0e9f2	5 years ago
Jules Villard	7a888170e7	[pudge] it's alive! Summary: Add a path condition to each symbolic state, represented in sledge's arithmetic domain. This gives a precise account of arithmetic constraints. In particular, it is relation and thus is more robust in the face of inter-procedural analysis. This is gated behind a flag for now as there are performance issues with the new arithmetic. Reviewed By: jberdine Differential Revision: D20393947 fbshipit-source-id: b780de22a	5 years ago
Dulma Churchill	2d168f75a6	[pulse] Add options for modelling alloc models and free models from user-defined regexes. Reviewed By: jvillard Differential Revision: D21039304 fbshipit-source-id: a43b17235	5 years ago
Jules Villard	6247437296	[pulse] unified API for arithmetic Summary: Instead of having to remember to update both the inferbo and the concrete intervals domains of pulse, hide these details under a unified API. This should help the transition to adding a third(!) numerical domain later on (pudge!). Reviewed By: ezgicicek Differential Revision: D21022920 fbshipit-source-id: 783157464	5 years ago
Jules Villard	0a8ad85596	[pulse][minor] rename AbductiveDomain.Domain -> AbductiveDomain.PostDomain Summary: To be more explicit and symmetric with PreDomain. Reviewed By: ezgicicek Differential Revision: D21022925 fbshipit-source-id: 51885a291	5 years ago
Jules Villard	af2aaf2a14	[pulse][minor] remove skipped_calls getter Summary: Now that the shape of the record type of AbductiveDomain.t is known, we don't need this getter anymore. Keep `get_pre` and `get_post` as they perform useful casting to `BaseDomain.t`. Reviewed By: ezgicicek Differential Revision: D21022924 fbshipit-source-id: 340f4edf8	5 years ago
Jules Villard	bb9726bbd7	[pulse] enforce short forms for PulseDomainInterface Summary: See previous diff. Reviewed By: ezgicicek Differential Revision: D21022923 fbshipit-source-id: b1cab2fdc	5 years ago
Jules Villard	94e3b06900	[pulse] enforce short forms for PulseBasicInterface Summary: The "interface" modules define short forms for the internals of pulse and also serve as a guide of which modules you are supposed to use at which "level" in the pulse domains (base domain vs abductive domain vs higher-level PulseOperations.ml). Make sure they are used. Reviewed By: skcho Differential Revision: D21022927 fbshipit-source-id: f890df245	5 years ago
Jules Villard	a0d1fee1dc	[pulse] move SkippedCalls to its own file Summary: Seems logical. Reviewed By: ezgicicek Differential Revision: D21022922 fbshipit-source-id: 1b8546332	5 years ago
Jules Villard	c00de7ad27	[pulse] move interproc call to its own file Summary: PulseAbductiveDomain.ml can be split into two distinct parts: 1. The definition of the "abductive domain" itself. This remains in that file. 2. How to apply a given pre/post pair to the current state (during a function call). This is about the same size as 1. in terms of lines of code(!) and is now in PulseInterproc.ml. Reviewed By: ezgicicek Differential Revision: D21022921 fbshipit-source-id: 431fe061e	5 years ago
Jules Villard	9ed10d435b	[pulse][minor] simplify rewriting of callee post attributes Summary: I'm moving this code in the next diff and need this refactor. It should be the same as before. Reviewed By: ezgicicek Differential Revision: D21022926 fbshipit-source-id: ebe644ef9	5 years ago
Dulma Churchill	2382e3d613	[pulse] Model Core Graphics Create and Copy just like malloc Summary: Unify the models of malloc and for the Create and Copy functions for Core Graphics. This add the null case from the malloc model to the Core Graphics models. Reviewed By: jvillard Differential Revision: D20890956 fbshipit-source-id: 278ac9d2f	5 years ago
Dulma Churchill	59ea968de8	[pulse] Model the correct CFAutorelease Reviewed By: ezgicicek Differential Revision: D20941777 fbshipit-source-id: 150924949	5 years ago
Ezgi Çiçek	e1093159b0	[pulse] Distinguish error state at top level Summary: As soon as pulse detects an error, it completely stops the analysis and loses the state where the error occurred. This makes it difficult to debug and understand the state the program failed. Moreover, other analyses that might build on pulse (e.g. impurity), cannot access the error state. This diff aims to restore and display the state at the time of the error in `PulseExecutionState` along with the diagnostic by extending it as follows: ``` type exec_state = \| represents the state at the program point that caused an error ) ``` As a result, since we don't immediately stop the analysis as soon as we find an error, we detect both errors in conditional branches simultaneously (see test result changes for examples). NOTE: We need to extend `PulseOperations.access_result` to keep track of the failed state as follows: ``` type 'a access_result = ('a, Diagnostic.t t [denoting the exit state] ) result ``` Reviewed By: jvillard Differential Revision: D20918920 fbshipit-source-id: 432ac68d6	5 years ago
Dulma Churchill	b29d1a2f5f	[pulse] Adding new value history for allocations Reviewed By: jvillard Differential Revision: D20914622 fbshipit-source-id: f32836a95	5 years ago
Ezgi Çiçek	5a2b285fff	[pulse] Distinguish exit state at top level Summary: This diff lifts the `PulseAbductiveDomain.t` in `PulseExecutionState` by tracking whether the program continues the analysis normally or exits unusually (e.g. by calling `exit` or `throw`): ``` type exec_state = \| ContinueProgram of PulseAbductiveDomain.t (** represents the state at the program point ) \| ExitProgram of PulseAbductiveDomain.t (* represents the state originating at exit/divergence. *) ``` Now, Pulse's actual domain is tracked by `PulseExecutionState` and as soon as we try to analyze an instruction at `ExitProgram`, we simply return its state. The aim is to recover the state at the time of the exit, rather than simply ignoring them (i.e. returning empty disjuncts). This allows us to get rid of some FNs that we were not able to detect before. Moreover, it also allows the impurity analysis to be more precise since we will know how the state changed up to exit. TODO: - Impurity analysis needs to be improved to consider functions that simply exit as impure. - The next goal is to handle error state similarly so that when pulse finds an error, we recover the state at the error location (and potentially continue to analyze?). Disclaimer: currently, we handle throw statements like exit (as was the case before). However, this is not correct. Ideally, control flow from throw nodes follows catch nodes rather than exiting the program entirely. Reviewed By: jvillard Differential Revision: D20791747 fbshipit-source-id: df9e5445a	5 years ago
Dulma Churchill	dba4140a7b	[pulse] Adding null case to malloc's model Summary: Malloc returns either an allocated object or a null pointer if there is no memory available. Modelling that. This has always been a bit contentious because this leads to NPEs that people often ignores because they don't care. But if we don't model this, then we have FPs when people do take this into account when freeing the memory. Reviewed By: jvillard Differential Revision: D20791692 fbshipit-source-id: 6fd259f12	5 years ago
Dulma Churchill	271946a178	[pulse] Model release functions from Core Graphics and Core Foundation Summary: Modelling `CG.*Release ` and `CFRelease` as `free`. This is what we were doing in biabduction. Reviewed By: skcho Differential Revision: D20767174 fbshipit-source-id: c77c1cdc6	5 years ago
Dulma Churchill	6f2b52fcc7	[pulse] Model Core Graphics create and copy functions Summary: This models all the Create and Copy functions from CoreGraphics, examples in the tests. These functions all allocate memory that needs to be manually released. The modelling of the release functions will happen in a following diff. Until then, we have some false positives in the tests. This check is currently in biabduction, and we aim to move it to Pulse. Reviewed By: jvillard Differential Revision: D20626395 fbshipit-source-id: b39eae2d9	5 years ago
Jules Villard	6dc0894eef	[pulse][models] add the proc name being matched to the context Summary: This will be needed in a future diff. Reviewed By: dulmarod Differential Revision: D20772937 fbshipit-source-id: ce836cd07	5 years ago
Dulma Churchill	902514dccd	[pulse] Add unreachable point to the trace of memory leaks Summary: When looking at some reports I realised that adding the place where the memory becomes unreachable to the trace makes it more readable. Reviewed By: skcho Differential Revision: D20790277 fbshipit-source-id: d5df69e68	5 years ago
Ezgi Çiçek	d97e1c8fdb	[pulse][impurity] Add model for System.exit() Summary: - Model `System.exit()` as early_exit and add a test - Tweak message of methods that are impure due to having no pulse summary (and add a test) Reviewed By: skcho Differential Revision: D20668979 fbshipit-source-id: 6b5589aae	5 years ago
Ezgi Çiçek	f7baf845fd	[pulse] Fix printing order in contradiction for CItv and add tests Summary: - the order of call state was wrong when printing contradiction for CItv - add a test for impurity Reviewed By: jvillard Differential Revision: D20646181 fbshipit-source-id: 1c86fd0a4	5 years ago
Dulma Churchill	e99295e0e9	[pulse] Memory leak check Summary: First version of a new memory leak check based on Pulse. The idea is to examine unreachable cells in the heap and check that the "Allocated" attribute is available but the "Invalid CFree" isn't. This is done when we remove variables from the state. Currently it only works for malloc, we can extend it to other allocation functions later. Reviewed By: jvillard Differential Revision: D20444097 fbshipit-source-id: 33b6b25a2	5 years ago
Ezgi Çiçek	7ca2fcc948	[pulse][purity] Add more naive models for Java Summary: - Add more naive pulse models for: - `System.arraycopy` - `StringBuilder.setLength` - `StringBuilder.delete` - Model the following as pure - `SparseArrayCompat.valueAt` - `File.get...` - Add a nice test Reviewed By: jvillard Differential Revision: D20513397 fbshipit-source-id: 6d412d13a	5 years ago
Ezgi Çiçek	25c058f706	[deadcode] Fix deadcode Summary: `make deadcode` is failing on master but our CI jobs didn't catch it :( Let's fix existing deadcode for now. Reviewed By: martintrojer Differential Revision: D20510062 fbshipit-source-id: 4a5e5f849	5 years ago
Ezgi Çiçek	cc815f5d20	[pulse] Only propagate existing WrittenTo attributes at function calls Summary: Previously, at each function call, we added a `WrittenTo` attribute for applying the address of the actuals. However, this results in mistakenly considering each function application that inspects its argument as impure. Instead, we should only propagate `WrittenTo` if the actuals have already `WrittenTo` attributes. For instance, for the following functions ``` public static boolean is_null(Byte a) { return a == null; } public static boolean call_is_null(Byte a) { return is_null(a); } ``` We used to get the following pulse summary for `call_is_null` (showing only one of the disjuncts): ``` #0: PRE: { roots={ &a=v1 }; mem ={ v1 -> { * -> v2 } }; attrs={ v1 -> { MustBeValid }, v2 -> { Arith =null, BoItv ([max(0, v2), min(0, v2)]) } };} POST: { roots={ &a=v1, &return=v8 }; mem ={ v1 -> { * -> v2 }, v8 -> { * -> v4 } }; attrs={ v2 -> { Arith =null, BoItv ([max(0, v2), min(0, v2)]), WrittenTo-----------WRONG }, v4 -> { Arith =1, BoItv (1), Invalid ConstantDereference(is the constant 1), WrittenTo-----------WRONG }, v8 -> { WrittenTo } };} SKIPPED_CALLS: { } ``` where we mistakenly recorded a `WrittenTo` for `v2` (what `a` points to). As a result, we considered `call_is_null` as impure :( This diff fixes that since the callee `is_null` doesn't have any `WrittenTo` attributes for its parameter `a`. So, we don't propagate `WrittenTo` and get the following summary ``` #0: PRE: { roots={ &a=v1 }; mem ={ v1 -> { * -> v2 } }; attrs={ v1 -> { MustBeValid }, v2 -> { Arith =null, BoItv ([max(0, v2), min(0, v2)]) } };} POST: { roots={ &a=v1, &return=v8 }; mem ={ v1 -> { * -> v2 }, v8 -> { * -> v4 } }; attrs={ v2 -> { Arith =null, BoItv ([max(0, v2), min(0, v2)]) }, v4 -> { Arith =1, BoItv (1), Invalid ConstantDereference(is the constant 1) }, v8 -> { WrittenTo } };} SKIPPED_CALLS: { } ``` Reviewed By: skcho Differential Revision: D20490102 fbshipit-source-id: 253d8ef64	5 years ago
Ezgi Çiçek	b372befee4	[pulse] Add more naive Java models Summary: This diff naively models the following as `StdVector.push_back`: - `StringBuilder.append` - `String.replace` - `Queue.poll` It also adds a FN test for `Iterator.next`. Reviewed By: skcho Differential Revision: D20469786 fbshipit-source-id: 2d8e8d117	5 years ago
Ezgi Çiçek	a65176de22	[pulse] Print SkippedCalls Summary: Let's also print skipped calls in `pp` to ease debugging both for summary and intermediate steps. Reviewed By: jvillard Differential Revision: D20417852 fbshipit-source-id: 7da03ae81	5 years ago
Dulma Churchill	d1923dcd71	[pulse] Changed the name of BaseDomain signature to avoid a name clash Summary: There is a module and a module type in the file PulseAbductiveDomain.ml with the same name. This is confusing and it's better to keep separate names. Reviewed By: jvillard Differential Revision: D20388769 fbshipit-source-id: bcfed436e	5 years ago
Jules Villard	3ba91fd596	[pulse] refactor of PrePost.t vs AbductiveDomain.t Summary: Be a bit more careful about the difference between PrePost.t and AbductiveDomain.t. It's needed in another diff where the types will be different. Reviewed By: ezgicicek Differential Revision: D20393927 fbshipit-source-id: beaf80c90	5 years ago
Jules Villard	7861752bf3	[pulse] rename "PulseArithmetic" to "PulseCItv" Summary: In preparation for PulseArithmetic to be something else. Reviewed By: ezgicicek Differential Revision: D20393928 fbshipit-source-id: d93131e12	5 years ago
Ezgi Çiçek	e3c89b1f10	[impurity] Fix include_value_history Summary: D20362149 missed - to pass the optional argument `include_value_history` to the recursive call in `PulseTrace.add_to_errlog`. - to set `include_value_history=false` for skipped calls. This diff fixes these issues. Reviewed By: skcho Differential Revision: D20385604 fbshipit-source-id: 176e4d010	5 years ago
Dulma Churchill	2f90b05c2a	[pulse] Add model for malloc Summary: Adding a model for malloc: we add an attribute "Allocated". This can be used for implementing memory leaks: whenever the variables get out of scope, we can check that if the variable has an attribute Allocated, it also has an attribute Invalid CFree. Possibly we will need more details in the Allocated attribute, to know if it's malloc, or other allocation function, but we can add that later when we know how it should look like. Reviewed By: jvillard Differential Revision: D20364541 fbshipit-source-id: 5e667a8c3	5 years ago
Ezgi Çiçek	b90d7c42d3	[impurity] Do not add value history in impurity traces Summary: Impurity traces are quite big due to recording values histories. Let's simplify the traces by removing pulse's value histories. Reviewed By: skcho Differential Revision: D20362149 fbshipit-source-id: 8a2a6115e	5 years ago
Ezgi Çiçek	c6237f5f9f	[pulse] Add model for Object.clone() Summary: This diff adds a model for Java's `Object.clone()` method (similar to existing shallow_copy). Reviewed By: jvillard Differential Revision: D20341073 fbshipit-source-id: 30ae40fe7	5 years ago
Ezgi Çiçek	c144761a26	[pulse] Pull skipped calls into AbductiveDomain Summary: We don't need skipped calls for pre and post. Let's pull them out to `PulseAbductiveDomain`, next to pre and post. Reviewed By: jvillard Differential Revision: D20283589 fbshipit-source-id: 5cf970292	5 years ago
Ezgi Çiçek	5f8e6233bb	[pulse] Take into account skipped calls for state comparison Summary: We forgot to take skipped calls into account for state comparison. This diff fixes that. Reviewed By: skcho Differential Revision: D20282739 fbshipit-source-id: 7b4d84bb0	5 years ago
Ezgi Çiçek	562a43621c	[pulse] Remove NoJoin sig from PulseBaseDomain Summary: `PulseBaseDomain.leq` is never called but was there to satisfy the signature of `NoJoin` which itself was not needed. This diff removes `include NoJoin` and instead just adds signature for `pp` in `PulseBaseDomain`. Reviewed By: jvillard Differential Revision: D20280104 fbshipit-source-id: 8e3659280	5 years ago
Jules Villard	826fd8a999	[pulse] monad, monads everywhere Summary: Add let*/+ syntax to `result` types to simplify all the applications of `>>=`, `>>\|` that are followed by a binding (eg `>>= fun x -> ...`) in pulse. Reviewed By: skcho Differential Revision: D19940728 fbshipit-source-id: 4df159029	5 years ago
Jules Villard	72f560036d	[pulse] formal/actual length mismatch is a contradiction Summary: We can already tell that a summary cannot be applied by raising `Contradiction`, so use this mechanism to stop applying a summary if the number of formals doesn't match the number of actuals provided. Previously we would return an option type and `None` in case of mismatch, on top of the `raise Contradiction` mechanism (used for aliasing and arithmetic contradictions). This changes the behaviour of pulse in this case: before we would skip over the function call, but now we stop the analysis. Reviewed By: dulmarod Differential Revision: D19940729 fbshipit-source-id: 6def40cd6	5 years ago
Ezgi Çiçek	239a5302f6	[pulse] Add more models for Java Summary: Adding naive models. Reviewed By: skcho Differential Revision: D19743521 fbshipit-source-id: a5a080a85	5 years ago
Ezgi Çiçek	040442c93b	[pulse] Don't write through pointer arguments in Java Summary: Pulse has an extra invalidation mechanism (introduced in D18726203) to prevent something invalid (e.g. `null`) to be passed by reference to an initialisation function. Therefore, it havocs formals passed by reference to skipped functions. However, I don't think this makes sense in Java. So, let's turn it off. A nice consequence of this is that in impurity analysis, we do not consider functions that call skipped library calls with object arguments as writing to their formals. Reviewed By: skcho Differential Revision: D19697110 fbshipit-source-id: 6e3a71f2a	5 years ago
Ezgi Çiçek	4677584018	[pulse] Remove map suffix from SkippedCalls Reviewed By: jvillard Differential Revision: D19555827 fbshipit-source-id: 8ebc2f41d	5 years ago
Ezgi Çiçek	a0fd5a0e6a	[pulse] Refactor attributes into domain Summary: Let's move attributes into Pulse's domain. Reviewed By: jvillard Differential Revision: D19533915 fbshipit-source-id: 995fd12da	5 years ago
Jules Villard	a8b2c58bfb	[pulse] new option to turn pulse back into an intra-procedural analysis Summary: To run experiments. Reviewed By: jberdine Differential Revision: D19411491 fbshipit-source-id: 9e9490d5e	5 years ago
Ezgi Çiçek	426b7dfe51	[pulse] Track skipped functions Summary: Let's collect the list of all skipped functions with a `proc_name` but no summary in Pulse's memory. This will be useful for the impurity analysis later (next diff). Concretely, we extend Pulse's domain with a map from skipped calls to their respective traces. For efficiency, we only keep a single trace per skipped call. For impurity analysis, tracking skipped calls in Pulse allows us to rely on Pulse's strong memory model to get rid of infeasible paths as opposed to creating an independent checker which wouldn't be able to do that. Reviewed By: jvillard Differential Revision: D19428426 fbshipit-source-id: 3c5e482c5	5 years ago

1 2 3 4 5 ...

288 Commits (b3c74c4152357dda224a4e5c110aaf953d8ba3fa)