infer_clone

Commit Graph

Author	SHA1	Message	Date
Josh Berdine	661db9db76	[sledge] Implement Map.find_and_remove more directly Summary: No need for exceptions. Reviewed By: jvillard Differential Revision: D18736382 fbshipit-source-id: 684bf896c	5 years ago
Josh Berdine	30aa8aa3b9	[sledge] Basic definitions for monadic binding operators Reviewed By: jvillard Differential Revision: D18736378 fbshipit-source-id: b9a2a6c0c	5 years ago
Josh Berdine	cfbbacf9f1	[sledge] Improve using extended open Summary: OCaml 4.08 supports `open` of arbitrary module expressions. This enables some code simplifications. Reviewed By: ngorogiannis Differential Revision: D18736381 fbshipit-source-id: c729dcfbd	5 years ago
Josh Berdine	7ed8a6a260	[sledge] Simplify and improve using local subst in sigs Summary: OCaml 4.08 supports a form of signature-local bindings, to that a type can be defined in order to be used in other definitions, without being part of the signature itself. Reviewed By: ngorogiannis, jvillard Differential Revision: D18736380 fbshipit-source-id: 0bb043de6	5 years ago
Josh Berdine	b22d8b4151	[sledge] Simplify using shadowing of modules from includes Summary: OCaml 4.08 supports shadowing modules from includes, which enables some simplification. Reviewed By: ngorogiannis Differential Revision: D18736379 fbshipit-source-id: 646e2c07c	5 years ago
Josh Berdine	b1a6928a50	[sledge] Avoid wildcard exception handler Reviewed By: ngorogiannis Differential Revision: D18736383 fbshipit-source-id: 7c87307ff	5 years ago
Josh Berdine	f2be1cbed0	[sledge] Hashtbl.Key has been deprecated in favor of Hashtbl.Key.S Reviewed By: jvillard Differential Revision: D18708465 fbshipit-source-id: b7ae04fe9	5 years ago
Josh Berdine	b5915db605	[sledge] Clear terminal between builds in watch mode Summary: Otherwise it is difficult to tell the difference between compilation errors from previous versus current builds. Reviewed By: ngorogiannis Differential Revision: D18736376 fbshipit-source-id: 2e583f4ba	5 years ago
Josh Berdine	48fd99d48f	[sledge] Avoid matching on Not_found Summary: base 0.13 breaks it (sometimes). Reviewed By: jvillard Differential Revision: D18717807 fbshipit-source-id: e27a4d6da	5 years ago
Josh Berdine	e201e517c9	[sledge][NFC] Refactor to avoid an unused open warning Summary: OCaml 4.08 has a new warning (66) on unused `open!` statements. This has a suboptimal interaction with `ppx_let`'s `let%map_open` which leads to triggering the warning if any of a group of such let bindings does not need the open. In this case, the refactor is easy. But, warning 66 is very dubious, so also just switch it off. Reviewed By: jvillard Differential Revision: D18708466 fbshipit-source-id: 77618ab6e	5 years ago
Josh Berdine	6c5d9d4acb	[sledge] Remove dependency on ppx_import Summary: It seems to be effectively unmaintained, as it still doesn't support 4.08. Reviewed By: jvillard Differential Revision: D18708467 fbshipit-source-id: dcb3361fc	5 years ago
Josh Berdine	9d7580b5cd	[sledge] Remove ocamlformat from dev-tools.opam Reviewed By: jvillard Differential Revision: D18571069 fbshipit-source-id: ec82e4661	5 years ago
Josh Berdine	e3734d3d2c	[sledge] Fix bug in Term.solve Summary: Term.solve makes the assumption that all distinct normalized constants denote distinct values. This is fragile at best, and it is better to enumerate the cases where solve discovers inconsistency. Reviewed By: jvillard Differential Revision: D18459619 fbshipit-source-id: 71f52557c	5 years ago
Josh Berdine	28e4c74426	[sledge] Fix bug in Equality.or_ Summary: Equality.or_ assumed a simpler representation of equality relations, and was incomplete as a result. Reviewed By: jvillard Differential Revision: D18298138 fbshipit-source-id: cf91229f6	5 years ago
Josh Berdine	8d20e4d64d	[ocamlformat] Upgrade ocamlformat version Reviewed By: jvillard Differential Revision: D18162727 fbshipit-source-id: ffb9f7541	5 years ago
Josh Berdine	52380b017c	[sledge][NFC] Simplify Term rec module Summary: Remove one duplicate of auxilliary type definitions. Reviewed By: ngorogiannis Differential Revision: D18298141 fbshipit-source-id: cfc5076c3	5 years ago
Josh Berdine	1f64634093	[sledge] Simplify type conversions Summary: The treatment of type conversions is too complicated, non-uniform, etc. This diff attempts to simplify things by separating integer to integer conversions, which are interpreted, from others, which are essentially just uninterpreted functions. Integer conversions are now handled using two expression and term forms: Signed and Unsigned. These each interpret their argument as either a signed or unsigned number of a given bitwidth: ``` \| Signed of {bits: int} (** [Ap1 (Signed {bits= n}, dst, arg)] is [arg] interpreted as an [n]-bit signed integer and injected into the [dst] type. That is, it two's-complement--decodes the low [n] bits of the infinite two's-complement encoding of [arg]. The injection into [dst] is a no-op, so [dst] must be an integer type with bitwidth at least [n]. ) \| Unsigned of {bits: int} (* [Ap1 (Unsigned {bits= n}, dst, arg)] is [arg] interpreted as an [n]-bit unsigned integer and injected into the [dst] type. That is, it unsigned-binary--decodes the low [n] bits of the infinite two's-complement encoding of [arg]. The injection into [dst] is a no-op, so [dst] must be an integer type with bitwidth greater than [n]. ) \| Convert of {src: Typ.t} (* [Ap1 (Convert {src}, dst, arg)] is [arg] converted from type [src] to type [dst], possibly with loss of information. The [src] and [dst] types must be [Typ.convertible] and must not both be [Integer] types. *) ``` Reviewed By: ngorogiannis Differential Revision: D18298140 fbshipit-source-id: 690f065b4	5 years ago
Josh Berdine	e6d93dcf94	[sledge][NFC] Simplify term tests Summary: Added `open Term` and simplified. Reviewed By: ngorogiannis Differential Revision: D18298139 fbshipit-source-id: 3a5fed25b	5 years ago
Scott Owens	1bd290634b	[sledge sem] Update integer conversions to new LLAIR Summary: LLAIR changed how it represents integer-to-integer conversions, and this updates the semantics and proofs to show that the new way is correct. Reviewed By: jberdine Differential Revision: D18448616 fbshipit-source-id: b657fcd20	5 years ago
Scott Owens	f68258ca73	[sledge sem] Update sanity proof for LLAIR convert Summary: The old version of simp_convert in LLAIR had a bug, but the sanity theorem didn't catch it because it didn't enforce that the result fit into the size it should have. This updates to newer version of simp_convert and adds a theorem that the result fits. Reviewed By: jberdine Differential Revision: D18346833 fbshipit-source-id: 533c836bf	5 years ago
Benno Stein	beb99932c3	[sledge] Handle more LLAIR expressions in APRON interval analysis Summary: Extend the APRON-backed interval analysis to handle a wider range of LLAIR expressions. Reviewed By: jvillard Differential Revision: D17858072 fbshipit-source-id: c50f5bf20	5 years ago
Josh Berdine	752b8ab56a	[sledge] Fix normalization of Convert terms Summary: In some cases the result of an integer conversion needs to be truncated by a bit. Differential Revision: D18271179 fbshipit-source-id: e80740045	5 years ago
Scott Owens	5caa19990b	[sledge sem] Improve a comment Reviewed By: jberdine Differential Revision: D18269122 fbshipit-source-id: 9e9fba662	5 years ago
Scott Owens	a4f0d6dbb7	[sledge sem] Complete (nearly) proof for phi instrs Summary: Improve the invariants to show that phi instructions are correctly translated. It remains to show that the invariants can be established when jumping to the start of a block Reviewed By: jberdine Differential Revision: D18228272 fbshipit-source-id: 4330b4781	5 years ago
Benno Stein	50b60bc049	[sledge] Add APRON-backed Interval abstract domain Summary: Add a new interval abstract domain. This domain uses the APRON numerical analysis library to keep track of the range of values held by llair variables where possible. This works by translating LLAIR expressions into APRON tree expressions, so only handles the subset of the LLAIR expression language that can be embedded. Note also that function summarization is not yet implemented. Future commits will add summarization and improve coverage of LLAIR's expression language. Reviewed By: jberdine Differential Revision: D17763517 fbshipit-source-id: 826ce4cc5	5 years ago
Scott Owens	9f0fdd3bfe	[sledge sem] Add proof of bit cast implementation Summary: Add some theorems establishing the correspondence between the implementation of the Convert operation in OCaml and the definition of Convert in the semantics. Essentially, the OCaml version is in terms of extracting certain ranges of bits, whereas the semantics is in terms of integer arithmetic (addition, modulus, and exponentiation) Reviewed By: jberdine Differential Revision: D18113878 fbshipit-source-id: c318596d0	5 years ago
Scott Owens	e9296d31b6	[sledge sem] Implement and verify cast expressions Summary: This commit adds truncation, sign extension and zero extension to LLVM and the Convert instruction to LLAIR. The LLVM instructions use HOL's build-in word/int and word/num conversions. Sanity-checking theorems prove that zero-extending leaves the value of the word unchanged when considered as an unsigned value, and that sign-extending leaves the value unchanged when considered as a signed value. The llair semantics for Convert uses the truncate_2comp function which converts an integer to another integer as though they were represented in 2's complement. e.g. truncate_2comp 255 16 = 255, truncate_2comp 255 8 = -1, truncate_2comp -3 2 = 1 Reviewed By: jberdine Differential Revision: D18058833 fbshipit-source-id: df9de480c	5 years ago
Scott Owens	86024892e1	[sledge sem] Refactor inductive definitions a bit Summary: Add universal quantification to suppress warning messages Reviewed By: jberdine Differential Revision: D18058834 fbshipit-source-id: c96230ff7	5 years ago
Scott Owens	573f0d8aed	[sledge sem] Make proof progress on phi instructions Summary: The old syntactic invariant in prog_ok was in the wrong direction, saying that all labels in a phi instruction have to exist, rather than saying that when we jump to a new block, the label of the block we came from must be in all of the phi instructions. Reviewed By: jberdine Differential Revision: D18058832 fbshipit-source-id: d2ad33b04	5 years ago
Scott Owens	0a35b1da35	[sledge sem] Prove the Load and Store cases (mostly) Summary: This required some minor tweaks to how the semantics encode values into and out of byte lists. The remaining problems have to do with how LLVM globals are translated into llair. At the moment, llair semantic's state keeps a mapping for globals to their addresses, following the LLVM semantics. However, it is not used because the translation (following the code in frontend.ml) translates LLVM globals into llair locals, which the llair semantics isn't set up to handle. Reviewed By: jberdine Differential Revision: D17930787 fbshipit-source-id: 06c6368e0	5 years ago
Josh Berdine	c0c96b5235	[sledge] Refactor Used globals analysis results type and query Summary: The Used globals (pre-)analysis produces results queried by Control. This diff adds a type definition for these and moves the query into the Used_globals module. Reviewed By: bennostein Differential Revision: D17856879 fbshipit-source-id: 0211b82d7	5 years ago
Josh Berdine	429fbddeda	[sledge] Refine inlining heuristic to allow casts Summary: To avoid code explosion, the frontend emits move instructions for expressions with more than one use. This diff relaxes this slightly by allowing duplication of casts. Reviewed By: bennostein Differential Revision: D17856384 fbshipit-source-id: 6f6c496ef	5 years ago
Josh Berdine	d6d65a785a	[sledge] Remove left-over SSA assertion Reviewed By: bennostein Differential Revision: D17856383 fbshipit-source-id: 34c347f3e	5 years ago
Josh Berdine	7105d85281	[sledge][NFC] Minor code cleanup Reviewed By: bennostein Differential Revision: D17844328 fbshipit-source-id: 32aac2f3f	5 years ago
Josh Berdine	081455278d	[sledge] Do not explore exceptional control flow by default Summary: The frontend translation of exceptional control flow is untrusted enough that it makes sense to disable it by default. Reviewed By: bennostein Differential Revision: D16061018 fbshipit-source-id: 65dca36ae	5 years ago
Josh Berdine	9acfb65ba0	[sledge][NFC] Update TODO Reviewed By: bennostein Differential Revision: D17821844 fbshipit-source-id: 36616b389	5 years ago
Josh Berdine	bc858fad2e	[sledge][NFC] Rename Term.call's func arg to callee to match type Summary: Just for consistency Reviewed By: bennostein Differential Revision: D17821846 fbshipit-source-id: 778c61a56	5 years ago
Josh Berdine	6399c59861	[sledge] Do not represent function CFGs explicitly Summary: The CFG of a function is implicit in the blocks themselves, so it is possible to remove the explicit represention as a vector of blocks. The only uses are fold or iter, and since the cycles are detected during construction, these can be simple depth-first traversals. Reviewed By: bennostein Differential Revision: D17821845 fbshipit-source-id: fc7a02151	5 years ago
Josh Berdine	2331e8d68a	[sledge] Fix frontend bug in trampoline creation Reviewed By: ngorogiannis Differential Revision: D17844327 fbshipit-source-id: 18a1c4fbe	5 years ago
Josh Berdine	cf5097a8b4	[sledge] Add report-summary test make target Summary: Produce a summary of the test results report. Reviewed By: ngorogiannis Differential Revision: D17915840 fbshipit-source-id: 91445cc7b	5 years ago
Josh Berdine	995de071ed	[sledge] Revise Sh_domain handling of function call and return Summary: Fix a bug where the actual return variable was not scoped correctly in cases where its name clashed with a local or formal of the callee. Also comment and simplify to attempt to make more understandable. Reviewed By: bennostein Differential Revision: D17801944 fbshipit-source-id: 286739241	5 years ago
Josh Berdine	df26b9b1a5	[sledge][NFC] Minor code simplification Reviewed By: bennostein Differential Revision: D17801942 fbshipit-source-id: 45b881877	5 years ago
Josh Berdine	65e963a162	[sledge] Add Sh.subst implemented ito and and exists Reviewed By: bennostein Differential Revision: D17801943 fbshipit-source-id: 192863296	5 years ago
Josh Berdine	1595fb7c60	[sledge] Fix potential name clash in Sh.rename Reviewed By: bennostein, ngorogiannis Differential Revision: D17801952 fbshipit-source-id: 6c66ebb9b	5 years ago
Josh Berdine	799b21761f	[sledge] Translate ExtractElement and InsertElement despite being vector Summary: Some code that is otherwise benignly scalar still uses the ExtractElement and InsertElement vector operations, so translate them as if they were array operations. Reviewed By: ngorogiannis Differential Revision: D17801949 fbshipit-source-id: 89f3666bd	5 years ago
Scott Owens	3080fba8fa	[sledge sem] Update LLVM and LLAIR sem for consistent stuckness Summary: Previously, the LLVM semantics could be stuck where the LLAIR semantics was not yet stuck, but would become stuck (at the same place) after taking a step. This was due to LLVM using the traditional definition of stuck states: any state from which there are no transitions. However, LLAIR cannot do that because it might get stuck in the middle of a block that contains several visible stores. We don't want to consider the whole block stuck, nor can we finish it. Thus, the LLAIR definition of stuckness is when the state has the stuck flag set which happens when stopping in the middle of a block after encountering a stuck instruction. Now LLVM takes the same approach. Reviewed By: jberdine Differential Revision: D17855085 fbshipit-source-id: a094d25d5	5 years ago
Scott Owens	14a8ae34b9	[sledge sem] Improve and unify treatment of Exit Summary: Add an argument to the Exit instruction. Update the LLVM semantics to execute the Exit instruction and store the result in an "exited" component of the state. (Previously it just noticed that it was stuck about to do an Exit.) With exiting treated uniformly, now in the proof that for every LLVM trace, there is a llair trace that simulates it, all of the cheats except for 1 are just cases that I haven't got to yet. However, the last cheat is for the situation where the LLVM program gets stuck and the llair program doesn't. For example, the following two line LLVM program gets stuck because r2 is not assigned (ignoring for the moment the static restriction that LLVM is in SSA form). r1 := r2 Exit(0) The compilation to llair omits the assignment and so we get a llair program that doesn't get stuck: Exit(0) The key question is whether the static restrictions are sufficient to ensure that no expression that might be omitted can get stuck. Reviewed By: jberdine Differential Revision: D17737589 fbshipit-source-id: bc6c01a1b	5 years ago
Scott Owens	5312b3d10c	[sledge sem] Fix trans. invariant for llair expressions Summary: If the LLVM to llair translation keeps a mapping from register r to expression e, then for each register r' mentioned in e, there must be an assignment to r' that dominates the entire live range of r. Thus, where ever r might be replaced by e, the value of r' will be the same as it was when the initial assignment to r occurred. Maintaining this invariant relies on the LLVM being in SSA form. Reviewed By: jberdine Differential Revision: D17710288 fbshipit-source-id: fd3eaa57d	5 years ago
Scott Owens	9f2f14b34c	[sledge sem] Sketch out translation correctness Summary: This is work in progress; many of the cheats aren't true. In particular, the definition of stuck/complete/partial traces in LLVM and llair don't quite match up and need some modification. Also, the state relation isn't strong enough; it will need to include information about registers used in the expressions of the LLVM register to llair expression mapping. But the overall shape of the proof is ok and so it can be used to poke at various local aspects of the translation, such as individual instructions. Reviewed By: jberdine Differential Revision: D17631604 fbshipit-source-id: 743b5d64d	5 years ago
Jules Villard	42470d8809	[hmm] sexp_{option,list} -> {option,list} Summary: By some unfortunate logic, OCaml often decides to use `sexp_list`/`sexp_option` instead of just `list`/`option`. Sometimes these get copy/pasted in interface files. It would be good to tell OCaml not to do that in the first place but in the meantime: this diff. Reviewed By: ngorogiannis Differential Revision: D17907938 fbshipit-source-id: 7546834a2	5 years ago
Josh Berdine	ef78ba83cf	[sledge] Report the number of alarms Summary: For test scripting purposes, when the analysis finishes successfully, report the number of alarms. Reviewed By: ngorogiannis Differential Revision: D17801947 fbshipit-source-id: 1660866df	5 years ago
Josh Berdine	ec52c05c30	[sledge][NFC] Minor simplification for singleton sets Reviewed By: ngorogiannis Differential Revision: D17801948 fbshipit-source-id: 86d2e6ec9	5 years ago
Josh Berdine	239d906ab6	[sledge] Improve tracing and debugging support Reviewed By: ngorogiannis Differential Revision: D17801930 fbshipit-source-id: 8cfac2eaf	5 years ago
Josh Berdine	3f5adecdcf	[sledge] Exec.exec_specs missed vocabulary extension Summary: In a spec, it currently may be that foot.us does not contain xs. So exec_specs needs to extend the vocabulary of foot before existentially quantifying out xs. Reviewed By: ngorogiannis Differential Revision: D17801933 fbshipit-source-id: 7b4b9262a	5 years ago
Josh Berdine	9ac854c970	[sledge] Exec.kill should preserve vocabulary Reviewed By: ngorogiannis Differential Revision: D17801935 fbshipit-source-id: 81fe4b067	5 years ago
Josh Berdine	8097f1a6df	[sledge] Adjust tests to match harnesses Reviewed By: ngorogiannis Differential Revision: D17801945 fbshipit-source-id: 0f984e013	5 years ago
Josh Berdine	b2f90a3994	[sledge] Treat freturn directly in Dom.call Summary: Previously it was added to the locals before calling Dom.call, but this results in the scope of freturn ending too early. Reviewed By: ngorogiannis Differential Revision: D17801939 fbshipit-source-id: 739ec8981	5 years ago
Josh Berdine	fbf0fe2f1a	[sledge][NFC] Rename args to actuals Summary: For consistency Reviewed By: ngorogiannis Differential Revision: D17801953 fbshipit-source-id: a797d2446	5 years ago
Josh Berdine	d3d0c4b36e	[sledge][NFC] Rename params to formals Summary: For consistency Reviewed By: ngorogiannis Differential Revision: D17801926 fbshipit-source-id: 012b13561	5 years ago
Josh Berdine	69c29ab3d8	[sledge][NFC] Label args of Domain.call Summary: Just for legibility. Reviewed By: ngorogiannis Differential Revision: D17801937 fbshipit-source-id: ee1bd95d2	5 years ago
Josh Berdine	47766a0e6e	[sledge] Drop globals with appending linkage and size 0 Summary: Some globals have 'appending' linkage, where linking modules results in appending the arrays from each module. These can appear even when empty, leading to useless and somewhat troublesome 0-length arrays. So drop them. Reviewed By: ngorogiannis Differential Revision: D17801927 fbshipit-source-id: d2dc180d7	5 years ago
Josh Berdine	1efd0df035	[sledge] Avoid potential name clash between trampolines Summary: Trampoline blocks introduced when eliminating SSA could clash. Reviewed By: ngorogiannis Differential Revision: D17801936 fbshipit-source-id: c1fdf2fc6	5 years ago
Josh Berdine	ebee451f1c	[sledge] Improve test scripts Summary: Better failure messages and reports Reviewed By: ngorogiannis Differential Revision: D17801940 fbshipit-source-id: db3d13eaf	5 years ago
Josh Berdine	38cab376f6	[sledge] Keep BitCasts and similar in expressions Summary: While BitCasts are the identity function on the bitwise representation, they are not necessarily so in the semantics or the logical representation. So be more conservative about eliding them in the Exp language. Those that are actually semantic identities are still omitted in the Term language. Reviewed By: ngorogiannis Differential Revision: D17801950 fbshipit-source-id: bf9ae57b5	5 years ago
Josh Berdine	b632d4f283	[sledge] Check the input datalayout agrees with assumptions Summary: The analyzer (currently) hard-codes some assumptions about sizes of basic types such as Typ.bool, Typ.siz, etc. Check that these assumptions are satisfied by the input llvm datalayout, and give reasonable error messages otherwise. Reviewed By: ngorogiannis Differential Revision: D17801941 fbshipit-source-id: 4fe484ee0	5 years ago
Josh Berdine	6328a6ce40	[sledge] Do not store size of globals separately Summary: Now that expression types and type sizes can be computed, it is not necessary to store the sizes of globals separately. Reviewed By: ngorogiannis Differential Revision: D17801932 fbshipit-source-id: f746e506b	5 years ago
Josh Berdine	ca95fc098f	[sledge] Keep size in both bits and bytes for each type Summary: - The `Llvm_target.DataLayout.size_in_bits` needs to be used for checking casts e.g. it is ok to `bitcast <16 x i1> to i16`: they both have 16 bits, but they have sizes 16 vs 2 bytes - The `Llvm_target.DataLayout.abi_size` needs to be used for the size of memory blocks containing values e.g. for the size of memory segments containing the initial values of globals - The example above shows that we can't compute the byte size from the bit size without knowing the target specific datalayout - So we need both in each sized type - Also add checks that Convert exps and terms are not no-ops - Simplifications of size manipulating code Reviewed By: ngorogiannis Differential Revision: D17801928 fbshipit-source-id: 8c8ce6128	5 years ago
Josh Berdine	d3bad1ce44	[sledge] Add sizes to types Summary: In order to type-check casts, it is necessary to have the size of each sized type. This size information is also useful in a few other places. Reviewed By: bennostein Differential Revision: D17801931 fbshipit-source-id: f8ef53276	5 years ago
Josh Berdine	6120b7d098	[sledge] Use the configured margin when formatting failure messages Reviewed By: bennostein Differential Revision: D17801934 fbshipit-source-id: af7acec9b	5 years ago
Josh Berdine	a386b36616	[sledge] Re-add Splat expression for zero-initialized aggregates Summary: This is needed since expressions distinguish between the integer or pointer zero value and zero-initialized array/tuple/struct aggregates based on type, and the backend distinguishes them semantically. Reviewed By: bennostein Differential Revision: D17801938 fbshipit-source-id: ac8665e65	5 years ago
Josh Berdine	727385d853	[sledge] Relax Typ.is_sized to allow opaque types Summary: Linking can lead to opaque types becoming identified with a known types. Assertions in various places that types should be sized can be triggered by such opaque types. Until there is a distinction between processing fully-linked versus incomplete code, these checks need to be relaxed to permit opaque types where sized ones are expected. Reviewed By: bennostein Differential Revision: D17801929 fbshipit-source-id: c5e62f7c8	5 years ago
Josh Berdine	f804220cd2	[sledge] Revise order of Term constructors for polynomial normalization Summary: Integer terms need to compare higher than any monomial. Reviewed By: bennostein Differential Revision: D17725607 fbshipit-source-id: c64fd52d5	5 years ago
Josh Berdine	1ef390ffca	[sledge] Relax Exp type-checking to be modulo-casting Summary: Also weaken definition of Typ.castable to permit casting between floats and ints of the same size. Reviewed By: bennostein Differential Revision: D17725611 fbshipit-source-id: 5e8114e26	5 years ago
Josh Berdine	fb184a6a1d	[sledge] Introduce the notion of types having the same semantics Summary: Typ.equivalent is currently defined the same as Typ.castable, but conceptually they are different and castable needs to be weakened. They are different since for example it is possible to cast from an i64 to a f64, but those types denote different sets of values in the semantics, and the bitcast is modeled using a conversion function. Reviewed By: bennostein Differential Revision: D17725615 fbshipit-source-id: 973574f2a	5 years ago
Josh Berdine	917cc62e28	[sledge] Fix type of functions called using a cast Summary: For function calls where the callee is a cast expression, previous the wrong type would be used for the callee. This could lead to crashes in llvm, or asserting in sledge. Reviewed By: bennostein Differential Revision: D17725610 fbshipit-source-id: 938b49a49	5 years ago
Josh Berdine	ce3252c348	[sledge] Allow global variables as function names Summary: Some called functions are represented in llvm as a global variable with e.g. external linkage, and so they do not appear as 'functions'. It is still valid to call such functions, though the analyzer does not know their definitions. Reviewed By: bennostein Differential Revision: D17725609 fbshipit-source-id: 333d19c0d	5 years ago
Josh Berdine	785928c77e	[sledge] Error reporting improvements Summary: Improve Trace.fail to log the error and raise informative exceptions. Eliminate the confusion between Import.fail and Trace.fail by removing Import.fail. Reviewed By: bennostein Differential Revision: D17725608 fbshipit-source-id: 79fdfbd86	5 years ago
Josh Berdine	ffeef16aae	[sledge] Add a flag to disable internalization Summary: By default all functions except those specified as entry points in the config file are "internalized". Internal functions are removed if they are not called. It is sometimes necessary to disable internalization, e.g. to analyze the llvm tests. Reviewed By: bennostein Differential Revision: D17725614 fbshipit-source-id: 4b13501f5	5 years ago
Josh Berdine	6ca09b14fd	[sledge] Add flag to disable linking in the models Summary: Sometimes the models for the C/C++ runtime and standard libraries are not needed. Furthermore, sometimes, e.g. when analyzing llvm tests, trying to link them fails. Reviewed By: bennostein Differential Revision: D17725616 fbshipit-source-id: 76a4bcf90	5 years ago
Josh Berdine	f699c9b9a8	[sledge] Simplify ¬¬e to e Reviewed By: bennostein Differential Revision: D17725617 fbshipit-source-id: 7467fad3e	5 years ago
Josh Berdine	06f2863dd8	[sledge] Simplify `e xor e` to `0` Reviewed By: bennostein Differential Revision: D17665226 fbshipit-source-id: 655ddf6a8	5 years ago
Josh Berdine	6f84787b19	[sledge] Change exec_inst to return an option instead of a result Summary: The `(t, unit) result` type is no more informative than `t option` and less convenient. Reviewed By: bennostein Differential Revision: D17665244 fbshipit-source-id: fa969d8b7	5 years ago
Josh Berdine	2840eb4781	[sledge] Refactor dispatch on instruction from Exec to Sh_domain Summary: This puts the mediation between Exp and Term together in Sh_domain rather than being spread across the two. Reviewed By: bennostein Differential Revision: D17665235 fbshipit-source-id: edf277d45	5 years ago
Josh Berdine	c6d7886fd8	[sledge] Make type of exec_move consistent with move instruction Summary: The move instruction takes a vector of assignments to perform in parallel, so generalize exec_move from one to a vector. Reviewed By: bennostein Differential Revision: D17665248 fbshipit-source-id: 52aae5ff9	5 years ago
Josh Berdine	162f027249	[sledge] Make type argument of Exp constructors optional where computable Reviewed By: bennostein Differential Revision: D17665251 fbshipit-source-id: 4d8bccfe8	5 years ago
Josh Berdine	ad5d5dd89e	[sledge] Add Exp.true_ and Exp.false_ Summary: Convenience wrappers for Exp.integer. Reviewed By: bennostein Differential Revision: D17665234 fbshipit-source-id: 0cf440861	5 years ago
Josh Berdine	37d1904bd3	[sledge] Move check for whether a variable is global from Reg to Var Summary: Extend the encoding using `id` from 0 indicating a program variable to also -1 indicating a global program variable. Reviewed By: bennostein Differential Revision: D17665229 fbshipit-source-id: 848b8a31e	5 years ago
Josh Berdine	3003a8e646	[sledge] NFC minor cleanups Reviewed By: jvillard Differential Revision: D17665255 fbshipit-source-id: 0f18e5777	5 years ago
Josh Berdine	8ee0c67d1f	[sledge] Precompute the Term form of each Exp, and add it to Exp.t Reviewed By: bennostein Differential Revision: D17665261 fbshipit-source-id: 25f2e656f	5 years ago
Josh Berdine	9ddfae4e89	[sledge] Change Term.rename to preserve sharing in cyclic records Reviewed By: bennostein Differential Revision: D17665265 fbshipit-source-id: 50844096a	5 years ago
Josh Berdine	7ecd091ff3	[sledge] Change Struct_rec to a generic n-ary recursive application Reviewed By: bennostein Differential Revision: D17665266 fbshipit-source-id: dd938ac31	5 years ago
Josh Berdine	356b4f0b4e	[sledge] Uncurry Record term constructor Reviewed By: bennostein Differential Revision: D17665260 fbshipit-source-id: 080f47739	5 years ago
Josh Berdine	99b60d191a	[sledge] Fix sorting of heap block subformulas when printing Summary: The sorting of heap blocks when printing formulas was broken by the change to the direct representation of polynomials. Reviewed By: bennostein Differential Revision: D17665246 fbshipit-source-id: 4ebea9f20	5 years ago
Josh Berdine	1228c8e31b	[sledge] Uncurry Update term constructor, and specialize index to int Reviewed By: bennostein Differential Revision: D17665245 fbshipit-source-id: d4716a220	5 years ago
Josh Berdine	09daac754c	[sledge] Uncurry Select term constructor, and specialize index to int Reviewed By: ngorogiannis Differential Revision: D17665264 fbshipit-source-id: c716a3eeb	5 years ago
Josh Berdine	5eaae07043	[sledge] Change Concat term contructor to a generic n-ary application Reviewed By: ngorogiannis Differential Revision: D17665238 fbshipit-source-id: 713b333e8	5 years ago
Josh Berdine	6cd82475f1	[sledge] Use generic binary application for Splat and Memory term constructors Reviewed By: bennostein Differential Revision: D17665256 fbshipit-source-id: 9c08338de	5 years ago
Josh Berdine	6805da9557	[sledge] Uncurry ternary term constructors Reviewed By: bennostein Differential Revision: D17665227 fbshipit-source-id: 56240d374	5 years ago
Josh Berdine	167e489e24	[sledge] Uncurry binary term constructors Reviewed By: bennostein Differential Revision: D17665243 fbshipit-source-id: 2d68e40b5	5 years ago
Josh Berdine	8b9d4ba066	[sledge] Uncurry unary term constructors Reviewed By: bennostein Differential Revision: D17665258 fbshipit-source-id: 456f7c58d	5 years ago
Josh Berdine	e87a0533be	[sledge] Minor simplification of polynomial representation Reviewed By: bennostein Differential Revision: D17665237 fbshipit-source-id: f9a082d26	5 years ago
Josh Berdine	3bbb05216f	[sledge] Remove the redundancy of both < and >= terms Summary: It is not necessary to have both < and >=, and similarly for <= and >. Reviewed By: bennostein Differential Revision: D17665232 fbshipit-source-id: 01b3511f5	5 years ago
Josh Berdine	a3506f995c	[sledge] Simplify arithmetic terms due to not needing type Summary: Now that terms operate over unbounded, signed, integers rather than bounded integers, and Boolean operations are treated uniformly with bitwise operations, it is not necessary to propagate types throughout arithmetic term manipulation. Reviewed By: bennostein Differential Revision: D17665257 fbshipit-source-id: 5236b101d	5 years ago
Josh Berdine	471d296266	[sledge] Fix check for range of representable integers Summary: Z.numbits ignores the sign, which allows 2^(N - 1) as representable within N bits, while it is not. So check explicitly. Reviewed By: bennostein Differential Revision: D17665231 fbshipit-source-id: 0d3940517	5 years ago
Josh Berdine	c440c4fc28	[sledge] Remove unsigned Term operations except Extract Summary: Instead of having separate signed and unsigned operations, use the signed operations applied to explicit conversion of the arguments using an unsigned integer interpretation. Reviewed By: bennostein Differential Revision: D17665267 fbshipit-source-id: 0b3271e71	5 years ago
Josh Berdine	e84f3fcf0f	[sledge] Add Extract term Summary: Add an Extract term form to interpret an integer with given signedness and bitwidth. Reviewed By: bennostein Differential Revision: D17665263 fbshipit-source-id: 1d8917f3c	5 years ago
Josh Berdine	5753f9b26a	[sledge] Rename clamp to extract Reviewed By: bennostein Differential Revision: D17665239 fbshipit-source-id: bab1175e1	5 years ago
Josh Berdine	d7ef03cf02	[sledge] Revise and fix unsigned conversions Summary: Be more explicit about semantics of unsigned vs. signed conversions, and fix a few related corner cases. Reviewed By: bennostein Differential Revision: D17665268 fbshipit-source-id: 67fecdf34	5 years ago
Josh Berdine	7f2165484b	[sledge] Do not special case boolean vs bitwise operations Summary: With terms using unbounded two's complement arithmetic, it is not necessary to special-case 1-bit integers as Booleans. Reviewed By: ngorogiannis Differential Revision: D17665228 fbshipit-source-id: a2f280fc3	5 years ago
Josh Berdine	8abfcfb504	[sledge] Simplify normalization of shift operations Summary: Remove the guards that prevent normalizing in some cases where the corresponding instruction in LLVM would produce a poison value. Usefully tracking poison values will be more involved. Reviewed By: ngorogiannis Differential Revision: D17665230 fbshipit-source-id: 59fb25042	5 years ago
Josh Berdine	e3f0ba8c54	[sledge] Revise program expressions Summary: Revise program expressions based on the changed constraints now that Term is separate from Exp. In particular: - Add types to all application, indicating how the operation interprets its arguments - Change to a simpler uncurried form - Remove now-unneeded normalizations Reviewed By: bennostein Differential Revision: D17665236 fbshipit-source-id: 1bcf2efd6	5 years ago
Josh Berdine	00639e15bb	[sledge] Delay normalization of xor to equality Summary: Boolean and bitwise negation of `e` is represented using `-1 xor e`. Since Equality can only maintain and propagate equality constraints, Boolean negation `-1 xor b` is normalized to `b = false`. This diff delays this normalization from being part of expression construction to part of symbolic heap formula construction. This makes the normalization done as part of expression construction independent of the distinction between bitwise and boolean operations. Reviewed By: bennostein Differential Revision: D17665254 fbshipit-source-id: 0a0722865	5 years ago
Josh Berdine	0e4110fc5c	[sledge] Normalize xor and equality based on type instead of bitwidth Reviewed By: bennostein Differential Revision: D17665233 fbshipit-source-id: dc2821943	5 years ago
Josh Berdine	0903355a0e	[sledge] Remove unused Exp constructors for memory exps Summary: Splat, Memory, and Concat expressions are never used. Only the term forms are needed. Reviewed By: bennostein Differential Revision: D17665259 fbshipit-source-id: cbfd7650d	5 years ago
Josh Berdine	3b03022b5e	[sledge] Remove redundant Reg.id Summary: It is always 0. Reviewed By: bennostein Differential Revision: D17665247 fbshipit-source-id: c146c9dc8	5 years ago
Josh Berdine	310d00f380	[sledge] Remove dead code in Exp and Term Reviewed By: bennostein Differential Revision: D17665249 fbshipit-source-id: c242634f1	5 years ago
Josh Berdine	442c8e92f4	[sledge] Distinguish program expressions and formula terms Summary: There are a number if issues with using the same type for expressions in code and in formulas. One is that the type systems of the two should be different. Another is that conflating the two compromises the ability of Llair to correctly express aspects such as integer overflow, floating point rounding, etc. Also, it could be beneficial to have more source locations for program expressions than makes sense for terms. This diff simply unshares Exp, leading to a copy named Term. Likewise, Reg is now a copy of Var. Simplifications to come. Reviewed By: bennostein Differential Revision: D17665250 fbshipit-source-id: 4359a80d5	5 years ago
Josh Berdine	13c06e4dd3	[sledge] Move generation of formal return and throw parameters to frontend Summary: The generation of names for the function formal return and throw parameters is not central to LLAIR, but a detail of the frontend, since they are generated only because LLVM does not already have such names. Reviewed By: ngorogiannis Differential Revision: D17665240 fbshipit-source-id: 684cbae92	5 years ago
Josh Berdine	0c04ecc9aa	[sledge] Change Llair representation of functions to a String map Summary: Using a type of keys richer than strings, which are the unique symbol names at the C/LLVM level, is unnecessary. Reviewed By: ngorogiannis Differential Revision: D17665262 fbshipit-source-id: 6b8c31146	5 years ago
Josh Berdine	6aaeaba104	[sledge] Move ops on signed 1-bit Z integers to import Summary: The convenience wrappers for operations on signed 1-bit integers represented by Z.t are not specific to Exp. Reviewed By: ngorogiannis Differential Revision: D17665252 fbshipit-source-id: d4b58e2a6	5 years ago
Josh Berdine	ed733f0247	[sledge] Add missing import of trace into symbheap Reviewed By: ngorogiannis Differential Revision: D17665241 fbshipit-source-id: 6f70e2925	5 years ago
Josh Berdine	1fdc76d163	[sledge] Rename State_domain back to Sh_domain Summary: Now that the relation domain construction is factored out and generalized. Reviewed By: ngorogiannis Differential Revision: D17665253 fbshipit-source-id: eb156ce6b	5 years ago
Josh Berdine	c6b8b4688b	[sledge] Move llvm build and install dirs out of llvm source tree Summary: Since version 2, none of the `opam pin` modes work reasonably well for the standard llvm build procedure. As a workaround to prevent opam from making several copies of the build directory when pinning, adjust to move the llvm build and install directories out of the llvm source tree. Reviewed By: bennostein Differential Revision: D17665242 fbshipit-source-id: ac84a4b0b	5 years ago
Scott Owens	5b7931e71a	[sledge sem] Add a rudimentary theory of SSA Summary: Since the correcteness of the mapping from LLVM to llair depends on LLVM being SSA, we need to formalise what that means. We also prove that the domination relation is a strict partial order, which will probably be helpful when reasoning about the translation. Reviewed By: jberdine Differential Revision: D17631456 fbshipit-source-id: a00eb3f87	5 years ago
Scott Owens	71aa4816d6	[sledge sem] Fix the semantics and trans. of If Summary: The LLVM semantics and translation was not consistently treating the 1-bit word value condition as signed or unsigned. Reviewed By: jberdine Differential Revision: D17605766 fbshipit-source-id: 77edf63b7	5 years ago
Scott Owens	ab7233c5b8	[sledge sem] Refactor the way LLVM sem. does phis Summary: Previously the LLVM semantics did the phi instructions at the head of a block as part of executing the branch into that block. This looked a bit weird, but had the advantage that the semantics knew which block was being jumped from, which is necessary to run the phi instructions. However, it meant that the rules for doing phi instructions would need to show up with each branching construct. It was also annoying for the LLVM->llair proof, since the phis are removed and their effect happens as a distinct step from the branch. Here we add a distinct Phi_ip instruction pointer to indicate that the phi instructions at the start of the block should execute next, and then be incremented to the usual numeric instruction pointer that points to the non-phi instructions. The Phi_ip contains the identity of the previous block. Reviewed By: jberdine Differential Revision: D17452416 fbshipit-source-id: 78fef7cca	5 years ago
Scott Owens	17b3c7a49f	[sledge sem] Add top-level llair semantics Summary: Give the llair semantics observable side effects (writes to global variables) and a semantic function mirroring the LLVM semantics. Start sketching out the LLVM/llair translation equivalence proof in a top-down way from the obvious statement of equality of the semantics. Reviewed By: jberdine Differential Revision: D17399654 fbshipit-source-id: 2170678a8	5 years ago
Scott Owens	30c301a3e8	[sledge sem] Add a more llair-like LLVM semantics Summary: The simple LLVM semantics steps one instruction at a time, but the generated llair does whole blocks at a time, since many individual LLVM instructions can become a single llair expression. We add a bigger-step LLVM semantics that does whole blocks at a time (except that it also stops at function calls, since those end blocks in llair). The steps in this bigger-step semantics should be at the same granularity as the llair steps, making it easier to verify the translation. We add a notion of observation to the LLVM semantics (right now, just global variable writes) and use that to define two top-level semantic functions, which we prove to be equivalent. Reviewed By: jberdine Differential Revision: D17396016 fbshipit-source-id: ee632fb92	5 years ago
Benno Stein	7ec2830d92	[sledge] Only merge worklist states that share a calling context Summary: This diff allows domains to specify which abstract states can or can't be merged together by the worklist. In particular, this is needed for relational domains to ensure that Hoare triples are joined only when they share a precondition. Reviewed By: jberdine Differential Revision: D17571148 fbshipit-source-id: d9345fdc9	5 years ago
Benno Stein	e44827b892	[sledge] Add option to apply used-globals as pre-analysis Summary: This diff adds a "-prenalyze-globals" flag to all analyze targets which, when set, computes used-globals sets for all reachable functions and then uses that information to track only relevant global variables at calls in the main analysis. Reviewed By: jberdine, jvillard Differential Revision: D17526746 fbshipit-source-id: 1a114285c	5 years ago
Benno Stein	1ab8359bc0	[sledge] fix bug spuriously marking a register as global variable Summary: Fixes a bug in Llair.Frontend.xlate_value where the l-val register of LLVM instruction calls was being marked as global. Reviewed By: jberdine Differential Revision: D17570458 fbshipit-source-id: e1b5924e2	5 years ago
Benno Stein	637fff5247	[sledge] Check for intrinsic calls in used-globals analysis Summary: Fixes a bug where are all calls are treated as intrinsics in used globals analysis, since exec_intrinsic is invoked at _all_ calls to determine which are intrinsic, not only at call sites known to target intrinsics. Reviewed By: jberdine Differential Revision: D17499406 fbshipit-source-id: 41f7621f2	5 years ago
Benno Stein	6592eb609f	[sledge] Add option to skip recursive calls at depth bound Summary: While the symbolic heap analysis ends its search upon hitting the bound on recursion depth, the used-globals analysis should instead simply skip recursive calls beyond the depth. Note that this is unsound for arbitrary abstract domains, however, and the flag controlling this feature should be used with caution. Note that procedure calls are still not handled correctly, since Used_globals.exec_intrinsic does not properly check whether callees are intrinsic. A forthcoming commit will fix that, as well. Reviewed By: jberdine Differential Revision: D17479753 fbshipit-source-id: aa92e0ef3	5 years ago
Benno Stein	00a5d3dd64	[sledge] Account for callees in used-globals analysis Summary: Include global variables used in function callees in used globals analysis. Also adds support for arbitrary changes to symbolic state while resolving callees in other analyses. Reviewed By: jberdine Differential Revision: D17479352 fbshipit-source-id: e3cd9f179	5 years ago
Josh Berdine	c131e2e669	[sledge] Use dune's Build_info for version reporting Summary: Replace custom version reporting support using a shell script with code using dune's Build_info API. Note that after this diff, the executables under _build/<context> are not version-stamped, but those under _build/_install are. The symlinks in bin point to the latter, stamped, exes. Reviewed By: bennostein Differential Revision: D16985446 fbshipit-source-id: 7afac87be	5 years ago
Benno Stein	47f314c00e	[sledge] Add used-globals abstract domain and transfer functions Summary: Adds an abstract domain to track global variable usages, as well as supporting changes to the frontend, IR and CLI. This analysis will support optimizations to the main symbolic-heap analysis, but for now can be invoked independently through the `-domain` flag on `analyze` targets of the Sledge executable. Reviewed By: jberdine Differential Revision: D17422212 fbshipit-source-id: 74bed0a76	5 years ago
Benno Stein	3dc0c5938f	[sledge] Extract relational logic from Sh_domain, create "domain" module Summary: Generalize the lifting from State_domain (i.e. symbolic heaps) to Sh_domain (i.e. relations over symbolic heaps). Also, extract abstract-domain-related code into its own module/directory. Reviewed By: jberdine Differential Revision: D17319007 fbshipit-source-id: cefbd1393	5 years ago
Benno Stein	2acb1c3dee	[sledge] Functorize worklist, separate out domain-specific logic Summary: Add support for future development of new abstract domains by eliminating hard-wired dependencies from the worklist into the symbolic heap domain. Also includes an implementation of a trivial unit domain and a CLI flag to enable its use, for debugging purposes. Reviewed By: jberdine Differential Revision: D17281681 fbshipit-source-id: 5858fd420	5 years ago
Scott Owens	f298d728c5	[sledge sem] Start sketching translation correctness Summary: This includes a few changes and corrections to the semantics, to support the translation. This initial attempt to reason about LLVM -> llair showed three things that needed repair in the semantics, in addition to various bugs. We address them as follows. Refactor llair semantics to have only a single kind of flat value: integers that fit into specified bit widths. Operations on size values (e.g., offsets, indices and the like) can just take an integer and ignore its number of bits. Pointers can just be considered integers that fit into a certain size given by the constant pointer_size. Later on we can consider making this a parameter to the model. Change the generic memory model interface to use numbers rather than words as the generic encoding of a large value. This makes it more useful for llair where words are not used. Pay more careful attention to signed/unsigned issues. Neither LLVM nor llair have a concept of signed vs unsigned value. Instead individual operations interpret bit patterns in various ways, some of which are ambiguous in the LLVM manual. For example, since getelementpointer's indices are explicitly said to be interpreted as signed 2's complement, we should probably do the same for insertvalue and extractvalue. However it is not clear how the argument to alloca is to be interpreted. For now we assume signed. Reviewed By: jberdine Differential Revision: D17164133 fbshipit-source-id: 31a8af635	5 years ago
Josh Berdine	72946c3be3	[sledge] Update dependencies Reviewed By: jvillard Differential Revision: D17132472 fbshipit-source-id: 9f4c9421e	5 years ago
Scott Owens	d864fb2c89	[sledge semantics] Add a rough draft llair semantics Summary: Not everything is here yet, and there is some confusion on what to do about the size values. However, the semantics has the right general shape and will be a nice starting point for thinking about the details. Reviewed By: jberdine Differential Revision: D17111041 fbshipit-source-id: cc75651c6	5 years ago
Scott Owens	32983e129b	[sledge semantics] Update expr transl. for cross-block Summary: The translation from LLVM to llair now builds expressions up across blocks, following the implementation. This is easy to do because of the dominance restrictions in SSA, but might be difficult to reason about. Reviewed By: jberdine Differential Revision: D17111040 fbshipit-source-id: a8e99147d	5 years ago
Scott Owens	9f44bbc264	[sledge semantics] Refactor the memory model Summary: LLVM and llair have similar memory models, and we don't want to duplicate any definitions or theorems. This adds a new memory model theory which should be understandable in its own right. A heap is a mapping from addresses to bytes, alongside a set of valid addresses, and intervals that have been allocated already. Primitives are defined for allocating and de-allocating as well as reading and writing chuncks of bytes. There is also a generic type of structured values, and functions for converting them to/from byte arrays. Reviewed By: jberdine Differential Revision: D17074470 fbshipit-source-id: bdab6089f	5 years ago
Josh Berdine	13fb57ec62	[sledge] Revise llvm to llair translation to avoid code duplication Summary: In some cases inlining pure expressions into their use sites causes code blowup. This diff changes the frontend to inline expressions only if there is a single use, and otherwise adds a move instruction. Reviewed By: ngorogiannis Differential Revision: D17071770 fbshipit-source-id: d866a0622	5 years ago
Josh Berdine	ed4aac4f66	[sledge] Update stale comment Summary: This has been out of date since arithmetic was changed from a purely uninterpreted treatment to having a solver. Reviewed By: jvillard Differential Revision: D16985159 fbshipit-source-id: 39e42069c	5 years ago
Josh Berdine	0667edf418	[sledge] Remove unused Llair.ignore_result Summary: No longer needed due to blocks not taking parameters. Reviewed By: jvillard Differential Revision: D16914858 fbshipit-source-id: 24b1106ac	5 years ago
Josh Berdine	3f8d5ace6e	[sledge] Eliminate SSA Summary: While SSA can be useful for code transformation purposes, it offers little for semantic static analyses. Essentially, such analyses explore the dynamic semantics of code, and the static single assignment property does not buy much. For example, once an execution visits a loop body that assigns a variable, there are multiple assignments that the analysis must deal with. This leads to the need to treat blocks as if they assign all their local variables, renaming to avoid name clashes a la Floyd's assignment axiom. That is fine, but it makes it much more involved to implement a version that is economical with respect to renaming only when necessary. Additionally the scoping constraints of SSA are cumbersome and significantly complicate interprocedural analysis (where there is a long history of incorrect proof rules for procedures, and SSA pushes the interprocedural analysis away from being able to use known-good ones). So this diff changes Llair from a functional SSA form to a traditional imperative language. Reviewed By: jvillard Differential Revision: D16905898 fbshipit-source-id: 0fd835220	5 years ago
Josh Berdine	b6eab89504	[sledge] Remove dead from_call.actuals_to_formals field Reviewed By: jvillard Differential Revision: D16905894 fbshipit-source-id: ab4b34ba0	5 years ago
Josh Berdine	8d9b8962c7	[sledge] Add Move instruction Reviewed By: jvillard Differential Revision: D16905896 fbshipit-source-id: 3d8b9a88a	5 years ago
Josh Berdine	2c9fce0bf2	[sledge] Add Vector.unzip Reviewed By: jvillard Differential Revision: D16905895 fbshipit-source-id: 98891d4b0	5 years ago
Josh Berdine	0790a64763	[sledge] Change symbolic execution of instructions to not rely on SSA Summary: Before this diff symbolic execution of instructions assumed that assigned variables were unconstrained in the precondition. This is ensured by symbolic execution of control flow, which renames all local variables of a block when it is entered. This diff changes symbolic execution of instructions to rename modified variables that appear in the precondition when necessary, and accounts for the modified variable occurrence condition on the frame rule. This will enable more economically renaming variables, as most of the time it is not needed. Reviewed By: jvillard Differential Revision: D16905893 fbshipit-source-id: 3a53525d7	5 years ago
Scott Owens	808a61623f	Add types to the variable syntax in llair Summary: Each variable now contains its type, alongside its name. This is more uniform than in LLVM, where the name is usually paired with a type, but not always, for example, the register type of the result of an extractvalue is left implicit. Reviewed By: jberdine Differential Revision: D16984630 fbshipit-source-id: 1c3bc4985	5 years ago
Scott Owens	85243ada62	Update for improved HOL syntax for Datatypes Summary: HOL now lets us omit quotations on Datatypes and make them look more like the other new-style HOL definitions. Reviewed By: jberdine Differential Revision: D16983934 fbshipit-source-id: f8ef3abb5	5 years ago
Scott Owens	84883127af	Add a skeleton of an approach to llvm->llair Summary: This sketches out how translation can be approached. It is partially based on the Sledge code. For basic blocks, isn't based on the Sledge code, but just my own thoughts as a starting point. Essentially, we are trying to build up larger expressions, and so not assigning to temporary registers that don't live past the end of the block. This does remove sharing, so a fancier approach could check for multiple uses of end-of-block dead registers, or look at the sizes of expressions. The approach should be flexible enough to accommodate such changes. Fix icmp syntax Using finite maps is elegant in the semantics, but awkward for writing the translation function. Refactor the mappings from labels to functions and from labels to blocks to use association lists instead. To remove phi nodes, the translation takes every edge in the control flow graph and makes a new basic block that contains a single parallel move instruction that corresponds to the action of the phi node of the target block. Reviewed By: jberdine Differential Revision: D16831051 fbshipit-source-id: 005663e26	5 years ago
Scott Owens	6eab69d0d1	Definie a prelim. AST for llair's semantics Summary: The AST is not complete on expressions, but it should have most of the important features. The representation is in some ways very different from the OCaml implementation, because the OCaml code uses mutation to build the CFG as an actual pointer graph in memory, and also because the expression representation is optimised for the backend. For the former, it should be easy to see that the AST here is isomorphic, representing the CFG with finite maps from block labels. The correspondence is less clear in the latter case, but the point here is not to model or verify implementation optimisations, but to give a semantics to llair as a language. Reviewed By: jberdine Differential Revision: D16807132 fbshipit-source-id: b0f64b3ec	5 years ago
Josh Berdine	7efc9285cb	[sledge] Fix type of Exp.rename Reviewed By: ngorogiannis Differential Revision: D16905897 fbshipit-source-id: 2f6740b52	5 years ago
Josh Berdine	0895246e4f	[sledge] Remove label on ~opts args in Control Reviewed By: ngorogiannis Differential Revision: D16905899 fbshipit-source-id: 205df2489	5 years ago
Scott Owens	742ab9089d	Change a type name Summary: Change loc_var (for local variable) to reg (for register) because loc_var looks too much like a location tagged variable. Reviewed By: jberdine Differential Revision: D16827920 fbshipit-source-id: 5b11f1065	5 years ago
Scott Owens	a635aff1bc	Finish proving sanity checking property Summary: There could very well still be bugs in the semantics, since the invariant here doesn't say all that much, and it completely ignores local registers. But most trivial things and typos are probably fixed. Reviewed By: jberdine Differential Revision: D16803281 fbshipit-source-id: 48ba2523b	5 years ago
Scott Owens	89c3da4510	Prove that Ret preserves the invariant Summary: Made progress on the sanity checking lemma (that the step relation preserves some simple invariants on the state). Proved the Ret instruction case of the state invariant lemma. To do this, I fixed a few bugs in the definition, and strengthened the invariants. Reviewed By: jberdine Differential Revision: D16786900 fbshipit-source-id: 6fa8cb170	5 years ago
Scott Owens	df5f20956f	Define a simple initial state that inits the globals Summary: Global variables need allocating and initialising before the machine can start. The definition here shouldn't constrain how and where they are allocated. For example, they don't all need to have separate allocations. We also tag allocated blocks so that the allocation for a global can never be deallocated. Start working on a sanity checking invariant on states. Reviewed By: jberdine Differential Revision: D16735068 fbshipit-source-id: 0d5e60e7a	5 years ago
Scott Owens	97eb280cb5	Add initial mini-LLVM semantics written in HOL4 Summary: Start working on a simple model of LLVM with the ultimate goal of handling relevant and/or tricky aspects of LLVM and LLAIR and then formalising the translation from LLVM to LLAIR. This is a complete initial model of everything that we are interested in except for exceptions, which should be tricky. Also no thought has gone into the treatment of poison and the undefined value, so the treatment is naive, which is at least partially justified because we are interested in the semantics of LLVM IR after the optimisation passes have run. Include some sanity checking theorems. Reviewed By: jberdine Differential Revision: D16731885 fbshipit-source-id: fd53949fe	5 years ago
Timotej Kapus	afb6a4fd11	[sledge] Fix internalization Summary: Currently bitcode produced with `sledge buck link` can have missing symbols that are clearly defined in the source. For example consider a symbol `awesome_function` that is defined in the libraries linked in but not in the produced binary (despite being reachable from main). `llvm-nm` of the bitcode produced by `llvm-link` might look like: ``` U awesome_function t awesome_function.1892 ``` Some our `awesome_function` is undefined and its definition is called `awsome_function.1892` for some reason and is local. I think this is because symbol get internalized too early and then they get renamed and somehow lost. Not sure why `llvm-link` behaves this way sometimes. This patch removes internalization from `llvm-link` and puts it into `opt`, where it doesn't cause problems. Reviewed By: jvillard Differential Revision: D16494153 fbshipit-source-id: aad9053a4	5 years ago
Timotej Kapus	c8d1da1e0d	[sledge] Fix __llair_alloc Summary: `__llair_alloc` is meant to be a drop-in non-failing replacement for `mallco`. Currently `__llair_alloc(1)` allocates 8 bytes instead of 1 as `malloc(1)` would. This is because handling of `__llair_alloc` was merged with handling of `new`. This patch reverts changes to handling of `new` in D15778817 and adds a new case for `__llair_alloc`. Reviewed By: jvillard Differential Revision: D16356865 fbshipit-source-id: 3878d87c3	5 years ago
Timotej Kapus	6c9e4e52c6	[sledge][summaries] Fix unsoundes due to missing frame Summary: When using summaries we first garbage collect the precondition and then ask the solver to infer the frame of the precondition with respect to grabage-collected footprint. Currently if the solver fails to show the frame, we just give it an empty frame. This is bad, because if grabage collection removed some segments, they don't get added back on. This patch throws an exception instead to be very explicit when the solver cannot show the frame in this case. Reviewed By: ngorogiannis Differential Revision: D16339587 fbshipit-source-id: b88d0689c	5 years ago
Josh Berdine	7f423f7fa1	[sledge] Model `folly::usingJEMalloc()` Summary: The actual implementation of folly::usingJEMalloc() tests if malloc is jemalloc using internal knowledge of the jemalloc implemenation of malloc. This internal behavior is not reflected in the analyzer's spec, so the detection fails. Additionally, folly::usingJEMalloc is implemented using mallctl to query internal state of jemalloc. Depending on the key string passed to mallctl, it might return a pointer to jemalloc internal state, or a scalar, which means that the spec needs to essentially allocate that state in those cases. Since the jemalloc detection fails, and the analyzer is not always able to reason precisely about string equality, this diff models folly::usingJEMalloc directly (as nondet). Reviewed By: kren1 Differential Revision: D16059776 fbshipit-source-id: 7e7156d7d	5 years ago
Josh Berdine	4bbe05698e	[sledge] Remove `.<int>` suffix when looking up modeled function names Summary: It seems that functions internalized by llvm no longer have valid mangled names, and instead have a `.<int>` suffix. This diff removes these unpredictable suffixes when checking if a called function is a specified/modeled intrinsic. Reviewed By: kren1 Differential Revision: D16059781 fbshipit-source-id: a4b9f6c73	5 years ago
Josh Berdine	0126b64d16	[sledge] Explicate output flag of disassemble command Summary: This one was overlooked before Reviewed By: kren1 Differential Revision: D16269729 fbshipit-source-id: 0aa86ca9a	5 years ago
Josh Berdine	9865bc0f74	[sledge] [solver] Strengthen handling of existential subtrahends Summary: A frame inference query `Minuend ⊢ ∃xs. Subtrahend` returns a `∃zs. Remainder` formula such that `Minuend ⊢ ∃xs. Subtrahend * ∃zs. Remainder` when successful. Currently if the subtrahend is itself existentially quantified, its existentials are treated trivially: they must witness themselves. This diff allows the solver to find witnesses as the `xs`. They are still existentially quantified in the remainder, so clients that need to constrain them should still name them before calling the solver. Reviewed By: kren1 Differential Revision: D16269630 fbshipit-source-id: 65136edd1	5 years ago
Timotej Kapus	b5dea36c5e	[sledge] Add global merge pass Summary: Add a global merge pass that merges globals into a single big global. It replaces the uses of globals merged, with offsets into the big global. Function summarisationis a big benefactor of this as it greatly reduces the number of implicit formals (ie. globals). Reviewed By: jvillard Differential Revision: D16260098 fbshipit-source-id: 1b936f02f	5 years ago
Timotej Kapus	5882c49d7d	[sledge] Disable creating of summaries when summaries disabled Summary: Fix a bug where summaries would be created even if summarisation option is disabled. Reviewed By: jvillard Differential Revision: D16259761 fbshipit-source-id: f7319ef03	5 years ago
Timotej Kapus	ba6e6bf369	[sledge] Actually use function summaries Summary: If function summaries are enabled calling a function first tries to apply a summary, if succesful, it directly jumps to the return site of the call. Otherwise it proceeds as before. Reviewed By: jvillard Differential Revision: D16201251 fbshipit-source-id: cec52e0e5	5 years ago
Timotej Kapus	c0c6d65d45	[sledge] Generate and apply summaries Summary: Define a new function summary type and compute it on function return. As an intermediary step also apply the just computed summary to function pre so it can be compared to what was actually computed. Reviewed By: jvillard Differential Revision: D16149833 fbshipit-source-id: b826c17e8	5 years ago
Timotej Kapus	8173eedf1f	[sledge] Fix solver crash Summary: Fix a crash that occurs when subtrahend has an existential variable that was renamed as in the test. The crash is due to an assertion in `Sh.exists` that says only variables in the vocabulary can be existentialy quantifed out. The problem was `Sh.exists` call in Solver.ml:611. Where `ws` (existentials of the subthrehend) are not present in the vocabulary of the remainder. This is because remainder "inheirts" the vocabulary of the minued. This fix simply extends the vocabulary of minued with `ws`, which means the remaainder has the correct vocabulary. This should have no externally visible effect as `ws` are then existentialed out. Another option would be to try to change all the `excise_seg` functions, to keep the vocabulary, but that looked annoying to implement. Reviewed By: jvillard Differential Revision: D16201423 fbshipit-source-id: b88c3abc4	5 years ago
Timotej Kapus	b5b8259ea7	[sledge] Add printing of some variables in bold Summary: Add a `-color` option to sledge, that prints variable that are existentially bound as bold. Reviewed By: ngorogiannis Differential Revision: D16088750 fbshipit-source-id: bd21cb8a0	5 years ago
Timotej Kapus	c5f261e977	[sledge] [summaries] Fix variable naming bugs Summary: This fixes two bugs: * All local variables would get existentially quantified out, that means the the local variables of the caller couldn't be restored properly * Frame was added back on after the formals were killed. Which meant that if frame talked about formals (in pure for example), those formals would remain to be free variables. Reviewed By: ngorogiannis Differential Revision: D16091157 fbshipit-source-id: dfe12ed82	5 years ago
Timotej Kapus	b25f735c6e	[sledge] Fix Exp.map and garbage_collect Summary: : * Fix non termination of garbage collection * Fix implementation of Exp.map to handle recursive Exps (vtables) Reviewed By: jberdine Differential Revision: D16089676 fbshipit-source-id: 337c19f18	5 years ago
Timotej Kapus	38e66d6f91	[sledge] [summaries] Fix issues with multiple calls Summary: This fixes two issues with function summarization when calling a function multiple times. * Previously on return, the actuals wouldn't get added back in, so their name would be "lost" (that is existentially quantified out), this patch adds the formals to actuals equalities back on return, before quantifying the formals out. * Previously the entry state of the function would be lost if there were multiple calls to other functions. Reviewed By: jberdine Differential Revision: D16071656 fbshipit-source-id: 9df7b1d4b	5 years ago
Josh Berdine	1908077aa9	[sledge] Include alarms in debug trace Summary: Currently alarms are reported to stdout while the debug trace is written to stderr. This makes synchronizing the two difficult. With this diff, the alarm reports can also be included in the debug trace, and analysis can be stopped when an alarm is encountered by tracing the `Stop` module, e.g.: ``` sledge -trace Report+Stop.on_invalid_access ``` Reviewed By: kren1 Differential Revision: D16072611 fbshipit-source-id: 32c3639a2	5 years ago
Josh Berdine	e27af1f184	[sledge] Build models without threads support Summary: There are many assumptions on the behavior of mutexes, condition variables, etc. in the implementation of the cxxabi with threads support. So compile with `_LIBCXXABI_HAS_NO_THREADS` defined to select the much simpler code paths for the single-threaded case. Reviewed By: kren1 Differential Revision: D16069454 fbshipit-source-id: 9f975e0e6	5 years ago
Josh Berdine	b8065e9b62	[sledge] Model __cxa_allocate_exception as unreachable with -skip-throw Summary: Each call to __cxa_allocate_exception, in practice, is shortly followed by raising an exception. With -skip-throw, execution will not proceed past the throw. Since the concrete implementation of __cxa_allocate_exception and the following initialization of the exception object is very low-level code that plays tricks, the analyzer has trouble with it. So model __cxa_allocate_exception as unreachable to avoid (needlessly) executing that code and potentially failing spuriously. Reviewed By: kren1 Differential Revision: D16069451 fbshipit-source-id: bea1dae09	5 years ago
Josh Berdine	bcc6e1ecc9	[sledge] Support intrinsics which do not return Summary: Allow intrinsics to return an inconsistent state, to specify that they do not return. Reviewed By: kren1 Differential Revision: D16069453 fbshipit-source-id: deb5d2a22	5 years ago
Josh Berdine	8f765bf742	[sledge] Add -margin flag for debug tracing output Reviewed By: kren1 Differential Revision: D16069455 fbshipit-source-id: 0be9404b6	5 years ago
Josh Berdine	d42908a5ff	[sledge] Add dbg-opt build mode Summary: This adds an optimized debug build mode, which is compiled with optimizations, and without assertions, but still has tracing enabled. Reviewed By: kren1 Differential Revision: D16069452 fbshipit-source-id: 445cfa329	5 years ago
Josh Berdine	ddc1a028c4	[sledge] Manually set exception backtrace recording Summary: Base, ridiculously, enables backtrace recording by default. So manually disable it unless in debug mode. Reviewed By: kren1 Differential Revision: D16069450 fbshipit-source-id: 34cded329	5 years ago
Josh Berdine	8be5dbec0b	[sledge] Revise Report printing Summary: The report output got disturbed by the change from predicate to relational Domain, and the tricky control of printing simplified states. After this diff by default states are printed in full, and in simplified form with `-t State_domain.pp_simp`. Also includes some minor output improvements. Reviewed By: kren1 Differential Revision: D16059780 fbshipit-source-id: b33289887	5 years ago
Josh Berdine	4c6ea0c887	[sledge] Use standard "libFuzzer" name Summary: Trivial renamings to use the standard "libFuzzer" name instead of "lib fuzzer". Reviewed By: kren1 Differential Revision: D16067881 fbshipit-source-id: 3ff2a8f86	5 years ago
Timotej Kapus	e15a1d36a5	[sledge] Add data structure to hold summaries Summary: On function return add the computed summary (pre/post) condition to a hashtable. Reviewed By: jberdine Differential Revision: D16052136 fbshipit-source-id: 0c5c9bafb	5 years ago
Josh Berdine	03e338b2b9	[sledge] Give more specific names to `-output` flags Summary: This also improves the cli parser's prefix-based disambiguation. Reviewed By: kren1 Differential Revision: D16060635 fbshipit-source-id: 626f93641	5 years ago
Josh Berdine	39fe848146	[sledge] Define `sledge buck link` in terms of `sledge buck bitcode` Summary: Defining link by composition inherits the flags of bitcode. Reviewed By: kren1 Differential Revision: D16059777 fbshipit-source-id: c8f6b1d73	5 years ago
Josh Berdine	26a34bc33c	[sledge] Do not always output list of bitcode inputs Summary: Outputting the list of bitcode inputs when no output flag is ok for `sledge buck bitcode` but does not make sense when it is composed as part of other commands. So only output to stdout if `-` is given as the output file name. Reviewed By: kren1 Differential Revision: D16059782 fbshipit-source-id: abac9c36f	5 years ago
Josh Berdine	b8bd639ad8	[sledge] Generate and commit cli help Summary: To easily monitor and track changes to the help generated by the command line interface, generate it in full and add it to the repo. Reviewed By: kren1 Differential Revision: D16059783 fbshipit-source-id: be15f9943	5 years ago
Timotej Kapus	fc6aee2d06	[sledge] Function summarisation: maybe summaries Summary: This diff enhances `-function-summaries` to remember the frame computed by the solver and actually execute the function using the summary. Upon return the frame is added back on the computed post condition. Reviewed By: ngorogiannis Differential Revision: D15900318 fbshipit-source-id: 8bb56b771	5 years ago
Timotej Kapus	5df12c7725	[sledge] Add lib-fuzzer to buck analyze Summary: Adds `-lib-fuzzer` flag to `buck analyze` for better usability Reviewed By: ngorogiannis Differential Revision: D16032095 fbshipit-source-id: cc528dd5d	5 years ago
Timotej Kapus	0ab1223d3d	[sledge] Function summarization: solver can show pre Summary: This diff is preparation for function summarization and focuses on function calls and function summary precondition computation. It introduces `-function-summaries` flag behind most of functionality is hidden, when enabled on each call * A function summary is computed by quantifying all the non-formal/global variables and removing all the segments that are not reachable from them * `pre` and `foot` are computed from function summary and the calling context by replacing formals with actuals again. * A solver is asked if `pre` entails `foot` and a frame is printed if it does Currently this only works for formulas without disjunctions, so when function summaries are enabled, that state is first moved to dnf and then the call is done for each disjunct. Reviewed By: ngorogiannis Differential Revision: D15898928 fbshipit-source-id: 49d32504c	5 years ago
Timotej Kapus	4ac252120b	[sledge] special case buck-target-patterns Summary: For buck targets that contain at least one of the substrings in `buck-target-pattern` option in config, change the buck target to add `_sledge` suffix. Reviewed By: jberdine Differential Revision: D15920018 fbshipit-source-id: 44c242e99	5 years ago
Josh Berdine	0f5ae186b3	[sledge] Add test for use-after-destroy of a temp Summary: And fix test Makefile to call the C++ compiler on .cpp files. Reviewed By: kren1 Differential Revision: D15972426 fbshipit-source-id: 719de755f	6 years ago
Josh Berdine	a58bc25aa5	[sledge] Strengthen simplification of convert Exps Summary: Simplify all conversions between castable types to the identity. The backend treats castable types as equal, so distinguishing conversions between them is incomplete. Reviewed By: kren1 Differential Revision: D15972427 fbshipit-source-id: fa09859ac	6 years ago
Josh Berdine	cc1f88a747	[sledge] Fix macos build of models Reviewed By: ngorogiannis Differential Revision: D15965940 fbshipit-source-id: a50882a70	6 years ago
Timotej Kapus	6949a5ee68	[sledge] Add a todo for calls with inttoptr Reviewed By: ngorogiannis Differential Revision: D15965374 fbshipit-source-id: bbee029d7	6 years ago
Josh Berdine	b14580d88b	[sledge] Move locals from blocks to functions Summary: The entry block contains all locals of the entire function, as required by the backend. This makes the manipulation of the locals of each block redundant. This diff moves the locals from the entry block to the function itself, removes the Locals frames of the Control.Stack, and adds a locals field to Return frames. This is part cleanup and part preparation for removing the Control.Stack. Reviewed By: ngorogiannis Differential Revision: D15963503 fbshipit-source-id: 523ebc260	6 years ago
Timotej Kapus	86e12cb1a3	[sledge] Add missing llvm passes to frontend.ml Summary: Adds `-mergefunc` and `-dce` passes to `Frontend.translate` to match the `buck link` flow with `opt` Reviewed By: ngorogiannis Differential Revision: D15938641 fbshipit-source-id: 128cb89cd	6 years ago
Josh Berdine	330b266d28	[sledge] Rework function return value passing Summary: The current handling of the formal return variable scope is not correct. Since it is passed as an actual argument to the return continuation, it is manipulated as if it was a local variable of the caller. However, its scope is not ended with the caller's locals, leading to clashes. This diff reworks the passing of return values to avoid this problem, mainly by introducing a notion of temporary variables during parameter passing. This essentially has the effect of taking a function spec { P } f(x) { λv. Q } and generating a "temporary" variable v, applying the post λv. Q to it to obtain the pre-state for the call to the return continuation k(v). Being a temporary variable just means that it goes out of scope just after parameter passing. This amounts to a long-winded way of applying the post-state to the formal parameter of the return continuation without violating scopes or SSA. This diff also separates the manipulation of the symbolic states as they proceed from: 1. the pre-state before the return instruction; 2. the exit-state after the return instruction (including the binding of the returned value to the return formal variable); 3. the post-state, where the locals are existentially quantified; and 4. the return-state, which is expressed in terms of actual args instead of formal parameters. Also in support of summarization, formal return and throw parameters are no longer tracked on the analyzer's stack. Note that these changes involve changing the locals of blocks and functions to no longer include the formal parameters. Reviewed By: kren1 Differential Revision: D15912148 fbshipit-source-id: e41dd6e42	6 years ago
Timotej Kapus	01e6c5c558	[sledge] [solver] add handling of trivial equality Summary: The solver couldn't deal with `∃ a,b . a = b` , so this diff adds a special case to deal with it. Reviewed By: ngorogiannis Differential Revision: D15897953 fbshipit-source-id: d841d3557	6 years ago
Timotej Kapus	a75a50215b	[sledge] Add LLVM passes that reduce bitcode size Summary: : This patch adds several passes that reduce the amount of bitcode making sledge's job easier, more info: https://llvm.org/docs/Passes.html `-mergefunc` This pass merges functions that do the same thing, this can be because of templating or casts (ie. same functionality but on 32bit and 64bit ints, which is the same in machine code). More details at http://llvm.org/docs/MergeFunctions.html Note that this pass is currently not available through C/OCaml API. `-constmerge` This merges constants that have the same value, this is possible to do when the constants are internalized. `-argpromotion` ``` This pass promotes “by reference” arguments to be “by value” arguments. In practice, this means looking for internal functions that have pointer arguments. If it can prove, through the use of alias analysis, that an argument is only loaded, then it can pass the value into the function instead of the address of the value. This can cause recursive simplification of code and lead to the elimination of allocas (especially in C++ template code like the STL). ``` `-ipsccp` ``` Sparse conditional constant propagation and merging, which can be summarized as: Assumes values are constant unless proven otherwise Assumes BasicBlocks are dead unless proven otherwise Proves values to be constant, and replaces them with constants Proves conditional branches to be unconditional ``` `-deadargelim` Removes dead arguments of internal functions, good to run after other inter-procedural passes. Seems to crash llvm if run directly after `ipsccp`. Note that while this might look like doing full link-time optimisation, we are actually picking relatively cheap optimisations that mostly look at globals and walk their use chains. The main reason link-time optimisations are expensive is due to inlining and then running the full optimisation again from there. Reviewed By: jberdine Differential Revision: D15851408 fbshipit-source-id: be7191683	6 years ago
Timotej Kapus	1614f78f6d	[sledge] Add a harness for lionhead fuzzers Summary: This diff introduces a `-lib-fuzz` flag to `buck link`, which links in a simple main that calls the LLVMFuzzerTestOneInput function, which is the entry point of libFuzzer fuzzer. Reviewed By: jberdine, jvillard Differential Revision: D15821512 fbshipit-source-id: cff731ed3	6 years ago
Timotej Kapus	46f5667823	[sledge] Relax call instruction arguments Summary: Previous change to allow bitcasts in call instructions was too strict and did not allow for indirect calls. Reviewed By: jberdine Differential Revision: D15803262 fbshipit-source-id: 40d828b59	6 years ago
Timotej Kapus	551a03c4c9	[sledge] Simplify the printed symbolic heaps Summary: Currently printing symbolic heaps is unreadable, because there are too many quantified variables, that are mostly just equal to other variables. This diff tries to replace all variables in an equivalence class with a single variable and remove the unneccesary variables. It also introduces two modes for printing state domains: `-t +State_domain.pp_full` prints the state domain as is `-t +State_domain.pp` uses the simplification before printing. Reviewed By: jberdine Differential Revision: D15738748 fbshipit-source-id: 7c85b580e	6 years ago
Josh Berdine	cfc1c8be36	[copyright] Remove years Reviewed By: jvillard Differential Revision: D15771884 fbshipit-source-id: e2997e3a3	6 years ago
Timotej Kapus	5a92171b26	[sledge] Print pre/post on function return Summary: Print pre- and post- conditions (aka, summaries) when analyzer hits a function return - plumbing the precondition through the analyzer so that it is available when return is hit Reviewed By: jberdine Differential Revision: D15713725 fbshipit-source-id: b10b6206f	6 years ago
Timotej Kapus	0f61a97feb	[sledge] Add non-failling alloc intrinsic Summary: This diff adds a `__llair_alloc` intrinsic which is modeled as a non-failing malloc. Using it instead of `malloc` increases the readbility of symbolic heaps, because it removes all the cases where malloc failed. Note that `assert(malloc())` does not have the desired effect. Reviewed By: ngorogiannis Differential Revision: D15778817 fbshipit-source-id: d02784077	6 years ago
Timotej Kapus	d2ee43e818	[sledge] Remove --auto-promote from CI builds Reviewed By: jberdine Differential Revision: D15779200 fbshipit-source-id: 5c2ab24b5	6 years ago
Timotej Kapus	ad035a4cc7	[sledge] Fix handling of bitcasts in call instr Summary: Some call instructions in LLVM bitcast the function, for example `%call = call i32 (i64, ...) bitcast (i32 (...)* @__llair_alloc to i32 (i64, ...)*)(i64 %conv)` This would cause sledge to crash in LLVM when build with assertions. Reviewed By: jberdine Differential Revision: D15779003 fbshipit-source-id: c273f92db	6 years ago
Timotej Kapus	8e31b136d0	[sledge] CI install script Reviewed By: jvillard Differential Revision: D15603242 fbshipit-source-id: d6aff4aad	6 years ago
Josh Berdine	12bab4b16b	[sledge] Add formal parameters to functions for return values Summary: This diff adds a formal parameter to each non-void-returning function to name the return value, and similarly a formal parameter for the thrown exception value. These are interpreted as call-by-reference parameters, so that they can be constrained in formulas to e.g. be equal to the return value, and are still in scope when the function returns, and so can be passed to the return block. Prior to summarizing functions, this means that these formals need to be tracked on the analyzer's control stack. This will be needed to express function specs/summaries in terms of formals, and fixes a bug where in some cases return values were not tracked correctly. Reviewed By: kren1 Differential Revision: D15738026 fbshipit-source-id: fff2d107c	6 years ago
Josh Berdine	2440ee69ae	[sledge] Preserve sharing of Func.parent Summary: Previously the locals of a function were computed after backpatching the blocks in its cfg. This resulted in loss of sharing, and incorrect locals if queried through the parent of a block. Reviewed By: kren1 Differential Revision: D15738027 fbshipit-source-id: d7e70530a	6 years ago
Timotej Kapus	2d69e17d51	[sledge] Add CL option to disable exceptions Summary: Disable exceptional control flow - treat throw as unreachable - confidence in the correctness of the frontend's treatment of exception handling is very low, and making summaries that are expressive enough to talk about exceptions is a complication that isn't needed for the first iteration To facilitate, start on a struct that holds all the CL options. Reviewed By: jberdine, jvillard Differential Revision: D15713601 fbshipit-source-id: ee92dfbd8	6 years ago
Josh Berdine	caef28f49e	[sledge] Revise test scripts Summary: This diff adapts the test scripts to the new sledge CLI, and reworks them to enable checking changes with respect to a baseline. In particular, now ``` make -C test ``` has exit code 0 if the current test results match the expected ones, and otherwise prints the diff. Also, ``` make -C test promote ``` promotes the current test results to the new baseline. Reviewed By: kren1 Differential Revision: D15706573 fbshipit-source-id: 0cbf3231e	6 years ago
Josh Berdine	f119154a41	[sledge] Add cxa_default_handlers to models Summary: Include cxa_default_handlers.cpp to bring in definitions for __cxa_terminate_handler and __cxa_unexpected_handler. Reviewed By: kren1 Differential Revision: D15712980 fbshipit-source-id: f536930a8	6 years ago
Josh Berdine	a0949495c1	[sledge] Translate `invoke abort` to `abort` Summary: Sometimes calls to the `abort` C stdlib function appear as `invoke` instructions in LLVM. They should be translated to the LLAIR abort instruction just like the non-raising `call abort` case. Reviewed By: kren1 Differential Revision: D15706574 fbshipit-source-id: 1509ed0e3	6 years ago
Josh Berdine	f3bee3f513	[sledge] Print locations of globals in textual LLAIR Reviewed By: kren1 Differential Revision: D15706577 fbshipit-source-id: 7ed4d37c2	6 years ago
Josh Berdine	d104f5e518	[sledge] Extend Exp.typ to binary and ternary ops Summary: Most binary and ternary operations have the same type as their arguments, so try to compute the type of arguments in these cases. Reviewed By: kren1 Differential Revision: D15706576 fbshipit-source-id: 4749d6e32	6 years ago
Timotej Kapus	65f3b10c99	[sledge] Fix crashing frontned Summary: When LLVM is built with assertions, it crash `add_sym` if you try to get the global scope of a non global value. This patch special cases add_sym, to just do nothing when `llv` is an `UndefValue`. Also enhances debuging printout of transalte to include the number of functions and globals. Reviewed By: jvillard Differential Revision: D15669447 fbshipit-source-id: 4b5483810	6 years ago
Timotej Kapus	9ef992394c	[sledge] Put all the entry points in the config Summary: The entry point functions are used in a couple of places, this puts them in a single source of truth in the config file. Reviewed By: jvillard Differential Revision: D15651976 fbshipit-source-id: a572e8d4d	6 years ago
Timotej Kapus	b9ba97a2fd	[sledge] Add globalopt pass to remove globals Summary: This adds a globalopt optimization pass to sledge. Consider code like: ``` const char a_string = "I'm a string"; int an_int = 0; int c() { return an_int; } int main() { char c1 = a_string; return c(); } ``` When compiled there are 2 levels of indirection. For example `return an_int` Get's compiled as ``` %0 = load i32, i32* an_int1 ret i32 %0 ``` Global opt reduces this (if `an_int` is internal) to just ` ret i32 0`. Similarly and more importantly `c1 = a_string;` get's compiled into ``` @.str = private unnamed_addr constant [13 x i8] c"I'm a string\00" a_string = dso_local global i8* getelementptr inbounds ([13 x i8], [13 x i8]* @.str, i32 0, i32 0) %c1 = alloca i8, align 8 %0 = load i8, i8** a_string, align 8, !dbg !25 store i8* %0, i8** %c1, align 8, !dbg !24 ``` So there is a level of indirection between `c1` and `.str` where the string is stored. With global opt, this gets reduced to: ``` @.str = private unnamed_addr constant [13 x i8] c"I'm a string\00" %c1 = alloca i8, align 8 store i8 getelementptr inbounds ([13 x i8], [13 x i8]* @.str, i64 0, i64 0), i8** %c1, align 8, !dbg !23 ``` and `a_string` variable gets deleted. On sledge this has the effect of reducing the complexity of the symbolic heap significantly. Without this optimisation, running `sledge.dbg llvm analyze -trace Domain.call global_vars.bc` Gives prints the following segments: ``` ∧ %.str -[)-> ⟨13,{}⟩ * %a_string -[)-> ⟨8,%.str⟩ * %an_int -[)-> ⟨4,0⟩ * %c1 -[)-> ⟨8,%.str⟩ * %retval -[)-> ⟨4,0⟩ ``` So there are `an_int` and `a_string` segments, which are redundant. with the optimisation, the heap looks like: `∧ %.str -[)-> ⟨13,{}⟩ * %c1 -[)-> ⟨8,%.str⟩ * %retval -[)-> ⟨4,0⟩`, Where we only have the `.str` segment and the `c1` segment, which are the two we need. Reviewed By: ngorogiannis Differential Revision: D15649195 fbshipit-source-id: 5f71e56e8	6 years ago
Josh Berdine	4ea2cf9814	[sledge] Improve uncaught exceptions Summary: Do not implicitly open `Trace`, which shadows `Import.fail`, and degrades uncaught exceptions. Opening `Trace` was a mistake. Reviewed By: kren1 Differential Revision: D15653730 fbshipit-source-id: d65277af5	6 years ago
Josh Berdine	6a2da2acc4	[sledge] Rework command line interface Summary: Change command line interface to include buck and llvm integration as separate subcommands. Reviewed By: kren1 Differential Revision: D15614567 fbshipit-source-id: b7618571b	6 years ago
Josh Berdine	9c277e9732	[sledge] Simplify Llair.pp Summary: NFC refactor Reviewed By: kren1 Differential Revision: D15614568 fbshipit-source-id: 0abaa9afd	6 years ago
Timotej Kapus	c8b063fb50	[sledge] Fix ~predicate label Reviewed By: jberdine Differential Revision: D15603866 fbshipit-source-id: cfa5f771b	6 years ago
Timotej Kapus	cdd444b901	[sledge] Update internalize to handle other mains Summary: Use the new LLVM bindings to handle internalization. We would like to use global dead code elimination (gdce) to remove all the code not reachable from an entry point. However in normal compilation most functions aren't globally dead (they can be linked to). At the point of running sledge we won't be linking anymore, therefore we can internalize (make invisible outside of the compilation unit) all the symbols. In that case the whole program is dead with respect to gdce, therefore we need to preserve the entry point as external. Previously we could only preserve `main` as the entry point (through a boolean flag). This patch uses a newer API that lets us preserve functions that satisfy a given predicate function. This enables us to have arbitrary entry points (not just main). Currently only the 3 entry points from `src/control.ml` are used, but this patch makes it easy to change. Reviewed By: ngorogiannis Differential Revision: D15561461 fbshipit-source-id: 88e054411	6 years ago
Timotej Kapus	e45a05a574	[sledge] fix LLVM assertion failure in xlate_global Summary: LLVM.global_initializer casues a cast assertion failure if the value passed to it is not a GlobalVariable. So we first check if it is a GlobalVariable and only then ask for an initiliazer. This is hidden if LLVM is built without assertions. Reviewed By: jberdine Differential Revision: D15601632 fbshipit-source-id: e9db23a12	6 years ago
Timotej Kapus	881a4d10af	[sledge] Fix bound not bounding recursion Summary: Sledge does not terminate on programs with recursion, because functions get "infinitely inlined" and therefore recursion is not treated as retreating edge. This patch bounds the number of times the same function can "inlined" to respect the bound (`-b` option). On each call we check the number of occurances of the called function in the call stack. If that is higher than the bound, we skip it. Reviewed By: jvillard Differential Revision: D15577134 fbshipit-source-id: 4cd3b62c6	6 years ago
Josh Berdine	1e7b13bdcd	[sledge] Add printers for some LLVM enums Reviewed By: kren1 Differential Revision: D15577754 fbshipit-source-id: 8a8f39a87	6 years ago
Josh Berdine	babe25fd29	[sledge] Fix translation of global initializers Summary: This diff changes the translation of global variables to translate the initializer whenever it exists in LLVM, rather than relying on linkage. Previously code such as ``` char mutable_string = "hahaha"; ``` would lead to LLVM code ``` nutritious_string = global i8 getelementptr inbounds ([7 x i8], [7 x i8]* @.str, i32 0, i32 0), align 8, !dbg !0 ; [#uses=2 type=i8**] ``` in which `mutable_string` had `External` linkage according to `Llvm.linkage` even though it has an initializer. This could cause sledge to drop the initializer. Reviewed By: kren1 Differential Revision: D15577755 fbshipit-source-id: 50aa06c5e	6 years ago
Josh Berdine	00c5e1b9fe	[sledge] Fix size in translation of global variables Summary: Global variables have pointer type. The size needed by the backend is of the element type, not of the pointer itself. This diff corrects this. Reviewed By: kren1 Differential Revision: D15577756 fbshipit-source-id: 948ecf3cd	6 years ago
Josh Berdine	62a3187f5d	[sledge] Don't call Llvm.dispose_context as it leads to crashes in GC Summary: The root cause is not clear, but it seems that not calling Llvm.dispose_context avoids segfaults in the GC. Reviewed By: kren1 Differential Revision: D15535434 fbshipit-source-id: 280e44d0b	6 years ago
Josh Berdine	14a15931f7	[sledge] Combine name and loc tables into one Summary: The name and loc tables are added-to almost exactly in sync, so combine them to amortize the overhead. Reviewed By: kren1 Differential Revision: D15535435 fbshipit-source-id: 801da75bb	6 years ago
Josh Berdine	ccd2a92ba5	[sledge] Avoid Format in non-debug code Summary: Format is slow. Especially Format.sprintf, which has to allocate and initialize a buffer every time. Reviewed By: kren1 Differential Revision: D15535437 fbshipit-source-id: ea43f44e1	6 years ago
Josh Berdine	d5c2468007	[sledge] Combine scan_locs and scan_names into a single pass Summary: No need to traverse the entire IR twice in the same way. Reviewed By: kren1 Differential Revision: D15535436 fbshipit-source-id: 08b988e0a	6 years ago
Josh Berdine	4d5970f693	[sledge] Only call Llvm_analysis.verify_module in debug mode Summary: This isn't free and is expected to hold of bitcode produced by clang/llvm. There are tests that fail verification, so keep it in debug mode. Reviewed By: kren1 Differential Revision: D15535438 fbshipit-source-id: 9390a8363	6 years ago
Josh Berdine	da097679bd	[sledge] Fix crash when trying to warn Reviewed By: mbouaziz Differential Revision: D15518828 fbshipit-source-id: 069ff4e9c	6 years ago
Josh Berdine	611fb57d3a	[sledge] Treat .bc or .ll input files as pre-linked bitcode Summary: If the input file has a .bc or .ll suffix, treat it as a pre-linked bitcode file. Otherwise, treat it as before, as a file containing a list of bitcode files to be linked. Also, perform global dead-code elimination only when linking multiple files. Reviewed By: kren1 Differential Revision: D15513345 fbshipit-source-id: 4c80ff9c3	6 years ago
Josh Berdine	7ac04fa46a	[sledge] Optimize finding functions by name Summary: Replace the naive linear scan with a map lookup. Reviewed By: ngorogiannis Differential Revision: D15512746 fbshipit-source-id: d103ffdc7	6 years ago
Timotej Kapus	d37374dd8c	[sledge] change input format Summary: Change the sledge input format from a bitcode file to a newline separated list of paths to LLVM bitcode files. Reviewed By: jberdine Differential Revision: D15470082 fbshipit-source-id: 8860f947c	6 years ago
Josh Berdine	139a3d3e00	[sledge] Avoid calling Llvm.string_of_llvalue on instructions Summary: Llvm.string_of_llvalue, which just calls llvm::Value::print, is extremely slow when called on instructions or functions. In these cases, it initializes metadata slots for everything in the parent module of the instruction or function being printed, on every call. This is ridiculously slow, don't do it. Reviewed By: kren1 Differential Revision: D15511376 fbshipit-source-id: 658eeccab	6 years ago
Josh Berdine	a3e7107969	[sledge] Optimize variable renaming in symbolic heaps Summary: Add shortcut code paths to return early in some cases guaranteed to be the identity function. Reviewed By: ngorogiannis Differential Revision: D15468704 fbshipit-source-id: f137049c6	6 years ago
Josh Berdine	e391a8a9b2	[sledge] Simplify Equality.map_exps Summary: Remove left-over complexity from previous versions. Reviewed By: ngorogiannis Differential Revision: D15468705 fbshipit-source-id: 316cda51b	6 years ago
Josh Berdine	c4707621ea	[sledge] Make execution bound part of the work queue Summary: No need for it to be global mutable state Reviewed By: ngorogiannis Differential Revision: D15468706 fbshipit-source-id: 840fa8c83	6 years ago
Josh Berdine	dda922b6ad	[sledge] Add command line option for execution bound Summary: Instead of a compile-time constant. Reviewed By: ngorogiannis Differential Revision: D15468707 fbshipit-source-id: 0a2668a18	6 years ago
Josh Berdine	3a87a0e2f3	[sledge] Unignore model/cxxabi.bc Summary: It is no longer generated outside the _build dir. Reviewed By: kren1 Differential Revision: D15432940 fbshipit-source-id: 2d51e49ff	6 years ago

... 3 4 5 6 7 ...

611 Commits (53822697f9392323fa4c4e09a5b853066d96ad58)