infer_clone

Commit Graph

Author	SHA1	Message	Date
Josh Berdine	799b21761f	[sledge] Translate ExtractElement and InsertElement despite being vector Summary: Some code that is otherwise benignly scalar still uses the ExtractElement and InsertElement vector operations, so translate them as if they were array operations. Reviewed By: ngorogiannis Differential Revision: D17801949 fbshipit-source-id: 89f3666bd	5 years ago
Scott Owens	3080fba8fa	[sledge sem] Update LLVM and LLAIR sem for consistent stuckness Summary: Previously, the LLVM semantics could be stuck where the LLAIR semantics was not yet stuck, but would become stuck (at the same place) after taking a step. This was due to LLVM using the traditional definition of stuck states: any state from which there are no transitions. However, LLAIR cannot do that because it might get stuck in the middle of a block that contains several visible stores. We don't want to consider the whole block stuck, nor can we finish it. Thus, the LLAIR definition of stuckness is when the state has the stuck flag set which happens when stopping in the middle of a block after encountering a stuck instruction. Now LLVM takes the same approach. Reviewed By: jberdine Differential Revision: D17855085 fbshipit-source-id: a094d25d5	5 years ago
Scott Owens	14a8ae34b9	[sledge sem] Improve and unify treatment of Exit Summary: Add an argument to the Exit instruction. Update the LLVM semantics to execute the Exit instruction and store the result in an "exited" component of the state. (Previously it just noticed that it was stuck about to do an Exit.) With exiting treated uniformly, now in the proof that for every LLVM trace, there is a llair trace that simulates it, all of the cheats except for 1 are just cases that I haven't got to yet. However, the last cheat is for the situation where the LLVM program gets stuck and the llair program doesn't. For example, the following two line LLVM program gets stuck because r2 is not assigned (ignoring for the moment the static restriction that LLVM is in SSA form). r1 := r2 Exit(0) The compilation to llair omits the assignment and so we get a llair program that doesn't get stuck: Exit(0) The key question is whether the static restrictions are sufficient to ensure that no expression that might be omitted can get stuck. Reviewed By: jberdine Differential Revision: D17737589 fbshipit-source-id: bc6c01a1b	5 years ago
Scott Owens	5312b3d10c	[sledge sem] Fix trans. invariant for llair expressions Summary: If the LLVM to llair translation keeps a mapping from register r to expression e, then for each register r' mentioned in e, there must be an assignment to r' that dominates the entire live range of r. Thus, where ever r might be replaced by e, the value of r' will be the same as it was when the initial assignment to r occurred. Maintaining this invariant relies on the LLVM being in SSA form. Reviewed By: jberdine Differential Revision: D17710288 fbshipit-source-id: fd3eaa57d	5 years ago
Scott Owens	9f2f14b34c	[sledge sem] Sketch out translation correctness Summary: This is work in progress; many of the cheats aren't true. In particular, the definition of stuck/complete/partial traces in LLVM and llair don't quite match up and need some modification. Also, the state relation isn't strong enough; it will need to include information about registers used in the expressions of the LLVM register to llair expression mapping. But the overall shape of the proof is ok and so it can be used to poke at various local aspects of the translation, such as individual instructions. Reviewed By: jberdine Differential Revision: D17631604 fbshipit-source-id: 743b5d64d	5 years ago
Jules Villard	42470d8809	[hmm] sexp_{option,list} -> {option,list} Summary: By some unfortunate logic, OCaml often decides to use `sexp_list`/`sexp_option` instead of just `list`/`option`. Sometimes these get copy/pasted in interface files. It would be good to tell OCaml not to do that in the first place but in the meantime: this diff. Reviewed By: ngorogiannis Differential Revision: D17907938 fbshipit-source-id: 7546834a2	5 years ago
Josh Berdine	ef78ba83cf	[sledge] Report the number of alarms Summary: For test scripting purposes, when the analysis finishes successfully, report the number of alarms. Reviewed By: ngorogiannis Differential Revision: D17801947 fbshipit-source-id: 1660866df	5 years ago
Josh Berdine	ec52c05c30	[sledge][NFC] Minor simplification for singleton sets Reviewed By: ngorogiannis Differential Revision: D17801948 fbshipit-source-id: 86d2e6ec9	5 years ago
Josh Berdine	239d906ab6	[sledge] Improve tracing and debugging support Reviewed By: ngorogiannis Differential Revision: D17801930 fbshipit-source-id: 8cfac2eaf	5 years ago
Josh Berdine	3f5adecdcf	[sledge] Exec.exec_specs missed vocabulary extension Summary: In a spec, it currently may be that foot.us does not contain xs. So exec_specs needs to extend the vocabulary of foot before existentially quantifying out xs. Reviewed By: ngorogiannis Differential Revision: D17801933 fbshipit-source-id: 7b4b9262a	5 years ago
Josh Berdine	9ac854c970	[sledge] Exec.kill should preserve vocabulary Reviewed By: ngorogiannis Differential Revision: D17801935 fbshipit-source-id: 81fe4b067	5 years ago
Josh Berdine	8097f1a6df	[sledge] Adjust tests to match harnesses Reviewed By: ngorogiannis Differential Revision: D17801945 fbshipit-source-id: 0f984e013	5 years ago
Josh Berdine	b2f90a3994	[sledge] Treat freturn directly in Dom.call Summary: Previously it was added to the locals before calling Dom.call, but this results in the scope of freturn ending too early. Reviewed By: ngorogiannis Differential Revision: D17801939 fbshipit-source-id: 739ec8981	5 years ago
Josh Berdine	fbf0fe2f1a	[sledge][NFC] Rename args to actuals Summary: For consistency Reviewed By: ngorogiannis Differential Revision: D17801953 fbshipit-source-id: a797d2446	5 years ago
Josh Berdine	d3d0c4b36e	[sledge][NFC] Rename params to formals Summary: For consistency Reviewed By: ngorogiannis Differential Revision: D17801926 fbshipit-source-id: 012b13561	5 years ago
Josh Berdine	69c29ab3d8	[sledge][NFC] Label args of Domain.call Summary: Just for legibility. Reviewed By: ngorogiannis Differential Revision: D17801937 fbshipit-source-id: ee1bd95d2	5 years ago
Josh Berdine	47766a0e6e	[sledge] Drop globals with appending linkage and size 0 Summary: Some globals have 'appending' linkage, where linking modules results in appending the arrays from each module. These can appear even when empty, leading to useless and somewhat troublesome 0-length arrays. So drop them. Reviewed By: ngorogiannis Differential Revision: D17801927 fbshipit-source-id: d2dc180d7	5 years ago
Josh Berdine	1efd0df035	[sledge] Avoid potential name clash between trampolines Summary: Trampoline blocks introduced when eliminating SSA could clash. Reviewed By: ngorogiannis Differential Revision: D17801936 fbshipit-source-id: c1fdf2fc6	5 years ago
Josh Berdine	ebee451f1c	[sledge] Improve test scripts Summary: Better failure messages and reports Reviewed By: ngorogiannis Differential Revision: D17801940 fbshipit-source-id: db3d13eaf	5 years ago
Josh Berdine	38cab376f6	[sledge] Keep BitCasts and similar in expressions Summary: While BitCasts are the identity function on the bitwise representation, they are not necessarily so in the semantics or the logical representation. So be more conservative about eliding them in the Exp language. Those that are actually semantic identities are still omitted in the Term language. Reviewed By: ngorogiannis Differential Revision: D17801950 fbshipit-source-id: bf9ae57b5	5 years ago
Josh Berdine	b632d4f283	[sledge] Check the input datalayout agrees with assumptions Summary: The analyzer (currently) hard-codes some assumptions about sizes of basic types such as Typ.bool, Typ.siz, etc. Check that these assumptions are satisfied by the input llvm datalayout, and give reasonable error messages otherwise. Reviewed By: ngorogiannis Differential Revision: D17801941 fbshipit-source-id: 4fe484ee0	5 years ago
Josh Berdine	6328a6ce40	[sledge] Do not store size of globals separately Summary: Now that expression types and type sizes can be computed, it is not necessary to store the sizes of globals separately. Reviewed By: ngorogiannis Differential Revision: D17801932 fbshipit-source-id: f746e506b	5 years ago
Josh Berdine	ca95fc098f	[sledge] Keep size in both bits and bytes for each type Summary: - The `Llvm_target.DataLayout.size_in_bits` needs to be used for checking casts e.g. it is ok to `bitcast <16 x i1> to i16`: they both have 16 bits, but they have sizes 16 vs 2 bytes - The `Llvm_target.DataLayout.abi_size` needs to be used for the size of memory blocks containing values e.g. for the size of memory segments containing the initial values of globals - The example above shows that we can't compute the byte size from the bit size without knowing the target specific datalayout - So we need both in each sized type - Also add checks that Convert exps and terms are not no-ops - Simplifications of size manipulating code Reviewed By: ngorogiannis Differential Revision: D17801928 fbshipit-source-id: 8c8ce6128	5 years ago
Josh Berdine	d3bad1ce44	[sledge] Add sizes to types Summary: In order to type-check casts, it is necessary to have the size of each sized type. This size information is also useful in a few other places. Reviewed By: bennostein Differential Revision: D17801931 fbshipit-source-id: f8ef53276	5 years ago
Josh Berdine	6120b7d098	[sledge] Use the configured margin when formatting failure messages Reviewed By: bennostein Differential Revision: D17801934 fbshipit-source-id: af7acec9b	5 years ago
Josh Berdine	a386b36616	[sledge] Re-add Splat expression for zero-initialized aggregates Summary: This is needed since expressions distinguish between the integer or pointer zero value and zero-initialized array/tuple/struct aggregates based on type, and the backend distinguishes them semantically. Reviewed By: bennostein Differential Revision: D17801938 fbshipit-source-id: ac8665e65	5 years ago
Josh Berdine	727385d853	[sledge] Relax Typ.is_sized to allow opaque types Summary: Linking can lead to opaque types becoming identified with a known types. Assertions in various places that types should be sized can be triggered by such opaque types. Until there is a distinction between processing fully-linked versus incomplete code, these checks need to be relaxed to permit opaque types where sized ones are expected. Reviewed By: bennostein Differential Revision: D17801929 fbshipit-source-id: c5e62f7c8	5 years ago
Josh Berdine	f804220cd2	[sledge] Revise order of Term constructors for polynomial normalization Summary: Integer terms need to compare higher than any monomial. Reviewed By: bennostein Differential Revision: D17725607 fbshipit-source-id: c64fd52d5	5 years ago
Josh Berdine	1ef390ffca	[sledge] Relax Exp type-checking to be modulo-casting Summary: Also weaken definition of Typ.castable to permit casting between floats and ints of the same size. Reviewed By: bennostein Differential Revision: D17725611 fbshipit-source-id: 5e8114e26	5 years ago
Josh Berdine	fb184a6a1d	[sledge] Introduce the notion of types having the same semantics Summary: Typ.equivalent is currently defined the same as Typ.castable, but conceptually they are different and castable needs to be weakened. They are different since for example it is possible to cast from an i64 to a f64, but those types denote different sets of values in the semantics, and the bitcast is modeled using a conversion function. Reviewed By: bennostein Differential Revision: D17725615 fbshipit-source-id: 973574f2a	5 years ago
Josh Berdine	917cc62e28	[sledge] Fix type of functions called using a cast Summary: For function calls where the callee is a cast expression, previous the wrong type would be used for the callee. This could lead to crashes in llvm, or asserting in sledge. Reviewed By: bennostein Differential Revision: D17725610 fbshipit-source-id: 938b49a49	5 years ago
Josh Berdine	ce3252c348	[sledge] Allow global variables as function names Summary: Some called functions are represented in llvm as a global variable with e.g. external linkage, and so they do not appear as 'functions'. It is still valid to call such functions, though the analyzer does not know their definitions. Reviewed By: bennostein Differential Revision: D17725609 fbshipit-source-id: 333d19c0d	5 years ago
Josh Berdine	785928c77e	[sledge] Error reporting improvements Summary: Improve Trace.fail to log the error and raise informative exceptions. Eliminate the confusion between Import.fail and Trace.fail by removing Import.fail. Reviewed By: bennostein Differential Revision: D17725608 fbshipit-source-id: 79fdfbd86	5 years ago
Josh Berdine	ffeef16aae	[sledge] Add a flag to disable internalization Summary: By default all functions except those specified as entry points in the config file are "internalized". Internal functions are removed if they are not called. It is sometimes necessary to disable internalization, e.g. to analyze the llvm tests. Reviewed By: bennostein Differential Revision: D17725614 fbshipit-source-id: 4b13501f5	5 years ago
Josh Berdine	6ca09b14fd	[sledge] Add flag to disable linking in the models Summary: Sometimes the models for the C/C++ runtime and standard libraries are not needed. Furthermore, sometimes, e.g. when analyzing llvm tests, trying to link them fails. Reviewed By: bennostein Differential Revision: D17725616 fbshipit-source-id: 76a4bcf90	5 years ago
Josh Berdine	f699c9b9a8	[sledge] Simplify ¬¬e to e Reviewed By: bennostein Differential Revision: D17725617 fbshipit-source-id: 7467fad3e	5 years ago
Josh Berdine	06f2863dd8	[sledge] Simplify `e xor e` to `0` Reviewed By: bennostein Differential Revision: D17665226 fbshipit-source-id: 655ddf6a8	5 years ago
Josh Berdine	6f84787b19	[sledge] Change exec_inst to return an option instead of a result Summary: The `(t, unit) result` type is no more informative than `t option` and less convenient. Reviewed By: bennostein Differential Revision: D17665244 fbshipit-source-id: fa969d8b7	5 years ago
Josh Berdine	2840eb4781	[sledge] Refactor dispatch on instruction from Exec to Sh_domain Summary: This puts the mediation between Exp and Term together in Sh_domain rather than being spread across the two. Reviewed By: bennostein Differential Revision: D17665235 fbshipit-source-id: edf277d45	5 years ago
Josh Berdine	c6d7886fd8	[sledge] Make type of exec_move consistent with move instruction Summary: The move instruction takes a vector of assignments to perform in parallel, so generalize exec_move from one to a vector. Reviewed By: bennostein Differential Revision: D17665248 fbshipit-source-id: 52aae5ff9	5 years ago
Josh Berdine	162f027249	[sledge] Make type argument of Exp constructors optional where computable Reviewed By: bennostein Differential Revision: D17665251 fbshipit-source-id: 4d8bccfe8	5 years ago
Josh Berdine	ad5d5dd89e	[sledge] Add Exp.true_ and Exp.false_ Summary: Convenience wrappers for Exp.integer. Reviewed By: bennostein Differential Revision: D17665234 fbshipit-source-id: 0cf440861	5 years ago
Josh Berdine	37d1904bd3	[sledge] Move check for whether a variable is global from Reg to Var Summary: Extend the encoding using `id` from 0 indicating a program variable to also -1 indicating a global program variable. Reviewed By: bennostein Differential Revision: D17665229 fbshipit-source-id: 848b8a31e	5 years ago
Josh Berdine	3003a8e646	[sledge] NFC minor cleanups Reviewed By: jvillard Differential Revision: D17665255 fbshipit-source-id: 0f18e5777	5 years ago
Josh Berdine	8ee0c67d1f	[sledge] Precompute the Term form of each Exp, and add it to Exp.t Reviewed By: bennostein Differential Revision: D17665261 fbshipit-source-id: 25f2e656f	5 years ago
Josh Berdine	9ddfae4e89	[sledge] Change Term.rename to preserve sharing in cyclic records Reviewed By: bennostein Differential Revision: D17665265 fbshipit-source-id: 50844096a	5 years ago
Josh Berdine	7ecd091ff3	[sledge] Change Struct_rec to a generic n-ary recursive application Reviewed By: bennostein Differential Revision: D17665266 fbshipit-source-id: dd938ac31	5 years ago
Josh Berdine	356b4f0b4e	[sledge] Uncurry Record term constructor Reviewed By: bennostein Differential Revision: D17665260 fbshipit-source-id: 080f47739	5 years ago
Josh Berdine	99b60d191a	[sledge] Fix sorting of heap block subformulas when printing Summary: The sorting of heap blocks when printing formulas was broken by the change to the direct representation of polynomials. Reviewed By: bennostein Differential Revision: D17665246 fbshipit-source-id: 4ebea9f20	5 years ago
Josh Berdine	1228c8e31b	[sledge] Uncurry Update term constructor, and specialize index to int Reviewed By: bennostein Differential Revision: D17665245 fbshipit-source-id: d4716a220	5 years ago
Josh Berdine	09daac754c	[sledge] Uncurry Select term constructor, and specialize index to int Reviewed By: ngorogiannis Differential Revision: D17665264 fbshipit-source-id: c716a3eeb	5 years ago
Josh Berdine	5eaae07043	[sledge] Change Concat term contructor to a generic n-ary application Reviewed By: ngorogiannis Differential Revision: D17665238 fbshipit-source-id: 713b333e8	5 years ago
Josh Berdine	6cd82475f1	[sledge] Use generic binary application for Splat and Memory term constructors Reviewed By: bennostein Differential Revision: D17665256 fbshipit-source-id: 9c08338de	5 years ago
Josh Berdine	6805da9557	[sledge] Uncurry ternary term constructors Reviewed By: bennostein Differential Revision: D17665227 fbshipit-source-id: 56240d374	5 years ago
Josh Berdine	167e489e24	[sledge] Uncurry binary term constructors Reviewed By: bennostein Differential Revision: D17665243 fbshipit-source-id: 2d68e40b5	5 years ago
Josh Berdine	8b9d4ba066	[sledge] Uncurry unary term constructors Reviewed By: bennostein Differential Revision: D17665258 fbshipit-source-id: 456f7c58d	5 years ago
Josh Berdine	e87a0533be	[sledge] Minor simplification of polynomial representation Reviewed By: bennostein Differential Revision: D17665237 fbshipit-source-id: f9a082d26	5 years ago
Josh Berdine	3bbb05216f	[sledge] Remove the redundancy of both < and >= terms Summary: It is not necessary to have both < and >=, and similarly for <= and >. Reviewed By: bennostein Differential Revision: D17665232 fbshipit-source-id: 01b3511f5	5 years ago
Josh Berdine	a3506f995c	[sledge] Simplify arithmetic terms due to not needing type Summary: Now that terms operate over unbounded, signed, integers rather than bounded integers, and Boolean operations are treated uniformly with bitwise operations, it is not necessary to propagate types throughout arithmetic term manipulation. Reviewed By: bennostein Differential Revision: D17665257 fbshipit-source-id: 5236b101d	5 years ago
Josh Berdine	471d296266	[sledge] Fix check for range of representable integers Summary: Z.numbits ignores the sign, which allows 2^(N - 1) as representable within N bits, while it is not. So check explicitly. Reviewed By: bennostein Differential Revision: D17665231 fbshipit-source-id: 0d3940517	5 years ago
Josh Berdine	c440c4fc28	[sledge] Remove unsigned Term operations except Extract Summary: Instead of having separate signed and unsigned operations, use the signed operations applied to explicit conversion of the arguments using an unsigned integer interpretation. Reviewed By: bennostein Differential Revision: D17665267 fbshipit-source-id: 0b3271e71	5 years ago
Josh Berdine	e84f3fcf0f	[sledge] Add Extract term Summary: Add an Extract term form to interpret an integer with given signedness and bitwidth. Reviewed By: bennostein Differential Revision: D17665263 fbshipit-source-id: 1d8917f3c	5 years ago
Josh Berdine	5753f9b26a	[sledge] Rename clamp to extract Reviewed By: bennostein Differential Revision: D17665239 fbshipit-source-id: bab1175e1	5 years ago
Josh Berdine	d7ef03cf02	[sledge] Revise and fix unsigned conversions Summary: Be more explicit about semantics of unsigned vs. signed conversions, and fix a few related corner cases. Reviewed By: bennostein Differential Revision: D17665268 fbshipit-source-id: 67fecdf34	5 years ago
Josh Berdine	7f2165484b	[sledge] Do not special case boolean vs bitwise operations Summary: With terms using unbounded two's complement arithmetic, it is not necessary to special-case 1-bit integers as Booleans. Reviewed By: ngorogiannis Differential Revision: D17665228 fbshipit-source-id: a2f280fc3	5 years ago
Josh Berdine	8abfcfb504	[sledge] Simplify normalization of shift operations Summary: Remove the guards that prevent normalizing in some cases where the corresponding instruction in LLVM would produce a poison value. Usefully tracking poison values will be more involved. Reviewed By: ngorogiannis Differential Revision: D17665230 fbshipit-source-id: 59fb25042	5 years ago
Josh Berdine	e3f0ba8c54	[sledge] Revise program expressions Summary: Revise program expressions based on the changed constraints now that Term is separate from Exp. In particular: - Add types to all application, indicating how the operation interprets its arguments - Change to a simpler uncurried form - Remove now-unneeded normalizations Reviewed By: bennostein Differential Revision: D17665236 fbshipit-source-id: 1bcf2efd6	5 years ago
Josh Berdine	00639e15bb	[sledge] Delay normalization of xor to equality Summary: Boolean and bitwise negation of `e` is represented using `-1 xor e`. Since Equality can only maintain and propagate equality constraints, Boolean negation `-1 xor b` is normalized to `b = false`. This diff delays this normalization from being part of expression construction to part of symbolic heap formula construction. This makes the normalization done as part of expression construction independent of the distinction between bitwise and boolean operations. Reviewed By: bennostein Differential Revision: D17665254 fbshipit-source-id: 0a0722865	5 years ago
Josh Berdine	0e4110fc5c	[sledge] Normalize xor and equality based on type instead of bitwidth Reviewed By: bennostein Differential Revision: D17665233 fbshipit-source-id: dc2821943	5 years ago
Josh Berdine	0903355a0e	[sledge] Remove unused Exp constructors for memory exps Summary: Splat, Memory, and Concat expressions are never used. Only the term forms are needed. Reviewed By: bennostein Differential Revision: D17665259 fbshipit-source-id: cbfd7650d	5 years ago
Josh Berdine	3b03022b5e	[sledge] Remove redundant Reg.id Summary: It is always 0. Reviewed By: bennostein Differential Revision: D17665247 fbshipit-source-id: c146c9dc8	5 years ago
Josh Berdine	310d00f380	[sledge] Remove dead code in Exp and Term Reviewed By: bennostein Differential Revision: D17665249 fbshipit-source-id: c242634f1	5 years ago
Josh Berdine	442c8e92f4	[sledge] Distinguish program expressions and formula terms Summary: There are a number if issues with using the same type for expressions in code and in formulas. One is that the type systems of the two should be different. Another is that conflating the two compromises the ability of Llair to correctly express aspects such as integer overflow, floating point rounding, etc. Also, it could be beneficial to have more source locations for program expressions than makes sense for terms. This diff simply unshares Exp, leading to a copy named Term. Likewise, Reg is now a copy of Var. Simplifications to come. Reviewed By: bennostein Differential Revision: D17665250 fbshipit-source-id: 4359a80d5	5 years ago
Josh Berdine	13c06e4dd3	[sledge] Move generation of formal return and throw parameters to frontend Summary: The generation of names for the function formal return and throw parameters is not central to LLAIR, but a detail of the frontend, since they are generated only because LLVM does not already have such names. Reviewed By: ngorogiannis Differential Revision: D17665240 fbshipit-source-id: 684cbae92	5 years ago
Josh Berdine	0c04ecc9aa	[sledge] Change Llair representation of functions to a String map Summary: Using a type of keys richer than strings, which are the unique symbol names at the C/LLVM level, is unnecessary. Reviewed By: ngorogiannis Differential Revision: D17665262 fbshipit-source-id: 6b8c31146	5 years ago
Josh Berdine	6aaeaba104	[sledge] Move ops on signed 1-bit Z integers to import Summary: The convenience wrappers for operations on signed 1-bit integers represented by Z.t are not specific to Exp. Reviewed By: ngorogiannis Differential Revision: D17665252 fbshipit-source-id: d4b58e2a6	5 years ago
Josh Berdine	ed733f0247	[sledge] Add missing import of trace into symbheap Reviewed By: ngorogiannis Differential Revision: D17665241 fbshipit-source-id: 6f70e2925	5 years ago
Josh Berdine	1fdc76d163	[sledge] Rename State_domain back to Sh_domain Summary: Now that the relation domain construction is factored out and generalized. Reviewed By: ngorogiannis Differential Revision: D17665253 fbshipit-source-id: eb156ce6b	5 years ago
Josh Berdine	c6b8b4688b	[sledge] Move llvm build and install dirs out of llvm source tree Summary: Since version 2, none of the `opam pin` modes work reasonably well for the standard llvm build procedure. As a workaround to prevent opam from making several copies of the build directory when pinning, adjust to move the llvm build and install directories out of the llvm source tree. Reviewed By: bennostein Differential Revision: D17665242 fbshipit-source-id: ac84a4b0b	5 years ago
Scott Owens	5b7931e71a	[sledge sem] Add a rudimentary theory of SSA Summary: Since the correcteness of the mapping from LLVM to llair depends on LLVM being SSA, we need to formalise what that means. We also prove that the domination relation is a strict partial order, which will probably be helpful when reasoning about the translation. Reviewed By: jberdine Differential Revision: D17631456 fbshipit-source-id: a00eb3f87	5 years ago
Scott Owens	71aa4816d6	[sledge sem] Fix the semantics and trans. of If Summary: The LLVM semantics and translation was not consistently treating the 1-bit word value condition as signed or unsigned. Reviewed By: jberdine Differential Revision: D17605766 fbshipit-source-id: 77edf63b7	5 years ago
Scott Owens	ab7233c5b8	[sledge sem] Refactor the way LLVM sem. does phis Summary: Previously the LLVM semantics did the phi instructions at the head of a block as part of executing the branch into that block. This looked a bit weird, but had the advantage that the semantics knew which block was being jumped from, which is necessary to run the phi instructions. However, it meant that the rules for doing phi instructions would need to show up with each branching construct. It was also annoying for the LLVM->llair proof, since the phis are removed and their effect happens as a distinct step from the branch. Here we add a distinct Phi_ip instruction pointer to indicate that the phi instructions at the start of the block should execute next, and then be incremented to the usual numeric instruction pointer that points to the non-phi instructions. The Phi_ip contains the identity of the previous block. Reviewed By: jberdine Differential Revision: D17452416 fbshipit-source-id: 78fef7cca	5 years ago
Scott Owens	17b3c7a49f	[sledge sem] Add top-level llair semantics Summary: Give the llair semantics observable side effects (writes to global variables) and a semantic function mirroring the LLVM semantics. Start sketching out the LLVM/llair translation equivalence proof in a top-down way from the obvious statement of equality of the semantics. Reviewed By: jberdine Differential Revision: D17399654 fbshipit-source-id: 2170678a8	5 years ago
Scott Owens	30c301a3e8	[sledge sem] Add a more llair-like LLVM semantics Summary: The simple LLVM semantics steps one instruction at a time, but the generated llair does whole blocks at a time, since many individual LLVM instructions can become a single llair expression. We add a bigger-step LLVM semantics that does whole blocks at a time (except that it also stops at function calls, since those end blocks in llair). The steps in this bigger-step semantics should be at the same granularity as the llair steps, making it easier to verify the translation. We add a notion of observation to the LLVM semantics (right now, just global variable writes) and use that to define two top-level semantic functions, which we prove to be equivalent. Reviewed By: jberdine Differential Revision: D17396016 fbshipit-source-id: ee632fb92	5 years ago
Benno Stein	7ec2830d92	[sledge] Only merge worklist states that share a calling context Summary: This diff allows domains to specify which abstract states can or can't be merged together by the worklist. In particular, this is needed for relational domains to ensure that Hoare triples are joined only when they share a precondition. Reviewed By: jberdine Differential Revision: D17571148 fbshipit-source-id: d9345fdc9	5 years ago
Benno Stein	e44827b892	[sledge] Add option to apply used-globals as pre-analysis Summary: This diff adds a "-prenalyze-globals" flag to all analyze targets which, when set, computes used-globals sets for all reachable functions and then uses that information to track only relevant global variables at calls in the main analysis. Reviewed By: jberdine, jvillard Differential Revision: D17526746 fbshipit-source-id: 1a114285c	5 years ago
Benno Stein	1ab8359bc0	[sledge] fix bug spuriously marking a register as global variable Summary: Fixes a bug in Llair.Frontend.xlate_value where the l-val register of LLVM instruction calls was being marked as global. Reviewed By: jberdine Differential Revision: D17570458 fbshipit-source-id: e1b5924e2	5 years ago
Benno Stein	637fff5247	[sledge] Check for intrinsic calls in used-globals analysis Summary: Fixes a bug where are all calls are treated as intrinsics in used globals analysis, since exec_intrinsic is invoked at _all_ calls to determine which are intrinsic, not only at call sites known to target intrinsics. Reviewed By: jberdine Differential Revision: D17499406 fbshipit-source-id: 41f7621f2	5 years ago
Benno Stein	6592eb609f	[sledge] Add option to skip recursive calls at depth bound Summary: While the symbolic heap analysis ends its search upon hitting the bound on recursion depth, the used-globals analysis should instead simply skip recursive calls beyond the depth. Note that this is unsound for arbitrary abstract domains, however, and the flag controlling this feature should be used with caution. Note that procedure calls are still not handled correctly, since Used_globals.exec_intrinsic does not properly check whether callees are intrinsic. A forthcoming commit will fix that, as well. Reviewed By: jberdine Differential Revision: D17479753 fbshipit-source-id: aa92e0ef3	5 years ago
Benno Stein	00a5d3dd64	[sledge] Account for callees in used-globals analysis Summary: Include global variables used in function callees in used globals analysis. Also adds support for arbitrary changes to symbolic state while resolving callees in other analyses. Reviewed By: jberdine Differential Revision: D17479352 fbshipit-source-id: e3cd9f179	5 years ago
Josh Berdine	c131e2e669	[sledge] Use dune's Build_info for version reporting Summary: Replace custom version reporting support using a shell script with code using dune's Build_info API. Note that after this diff, the executables under _build/<context> are not version-stamped, but those under _build/_install are. The symlinks in bin point to the latter, stamped, exes. Reviewed By: bennostein Differential Revision: D16985446 fbshipit-source-id: 7afac87be	5 years ago
Benno Stein	47f314c00e	[sledge] Add used-globals abstract domain and transfer functions Summary: Adds an abstract domain to track global variable usages, as well as supporting changes to the frontend, IR and CLI. This analysis will support optimizations to the main symbolic-heap analysis, but for now can be invoked independently through the `-domain` flag on `analyze` targets of the Sledge executable. Reviewed By: jberdine Differential Revision: D17422212 fbshipit-source-id: 74bed0a76	5 years ago
Benno Stein	3dc0c5938f	[sledge] Extract relational logic from Sh_domain, create "domain" module Summary: Generalize the lifting from State_domain (i.e. symbolic heaps) to Sh_domain (i.e. relations over symbolic heaps). Also, extract abstract-domain-related code into its own module/directory. Reviewed By: jberdine Differential Revision: D17319007 fbshipit-source-id: cefbd1393	5 years ago
Benno Stein	2acb1c3dee	[sledge] Functorize worklist, separate out domain-specific logic Summary: Add support for future development of new abstract domains by eliminating hard-wired dependencies from the worklist into the symbolic heap domain. Also includes an implementation of a trivial unit domain and a CLI flag to enable its use, for debugging purposes. Reviewed By: jberdine Differential Revision: D17281681 fbshipit-source-id: 5858fd420	5 years ago
Scott Owens	f298d728c5	[sledge sem] Start sketching translation correctness Summary: This includes a few changes and corrections to the semantics, to support the translation. This initial attempt to reason about LLVM -> llair showed three things that needed repair in the semantics, in addition to various bugs. We address them as follows. Refactor llair semantics to have only a single kind of flat value: integers that fit into specified bit widths. Operations on size values (e.g., offsets, indices and the like) can just take an integer and ignore its number of bits. Pointers can just be considered integers that fit into a certain size given by the constant pointer_size. Later on we can consider making this a parameter to the model. Change the generic memory model interface to use numbers rather than words as the generic encoding of a large value. This makes it more useful for llair where words are not used. Pay more careful attention to signed/unsigned issues. Neither LLVM nor llair have a concept of signed vs unsigned value. Instead individual operations interpret bit patterns in various ways, some of which are ambiguous in the LLVM manual. For example, since getelementpointer's indices are explicitly said to be interpreted as signed 2's complement, we should probably do the same for insertvalue and extractvalue. However it is not clear how the argument to alloca is to be interpreted. For now we assume signed. Reviewed By: jberdine Differential Revision: D17164133 fbshipit-source-id: 31a8af635	5 years ago
Josh Berdine	72946c3be3	[sledge] Update dependencies Reviewed By: jvillard Differential Revision: D17132472 fbshipit-source-id: 9f4c9421e	5 years ago
Scott Owens	d864fb2c89	[sledge semantics] Add a rough draft llair semantics Summary: Not everything is here yet, and there is some confusion on what to do about the size values. However, the semantics has the right general shape and will be a nice starting point for thinking about the details. Reviewed By: jberdine Differential Revision: D17111041 fbshipit-source-id: cc75651c6	5 years ago
Scott Owens	32983e129b	[sledge semantics] Update expr transl. for cross-block Summary: The translation from LLVM to llair now builds expressions up across blocks, following the implementation. This is easy to do because of the dominance restrictions in SSA, but might be difficult to reason about. Reviewed By: jberdine Differential Revision: D17111040 fbshipit-source-id: a8e99147d	5 years ago
Scott Owens	9f44bbc264	[sledge semantics] Refactor the memory model Summary: LLVM and llair have similar memory models, and we don't want to duplicate any definitions or theorems. This adds a new memory model theory which should be understandable in its own right. A heap is a mapping from addresses to bytes, alongside a set of valid addresses, and intervals that have been allocated already. Primitives are defined for allocating and de-allocating as well as reading and writing chuncks of bytes. There is also a generic type of structured values, and functions for converting them to/from byte arrays. Reviewed By: jberdine Differential Revision: D17074470 fbshipit-source-id: bdab6089f	5 years ago
Josh Berdine	13fb57ec62	[sledge] Revise llvm to llair translation to avoid code duplication Summary: In some cases inlining pure expressions into their use sites causes code blowup. This diff changes the frontend to inline expressions only if there is a single use, and otherwise adds a move instruction. Reviewed By: ngorogiannis Differential Revision: D17071770 fbshipit-source-id: d866a0622	5 years ago

1 2 3 4 5 ...

417 Commits (ce39017611675e487b796bd3c2287cdeb73c452e)