codeql

mirror of https://github.com/github/codeql.git synced 2025-12-24 04:36:35 +01:00

Author	SHA1	Message	Date
Dave Bartolomeo	1428811f75	C++: IR translation for binary conditional operator IR generation was not handling the special two-operand flavor of the `?:` operator that GCC supports as an extension. The extractor doesn't quite give us enough information to do this correctly (see github/codeql-c-extractor-team#67), but we can get pretty close. About half of the code could be shared between the two-operand and three-operand flavors. The main differences for the two-operand flavor are: 1. The "then" operand isn't a child of the `ConditionalExpr`. Instead, we just reuse the original value of the "condition" operand, skipping any implicit cast to `bool` (see comment for rationale). 2. For the three-operand flavor, we generate the condition as control flow rather than the computation of a `bool` value, to avoid creating unnecessarily complicated branching. For the two-operand version, we just compute the value, since we have to reuse that value in the "then" branch anyway. I've added IR tests for these new cases. I've also updated the expectations for `SignAnalysis.ql` based on the fix. @rdmarsh2, can you please double-check that these diffs look correct? I believe they do, but you're the range/sign analysis expert.	2020-04-21 02:05:21 -04:00
Dave Bartolomeo	c705797a2d	C++: IR construction for `_Imaginary` types Includes a fairly exhaustive test case for arithmetic operations involving `_Complex` and/or `_Imaginary` types. Thanks to these new tests, I discovered that the extractor treats certain arithmetic operations on `_Imaginary` types as separate expression kinds, so I added support for those kinds in IR construction.	2020-04-16 11:58:09 -04:00
Dave Bartolomeo	3dd3b5304e	Merge remote-tracking branch 'upstream/master' into dbartol/floats	2020-04-16 08:44:41 -04:00
Chris Gavin	4e981d8e70	Merge rc/1.24 into master.	2020-04-14 21:30:29 +01:00
Robert Marsh	146bfca2ad	Merge pull request #3254 from dbartol/dbartol/ImplicitReturnValue2 C++: Treat implicit end of body of non`-void` function as `Unreached`	2020-04-14 12:18:50 -07:00
Dave Bartolomeo	125a09ce6e	C++: Fix IR generation of `return` of `void` expression	2020-04-14 11:40:33 -04:00
Dave Bartolomeo	1bde11706e	C++: Connect `InitializeIndirection` to `UnmodeledDefinition` The IR generation for `InitializeIndirection` currently connects its load operand to the result of the corresponding `InitializeParameter` instruction. This isn't exactly wrong, but it doesn't fit the IR invariant of "All unmodeled uses consume `UnmodeledDefinition`". Our current code doesn't care, because we just throw away all of the existing def-use information, modeled or otherwise, when we build unaliased SSA. However, some upcoming SSA changes don't work correctly if this invariant is broken. I've added the trivial IR generation change, along with a new sanity query.	2020-04-13 18:37:47 -04:00
Dave Bartolomeo	603a3af19b	C++: Treat implicit end of body of non`-void` function as `Unreached` When the extractor can't prove that control flow will never reach the end of a non-`void`-returning function without reaching an explicit `return` statement, it inserts an implicit `return` without an operand. If control actually reaches this point, the behavior is undefined. We were previously generating invalid IR for these implicit `return` statements, because the lack of an operand meant that there was no definition of the return value variable along that path. Instead, I've changed the IR generation to emit an `Unreached` instruction for the implicit `return`. This ensures that we don't create a control flow edge from the end of the body to the function epilogue. The change to the range analysis test avoids having that test depend on the previous bad IR behavior, while still preserving the original spirit of the test.	2020-04-13 18:09:44 -04:00
Robert Marsh	8779177839	C++: accept minor test change	2020-04-13 14:03:02 -07:00
Robert Marsh	a5e7db73b2	Merge branch 'rc/1.24' into rdmarsh/cpp/ir-flow-through-outparams For submodule consistency	2020-04-13 12:02:59 -07:00
Robert Marsh	c38ccaaab6	Merge branch 'master' into rdmarsh/cpp/ir-flow-through-outparams	2020-04-08 12:32:35 -07:00
Mathias Vorreiter Pedersen	7298b68674	C++: Exclude 'this' params and read/write side effects from static member function calls through qualifiers, and accept tests	2020-04-08 13:35:26 +02:00
Mathias Vorreiter Pedersen	e4e0d3b573	C++: Add test cases showing that static member function calls get 'this' pointers and side effects for 'this' when accessed through qualifiers	2020-04-08 13:00:51 +02:00
Robert Marsh	a8e191248e	Merge branch 'master' into rdmarsh/cpp/ir-flow-through-outparams Merge IR SSA test additions	2020-04-02 15:30:20 -07:00
Mathias Vorreiter Pedersen	a273917e51	Merge branch 'master' into init-dynamic-alloc-newexpr	2020-04-02 14:11:03 +02:00
Robert Marsh	b579e6aabe	C++: accept consistency test output	2020-03-31 12:56:52 -07:00
Robert Marsh	25f3f67c4a	Merge branch 'master' into rdmarsh/cpp/ir-flow-through-outparams Fixes test conflicts and reveals a bug in parameter handling	2020-03-31 12:54:00 -07:00
Mathias Vorreiter Pedersen	97061716f9	C++: Accept test output	2020-03-31 17:10:33 +02:00
Mathias Vorreiter Pedersen	94f5468504	C++: Accept tests	2020-03-31 13:56:47 +02:00
Mathias Vorreiter Pedersen	688464a00f	C++: Add testcases with new and accept output	2020-03-31 12:22:07 +02:00
Geoffrey White	6d6ad4a0ae	Merge branch 'master' into sideeffect	2020-03-30 14:16:23 +01:00
Dave Bartolomeo	39dd9b7099	C++/C#: Fix formatting	2020-03-27 19:46:53 -04:00
Dave Bartolomeo	c3a6ca0d9a	C++: Better support for complex numbers in IR and AST This PR adds better support for differentiating complex and imaginary floating-point types from real floating-point types, in both the AST and in the IR type system. AST Changes - Introduces the new class `TypeDomain`, which can be either `RealDomain`, `ImaginaryDomain` or `ComplexDomain`. "type domain" is the term used for this concept in the C standard, and I couldn't think of a better one. - Introduces `FloatingPointType.getDomain()`, to get the type domain of the type. - Introduces `FloatingPointType.getBase()`, to get the numeric base of the type (either 2 or 10). - Introduces three new subtypes of `FloatingPointType`: `RealNumberType`, `ComplexNumberType`, and `ImaginaryNumberType`, which differentiate between the types based on their type domain. Note that the decimal types (e.g., `_Decimal32`) are included in `RealNumberType`. - Introduces two new subtypes of `FloatingPointType`: `BinaryFloatingPointType` and `DecimalFloatingPointType`, which differentiate between the types based on their numeric base, independent of type domain. IR Changes - `IRFloatingPointType` now has two additional parameters: the base and the type domain. - New test that ensures that C++ types get mapped to the correct IR types. - New IR test that verifies the IR for some basic usage of complex FP types.	2020-03-27 18:08:14 -04:00
Jonas Jensen	2b2667aef7	Merge remote-tracking branch 'upstream/master' into detect-conflated-memory Conflicts: cpp/ql/src/semmle/code/cpp/ir/implementation/aliased_ssa/IRSanity.qll cpp/ql/src/semmle/code/cpp/ir/implementation/raw/IRSanity.qll cpp/ql/src/semmle/code/cpp/ir/implementation/unaliased_ssa/IRSanity.qll cpp/ql/test/library-tests/ir/ir/aliased_ssa_sanity.expected cpp/ql/test/library-tests/ir/ir/aliased_ssa_sanity_unsound.expected cpp/ql/test/library-tests/ir/ir/raw_sanity.expected cpp/ql/test/library-tests/ir/ir/unaliased_ssa_sanity.expected cpp/ql/test/library-tests/ir/ir/unaliased_ssa_sanity_unsound.expected cpp/ql/test/library-tests/ir/ssa/aliased_ssa_sanity.expected cpp/ql/test/library-tests/ir/ssa/aliased_ssa_sanity_unsound.expected cpp/ql/test/library-tests/ir/ssa/unaliased_ssa_sanity.expected cpp/ql/test/library-tests/ir/ssa/unaliased_ssa_sanity_unsound.expected cpp/ql/test/library-tests/syntax-zoo/aliased_ssa_sanity.expected cpp/ql/test/library-tests/syntax-zoo/raw_sanity.expected cpp/ql/test/library-tests/syntax-zoo/unaliased_ssa_sanity.expected csharp/ql/src/semmle/code/csharp/ir/implementation/raw/IRSanity.qll csharp/ql/src/semmle/code/csharp/ir/implementation/unaliased_ssa/IRSanity.qll csharp/ql/test/library-tests/ir/ir/raw_ir_sanity.expected csharp/ql/test/library-tests/ir/ir/unaliased_ssa_sanity.expected	2020-03-25 11:55:39 +01:00
Jonas Jensen	27832148a9	C++: Phi-node conflation is not about vvars	2020-03-24 13:56:42 +01:00
Jonas Jensen	95a8dcb3fd	C++: failing test for non-conflated escaped var	2020-03-24 13:31:08 +01:00
Dave Bartolomeo	c5ac357bfc	C++/C#: Fix bad overlap sanity failures `Instruction.getDefinitionOverlap()` depends on `SSAConstruction::getMemoryOperandDefinition()`, which in turn depends on `SSAConstruction::hasMemoryOperandDefinition()`. When the definition in question came from a `Chi` instruction, `hasMemoryOperandDefinition()` incorrectly bound `overlap` to the overlap relationship between the original (non-`Chi`) instruction and the use. The fix is to make use of the `actualDefLocation` parameter to `getDefinitionOrChiInstruction()`, which specifies the location for the result of the `Chi` in that case.	2020-03-23 14:57:41 -04:00
Jonas Jensen	b0d3c9ee6b	C++: Fix getExtentOverlap for entire allocation	2020-03-23 19:49:39 +01:00
Dave Bartolomeo	a2741da8e2	C++/C#: Add sanity test for invalid overlap from `getDefinitionOverlap()` The result of `getDefinitionOverlap()` should never be `MayPartiallyOverlap`, because if that were the case, we should have inserted as `Chi` instruction and hooked the definition up to that instead. There are quite a few existing failures.	2020-03-23 14:37:06 -04:00
Robert Marsh	d529fedbad	C++: accept extractor changes to IR	2020-03-20 11:00:54 -07:00
Jonas Jensen	f6f9afe462	C++: Implement Instruction.isResultConflated This predicate replaces `isChiForAllAliasedMemory`, which was always intended to be temporary. A test is added to `IRSanity.qll` to verify that the new predicate corresponds exactly with (a fixed version of) the old one. The implementation of the new predicate, `Cached::hasConflatedMemoryResult` in `SSAConstruction.qll`, is faster to compute than the old `isChiForAllAliasedMemory` because it uses information that's readily available during SSA construction.	2020-03-20 17:57:18 +01:00
Dave Bartolomeo	82e2816915	C++: Fix handling of `std::va_list` that is used as a function parameter In the Unix ABI, `std::va_list` is defined as `typedef struct __va_list_tag { ... } va_list[1];`, which means that any `std::va_list` used as a function parameter decays to `struct __va_list_tag`. Handling this actually made the QL code slightly cleaner. The only tricky bit is that we have to determine what type to use as the actual `va_list` type when loading, storing, or modifying a `std::va_list`. To do this, we look at the type of the argument to the `va_` macro. A detailed QLDoc comment explains the details. I added a test case for passing a `va_list` as an argument, and then manipulating that `va_list` in the callee.	2020-03-20 12:53:09 -04:00
Dave Bartolomeo	bf284514fc	C++: Better IR for varargs This PR changes the IR we generate for functions that accept a variable argument list. Rather than simply using `BuiltInOperationInstruction` to model the various `va_*` macros as mysterious function-like operations, we now model them in more detail. The intent is to enable better alias analysis and taint flow through varargs. The `va_start` macro now generates a unary `VarArgsStart` instruction that takes the address of the ellipsis pseudo-parameter as its operand, and returns a value of type `std::va_list`. This value is then stored into the actual `std::va_list` variable via a regular `Store`. The `va_arg` macro now loads the `std::va_list` argument, then emits a `VarArg` instruction on the result. This returns the address of the vararg argument to be loaded. That address is later used as the address operand of a regular `Load` to return the value of the argument. To model the side effect of moving to the next argument, we emit a `NextVarArg` instruction that takes the previous `std::va_list` value and returns an updated one, which is then stored back into the `std::va_list` variable. The `va_end` macro just emits a `VarArgsEnd` unary instruction that takes the address of the `std::va_list` argument and does nothing, since `va_end` doesn't really do anything on most compiler implementations anyway. The `va_copy` macro is just modeled as a plain copy.	2020-03-19 19:23:33 -04:00
Robert Marsh	59a81d8445	C++: merge from master and accept test changes	2020-03-18 13:47:01 -07:00
Geoffrey White	2e4e491ce8	Merge branch 'master' into sideeffect	2020-03-18 17:12:14 +00:00
Geoffrey White	6cc1c2341c	C++: Add some SideEffect models.	2020-03-18 16:42:33 +00:00
Geoffrey White	935b8d96f8	C++: Offset .expected for cleaner diff.	2020-03-18 16:42:33 +00:00
Geoffrey White	0c9466640a	C++: Add IR test for strcpy/strcat.	2020-03-18 16:42:25 +00:00
Dave Bartolomeo	26ea93af58	Merge remote-tracking branch 'upstream/master' into dbartol/VarArgIR	2020-03-18 09:52:21 -04:00
Dave Bartolomeo	9cc3cda58e	C++: Model varargs in IR, Part I This change introduces a new synthesized `IRVariable` in every varargs function. This variable represents the entire set of arguments passed to the ellipsis by the caller. We give it an opaque type big enough hold all of the arguments passed by the largest vararg call in the database. It is treated just like any other parameter. It is initialized the same, it has indirect buffers, etc. I had to introduce a couple new APIs to `Call` and `Function`. The QLDoc comments should explain these. I added tests for these new APIs as well. The next step will be to change the IR generation for the `va_*` macros to manipulate the ellipsis parameter.	2020-03-17 11:11:48 -04:00
Dave Bartolomeo	9922958bf1	C++: Fix failed tests Added a new `StaticLocalVariable` class, which made several other pieces of the original change a bit cleaner. Fixed test failures due to a mistake in the original `CFG.qll` change. Added a test case for static local variables with constructors. Removed the `Uninitialized` instruction from the initialization of a static local, because all objects with static storage duration are zero-initialized at startup. Fixed expectations for `SignAnalysis.ql` to reflect that a bad result is now fixed.	2020-03-13 06:46:07 -04:00
Dave Bartolomeo	1526400a81	C++: Model dynamic initialization of static local variables in IR Previously, the IR for the initialization of a static local variable ran the initialization unconditionally, every time the declaration was reached during execution. This means that we don't model the possibility that an access to the static variable fetches a value that was set on a previous execution of the function. I've added some simple modelling of the correct behavior to the IR. For each static local variable that has a dynamic initializer, we synthesize a (static) `bool` variable to hold whether the initializer for the original variable has executed. When executing a declaration, we check the value of the synthesized variable, and skip the initialization code if it is `true`. If it is `false`, we execute the initialization code as before, and then set the flag to `true`. This doesn't capture the thread-safe nature of static initialization, but I think it's more than enough to handle anything we're likely to care about for the foreseeable future. In `TranslatedDeclarationEntry.qll`, I split the translation of a static local variable declaration into two `TranslatedElement`s: one for the declaration itself, and one for the initialization. The declaration part handles the checking and setting of the flag; the initialization just does the initialization as before. I've added an IR test case that has static variables with constant, zero, and dynamic initialization. I've also verified the new IR generated for @jbj's previous test cases for constant initialization. I inverted the sense of the `hasConstantInitialization()` predicate to be `hasDynamicInitialization()`. Mostly this just made more sense to me, but I think it also fixed a potential bug where `hasConstantInitialization()` would not hold for a zero-initialized variable. Technically, constant initialization isn't the same as zero initialization, but I believe that most code really cares about the distinction between dynamic initialization and static initialization, where static initialization includes both constant and zero initialization. I've fixed up the C# side of IR generation to continue working, but it doesn't use any of the dynamic initialization stuff. In theory, it could use something similar to model the initialization of static fields.	2020-03-12 18:29:16 -04:00
Robert Marsh	9f1833af76	Merge branch 'master' into rdmarsh/cpp/ir-flow-through-outparams	2020-03-12 15:09:58 -07:00
Robert Marsh	a0823a2582	C++: add argv chi chain example to ssa test	2020-03-12 15:06:17 -07:00
Robert Marsh	ba8ebe9f3a	C++: accept test changes	2020-03-10 11:41:59 -07:00
Robert Marsh	bba6b23019	Merge branch 'master' into rdmarsh/cpp/ir-flow-through-outparams	2020-03-10 11:12:19 -07:00
Jonas Jensen	cc38abd228	C++: Ignore constant static initializers	2020-03-06 15:05:28 +01:00
Jonas Jensen	02f0b89a0d	C++: Test for constant static initializer	2020-03-06 15:05:28 +01:00
Robert Marsh	95a762c987	Merge master for submodule update	2020-02-26 13:44:26 -08:00
Robert Marsh	4333fe7905	Merge branch 'master' into rdmarsh/cpp/ir-flow-through-outparams	2020-02-26 13:15:27 -08:00

1 2 3 4 5 ...

280 Commits