codeql

mirror of https://github.com/github/codeql.git synced 2025-12-18 09:43:15 +01:00

Author	SHA1	Message	Date
Jonas Jensen	bdce24735c	C++: Add flow through arrays This works by adding data-flow edges to skip over array expressions when reading from arrays. On the post-update side, there was already code to skip over array expressions when storing to arrays. That happens in `valueToUpdate` in `AddressFlow.qll`, which needed just a small tweak to support assignments with non-field expressions at the top-level LHS, like `*a = ...` or `a[0] = ...`. The new code in `AddressFlow.qll` is copy-pasted from `EscapesTree.qll`, and there is already a note in these files saying that they share a lot of code and must be maintained in sync.	2020-09-15 14:46:11 +02:00
Geoffrey White	adbfad21ef	C++: Correct the localFlow test.	2020-08-24 18:05:30 +01:00
Jonas Jensen	88eeca39fb	Merge commit '52d8acc1a198c5ea29c1dddceda1d6c0fb75de14' into dataflow-defbyref-to-field This is a partial merge from master. In particular, it takes in #3382 and #3385.	2020-05-07 16:46:11 +02:00
Jonas Jensen	1b1095ee75	C++: Post-update flow through &, , +, ... Flow from a definition by reference of a field into its object was working inconsistently and in a very syntax-dependent way. For a function `f` receiving a reference, `f(a->x)` could propagate data back to `a` via the _reverse read_ mechanism in the shared data-flow library, but for a function `g` receiving a pointer, `g(&a->x)` would not work. And `f((a).x)` would not work either. In all cases, the issue was that the shared data-flow library propagates data backwards between `PostUpdateNode`s only, but there is no `PostUpdateNode` for `a->x` in `g(&a->x)`. This pull request inserts such post-update nodes where appropriate and links them to their neighbors. In this exapmle, flow back from the output parameter of `g` passes first to the `PostUpdateNode` of `&`, then to the (new) `PostUpdateNode` of `a->x`, and finally, as a _reverse read_ with the appropriate field projection, to `a`.	2020-05-01 15:40:19 +02:00
Jonas Jensen	36bdcfa42d	C++: Remove an unneeded local-flow case This case was added in `dccc0f4db`. The surrounding code has changed a lot since then, and the case no longer seems to have an effect except to create some dead ends and possibly cycles in the local flow graph.	2020-05-01 15:08:15 +02:00
Jonas Jensen	fd6d06fe6f	C++: Data flow through address-of operator (&) The data flow library conflates pointers and their objects in some places but not others. For example, a member function call `x.f()` will cause flow from `x` of type `T` to `this` of type `T*` inside `f`. It might be ideal to avoid that conflation, but that's not realistic without using the IR. We've had good experience in the taint tracking library with conflating pointers and objects, and it improves results for field flow, so perhaps it's time to try it out for all data flow.	2019-09-17 13:16:34 +02:00
Ian Lynagh	1d56407c72	C++: Pull some of library-tests/dataflow/dataflow-tests into clang.cpp g++ doesn't support this code: sorry, unimplemented: non-trivial designated initializers not supported twoIntFields sSwapped = { .m2 = source(), .m1 = 0 }; so we need to build it in clang mode.	2019-09-05 15:12:17 +01:00
Jonas Jensen	98d6f3cada	C++: Unify partial def and def-by-ref This removes a lot of flow steps, but it all seems to be flow that was present twice: both exiting a `PartialDefNode` and a `DefinitionByReferenceNode`. All `DefinitionByReferenceNode`s are now `PartialDefNode`s.	2019-08-08 14:05:03 +02:00
Jonas Jensen	6a3f5efc1b	C++: Accept AST field flow test output	2019-08-08 14:05:03 +02:00
Jonas Jensen	71659594c8	C++: Let data flow past definition by reference This commit changes how data flow works in the following code. MyType x = source(); defineByReference(&x); sink(x); The question here is whether there should be flow from `source` to `sink`. Such flow is desirable if `defineByReference` doesn't write to all of `x`, but it's undesirable if `defineByReference` is a typical init function in `C` that writes to every field or if `defineByReference` is `memcpy` or `memset` on the full range. Before 1.20.0, there would be flow from `source` to `sink` in case `x` happened to be modeled with `BlockVar` but not in case `x` happened to be modelled with SSA. The choice of modelling depends on an analysis of how `x` is used elsewhere in the function, and it's supposed to be an internal implementation detail that there are two ways to model variables. In 1.20.0, I changed the `BlockVar` behavior so it worked the same as SSA, never allowing that flow. It turns out that this change broke a customer's query. This commit reverts `BlockVar` to its old behavior of letting flow propagate past the `defineByReference` call and then regains consistency by changing all variables that are ever defined by reference to be modelled with `BlockVar` instead of SSA. This means we now get too much flow in certain cases, but that appears to be better overall than getting too little flow. See also the discussion in CPP-336.	2019-04-01 14:13:47 +02:00
Jonas Jensen	972d00822c	C++: Generalize std::move data flow	2019-02-27 15:53:00 +01:00
Jonas Jensen	80183464d9	C++: Define DefinitionByReferenceNode This enables data flow through `memcpy` and similar functions modeled in `semmle.code.cpp.model`.	2019-02-27 15:53:00 +01:00
Jonas Jensen	5647a1a658	C++: BlockVar value stops at def by ref (partial)	2019-02-27 15:05:53 +01:00
Jonas Jensen	20f3df0d09	C++: Add tests to demo lack dataflow by reference	2019-02-27 13:19:16 +01:00
Pavel Avgustinov	b55526aa58	QL code and tests for C#/C++/JavaScript.	2018-08-02 17:53:23 +01:00

15 Commits