codeql

mirror of https://github.com/github/codeql.git synced 2026-05-27 17:41:24 +02:00

Author	SHA1	Message	Date
yoff	4ce6131ca3	Python: treat augmented-assignment targets as both load and store The legacy CFG emitted two ControlFlowNodes for `x[i] += 42` (one load, one store, with `load.strictlyDominates(store)`). The new CFG collapses them to a single canonical node, mirroring Java's single-`VarAccess` model where `isVarRead`/`isVarWrite` are non-disjoint on the same expression. Reconcile two legacy two-node behaviours with the merged single-node world: 1. `Cfg::ControlFlowNode.isLoad()` no longer excludes augmented targets — both `isLoad` and `isStore` hold on the merged canonical node, matching Java. `NameNode.defines` drops the now-redundant `not isLoad` guard; `Py::Name.defines` already filters by `isDefinition` (Store/Param/AugAssign-target ctx). 2. `LocalFlow::definitionFlowStep` is restricted to NameNode targets, matching legacy ESSA's `assignment_definition` which required `defn.(NameNode).defines(v)`. Subscript and attribute writes (`x[i] = 42`, `obj.attr = 42`) no longer emit a local-flow step into the LHS expression — that flow is handled by the AttrWrite and content-flow machinery. This is essential for keeping augmented Subscript/Attribute targets classifiable as `LocalSourceNode` on the read side, which the API graph requires for emitting Use edges. `StoreLoadTest.ql` is updated to filter `isAugLoad` out of the regular `load` tag, mirroring the pre-existing `not isAugStore` filter on the `store` tag so augmented-assignment expectations remain `augload=n augstore=n` (not also `load=n store=n`). Closes the three remaining ApiGraphs library-test failures (`getSubscript.ql` semantically, plus cosmetic toString updates in `ModuleImportWithDots.ql` and `test_crosstalk.ql`). Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-05-26 16:32:44 +00:00
yoff	f5bf8ae8dd	Python: fix augstore for the new CFG and add store/load test In the legacy CFG the same Python 'Name' that is the target of an augmented assignment has two distinct CFG nodes — a load node (context 3) earlier in the basic block and a store node (context 5) later. 'augstore(load, store)' relates the pair via dominance. The new (shared) CFG canonicalises each AST expression to a single CFG node, so 'load' and 'store' collapse to one. The dominance-based 'augstore' from the legacy implementation no longer holds (it would require 'load.strictlyDominates(load)'), so 'isAugLoad' / 'isAugStore' never fired and 'isStore' missed the AugAssign target entirely. Redefines 'augstore' as reflexive on the AugAssign target's canonical CFG node. With this change: * isAugLoad / isAugStore both fire on the single canonical node. * isStore fires (via 'or augstore(_, this)') — matching the legacy classification that an augmented-assignment target is a store. * isLoad does not fire (excluded by 'not augstore(_, this)'). Adds 'python/ql/test/library-tests/ControlFlow/store-load/' covering plain load/store/delete, parameters, augmented assignment, tuple unpacking, attribute and subscript stores. The test asserts the classification directly on the new-CFG facade. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-05-26 16:32:43 +00:00
yoff	3c21bbfbf5	Python: test dead bindings under no-raise CFG abstraction Adds 'dead_under_no_raise.py' to the bindings test suite, capturing the three CPython patterns where bindings legitimately have no CFG node because the surrounding code is unreachable under the 'no expressions raise' abstraction: 1. Statements after a 'try: return X; except: pass' block. 2. The 'else:' clause of a try whose body always raises. 3. Cache-lookup pattern 'try: return cache[k]; except: pass' followed by computation and store. These bindings intentionally carry no 'cfgdefines=' annotations. If raise modelling is later added to the CFG, the BindingsTest will surface the new CFG nodes as unexpected results and this file will need to be revisited. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-05-26 16:32:43 +00:00
yoff	01c6b2b262	Python: wire PEP 695 type parameters into the shared CFG (green) Adds CFG coverage for the binding 'Name's introduced by PEP 695 type-parameter syntax on functions, classes, and 'type' aliases: def func[T](...): ... class Box[T]: ... def multi[T: int, Ts, *P](...): ... type Alias[T] = ... For each parametrised AST node, the type-parameter names (and, for 'type' aliases, the alias name itself) are added as children of the enclosing CFG node so that 'Name.defines(v)' has a corresponding position. Bounds and defaults are intentionally not wired (they have no SSA-relevant semantics for our purposes). Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-05-26 16:32:42 +00:00
Copilot	f12307278a	Python: wire match-pattern bindings into the shared CFG (green) Adds concrete `Pattern` subclasses in `AstNodeImpl.qll` for every `MatchPattern` AST kind, with `getChild` overrides that expose sub-patterns and bound Names. Specifically: - MatchCapturePattern (`case x:`) -> getVariable() - MatchAsPattern (`case … as v:`) -> getPattern(), getAlias() - MatchStarPattern (`case [*rest]:`) -> getTarget() - MatchSequencePattern (`case [a, b]:`) -> getPattern(i) - MatchClassPattern (`case Cls(p, q, k=v)`) -> getClass(), positional, keyword - MatchMappingPattern (`case {k: v}:`) -> getMapping(i) - MatchKeyValuePattern, MatchKeywordPattern, MatchDoubleStarPattern - MatchOrPattern, MatchLiteralPattern, MatchValuePattern Without these, every Name bound by a match pattern lacked a CFG node. Removes the corresponding MISSING: annotations from match_pattern.py (all 11 cases). Verified: all 24 ControlFlow/evaluation-order tests still pass. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-05-26 16:32:42 +00:00
Copilot	ba9dc9f5f1	Python: wire import-statement bindings into the shared CFG (green) Adds `ImportStmt` and `ImportStarStmt` wrappers in `AstNodeImpl.qll`. For each `Alias` in an import statement, both the value (module/member expression) and the bound `asname` Name become children of the CFG node for the import statement, in evaluation order. Without this, every `Name` introduced by `import` / `from .. import ..` lacked a CFG node, even though `Name.defines(v)` returns true for it on the AST side. This was the highest-volume gap: 20,332 missing import aliases across CPython. Removes the corresponding MISSING: annotations from imports.py. Verified: all 24 ControlFlow/evaluation-order tests still pass. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-05-26 16:32:42 +00:00
Copilot	768ebc1e2d	Python: wire parameters into the shared CFG (C# pattern) Implements `AstSig::Parameter` and `callableGetParameter(c, i)` in `AstNodeImpl.qll`, following the C# template (`csharp/.../ControlFlowGraph.qll:147-156`) rather than Java's `Parameter() { none() }`. Each Python parameter (positional, args, keyword-only, *kwargs) now becomes a CFG node at a stable position in the enclosing callable's entry sequence. Defaults still evaluate at function-definition time via `FunctionDefExpr.getDefault` / `LambdaExpr.getDefault`, so `Parameter::getDefaultValue()` returns `none()` (the shared CFG library calls this to model the missing-argument fallback, which Python does not surface at the CFG level). The bindings test now exercises parameters (the `py_expr_contexts(_, 4, ...)` exclusion has been removed). A new `parameters.py` test case covers positional, defaulted, vararg, kwarg, keyword-only, kitchen-sink, method (self/cls), lambda, and PEP 570 positional-only parameters. Several other test files were updated to annotate parameters that the test had previously hidden (synthetic `.0` comprehension parameter, method `self`, decorator `f`, etc.). Verified: - All 24 ControlFlow/evaluation-order tests still pass. - CFG consistency query (`python/ql/consistency-queries/CfgConsistency.ql`) shows zero violations on CPython. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-05-26 16:32:42 +00:00
Copilot	5d60a0d7c1	Python: wire AnnAssign into the shared CFG (green) Adds an `AnnAssignStmt` wrapper in `AstNodeImpl.qll` so that PEP 526 annotated assignments (`x: int = 1`, `x: int`) participate in the control flow graph. Evaluation order follows CPython: annotation, optional value, target binding. Without this, `x: int = 1` had no CFG node for `x` even though `Name.defines(v)` returns true for it on the AST side. SSA built on the new CFG would therefore miss every annotated-assignment write. Removes the corresponding MISSING: annotations from the CFG-binding gap test: - annassign.py — all four cases now green. - match_pattern.py — class-body annotated fields (`x: int`, `y: int`). - type_params.py — `item: T` inside class. Verified: all 24 ControlFlow/evaluation-order tests still pass. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-05-26 16:32:42 +00:00
Copilot	336c7a44a8	Python: add CFG-binding gap tests (red) Adds inline-expectation tests for the new shared CFG implementation in python/ql/lib/semmle/python/controlflow/internal/AstNodeImpl.qll, covering every Python binding construct that introduces a variable. The test files use MISSING: annotations to record bindings whose defining Name AST node is not currently reachable from the new CFG. These are the 'red' half of red-green commit pairs: subsequent commits will extend AstNodeImpl to cover each construct and remove the corresponding MISSING: marker. Confirmed-broken categories: - Import aliases (from x import a) - Annotated assignment (x: int = 1) - Exception handler (except E as e) - Match patterns (case x, case [a,b], case ... as v) - PEP 695 type params (def f[T], class C[T]) Confirmed-working (no MISSING:): - Compound targets, with-as, comprehensions, decorated def/class, walrus, starred. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-05-26 16:32:42 +00:00
Copilot	577cf4a630	Shared CFG: support for-else and while-else loops Add two default predicates to AstSig: default AstNode getWhileElse(WhileStmt loop) { none() } default AstNode getForeachElse(ForeachStmt loop) { none() } When defined, the explicit-step rules for While/Do and Foreach route the loop's normal-completion exits through the else block before reaching the after-loop node: - WhileStmt: after-false condition -> before-else -> after-while (instead of directly after-while). - ForeachStmt: after-collection [empty] and the LoopHeader exit are both routed through before-else -> after-foreach. Python's Ast module overrides the predicates to return the synthetic BlockStmt for the orelse slot, replacing the previous customisations in Input::step. This eliminates parallel direct successors emitted by the previous Python-side step additions (verified: multipleSuccessors on a CPython database goes from 1340 to 0). Java and C# CFG tests are unaffected. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-05-26 16:32:39 +00:00
Taus	28567870ac	WIP2	2026-05-26 16:32:38 +00:00
Taus	f5629a5583	WIP	2026-05-26 16:32:38 +00:00
Taus	75a3168c09	Python: Ignore synthetic CFG nodes We can only annotate the ones that correspond directly to AST nodes anyway. Co-authored-by: yoff <yoff@github.com>	2026-05-26 16:32:37 +00:00
Taus	49c38dddb7	Python: Instantiate CFG tests with new CFG library Co-authored-by: yoff <yoff@github.com>	2026-05-26 16:32:36 +00:00
Taus	166b3226ac	Python: Make CFG tests parameterised Currently we only instantiate them with the old CFG library, but in the future we'll want to do this with the new library as well. Co-authored-by: yoff <yoff@github.com>	2026-05-26 16:32:36 +00:00
Taus	66bdd22a14	Python: Add ConsecutiveTimestamps test This one is potentially a bit iffy -- it checks for a very powerful propetry (that implies many of the other queries), but as the test results show, it can produce false positives when there is in fact no problem. We may want to get rid of it entirely, if it becomes too noisy.	2026-05-26 16:32:36 +00:00
Taus	e21b6b9b2e	Python: Add NeverReachable test This looks for nodes annotated with `t.never` in the test that are reachable in the CFG. This should not happen (it messes with various queries, e.g. the "mixed returns" query), but the test shows that in a few particular cases (involving the `match` statement where all cases contain `return`s), we _do_ have reachable nodes that shouldn't be.	2026-05-26 16:32:36 +00:00
Taus	500dec3f67	Python: Add BasicBlockOrdering test This one demonstrates a bug in the current CFG. In a dictionary comprehension `{k: v for k, v in d.items()}`, we evaluate the value before the key, which is incorrect. (A fix for this bug has been implemented in a separate PR.)	2026-05-26 16:32:36 +00:00
Taus	29ce07c204	Python: Add some CFG-validation queries These use the annotated, self-verifying test files to check various consistency requirements. Some of these may be expressing the same thing in different ways, but it's fairly cheap to keep them around, so I have not attempted to produce a minimal set of queries for this.	2026-05-26 16:32:36 +00:00
Taus	6e77a45fb3	Python: Add self-validating CFG tests These tests consist of various Python constructions (hopefully a somewhat comprehensive set) with specific timestamp annotations scattered throughout. When the tests are run using the Python 3 interpreter, these annotations are checked and compared to the "current timestamp" to see that they are in agreement. This is what makes the tests "self-validating". There are a few different kinds of annotations: the basic `t[4]` style (meaning this is executed at timestamp 4), the `t.dead[4]` variant (meaning this _would_ happen at timestamp 4, but it is in a dead branch), and `t.never` (meaning this is never executed at all). In addition to this, there is a query, MissingAnnotations, which checks whether we have applied these annotations maximally. Many expression nodes are not actually annotatable, so there is a sizeable list of excluded nodes for that query.	2026-05-26 16:32:35 +00:00
Taus	e8de8433f4	Python: Update all metrics-dependant queries The ones that no longer require points-to no longer import `LegacyPointsTo`. The ones that do use the specific `...MetricsWithPointsTo` classes that are applicable.	2026-02-19 12:32:27 +00:00
Taus	24a29f46be	Python: Fix all metrics-related compilation failures In hindsight, having a `.getMetrics()` method that just returns `this` is somewhat weird. It's possible that it predates the existence of the inline cast, however.	2025-11-26 21:28:51 +00:00
Taus	cd1619b43e	Python: Fix queries and tests	2025-11-26 17:06:55 +00:00
Taus	58eaddf627	Python: Update all `.expected` files I'm beginning to realise why I didn't do the `toString` overriding way back when. Thankfully, now that all of our tests are in the same place, this is actually not a terrible ordeal.	2024-04-22 12:00:09 +00:00
Rasmus Wriedt Larsen	69453aa144	Python: Fix missing newline in `.expected`	2023-11-15 10:10:23 +01:00
Taus Brock-Nannestad	f07a7bf8cf	Python: Autoformat everything using `qlformat`. Will need subsequent PRs fixing up test failures (due to deprecated methods moving around), but other than that everything should be straight-forward.	2020-07-07 15:43:52 +02:00
Taus Brock-Nannestad	48e3e9c0b4	Python: Do all the renames.	2020-06-19 17:02:47 +02:00
Taus Brock-Nannestad	87a9f51c78	Python: Autoformat all `.ql` files.	2020-03-30 11:59:10 +02:00
Mark Shannon	f11d0638cf	Python: Remove tests for pruning in QL.	2019-08-25 17:00:08 +01:00
Mark Shannon	e77ae09a86	Python tests: Update test results to account for better handling of branches in finally blocks.	2019-08-21 14:47:57 +01:00
Mark Shannon	714fecbf5e	Python: Revert tests removed in #1767 .	2019-08-21 14:39:53 +01:00
Mark Shannon	edb50c129d	Python tests: TEMPORARILY remove 5 tests to allow modification of extractor CFG pass.	2019-08-19 16:00:28 +01:00
Mark Shannon	d96a009975	Python tests: Temporarily comment out loops prior to implementing loop-unrolling in extractor.	2019-08-12 14:10:44 +01:00
Mark Shannon	05e498dfdc	Python: Clarify pruning code.	2019-07-24 14:47:46 +01:00
Mark Shannon	2bdf42388c	Python: Fix up pruning in QL to better handle constraints from constants.	2019-07-23 16:28:13 +01:00
Taus	76f8da8986	Merge pull request #1484 from markshannon/python-aggressive-pruning Python: Use aggressive dead-code elimination when pruning.	2019-06-25 19:17:44 +02:00
Mark Shannon	a5f741e504	Python: Use aggressive dead-code elimination when pruning.	2019-06-21 13:03:36 +01:00
Mark Shannon	26f870bc7f	Merge branch 'rc/1.21' into master	2019-06-21 09:52:44 +01:00
Mark Shannon	1d269b0cd5	Python: Add extra test for pruning.	2019-06-14 09:59:28 +01:00
Mark Shannon	00fa80346b	Python (pruning): Fix up handling of integer inequality.	2019-06-13 18:20:15 +01:00
Mark Shannon	0b7d73456e	Remove incorrect metadata from test query.	2019-06-12 14:31:21 +01:00
Mark Shannon	bbf7ff9a3f	Python: do pruning in QL.	2019-05-09 17:58:36 +01:00
Arthur Baars	ba7fdddafb	Change @kind to 'table' for test and sanity checks queries that don't select problems	2019-04-01 11:20:12 +02:00
Mark Shannon	0ea2d56945	Python tests: Move helper predicate into TestUtils.qll	2019-01-23 11:20:04 +00:00
Mark Shannon	5dd0123f0d	Python tests: Remove useless duplicate test.	2019-01-21 14:31:15 +00:00
Mark Shannon	7b8468d81b	Python: Add bulk regression test for CFG successors.	2019-01-16 11:48:58 +00:00
Mark Shannon	e506bf85e8	Python: Add some regression tests for CFG of comparisons.	2019-01-16 11:45:20 +00:00
Mark Shannon	bc4301f7b1	Python: Add an extra test for import parsing.	2019-01-16 11:32:59 +00:00
Mark Shannon	05b69a1c0f	QL tests for Python queries and libraries.	2018-11-19 15:15:54 +00:00

49 Commits