codeql

mirror of https://github.com/github/codeql.git synced 2026-06-19 03:41:07 +02:00

Author	SHA1	Message	Date
Sotiris Dragonas	8e5f214041	Fix OpenRouter Python API and expand model coverage Verified all prompt-injection framework models against the real Python SDK sources: - OpenRouter: the official openrouter SDK uses client.chat.send(messages=) (not chat.completions.create), client.embeddings.generate(input=) (not embeddings.create), and client.responses.send(input=, instructions=). Corrected the framework qll and model, and fixed the test files that used the wrong API. - Anthropic: added the managed-agents system prompt sink (beta.agents.create/update Argument[system:]). - Google GenAI: added models.edit_image Argument[prompt:] as user content. OpenAI, agents and LangChain models were confirmed correct against their SDK sources. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-06-18 16:53:37 +03:00
Sotiris Dragonas	72bc52b2fd	Python: promote prompt injection queries from experimental to production Mirror the JavaScript layout from PR #21953: - Move SystemPromptInjection.ql / UserPromptInjection.ql to src/Security/CWE-1427 - Move customizations, query and framework libs to python/ql/lib - Move the AIPrompt concept to the production Concepts.qll - Drop the experimental tag; py/system-prompt-injection (high precision) now joins the code-scanning, security-extended and security-and-quality suites, while py/user-prompt-injection (low precision) stays out of the default suites - Move query tests to python/ql/test/query-tests/Security Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-06-18 16:30:29 +03:00
Sotiris Dragonas	db493ef30a	Python: port prompt injection queries (system + user) from JS PR #21953 Replace the experimental py/prompt-injection query with two queries mirroring the JavaScript split: - py/system-prompt-injection (system prompt / tool description / developer prompt) - py/user-prompt-injection (user-role prompt) Supports OpenAI (+Agents), Anthropic, Google GenAI, LangChain and OpenRouter via MaD models plus role-filtered framework sinks that MaD cannot express. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-06-18 13:52:51 +03:00
Owen Mansel-Chan	1f9899d7db	Extend added type tracking step to related types	2026-06-17 15:04:53 +01:00
Owen Mansel-Chan	dd61dd2d74	Fix FP for py/modification-of-locals	2026-06-17 14:24:18 +01:00
Owen Mansel-Chan	47c2c9e763	Add test for FP for py/modification-of-locals	2026-06-17 14:22:42 +01:00
Owen Mansel-Chan	415857cacb	Fix FP for py/should-use-with	2026-06-17 13:01:36 +01:00
Owen Mansel-Chan	d72144646a	Add test for FP for py/should-use-with	2026-06-17 12:55:17 +01:00
Owen Mansel-Chan	199fd864ad	Fix FP for py/file-not-closed	2026-06-17 12:36:04 +01:00
Owen Mansel-Chan	890969433f	Add test for FP for py/file-not-closed	2026-06-17 12:19:03 +01:00
Owen Mansel-Chan	9c65082189	Fix MISSING alert	2026-06-15 00:14:52 +01:00
Owen Mansel-Chan	434a99447e	Add thorough tests, including one MISSING alert	2026-06-12 13:45:02 +01:00
Owen Mansel-Chan	d389ea4039	Convert sql-injection test to inline expectations	2026-06-12 13:44:56 +01:00
Owen Mansel-Chan	befb557bfd	Accept fixed MISSING tests	2026-06-11 15:44:20 +02:00
copilot-swe-agent[bot]	73bc2d70ae	Model instance-attribute type flow Use a field level step like JS and Ruby.	2026-06-11 14:48:55 +02:00
copilot-swe-agent[bot]	a4585d8d94	Add test documenting missing PEP249 alerts for connection stored in self attribute	2026-06-11 05:48:40 +00:00
Owen Mansel-Chan	1f91f915c7	Merge pull request #21888 from owen-mc/py/remove-imprecise-container-steps Python: Remove imprecise container steps #2	2026-06-04 22:16:24 +01:00
Owen Mansel-Chan	b27d08ee32	Update edges in expected test output	2026-06-02 18:29:56 +01:00
Owen Mansel-Chan	20ce679d61	Accept changed edges in test output No changes to alerts	2026-06-02 16:15:08 +01:00
Owen Mansel-Chan	f62ebef9e0	Adjust expected test output	2026-06-02 16:15:06 +01:00
Taus	6165623cbf	Merge pull request #21724 from github/tausbn/python-add-self-validating-cfg-tests	2026-05-28 22:07:55 +02:00
Taus	35faec3db1	Python: Address review comments - Get rid of unnecessary parentheses - Use call syntax in the relevant test - Get rid of `dead(2)` annotation	2026-05-27 15:27:19 +00:00
Owen Mansel-Chan	ec13e1bcd3	Add wildcard `ContentSet`s to avoid performance problems	2026-05-27 15:28:07 +01:00
Owen Mansel-Chan	e8779295ee	Update test results	2026-05-22 11:43:18 +01:00
Rasmus Lerchedahl Petersen	fa758d6bf5	python: fix test	2026-05-21 16:59:19 +01:00
Rasmus Lerchedahl Petersen	fa9426c749	Python: extra tests for comprehension	2026-05-21 16:59:18 +01:00
Rasmus Lerchedahl Petersen	f669a4f3bf	Python: Make sure all imprecise taint bubbles up	2026-05-21 16:59:14 +01:00
Rasmus Lerchedahl Petersen	3275c814bd	Python: reset test expectations	2026-05-21 16:59:11 +01:00
Rasmus Lerchedahl Petersen	9a180036a5	Python: conversion step for `format_map` and adjust collection test	2026-05-21 16:59:08 +01:00
Rasmus Lerchedahl Petersen	93e7ab52b7	Python: adjust test expectations We now find an alert on this line as we hope to It is not an alert for _full_ SSRF, though, since that configuration cannot handle multiple substitutions.	2026-05-21 16:58:51 +01:00
Rasmus Lerchedahl Petersen	facb3b681d	Python: recover taint for % format strings	2026-05-21 16:57:50 +01:00
Rasmus Lerchedahl Petersen	b67694b2ab	Python: Remove imprecise container steps - remove `tupleStoreStep` and `dictStoreStep` from `containerStep` These are imprecise compared to the content being precise. - add implicit reads to recover taint at sinks - add implicit read steps for decoders to supplement the `AdditionalTaintStep` that now only covers when the full container is tainted.	2026-05-21 16:57:44 +01:00
Taus	1ef557c972	Python: Address Copilot's comments	2026-05-12 15:27:14 +00:00
Taus	f5c3b63a4a	Python: Add ConsecutiveTimestamps test This one is potentially a bit iffy -- it checks for a very powerful property (that implies many of the other queries), but as the test results show, it can produce false positives when there is in fact no problem. We may want to get rid of it entirely, if it becomes too noisy.	2026-05-12 12:54:26 +00:00
Taus	c30d6ae3aa	Python: Add NeverReachable test This looks for nodes annotated with `t[never]` in the test that are reachable in the CFG. This should not happen (it messes with various queries, e.g. the "mixed returns" query), but the test shows that in a few particular cases (involving the `match` statement where all cases contain `return`s), we _do_ have reachable nodes that shouldn't be.	2026-05-12 12:54:26 +00:00
Taus	fc2bc26f36	Python: Add BasicBlockOrdering test This one demonstrates a bug in the current CFG. In a dictionary comprehension `{k: v for k, v in d.items()}`, we evaluate the value before the key, which is incorrect. (A fix for this bug has been implemented in a separate PR.)	2026-05-12 12:54:25 +00:00
Taus	3a979ac2f8	Python: Add some CFG-validation queries These use the annotated, self-verifying test files to check various consistency requirements. Some of these may be expressing the same thing in different ways, but it's fairly cheap to keep them around, so I have not attempted to produce a minimal set of queries for this.	2026-05-12 12:54:25 +00:00
Taus	71cd5be513	Python: Add self-validating CFG tests These tests consist of various Python constructions (hopefully a somewhat comprehensive set) with specific timestamp annotations scattered throughout. When the tests are run using the Python 3 interpreter, these annotations are checked and compared to the "current timestamp" to see that they are in agreement. This is what makes the tests "self-validating". There are a few different kinds of annotations: the basic `t[4]` style (meaning this is executed at timestamp 4), the `t[dead(4)]` variant (meaning this _would_ happen at timestamp 4, but it is in a dead branch), and `t[never]` (meaning this is never executed at all). In addition to this, there is a query, MissingAnnotations, which checks whether we have applied these annotations maximally. Many expression nodes are not actually annotatable, so there is a sizeable list of excluded nodes for that query.	2026-05-12 12:42:29 +00:00
Geoffrey White	1c704a0912	Python: Accept test changes (improvement).	2026-05-07 10:28:19 +01:00
Josef Svenningsson	68be006a29	Merge pull request #21641 from github/josefs/promptInjectionImprovements Improve prompt inject for Python	2026-04-29 11:23:52 +01:00
Josef Svenningsson	25a8aa97b2	Fix openai prompt injection tests	2026-04-28 18:24:26 +01:00
Josef Svenningsson	a05e191518	Add tests for anthropic prompt injection models	2026-04-28 18:24:22 +01:00
Josef Svenningsson	e069c9c2ee	Fix tests	2026-04-28 18:24:19 +01:00
Taus	ac23e16786	Python: Move Python 3.15 data-flow tests to a separate file We won't be able to run these tests until Python 3.15 is actually out (and our CI is using it), so it seemed easiest to just put them in their own test directory.	2026-04-17 13:16:46 +00:00
Taus	dc36609743	Python: Add data-flow tests Alas, all these demonstrate is that we already don't fully support the desugared `yield from` form.	2026-04-17 12:15:04 +00:00
Taus	8b1ecf05c9	Python: Update test output This change reflects the `(value, key)` to `(key, value)` fix in an earlier commit.	2026-04-14 13:27:31 +02:00
Taus	de900fc3b5	Python: Add QL test for comprehensions with unpacking	2026-04-14 13:27:31 +02:00
Taus	c748fdf8ee	Merge pull request #21694 from github/tausbn/python-add-support-for-pep-810 Python: Add support for PEP 810	2026-04-14 13:27:08 +02:00
Taus	2eeb31b472	Python: Add tests for `lazy from ... import *` as well	2026-04-13 11:49:06 +00:00
Taus	6b7d47ee7d	Python: Add QL test for the new syntax	2026-04-10 14:39:13 +00:00

1 2 3 4 5 ...

4182 Commits