codeql

mirror of https://github.com/github/codeql.git synced 2025-12-19 18:33:16 +01:00

Author	SHA1	Message	Date
Rasmus Wriedt Larsen	ce4b192caa	Python: Improve usefulness of RemoteFlowSourcesReach meta query Before, results from `dca` would look something like ## + py/meta/alerts/remote-flow-sources-reach - django/django@c2250cf_cb8f: tests/messages_tests/urls.py:38:16:38:48 reachable with taint-tracking from RemoteFlowSource - django/django@c2250cf_cb8f: tests/messages_tests/urls.py:38:9:38:12 reachable with taint-tracking from RemoteFlowSource now it should make it easier to spot _what_ it is that actually changed, since we pretty-print the node.	2021-07-21 16:35:09 +02:00
yoff	6a77b890af	Merge pull request #6155 from RasmusWL/port-cleartext-queries Python: Port cleartext queries	2021-06-30 15:52:34 +02:00
Rasmus Wriedt Larsen	36c9ceb13b	Python: Add `Logging` concept	2021-06-25 14:26:35 +02:00
Rasmus Wriedt Larsen	e1af1f11ee	Python: Add `HTTP::Server::CookieWrite` concept along with tests, but no implementations (to ease reviewing). --- I've put quite some thinking into what to call our concept for this. [JS has `CookieDefinition`](`581f4ed757/javascript/ql/src/semmle/javascript/frameworks/HTTP.qll (L148-L187)`), but I couldn't find a matching concept in any other languages. We used to call this [`CookieSet`](`f07a7bf8cf/python/ql/src/semmle/python/web/Http.qll (L76)`) (and had a corresponding `CookieGet`). But for headers, [Go calls this `HeaderWrite`](`cd1e14ed09/ql/src/semmle/go/concepts/HTTP.qll (L97-L131)`) and [JS calls this `HeaderDefinition`](`581f4ed757/javascript/ql/src/semmle/javascript/frameworks/HTTP.qll (L23-L46)`) I think it would be really cool if we have a naming scheme that means the name for getting the value of a header on a incoming request is obvious. I think `HeaderWrite`/`HeaderRead` fulfils this best. We could go with `HeaderSet`/`HeaderGet`, but they feel a bit too vague to me. For me, I'm so used to talking about def-use, that I would immediately go for `HeaderDefinition` and `HeaderUse`, which could work, but is kinda strange. So in the end that means I went with `CookieWrite`, since that allows using a consistent naming scheme for the future :)	2021-06-24 17:34:43 +02:00
Rasmus Wriedt Larsen	39ec8701ca	Python: Add FileSystemWriteAccess concept I made `FileSystemWriteAccess` be a subclass of `FileSystemAccess` (like in [JS](`64001cc02c/javascript/ql/src/semmle/javascript/Concepts.qll (L68-L74)`)), but then I started wondering about how I could give a good result for `getAPathArgument`, and what would a good result even be? The argument to the `open` call, or the object that the `write` method is called on? I can't see how doing either of these enables us to do anything useful... So I looked closer at how JS uses `FileSystemWriteAccess`: 1. as sink for zip-slip: `7c51dff0f7/javascript/ql/src/semmle/javascript/security/dataflow/ZipSlipCustomizations.qll (L121)` 2. as sink for downloading unsafe files (identified through their extension) through non-secure connections: `89ef6ea4eb/javascript/ql/src/semmle/javascript/security/dataflow/InsecureDownloadCustomizations.qll (L134-L150)` 3. as sink for writing untrusted data to a local file `93b1e59d62/javascript/ql/src/semmle/javascript/security/dataflow/HttpToFileAccessCustomizations.qll (L43-L46)` for the 2 first sinks, it's important that `getAPathArgument` has a proper result... so that solves the problem, and highlights that it _can_ be important to give proper results for `getAPathArgument` (if possible). So I'm trying to do best effort for `f = open(...); f.write(...)`, but with this current code we won't always be able to give a result (as highlighted by the tests). It will also be the case that there are multiple `FileSystemAccess` with the same path-argument, which could be a little strange. overall, I'm not super confident about the way this new concept and implementation turned out, but it also seems like the best I could come up with right now... The obvious alternative solution is to NOT make `FileSystemWriteAccess` a subclass of `FileSystemAccess`, but I'm not very tempted to go down this path, given the examples of this being useful above, and just the general notion that we should be able to model writes as being a specialized kind of `FileSystemAccess`.	2021-06-23 10:50:04 +02:00
Rasmus Wriedt Larsen	bcef8d19e6	Python: Add Escaping concept	2021-06-16 19:09:00 +02:00
Rasmus Wriedt Larsen	dd457f9641	Python: Fix tests	2021-06-10 15:58:56 +02:00
yoff	716627c1dd	Merge pull request #5878 from RasmusWL/fix-concept-tests-pretty-print Python: Fix concept tests pretty print	2021-06-10 11:21:08 +02:00
Rasmus Wriedt Larsen	61ad5d0673	Python: Allow printing PostUpdateNode in ConceptsTest.qll See how this works in `test_json.py`	2021-05-19 17:10:33 +02:00
Rasmus Wriedt Larsen	51a25e45fe	Python: Use shared prettyExpr in ConceptsTest.qll This required quite some changes in the expected output. I think it's much more clear what the selected nodes are now 👍 (but it was a bit boring work to fix this up)	2021-05-19 17:10:33 +02:00
Rasmus Wriedt Larsen	f66dccafda	Python: Rename prettyExp => prettyExpr So we're consistenly using `expr` and not leaving our the `r`.	2021-05-19 17:10:33 +02:00
Rasmus Wriedt Larsen	65c8d9605e	Python: Add CryptographicOperation Concept I considered using `getInput` like in JS, but things like signature verification has multiple inputs (message and signature). Using getAnInput also aligns better with Decoding/Encoding.	2021-04-22 14:51:14 +02:00
Rasmus Wriedt Larsen	d607c13ab6	Python: Taint tests: include elment for forgotten MISSING	2021-04-19 15:01:42 +02:00
Rasmus Wriedt Larsen	9585390941	Python: Taint tests, report error location first To better match the standard output from inline expectation tests	2021-04-19 14:59:47 +02:00
Rasmus Wriedt Larsen	b2cb284ff2	Python: Add more examples of what is ok with new taint tests	2021-04-19 14:56:20 +02:00
Rasmus Wriedt Larsen	3e7dc12246	Python: Port taint tests to use inline expectations The meat of this PR is described in the new python/ql/test/experimental/meta/InlineTaintTest.qll file: > Defines a InlineExpectationsTest for checking whether any arguments in > `ensure_tainted` and `ensure_not_tainted` calls are tainted. > > Also defines query predicates to ensure that: > - if any arguments to `ensure_not_tainted` are tainted, their annotation is marked with `SPURIOUS`. > - if any arguments to `ensure_tainted` are not tainted, their annotation is marked with `MISSING`. > > The functionality of this module is tested in `ql/test/experimental/meta/inline-taint-test-demo`.	2021-04-15 18:00:33 +02:00
Rasmus Wriedt Larsen	11cd0dbbc0	Python: Add concepts for public-key generation I did spend some time to figure out how to best write `minimumSecureKeySize` predicate. I wanted to write once and for all the recommended sizes for each cryptosystem. I considered making the predicate such as ```codeql int minimumSecureKeySize() { this.getName() = "RSA" and result = 2048 or this.getName() = "DSA" and result = 2048 or this.getName() = "ECC" and result = 244 } ``` but then it would be impossible to add a new model without also being able to modify the body of this predicate -- which seems like a bad way to start off a brand new way of modeling things. So I considered if we could add it to the non-range class, such as ```codeql class RSAKeyGeneration extends KeyGeneration { RSAKeyGeneration() { this.getName() = "RSA" } override int minimumSecureKeySize() { result = 2048 } } ``` This has the major problem that when you're writing the models for a new API (and therefore extending KeyGeneration::Range), there is no way for you to see that you need to take this extra step :\| (also problem about how we should define `minimumSecureKeySize` on `KeyGeneration` class then, since if we make it abstract, we effectively disable the ability to refine `KeyGeneration` since any subclass must provide an implementation.) So, therefore I ended up with this solution ;)	2021-02-19 13:59:16 +01:00
Rasmus Wriedt Larsen	501e510622	Python: Add redirect modeling tests (flask/django)	2021-01-19 14:43:25 +01:00
Rasmus Wriedt Larsen	3094aedf14	Python: Fix regression in ConceptTests I accidentially deleted that line :D	2020-12-22 14:42:53 +01:00
Rasmus Wriedt Larsen	dc0d940331	Python: Ensure all concept tests ignore irrelevant results Since this was causing a CI error. also changed things a bit so we do it in a consistent way :)	2020-12-22 11:32:42 +01:00
Rasmus Wriedt Larsen	bc4a0bcbeb	Python: Split request handler / route setup concept tests Not doing so earlier was just a mistake.	2020-12-22 11:31:20 +01:00
Rasmus Wriedt Larsen	004ff38e22	Python: Add separate RequestHandler concept Since I really want to use our existing infrastructure to model that we can recognize something as a request handler without it having a route, we need this as a separate concept. All tests have been adjusted. The early modeling was based on flask, where all request-handling is based on handling requests from a specific route. But with the standard library handling and handlers without routes, the naming had to change.	2020-12-21 17:31:58 +01:00
Jonas Jensen	5680b2df13	Merge remote-tracking branch 'upstream/main' into better-syntax-for-false-positives-and-negatives-inline-expectation Required fixing up semantic conflicts in tests. Conflicts: python/ql/test/experimental/library-tests/frameworks/stdlib/Decoding.py	2020-11-03 09:47:26 +01:00
Taus Brock-Nannestad	5dadb0f476	Python: Fix imports in tests	2020-11-02 23:02:29 +01:00
Rasmus Wriedt Larsen	eff244db71	Python: Add Encoding concept I wasn't able to find a good opposite of "parsing", so left that out of the list of intended purposes.	2020-11-02 14:19:20 +01:00
Mathias Vorreiter Pedersen	6d0783a3bd	Python: Make sure that expected values with tag mimetype is wrapped in quotes if the value contains a space.	2020-10-31 18:13:12 +01:00
Rasmus Lerchedahl Petersen	80360450de	Merge branch 'main' of github.com:github/codeql into RasmusWL-python-port-reflected-xss	2020-10-30 17:56:36 +01:00
Rasmus Lerchedahl Petersen	96e79a2702	Python: restrict to python files	2020-10-29 15:00:47 +01:00
Rasmus Lerchedahl Petersen	8350d64763	Python: Add concept test definitions	2020-10-27 08:00:53 +01:00
Rasmus Wriedt Larsen	082e35c2c7	Python: Model mimetype instead of content-type for HTTP Response Since that's really what we're after (at least for now)	2020-10-23 14:31:33 +02:00
Rasmus Wriedt Larsen	35334cf630	Python: Remove status code modeling I'm not even trying to model it properly right now, and don't have a specific use-case for it RIGHT NOW. I think we could want this in the future, but I think it's probably better to model it when we know what we want to use it for.	2020-10-23 14:31:31 +02:00
Rasmus Wriedt Larsen	e38ac18e46	Python: Add (only) basic $HttpResponse tag to other tests files This seems really nice to me, but you might disagree	2020-10-23 14:31:30 +02:00
Rasmus Wriedt Larsen	df6fd53a7e	Python: Add HttpResponse concept We might need to rework this a bit when we also start to handle redirects. I could see a world where we simply allow http redirects to be subclasses of http responses, and need to manually exclude them from queries (or create HttpContentResponse to model the HttpResponses that will contain a body). Let us see where the wind will take us. I looked through JS and Go libraries, but I didn't feel their modeling would map very well to Python.	2020-10-23 14:31:25 +02:00
Rasmus Lerchedahl Petersen	3a416bce2d	Python: Move test annotation	2020-10-21 14:18:16 +02:00
Rasmus Lerchedahl Petersen	646ced2a1d	Python: Add concept test scaffold	2020-10-19 10:58:57 +02:00
Rasmus Lerchedahl Petersen	ef32488596	Merge branch 'main' of github.com:github/codeql into python-port-unsafe-deserialization	2020-10-15 15:45:35 +02:00
Rasmus Lerchedahl Petersen	172e058438	Python: `unsafe` -> `mayExecuteInput`	2020-10-15 12:56:29 +02:00
Rasmus Lerchedahl Petersen	b8cba381cf	Merge branch 'main' of github.com:github/codeql into python-port-unsafe-deserialization	2020-10-14 15:01:30 +02:00
Rasmus Wriedt Larsen	5db4f906d0	Merge branch 'main' into python-port-code-injection	2020-10-14 14:22:02 +02:00
Rasmus Lerchedahl Petersen	b0ebb5b6d1	Python: Adjust tag format	2020-10-14 09:51:24 +02:00
Rasmus Lerchedahl Petersen	93383747bd	Python: Use more common name for concept	2020-10-14 09:28:58 +02:00
Rasmus Lerchedahl Petersen	a76d276b48	Python: Adjust `getARelevantTag`	2020-10-14 08:44:04 +02:00
Rasmus Lerchedahl Petersen	4685f2d5f2	Python: Address many review comments still need to move concept tests	2020-10-13 12:03:23 +02:00
Rasmus Lerchedahl Petersen	0d8bd01e10	Python: Port query and add test	2020-10-09 16:11:37 +02:00
Rasmus Wriedt Larsen	5f6e4d47ca	Python: Add CodeExecution concept	2020-10-07 18:22:45 +02:00
Rasmus Wriedt Larsen	d27e6955b4	Python: Add test setup for HTTP::Server::RouteSetup	2020-10-06 03:03:06 +02:00
Rasmus Wriedt Larsen	e5b9ac8d9c	Python: Use getCommand as tag in ConceptsTest	2020-10-02 14:12:41 +02:00
Rasmus Wriedt Larsen	62dc0dd263	Python: Model os.exec* os.spawn* and os.posix_spawn* I also had to exclude the inline expectation tests from files outside the test repo.	2020-09-28 11:05:33 +02:00
Rasmus Wriedt Larsen	060720aae7	Python: Add tests for all SystemCommandExecution from stdlib Overall idea is that `test/experimental/meta/ConceptsTest.qll` will set up inline expectation tests for all the classes defined in `Concepts.qll`, so any time you model a new instance of Concepts, you simply just import that file. That makes the tests a little verbose, but allows us to share test-setup between all the different frameworks we model. Note that since the definitions of SystemCommandExecution subclasses are scattered across multieple framework modeling qll files, it think it makes the most sense to have the tests for each framework in one location. I'm not 100% convinced about if this is the right choice or not (especially when we want to write tests for sanitizers), but for now I'm going to try it out at least.	2020-09-28 11:05:32 +02:00
Rasmus Lerchedahl Petersen	e46ae9b98d	Python: Move some query predicates to debug	2020-09-15 21:45:47 +02:00

50 Commits