codeql

mirror of https://github.com/github/codeql.git synced 2025-12-20 02:44:30 +01:00

Author	SHA1	Message	Date
yoff	0509a12790	Merge pull request #5770 from tausbn/python-small-api-graph-fix Python: Use only `TApiNode` in `API::Impl`	2021-04-27 14:06:09 +02:00
Chris Smowton	64a2320be7	Merge pull request #5757 from smowton/smowton/admin/fix-dead-qhelp-links Fix all dead qhelp links	2021-04-27 12:17:08 +01:00
Rasmus Wriedt Larsen	37db21d269	Merge pull request #5284 from yoff/python-port-insecure-protocol Python: port py/insecure-protocol	2021-04-27 09:30:18 +02:00
thank_you	62f3e8d64a	Add sanitizer for ObjectId ObjectId is a sanitizer used to sanitize strings into valid MongoDB ids. During research we've found that this method is used. ObjectId returns a string representing an id. If at any time ObjectId can't parse it's input (like when a tainted dict in passed in), then ObjectId will throw an error preventing the query from running.	2021-04-26 15:35:42 -04:00
Taus	3889c8afec	Python: Use only `TApiNode` in `API::Impl` This ensures that changes to `API::Node` does not invalidate the cached `module Impl`. At present, I don't expect this to have any effect (as the `Node` class is also fairly static, though not explicitly cached), but I can imagine us making some of the `Node` methods have user-extensible behaviour, in which case we definitely do not want this to result in reevaluation of `API::Impl`.	2021-04-26 13:10:15 +00:00
Rasmus Lerchedahl Petersen	7cc97836a9	Python: More cleanup from reviewer suggestions	2021-04-23 20:26:13 +02:00
Chris Smowton	455b840712	Fix all dead qhelp links For those documents with no obvious new home I've pointed the links to the Internet Archive.	2021-04-23 15:20:21 +01:00
Rasmus Wriedt Larsen	deb3db3f95	Python: Add non-alert data for extractor diagnostics This is basically just a port of the C++/JS queries added in: - https://github.com/github/codeql/pull/5414 (C++) - https://github.com/github/codeql/pull/5656 (JS) SyntaxError should capture all errors we have information about. At least in `python/ql/src/semmlecode.python.dbscheme` the only match for `error` is `py_syntax_error_versioned` (which `SyntaxError` is based on).	2021-04-23 13:29:44 +02:00
Rasmus Wriedt Larsen	354dee1b09	Python: Add non-alert data for lines of code `py/summary/lines-of-code` is just a port of the C++/JS queries added in: - https://github.com/github/codeql/pull/5271 (C++) - https://github.com/github/codeql/pull/5304 (JS) We are the first to implement the `lines-of-user-code` query, so nothing to compare with in other languages -- but it makes a lot of sense to do for Python 👍	2021-04-23 13:22:18 +02:00
yoff	1954c0ba84	Apply suggestions from code review Co-authored-by: Taus <tausbn@github.com>	2021-04-23 10:20:18 +02:00
Rasmus Wriedt Larsen	f9383a31bf	Python: Fix BrokenCryptoAlgorithm.qhelp	2021-04-22 15:58:28 +02:00
Rasmus Wriedt Larsen	222c087e8c	Python: Remove type-tracking performance workaround Since we shouldn't need it anymore (yay)	2021-04-22 15:31:49 +02:00
Rasmus Wriedt Larsen	fc1a6d0e32	Python: Say salting is not part of py/weak-sensitive-data-hashing	2021-04-22 15:23:41 +02:00
Rasmus Wriedt Larsen	ac83c695ad	Python: Add py/weak-sensitive-data-hashing query	2021-04-22 15:23:41 +02:00
Rasmus Wriedt Larsen	794a86a6b0	Python: Add SensitiveDataSource	2021-04-22 15:23:39 +02:00
Rasmus Wriedt Larsen	56c409737d	Python: Port py/weak-cryptographic-algorithm The other query (py/weak-sensitive-data-hashing) is added in future commit	2021-04-22 15:23:38 +02:00
Rasmus Wriedt Larsen	1616975e06	Python: Model `hashlib` from standard library	2021-04-22 15:23:37 +02:00
Rasmus Lerchedahl Petersen	5a4e661e60	Merge branch 'main' of github.com:github/codeql into python-support-pathlib	2021-04-22 15:04:21 +02:00
Rasmus Wriedt Larsen	fa88f22453	Python: Model hashing operations in `cryptography` package	2021-04-22 14:51:20 +02:00
Rasmus Wriedt Larsen	c5f826580b	Python: Model encrypt/decrypt in cryptography package I introduced a InternalTypeTracking module, since the type-tracking code got so verbose, that it was impossible to get an overview of the relevant predicates. (this means the "first" type-tracking predicate that is usually private, cannot be marked private anymore, since it needs to be exposed in the private module.	2021-04-22 14:51:19 +02:00
Rasmus Wriedt Larsen	23140dfb76	Python: Add CryptographicOperation modeling for Cryptodome	2021-04-22 14:51:17 +02:00
Rasmus Wriedt Larsen	a8de2aba3b	Python: Move CryptoAlgorithms implementation	2021-04-22 14:51:15 +02:00
Rasmus Wriedt Larsen	65c8d9605e	Python: Add CryptographicOperation Concept I considered using `getInput` like in JS, but things like signature verification has multiple inputs (message and signature). Using getAnInput also aligns better with Decoding/Encoding.	2021-04-22 14:51:14 +02:00
Rasmus Lerchedahl Petersen	b724e51cab	Python: Improvements from review suggestions	2021-04-22 10:40:42 +02:00
Rasmus Wriedt Larsen	5a9e27c6fc	Merge branch 'main' into django-3.2	2021-04-21 17:15:47 +02:00
Taus	71780228ae	Python: Rename `TypeTrackerPrivate.qll`	2021-04-21 13:08:26 +00:00
Rasmus Wriedt Larsen	2302c8d5fa	Python: Model new `alias` method on django QuerySets	2021-04-21 13:52:38 +02:00
yoff	a19373ab54	Merge pull request #5727 from tausbn/python-use-localsource-in-stepsummary Python: Use `LocalSourceNode` in `StepSummary::step`	2021-04-21 13:50:31 +02:00
Taus	489e1e94e4	Python: Prevent bad joins Adds a few unbinds to prevent bad joins from occurring. Firstly, we never want to join `StepSummary::step` with `TypeTracker::append` on `summary` as the first join, as the resulting relation is absolutely massive. So we decouple the two occurrences of `summary` by unbinding each of them. Secondly, in some cases the node we're stepping to (`nodeTo` for type trackers, `nodeFrom` for type backtrackers) will get joined eagerly with the typetracker one is defining, and again this produces an uncomfortably large intermediate join. A bit of unbinding prevents this as well.	2021-04-21 11:44:34 +00:00
Taus	9e95f6e7c1	Python: Remove `typePreservingStep` This requires a bit of explanation, so strap in. Firstly, because we use `LocalSourceNode`s as the start and end points of our `StepSummary::step` relation, there's no need to include `simpleLocalFlowStep` (via `typePreservingStep`) in `smallstep`. Indeed, since the successor node for a `step` is a `LocalSourceNode`, and local sources never have incoming flow, this is entirely futile -- we can find values for `mid` and `nodeTo` that satisfy the body of `step`, but `nodeTo` will never be a `LocalSourceNode`. With this in mind, we can simplify `smallstep` to only refer to `jumpStep`. This then brings the other uses of `typePreservingStep` into question. The only other place we use this predicate is in the `TypeTracker` and `TypeBackTracker` `smallstep` predicates. Note, however, that here we no longer need `jumpStep` to be part of `typeTrackingStep` (as it is already accounted for in `StepSummary::smallstep`) so we can simplify to `simpleLocalFlowStep`. At this point, `typePreservingStep` is unused. Finally, because of the way `smallstep` is used in `step` (inside `StepSummary`), `nodeTo` must always be a `LocalSourceNode`, so I have propagated this restriction to `smallstep` as well. We can always lift this restriction later, but for now it seems like it's likely to cause fewer surprises to have made this explicit.	2021-04-21 11:12:06 +00:00
Rasmus Wriedt Larsen	775ed41592	Python: Update SensitiveDataHeuristics with newer JS version which also prompted me to rewrite the QLDoc for `nameIndicatesSensitiveData`	2021-04-21 11:34:01 +02:00
Rasmus Wriedt Larsen	16b62486e9	Python: Extract SensitiveDataHeuristics to be shared with JS Initially I had called `nameIndicatesSensitiveData` for `maybeSensitiveName`, which made the relationship with `maybeSensitive` and `notSensitive` quite strange -- and therefore I added the more informative `maybeSensitiveRegexp` and `notSensitiveRegexp`. Although I'm no longer using `maybeSensitiveName`, and I no longer have a strong argument for making this name change, I still like it. If someone thinks this is a terrible idea, I'm happy to change it though 👍	2021-04-21 11:31:28 +02:00
yoff	0c4181178d	Update python/ql/src/semmle/python/frameworks/Stdlib.qll Co-authored-by: Taus <tausbn@github.com>	2021-04-20 22:15:09 +02:00
yoff	ef0ea247c4	Merge pull request #5679 from tausbn/python-fix-bad-points-to-joins Python: Fix bad points-to joins	2021-04-20 21:19:32 +02:00
Rasmus Lerchedahl Petersen	6408ee2eaf	Python: Fix bad join	2021-04-20 20:03:06 +02:00
Rasmus Lerchedahl Petersen	fc2c62350e	Python: Fix bad join Also fixed up the QLDoc	2021-04-20 18:54:03 +02:00
Taus	890f96d9b5	Python: Prevent bad joins in `TypeBackTracker` Perhaps unsurprisingly, the join orderer was eager and willing to find the wrong join order in this predicate as well. Applying a similar fix to the one used in `TypeTracker::step` fixes the problem.	2021-04-20 15:01:04 +00:00
Taus	c0569da65c	Python: Move `track`/`backtrack` to `LocalSourceNode` This is merely making explicit what was implicitly enforced. The move to change the return type of `step` already meant that `this` and `result` had to be `LocalSourceNode`. By moving these methods to their rightful place, we should hopefully avoid a bit of suprising behaviour.	2021-04-20 14:39:56 +00:00
Taus	2a07441c19	Python: `ModuleVariableNode`s are not API uses This caused some suprising test changes, where suddenly we had flow from a `ModuleVariableNode` (as a `RemoteFlowSource`) to a sink. This of course makes little sense, so instead we simply exclude these nodes as uses in the first place.	2021-04-20 14:33:42 +00:00
Rasmus Lerchedahl Petersen	9c893cb0f4	Merge branch 'main' of github.com:github/codeql into python-port-insecure-protocol	2021-04-20 16:33:03 +02:00
Taus	7581cbade6	Python: Fix forgotten type tracker This was the last remaining type tracker that did not use `LocalSourceNode`.	2021-04-20 14:32:56 +00:00
Taus	38548c9acd	Python: Simplify charpred for `LocalSourceNode` The somewhat convoluted `comes_from_cfgnode` was originally introduced in order to have local sources for instances of global variables. This was needed because global variables have an implicit "scope entry" SSA definition that flows to the first actual use of the variable (and so would not fit the strict "has no incoming flow" definition of a local source node). However, a subsequent change means that we include all global variable reads anyway, and so the old definition is no longer needed. (See commit `3fafb47b16` for further context.)	2021-04-20 13:19:36 +00:00
Taus	a55b43b67e	Python: Use `LocalSourceNode` throughout `step` This commit does a lot of stuff all at once, so here are the main highlights: In `TypeTracker.qll`, we change `StepSummary::step` to step only between source nodes. Because reads and writes of global variables happen in two different (jump) steps, this requires the intermediate `ModuleVariableNode` to _also_ be a `LocalSourceNode`, and we therefore modify the charpred for that class accordingly. (This also means changing a few of the tests to account for these new source nodes.) In addition, we change `TypeTracker::step` to likewise step between local source nodes. Next, to enable the use of the `track` convenience method on nodes, we add some pragmas to `TypeTracker::step` that prevent bad joins from occurring. With this, we can eliminate all of the manual type tracker join predicates. Next, we observe that because `StepSummary::step` now uses `flowsTo`, it automatically encapsulates all local-flow steps. In particular this means we do not have to use `typePreservingStep` in `smallstep`, but can use `jumpStep` directly. A similar observation applies to `TypeTracker::smallstep`. Having done this, we no longer need `typePreservingStep`, so we get rid of it.	2021-04-20 12:59:33 +00:00
Taus	31bd701bd5	Python: Final `LocalSourceNode` fixes	2021-04-20 12:59:33 +00:00
Rasmus Wriedt Larsen	897105de02	Merge pull request #5717 from tausbn/python-use-api-graphs-in-django Python: Use API graphs in Django model	2021-04-20 14:57:55 +02:00
thank_you	7773c53124	Replace any(string) with _ wildcard	2021-04-20 08:49:08 -04:00
thank_you	bbd3552392	Rename predicate to getQuery	2021-04-20 08:47:37 -04:00
thank_you	c5fbbc0551	Refactor SqlAlchemy model - Replaced classes that look for SqlAlchemy instances with predicates - General clean-up of code	2021-04-19 18:56:00 -04:00
Taus	bc6685aa3f	Python: Fix typo Co-authored-by: Rasmus Wriedt Larsen <rasmuswriedtlarsen@gmail.com>	2021-04-19 19:57:35 +02:00
Taus	9acc71a7cb	Python: Get rid of all `_attr` methods in `Django.qll`	2021-04-19 11:54:10 +00:00

... 13 14 15 16 17 ...

3152 Commits