codeql

mirror of https://github.com/github/codeql.git synced 2025-12-18 09:43:15 +01:00

Author	SHA1	Message	Date
Rasmus Lerchedahl Petersen	c2cd58edc4	python: rewrite to separate configurations source nodes get duplicated, so perhaps flow states are actually better for performance?	2022-02-01 14:36:11 +01:00
Rasmus Lerchedahl Petersen	c587084758	python: use standard `InstanceSource` construction	2022-02-01 13:31:16 +01:00
Rasmus Wriedt Larsen	f7a0b17ed6	Merge pull request #7687 from yoff/python/PathInjection-FlowState python: Rewrite path injection query to use flow state	2022-02-01 11:33:37 +01:00
Rasmus Lerchedahl Petersen	119a7e4f34	python: provide links for Flask	2022-02-01 10:55:45 +01:00
Rasmus Lerchedahl Petersen	7511b33512	python: "command" -> "log"	2022-02-01 10:23:16 +01:00
yoff	45f0bfd8f0	Apply suggestions from code review Co-authored-by: Rasmus Wriedt Larsen <rasmuswriedtlarsen@gmail.com>	2022-02-01 10:06:37 +01:00
yoff	c03f89d712	Apply suggestions from code review Co-authored-by: Rasmus Wriedt Larsen <rasmuswriedtlarsen@gmail.com>	2022-02-01 10:04:26 +01:00
Taus	4a29095e3b	Python: Fix bad join order in `TPythonTuple` TL;DR: Something introduced the following bad join order: ``` (227s) Tuple counts for dom#TObject::TPythonTuple#ff/2@i2#8f58670w after 3m46s: 25000 ~0% {2} r1 = SCAN PointsToContext::PointsToContext::appliesToScope_dispred#ff#prev_delta OUTPUT In.1, In.0 'context' 24000 ~1% {2} r2 = JOIN r1 WITH @py_scope#f ON FIRST 1 OUTPUT Lhs.1 'context', Lhs.0 1076876712 ~6% {3} r3 = JOIN r2 WITH Flow::TupleNode#class#f CARTESIAN PRODUCT OUTPUT Rhs.0, Lhs.0 'context', Lhs.1 870129666 ~0% {3} r4 = JOIN r3 WITH Flow::ControlFlowNode::isLoad_dispred#f ON FIRST 1 OUTPUT Lhs.1 'context', Lhs.2, Lhs.0 'origin' 870129000 ~0% {3} r5 = r4 AND NOT dom#TObject::TPythonTuple#ff#prev(Lhs.2 'origin', Lhs.0 'context') 870129000 ~1% {3} r6 = SCAN r5 OUTPUT In.2 'origin', In.1, In.0 'context' 9000 ~0% {2} r7 = JOIN r6 WITH Flow::ControlFlowNode::getScope_dispred#ff ON FIRST 2 OUTPUT Lhs.0 'origin', Lhs.2 'context' return r7 ``` (...the above being the tuple counts _at the point when I cancelled the query_!) Rewriting the code to force a join between `TupleNode#class` and `getScope` results in the following join orders: ``` (0s) Tuple counts for TObject::scope_loads_tuplenode#ff/2@b3cf0bo5 after 13ms: 37369 ~3% {1} r1 = JOIN Flow::TupleNode#class#f WITH Flow::ControlFlowNode::isLoad_dispred#f ON FIRST 1 OUTPUT Lhs.0 'origin' 37369 ~3% {2} r2 = JOIN r1 WITH Flow::ControlFlowNode::getScope_dispred#ff ON FIRST 1 OUTPUT Rhs.1 's', Lhs.0 'origin' return r2 ``` and ``` (78s) Tuple counts for dom#TObject::TPythonTuple#ff/2@i53#121c440w after 6ms: 34736 ~3% {2} r1 = SCAN PointsToContext::PointsToContext::appliesToScope_dispred#ff#prev_delta OUTPUT In.1, In.0 'context' 7370 ~5% {2} r2 = JOIN r1 WITH TObject::scope_loads_tuplenode#ff ON FIRST 1 OUTPUT Lhs.1 'context', Rhs.1 'origin' 7370 ~5% {2} r3 = r2 AND NOT dom#TObject::TPythonTuple#ff#prev(Lhs.1 'origin', Lhs.0 'context') 7370 ~1% {2} r4 = SCAN r3 OUTPUT In.1 'origin', In.0 'context' return r4 ``` the latter being the largest iteration of `dom#TPythonTuple` throughout the log. No other major performance issues were observed.	2022-01-31 16:59:50 +00:00
Tom Hvitved	f2352d8272	Data flow: Inline `local(Expr\|Instruction)?(Flow\|Taint)` Computing a full transitive closure is often bad; by inlining all calls we are providing more context to the QL optimizer.	2022-01-31 14:33:41 +01:00
Rasmus Lerchedahl Petersen	8b5114d10e	python: Add standard customization setup - modernize the sanitizer, but do not make it less specific	2022-01-31 11:27:55 +01:00
Rasmus Lerchedahl Petersen	20d54543fd	python: move log injection out of experimental - move from custom concept `LogOutput` to standard concept `Logging` - remove `Log.qll` from experimental frameworks - fold models into standard models (naively for now) - stdlib: - make Logger module public - broaden definition of instance - add `extra` keyword as possible source - flak: add app.logger as logger instance - django: `add django.utils.log.request_logger` as logger instance (should we add the rest?) - remove LogOutput from experimental concepts	2022-01-31 11:27:55 +01:00
Rasmus Lerchedahl Petersen	211345c010	python: remove more annotations	2022-01-31 11:20:59 +01:00
Rasmus Lerchedahl Petersen	cac3862659	python: remove library annotation to clean up QL warnings. Should put these in a private module instead?	2022-01-31 08:50:37 +01:00
Rasmus Lerchedahl Petersen	0c3bce1415	python: deprecation I am slightly concerned that the test now generates many more intermediate results. I suppose that maes the analysis heavy. Should the new library get a new name instead, so the old code does not get evaluated?	2022-01-31 08:32:24 +01:00
Rasmus Wriedt Larsen	3e71d7f9bb	Python: Add note about `/` for Django upload_to I did a test locally, something like import requests req = requests.Request( "POST", "http://127.0.0.1:8000/app/upload-test/", data={"name": "foo"}, files={"upload" : ("wat/haha\|!#$%^&", open("foo.txt", "rb"))}, ) # print(req.prepare().body.decode('ascii')) requests.session().send(req.prepare()) and the `wat/` part was stripped from the filename	2022-01-28 12:17:46 +01:00
yoff	74d57bbb1a	Update python/ql/lib/semmle/python/dataflow/new/internal/DataFlowPrivate.qll Co-authored-by: Taus <tausbn@github.com>	2022-01-28 11:38:29 +01:00
Rasmus Lerchedahl Petersen	a026120c52	Python: Move configuration over and refine it The original configuration did not match sinks with sanitizers. Here it is resolved using flow state, it could also be done by using two configurations.	2022-01-28 09:00:40 +01:00
Rasmus Lerchedahl Petersen	d539920661	Python: Update list of frameworks	2022-01-28 08:58:30 +01:00
Rasmus Wriedt Larsen	4338c06b0d	Python: Support Django FileField.upload_to	2022-01-27 17:20:16 +01:00
Rasmus Lerchedahl Petersen	b93c04bb79	python: Add reverse flow in some patterns Particularly in value and literal patterns. This is getting a little bit into the guards aspect of matching. We could similarly add reverse flow in terms of sub-patterns storing to a sequence pattern, a flow step from alternatives to an-or-pattern, etc.. It does not seem too likely that sources are embedded in patterns to begin with, but for secrets perhaps? It is illustrated by the literal test. The value test still fails. I believe we miss flow in general from the static attribute.	2022-01-27 15:20:23 +01:00
github-actions[bot]	634134f283	Release preparation for version 2.8.0	2022-01-27 10:40:20 +00:00
Rasmus Lerchedahl Petersen	cb52ab669e	python: address review comments The comment about `py_scopes` was simply removed	2022-01-27 11:17:00 +01:00
yoff	e28669e487	Apply suggestions from code review Co-authored-by: Taus <tausbn@github.com>	2022-01-27 10:31:43 +01:00
Rasmus Lerchedahl Petersen	163c888781	python: port concepts and implementations	2022-01-26 19:05:37 +01:00
Rasmus Lerchedahl Petersen	47af3a69a5	Merge branch 'main' of github.com:github/codeql into python/support-match	2022-01-26 11:39:46 +01:00
Edoardo Pirovano	1b539eb4dc	Merge branch `rc/3.4` into `main`	2022-01-25 16:22:01 +00:00
Rasmus Wriedt Larsen	301318020f	Merge pull request #7455 from haby0/py/add-shutil-module-path-injection-sinks Python: Add shutil module sinks for path injection query	2022-01-24 20:06:36 +01:00
Rasmus Lerchedahl Petersen	41908cbf9f	python: add missing qldoc	2022-01-21 13:55:08 +01:00
Rasmus Lerchedahl Petersen	49d4b1480d	python: Do not remove `ChainedConfigs12.qll` since it was clearly already used. Add deprecation message instead.	2022-01-21 12:27:29 +01:00
Erik Krogh Kristensen	a235f8f023	remove redundant inline type casts	2022-01-21 11:46:33 +01:00
Erik Krogh Kristensen	ddfc3bc00f	use set literals instead of big disjunctions	2022-01-21 11:46:33 +01:00
yoff	5b9ae9cede	Merge pull request #7659 from RasmusWL/move-regex-injection-files Python: Move regex injection configuration files	2022-01-21 11:42:06 +01:00
yoff	4fd0ada9a8	Merge pull request #7652 from RasmusWL/cleartext-remove-fps Python: Remove usernames as sensitive source for cleartext queries	2022-01-21 11:30:40 +01:00
Rasmus Lerchedahl Petersen	a5bc5373d0	python: Rewrite path injection to use flow state This removes the FP cause by chaining This PR also removes `ChainedConfigs12.qll`, as we hope to solve future problems via flow states.	2022-01-21 09:26:48 +01:00
github-actions[bot]	ab218421da	Post-release preparation for codeql-cli-2.7.6	2022-01-20 12:59:20 +00:00
Erik Krogh Kristensen	4e8e3a7420	simplify expressions that could be type-casts	2022-01-20 10:41:35 +01:00
github-actions[bot]	4ce8ccc52b	Release preparation for version 2.7.6	2022-01-20 08:21:18 +00:00
Rasmus Lerchedahl Petersen	32cbeae05f	python: missing start tag for `relation`	2022-01-20 08:56:12 +01:00
Rasmus Lerchedahl Petersen	d10ad3bdd4	python: update stats for tables	2022-01-20 08:42:32 +01:00
Rasmus Lerchedahl Petersen	7e9a9e3d9a	python: remove compiler warnings	2022-01-19 18:01:58 +01:00
Rasmus Wriedt Larsen	b9ee2960e2	Python: Add change-note	2022-01-19 17:24:53 +01:00
Rasmus Wriedt Larsen	aa10ad6a8a	Python: Fix RegexInjection query, add old deprecated versions	2022-01-19 17:22:44 +01:00
Rasmus Wriedt Larsen	e82ea7ad17	Python: move regex injection configuration files I did not notice that these went to the wrong location in https://github.com/github/codeql/pull/6693. They should be in the dataflow folder with the rest of the data-flow configurations files, the injection folder is for old points-to based modeling.	2022-01-19 17:21:46 +01:00
Rasmus Lerchedahl Petersen	a0e79c1d7a	update stats for types - should still update stats for tables	2022-01-19 16:38:19 +01:00
Rasmus Wriedt Larsen	e82e648ca1	Python: Remove usernames as sensitive source for cleartext queries Closes #6363, #6927, #6726, #7497, #7116	2022-01-19 15:25:21 +01:00
Rasmus Lerchedahl Petersen	db253e8939	python: upgrade and downgrade scripts	2022-01-19 15:22:57 +01:00
Rasmus Lerchedahl Petersen	36e18d5d80	python: dataflow for match - also update `validTest.py`, but commented out for now otherwise CI will fail until we force it to run with Python 3.10 - added debug utility for dataflow (`dataflowTestPaths.ql`)	2022-01-19 14:29:58 +01:00
Rasmus Lerchedahl Petersen	bb210f4172	pythos: SSA for match - new SSA definition `PatternCaptureDefinition` - new SSA definition `PatternAliasDefinition` - implement `hasDefiningNode`	2022-01-19 14:29:58 +01:00
Rasmus Lerchedahl Petersen	de8ecb214f	python: Wrappers for database classes - new syntactic category `Pattern` (in `Patterns.qll`) - subpatterns available on statments - new statements `MatchStmt` and `Case` (`Match` would conflict with the shared ReDoS library) - new expression `Guard` - support for pattern lists	2022-01-19 14:29:58 +01:00
Rasmus Lerchedahl Petersen	b17f844f35	python: New generated files	2022-01-19 13:36:32 +01:00

... 47 48 49 50 51 ...

2826 Commits