codeql

mirror of https://github.com/github/codeql.git synced 2025-12-20 02:44:30 +01:00

Author	SHA1	Message	Date
Rasmus Lerchedahl Petersen	f8dba85e0f	Python: `PathNormalization` `os.path.normpath`	2020-10-21 22:21:40 +02:00
Rasmus Lerchedahl Petersen	17273dd27e	Python: Add `FileSystemAccess` `open`	2020-10-21 22:01:49 +02:00
Rasmus Lerchedahl Petersen	4570c29a11	Python: port query	2020-10-21 21:40:42 +02:00
Rasmus Lerchedahl Petersen	eb5ed23354	Python: Add TaintTracking2	2020-10-21 21:39:50 +02:00
Rasmus Lerchedahl Petersen	da77cbb3d0	Python: concepts PathCheck and PathNormalization Should they be in a module?	2020-10-21 21:37:43 +02:00
Rasmus Wriedt Larsen	278c071fe6	Python: Modernise werkzeug FileStorage modeling	2020-10-21 20:36:40 +02:00
Rasmus Wriedt Larsen	d0fdb542e5	Python: Modernise werkzeug MultiDict modeling	2020-10-21 20:30:20 +02:00
Rasmus Wriedt Larsen	b6bd70a5da	Python: Modernise flask library modeling Two interesting things happened while doing this: 1. I found out that you can't use the same name to define a submodule as any parent module. So we need give unique names to the top-level module, and the module for modeling the `flask.Flask` class. I randomly choose a new name for the top-level module to get things moving (and not be stuck in bikeshedding forever). 2. With this new setup, I wanted to expose the `route` and `add_url_rule` methods on instances of `flask.Flask`. It wasn't quite obvious how to do so. I simply lumped them next to `classRef()` and `instance()`, without too much care. I did consider putting them inside a `instance` module, which would allow you to access them by `flask::Flask::instance::route()`, but I wasn't quite sure, and just did something easy to get moving.	2020-10-21 20:30:14 +02:00
Rasmus Wriedt Larsen	62d665ecb3	Python: Fix shared QLDoc for InstanceSource	2020-10-21 19:55:40 +02:00
Rasmus Wriedt Larsen	047a326183	Python: Remove reference to old PR These have been added to internal tracking issue instead	2020-10-21 19:53:25 +02:00
Rasmus Lerchedahl Petersen	2e8cbbd866	Python: Add concept `FileSystemAccess`	2020-10-21 17:35:21 +02:00
Rasmus Wriedt Larsen	a6abee9b3a	Merge pull request #4476 from yoff/python-port-sql-injection Python: Port SqlInjection	2020-10-21 15:55:19 +02:00
yoff	ea4ea6b3e6	Merge pull request #4529 from tausbn/python-remove-cartesian-product-in-tkwoverflownode Python: Remove bad join in `getCallableScope`	2020-10-21 15:36:38 +02:00
Rasmus Lerchedahl Petersen	060481053a	Python: Add note about incompleteness I was going to do this in an issue, but it makes sense to have it in the code. We could still add an issue as well.	2020-10-21 15:15:19 +02:00
Rasmus Lerchedahl Petersen	c57c798bfa	Python: Add TODO	2020-10-21 15:10:40 +02:00
yoff	ee5221abb4	Apply suggestions from code review Co-authored-by: Rasmus Wriedt Larsen <rasmuswriedtlarsen@gmail.com>	2020-10-21 15:08:16 +02:00
Taus Brock-Nannestad	31c169daac	Python: Remove bad join in `getCallableScope` Clause timing report had this suspicious entry ``` CommandInjection.ql-12:DataFlowPublic::Node::getCallableScope#bbf .................. 7.2s (4 evaluations with max 6.4s in DataFlowPublic::Node::getCallableScope#bbf/3@i3#119d7b) ``` which indeed was a bad join: ``` Tuple counts for DataFlowPublic::Node::getCallableScope#bbf: 293509 ~2% {3} r1 = JOIN DataFlowPublic::Node::getCallableScope#bbf#prev_delta AS L WITH DataFlowPublic::TNode#f AS R ON FIRST 1 OUTPUT L.<1>, L.<0>, L.<2> 22337162 ~0% {3} r2 = JOIN r1 WITH Scope::Scope::getEnclosingScope_dispred#ff_10#join_rhs AS R ON FIRST 1 OUTPUT r1.<1>, r1.<2>, R.<1> 22337162 ~0% {3} r3 = r2 AND NOT DataFlowPublic::Node::getCallableScope#bbf#prev AS R(r2.<0>, r2.<2>, r2.<1>) 22337162 ~0% {3} r4 = SCAN r3 OUTPUT r3.<0>, r3.<2>, r3.<1> 722 ~1% {3} r5 = JOIN r4 WITH m#DataFlowPublic::Node::getCallableScope#bbf AS R ON FIRST 2 OUTPUT r4.<0>, r4.<1>, r4.<2> 722 ~1% {3} r6 = JOIN r5 WITH m#DataFlowPublic::Node::getCallableScope#bbf AS R ON FIRST 2 OUTPUT r5.<0>, r5.<2>, r5.<1> 722 ~1% {3} r7 = r6 AND NOT project#DataFlowPrivate::DataFlowCallable::getScope_dispred#ff AS R(r6.<2>) 722 ~1% {3} r8 = SCAN r7 OUTPUT r7.<0>, r7.<2>, r7.<1> return r8 ``` In this case, the join went away by simply moving the helper predicate out of the class it was situated in (and since it doesn't mention `this`, it didn't really belong there in the first place). Result: ``` DataFlowPublic.qll-8:DataFlowPublic::getCallableScope#ff ........................... 26ms (4 evaluations with max 15ms in DataFlowPublic::getCallableScope#ff/2@i3#709a9e) ```	2020-10-21 14:45:05 +02:00
yoff	9bd808c924	Merge pull request #4528 from tausbn/python-remove-cartesian-product-in-tkwoverflownode Python: Prevent early join on `argName` in `getArg`	2020-10-21 14:43:39 +02:00
Rasmus Lerchedahl Petersen	53ff1a32c1	Merge branch 'main' of github.com:github/codeql into python-port-sql-injection	2020-10-21 14:38:02 +02:00
Rasmus Lerchedahl Petersen	77d4cbc0df	Python: Only allow unsafe positional args to extra	2020-10-21 14:21:36 +02:00
Rasmus Lerchedahl Petersen	4571b3188c	Python: Fix false negative	2020-10-21 14:16:35 +02:00
yoff	75357727c4	Merge pull request #4490 from RasmusWL/python-model-django-sources Python: model Django HttpRequest as RemoteFlowSource	2020-10-21 13:46:51 +02:00
Rasmus Lerchedahl Petersen	7087522ed9	Python: Move doc reference	2020-10-21 13:30:38 +02:00
Taus Brock-Nannestad	eb3333c0ce	Python: Prevent early join on `argName` in `getArg`	2020-10-21 13:23:09 +02:00
Rasmus Lerchedahl Petersen	e51543ea79	Python: allow any positional argument in annotate	2020-10-21 13:17:20 +02:00
Rasmus Lerchedahl Petersen	2c855c739b	Python: Ignore relatively safe arguments of extra	2020-10-21 13:15:46 +02:00
Rasmus Lerchedahl Petersen	6805fb63cc	Python: Use modern pattern for RawSQL class	2020-10-21 13:14:32 +02:00
Rasmus Lerchedahl Petersen	e44247bb00	Python: Add links to function docs	2020-10-21 13:03:14 +02:00
yoff	8e7e107365	Apply suggestions from code review Co-authored-by: Rasmus Wriedt Larsen <rasmuswriedtlarsen@gmail.com>	2020-10-21 12:55:38 +02:00
CodeQL CI	eaed93fa7d	Merge pull request #4513 from RasmusWL/python-model-fabric Approved by yoff	2020-10-21 01:58:19 -07:00
Erik Krogh Kristensen	e18cf08d99	documentation changes based on review	2020-10-21 09:45:16 +02:00
Erik Krogh Kristensen	c1dba2ee9f	add a few `shouldPrint` calls to improve performance	2020-10-21 09:37:53 +02:00
Rasmus Lerchedahl Petersen	383d846396	Python: address review - smooth out future merge - keyword argument for execute	2020-10-21 00:15:05 +02:00
Rasmus Lerchedahl Petersen	e1dfbc0486	Python: address review	2020-10-20 23:59:44 +02:00
Erik Krogh Kristensen	3306b59a14	Update python/ql/src/semmle/python/PrintAst.qll Co-authored-by: yoff <lerchedahl@gmail.com>	2020-10-20 23:19:47 +02:00
yoff	01845d1278	Update python/ql/src/experimental/semmle/python/frameworks/Django.qll Co-authored-by: Rasmus Wriedt Larsen <rasmuswriedtlarsen@gmail.com>	2020-10-20 21:43:15 +02:00
Rasmus Wriedt Larsen	6053157b09	Python: Fix QLDoc to not mention sudo method on fabric Group	2020-10-20 17:30:09 +02:00
Taus Brock-Nannestad	0e2ec0dfb4	Python: Remove cartesian product in `TKwOverflowNode` With CP: (0s) Tuple counts for dom#DataFlowPublic::TKwOverflowNode#ff: 1209 ~0% {2} r1 = JOIN project#AstGenerated::Function_::getKwarg_dispred#ff AS L WITH ObjectAPI::CallableValue::getScope_dispred#ff_10#join_rhs AS R ON FIRST 1 OUTPUT R.<1>, L.<0> 4329 ~0% {3} r2 = JOIN r1 WITH DataFlowPrivate::ArgumentPassing::connects#bb_10#join_rhs AS R ON FIRST 1 OUTPUT R.<1>, r1.<1>, r1.<0> 7819 ~2% {4} r3 = JOIN r2 WITH Flow::CallNode::getArgByName_dispred#fff AS R ON FIRST 1 OUTPUT r2.<1>, r2.<2>, r2.<0>, R.<1> 7114 ~1% {4} r4 = r3 AND NOT Function::Function::getArgByName_dispred#fff_01#antijoin_rhs AS R(r3.<0>, r3.<3>) 7114 ~76% {2} r5 = SCAN r4 OUTPUT r4.<2>, r4.<1> 1123 ~0% {1} r6 = JOIN project#Exprs::Call::getKwargs_dispred#ff AS L WITH py_flow_bb_node_10#join_rhs AS R ON FIRST 1 OUTPUT R.<1> 1123 ~0% {1} r7 = JOIN r6 WITH Flow::CallNode#class#f AS R ON FIRST 1 OUTPUT r6.<0> 1357707 ~0% {2} r8 = JOIN r7 WITH project#AstGenerated::Function_::getKwarg_dispred#ff AS R CARTESIAN PRODUCT OUTPUT R.<0>, r7.<0> 1357707 ~0% {2} r9 = JOIN r8 WITH ObjectAPI::CallableValue::getScope_dispred#ff_10#join_rhs AS R ON FIRST 1 OUTPUT r8.<1>, R.<1> 1364821 ~0% {2} r10 = r5 \/ r9 return r10 Without CP: (13s) Tuple counts for dom#DataFlowPublic::TKwOverflowNode#ff: 1209 ~0% {2} r1 = JOIN project#AstGenerated::Function_::getKwarg_dispred#ff AS L WITH ObjectAPI::CallableValue::getScope_dispred#ff_10#join_rhs AS R ON FIRST 1 OUTPUT R.<1>, L.<0> 19175 ~4% {3} r2 = JOIN r1 WITH DataFlowPrivate::ArgumentPassing::connects#ff_10#join_rhs AS R ON FIRST 1 OUTPUT R.<1>, r1.<1>, r1.<0> 7819 ~2% {4} r3 = JOIN r2 WITH Flow::CallNode::getArgByName_dispred#fff AS R ON FIRST 1 OUTPUT r2.<1>, r2.<2>, r2.<0>, R.<1> 7114 ~1% {4} r4 = r3 AND NOT Function::Function::getArgByName_dispred#fff_01#antijoin_rhs AS R(r3.<0>, r3.<3>) 7114 ~76% {2} r5 = SCAN r4 OUTPUT r4.<2>, r4.<1> 1123 ~0% {1} r6 = JOIN project#Exprs::Call::getKwargs_dispred#ff AS L WITH py_flow_bb_node_10#join_rhs AS R ON FIRST 1 OUTPUT R.<1> 574 ~0% {2} r7 = JOIN r6 WITH DataFlowPrivate::ArgumentPassing::connects#ff AS R ON FIRST 1 OUTPUT R.<1>, r6.<0> 524 ~1% {3} r8 = JOIN r7 WITH ObjectAPI::CallableValue::getScope_dispred#ff AS R ON FIRST 1 OUTPUT R.<1>, r7.<1>, r7.<0> 291 ~0% {2} r9 = JOIN r8 WITH project#AstGenerated::Function_::getKwarg_dispred#ff AS R ON FIRST 1 OUTPUT r8.<1>, r8.<2> 7405 ~72% {2} r10 = r5 \/ r9 return r10	2020-10-20 17:29:26 +02:00
Rasmus Wriedt Larsen	cd116fc5f2	Python: Fix grammar in QLDoc Co-authored-by: yoff <lerchedahl@gmail.com>	2020-10-20 17:26:33 +02:00
Rasmus Wriedt Larsen	c8441dc4fb	Merge branch 'main' into python-model-django-sources	2020-10-20 15:38:20 +02:00
yoff	17155b64f5	Merge pull request #4514 from tausbn/python-add-module-boundary-flow-steps Python: Add module boundary flow steps	2020-10-20 14:36:10 +02:00
Taus Brock-Nannestad	a21c29507c	Python: Fix false negative I'm slightly suspicious of this fix -- it seems to work, but it makes me wonder if we're potentially missing other kinds of flow, by not handling other kinds of definitions. Also, I feel like this should really be attached to an appropriate post-update node of the given argument. As it is written now, the flow will go from the argument _before_ the call, which obviously misses a step if the argument is modified by the call. In practice, I would expect this to be rather rare.	2020-10-20 13:16:54 +02:00
Rasmus Lerchedahl Petersen	5990241c8f	Python: Support django models (with some caveats)	2020-10-20 03:20:00 +02:00
Taus	f5ec548e68	Python: Fix typo in QLDoc Co-authored-by: yoff <lerchedahl@gmail.com>	2020-10-19 23:51:38 +02:00
Rasmus Lerchedahl Petersen	d7308bddf2	Python: Add django sink with concept test	2020-10-19 21:34:55 +02:00
Taus Brock-Nannestad	7755993dd3	Python: Add jump steps for module attribute reads. This is the quick-and-dirty solution, as discussed. An even quicker-and-dirtier solution would have used `ModuleValue::attr` and take the `getOrigin` of that as the source of the jump step. However, this turns out to be a bad choice, since `attr` might fail to have a value for the given attribute (for a variety of reasons). Thus, we instead appeal to a helper predicate that keeps track of which names are defined by which right-hand-sides in a given module. (Observe that type tracking works correctly for `x` in `mymodule.py`, even though `x` is never assigned a value in the eyes of the Value API.) This means that points-to is only used to actually figure out if the object we're looking an attribute up on is a module or not. This is the next thing to replace in order to eliminate the dependence on points-to, but this will require some care to ensure that all module lookups are handled correctly. Only two test files needed to be changed for the tests to pass. The first was the fixed false negative in the type tracker, and the other was a bunch of missing flow in the regression test. I have manually removed the `# Flow not found` annotations to make them consistent with the output. Pay particular attention to the annotation on line 117 -- I believe it was misplaced and should have been on line 106 instead (where, indeed, we now have flow where none appeared before).	2020-10-19 19:13:32 +02:00
Rasmus Wriedt Larsen	98691fe8ec	Python: Model fabric Group execution (version 2.x) This required some thought for how to model that we're interested in subclasses of `fabric.group.Group`, and not so much that class itself. Some thoughts: --- After initially using this in `module Group` /** A reference to a subclass of `fabric.group.Group` */ abstract class SubclassRef extends DataFlow::Node { } private class SubclassInstantiation extends SubclassInstanceSource, DataFlow::CfgNode { override CallNode node; SubclassInstantiation() { node.getFunction() = any(SubclassRef ref).asCfgNode() } } with this in `module SerialGroup` and `module ThreadingGroup`: class ClassRef extends DataFlow::Node, fabric::group::Group::SubclassRef { ClassRef() { this = classRef(DataFlow::TypeTracker::end()) } } I wasn't too much of fan of that approach. Since we probably need the `SubclassInstanceSource` anyway, and don't really have a specific use for `SubclassRef`, I just went with concrete (QL) subclasses of `SubclassInstanceSource` in each of the modules for the Python subclasses. I really don't know what the best approach is, so I'm very open to suggestions. I think we'll really have to flesh this out for handling Django responses, since we're interested in the fact that some subclasses provide default values for the content-type, and keeping track of that is important for XSS (since there is no XSS if response is `text/plain`)	2020-10-19 18:09:11 +02:00
Rasmus Wriedt Larsen	f10456e35f	Python: Model fabric task decorator (version 2.x)	2020-10-19 18:03:03 +02:00
Rasmus Wriedt Larsen	c671017252	Python: Model fabric Connection (version 2.x)	2020-10-19 18:03:02 +02:00
Rasmus Wriedt Larsen	f7502386e7	Python: Model fabric package (version 1.x)	2020-10-19 18:03:01 +02:00

... 63 64 65 66 67 ...

4863 Commits