codeql

mirror of https://github.com/github/codeql.git synced 2025-12-20 18:56:32 +01:00

Author	SHA1	Message	Date
Anders Schack-Mulligen	a2bbacf58d	Java/C++/C#: Fix performance issue in partial paths exploration.	2020-03-09 11:30:59 +01:00
Anders Schack-Mulligen	f491fcd5ae	Java/C++/C#: Sync.	2020-03-09 11:05:13 +01:00
Jonas Jensen	a13f355a85	C++: autoformat fixup	2020-03-06 08:29:46 +01:00
Jonas Jensen	e29f517af2	C++: Move InstructionSanity out of Instruction.qll Having that module in `Instruction.qll` slowed down the parsing of that file both humans and the compiler. This commit moves the `InstructionSanity` module to `IRSanity.qll` without making any changes to its contents apart from adding some imports.	2020-03-05 12:11:50 +01:00
Jonas Jensen	6b2fd17f03	C++: IR: faster definitionReachesRank On Wireshark with 6GB RAM, I've observed `definitionReachesRank` to be the slowest predicate in the IR. It seems that the implementation was slow because the optimizer failed to eliminate the common `reachesRank - 1` subexpression. This led to context being pushed into the `not`, which got implemented as `MATERIALIZE`. That wouldn't normally be a disaster, but this is one of the largest predicates in the IR SSA construction, and iteration 2 was very slow. Before: (1505s) Starting to evaluate predicate SSAConstruction::DefUse::definitionReachesRank#ffff#cur_delta/4[1]@93f592 (iteration 1) (1535s) Tuple counts for SSAConstruction::DefUse::definitionReachesRank#ffff#cur_delta: 130670697 ~0% {4} r1 = SCAN project#SSAConstruction::DefUse::hasDefinitionAtRank#fffff AS I OUTPUT I.<0>, I.<1>, I.<2>, (I.<2> + 1) 130670697 ~6% {5} r2 = JOIN r1 WITH SSAConstruction::DefUse::exitRank#fff AS R ON FIRST 2 OUTPUT r1.<0>, r1.<1>, r1.<2>, r1.<3>, R.<2> 130670697 ~6% {5} r3 = SELECT r2 ON r2.<3> <= r2.<4> 130670697 ~0% {4} r4 = SCAN r3 OUTPUT r3.<0>, r3.<1>, r3.<2>, r3.<3> return r4 (1535s) - SSAConstruction::DefUse::definitionReachesRank#ffff_delta has 130670697 rows (order for disjuncts: delta=<standard>). (1535s) Starting to evaluate predicate SSAConstruction::DefUse::definitionReachesRank#ffff#cur_delta/4[2]@866c14 (iteration 2) (1626s) Tuple counts for SSAConstruction::DefUse::definitionReachesRank#ffff#cur_delta: 261341394 ~107% {4} r1 = JOIN SSAConstruction::DefUse::definitionReachesRank#ffff#prev_delta AS L WITH SSAConstruction::DefUse::definitionReachesRank#ffff#join_rhs AS R ON FIRST 3 OUTPUT R.<0>, R.<1>, R.<2>, (1 + L.<3>) 261341394 ~107% {4} r2 = r1 AND NOT SSAConstruction::DefUse::definitionReachesRank#ffff#prev AS R(r1.<0>, r1.<1>, r1.<2>, r1.<3>) 130670697 ~0% {5} r3 = SCAN r2 OUTPUT r2.<0>, r2.<1>, (r2.<3> - 1), r2.<2>, r2.<3> 106034590 ~1% {4} r4 = JOIN r3 WITH project#SSAConstruction::DefUse::hasDefinitionAtRank#fffff AS R ON FIRST 3 OUTPUT r3.<0>, r3.<1>, r3.<3>, r3.<4> 106034590 {4} r5 = MATERIALIZE r4 AS antijoin_rhs 24636107 ~3% {4} r6 = r2 AND NOT r5(r2.<0>, r2.<1>, r2.<2>, r2.<3>) 24636107 ~0% {5} r7 = JOIN r6 WITH SSAConstruction::DefUse::exitRank#fff AS R ON FIRST 2 OUTPUT r6.<0>, r6.<1>, r6.<2>, r6.<3>, R.<2> 2749441 ~0% {5} r8 = SELECT r7 ON r7.<3> <= r7.<4> 2749441 ~4% {4} r9 = SCAN r8 OUTPUT r8.<0>, r8.<1>, r8.<2>, r8.<3> return r9 (1626s) - SSAConstruction::DefUse::definitionReachesRank#ffff_delta has 2749441 rows (order for disjuncts: delta=<standard>). After: (12s) Tuple counts for SSAConstruction::DefUse::definitionReachesRank#ffff#cur_delta: 130670697 ~0% {4} r1 = SCAN project#SSAConstruction::DefUse::hasDefinitionAtRank#fffff AS I OUTPUT I.<0>, I.<1>, I.<2>, (I.<2> + 1) return r1 (12s) - SSAConstruction::DefUse::definitionReachesRank#ffff_delta has 130670697 rows (order for disjuncts: delta=<standard>). (12s) Starting to evaluate predicate SSAConstruction::DefUse::definitionReachesRank#ffff#cur_delta/4[2]@fff64c (iteration 2) (34s) Tuple counts for SSAConstruction::DefUse::definitionReachesRank#ffff#cur_delta: 108784031 ~0% {4} r1 = SSAConstruction::DefUse::definitionReachesRank#ffff#prev_delta AS L AND NOT SSAConstruction::DefUse::exitRank#fff AS R(L.<0>, L.<1>, L.<3>) 2749441 ~5% {4} r2 = r1 AND NOT project#SSAConstruction::DefUse::hasDefinitionAtRank#fffff AS R(r1.<0>, r1.<1>, r1.<3>) 2749441 ~4% {4} r3 = SCAN r2 OUTPUT r2.<0>, r2.<1>, r2.<2>, (r2.<3> + 1) 2749441 ~4% {4} r4 = r3 AND NOT SSAConstruction::DefUse::definitionReachesRank#ffff#prev AS R(r3.<0>, r3.<1>, r3.<2>, r3.<3>) return r4 (34s) - SSAConstruction::DefUse::definitionReachesRank#ffff_delta has 2749441 rows (order for disjuncts: delta=<standard>). Note that the row counts are exactly the same before and after.	2020-03-04 15:00:47 +01:00
Robert Marsh	1e3419fd60	C++/C#: generate IR for funcs excluded in PrintIR Previously, functions excluded from PrintIR would not have IR generated. This sometimes affected escacpe analysis of functions that were printed.	2020-03-03 14:34:08 -08:00
Mathias Vorreiter Pedersen	20529b4436	C++/C#: Sync identical files	2020-03-02 12:15:54 +01:00
Jonas Jensen	dab6691eb0	Merge pull request #2900 from dbartol/dbartol/void-buffer C++: Better fix for `void` type on buffer access	2020-03-02 09:00:15 +01:00
Jonas Jensen	ec85f9f1a1	Merge pull request #2797 from rdmarsh2/rdmarsh/cpp/malloc-alias-locations C++: Support dynamic memory allocations in IR alias analysis	2020-03-02 08:49:59 +01:00
Robert Marsh	28ee756c6a	Merge pull request #2934 from geoffw0/add_tests C++: Test and typos.	2020-02-28 15:12:32 -08:00
Geoffrey White	c6b0d4bbda	C#: Sync identical files.	2020-02-28 17:55:59 +00:00
semmle-qlci	ec90627a64	Merge pull request #2909 from yo-h/experimental Approved by aschackmull, jbj, max-schaefer, tausbn	2020-02-28 03:15:58 +00:00
Dave Bartolomeo	b0fb16c068	C++/C#: Fix formatting	2020-02-27 13:44:02 -05:00
Anders Schack-Mulligen	67d386b5ba	C++/C#: Add synchronization.	2020-02-27 14:10:16 +01:00
Robert Marsh	95a762c987	Merge master for submodule update	2020-02-26 13:44:26 -08:00
Robert Marsh	4333fe7905	Merge branch 'master' into rdmarsh/cpp/ir-flow-through-outparams	2020-02-26 13:15:27 -08:00
yo-h	43bcd5b26c	Add guidelines for experimental CodeQL queries and libraries	2020-02-24 15:08:31 -05:00
Jonas Jensen	2d9df70abc	Merge pull request #2887 from MathiasVP/fix-ir-gen-switch C++: Fix IR generation for switch statements	2020-02-24 13:29:27 +01:00
Mathias Vorreiter Pedersen	ed430ce855	C++/C#: Bind parameter in new case.	2020-02-24 09:12:14 +01:00
Mathias Vorreiter Pedersen	af364e66fc	C++/C#: Move sanity check inside InstructionSanity module and accept tests	2020-02-23 20:53:49 +01:00
Dave Bartolomeo	170331b105	C++: Better fix for `void` type on buffer access Fixes issue https://github.com/github/codeql-c-analysis-team/issues/20 This change undoes the workaround in https://github.com/Semmle/ql/pull/2736, and replaces it with a fix for the underlying cause. The problem was that the IR construction code for side effects incorrectly assumed that `BufferAccessOpcode` included `SizedBufferAccessOpcode`. I think that was actually a perfectly reasonable assumption to make, so I changed the `Opcode` hierarchy to make it true.	2020-02-21 18:46:32 -07:00
Mathias Vorreiter Pedersen	d9753b0ca5	C++/C#: Accept test output after adding sanity check to Instruction.qll	2020-02-21 15:09:53 +01:00
Mathias Vorreiter Pedersen	da41cbca06	C#: Add similar fix to translation of switch statements in C#	2020-02-21 13:33:54 +01:00
Anders Schack-Mulligen	771cb754c2	Merge pull request #2822 from hvitved/dataflow/node-cand-simple-call-context Data flow: Track simple call contexts in `nodeCand[Fwd]1`	2020-02-21 10:02:06 +01:00
Tom Hvitved	0cc3218115	Merge pull request #2872 from aschackmull/dataflow/pathstep-localflow-join Java/C++/C#: Improve join-order in pathStep predicate	2020-02-21 09:39:17 +01:00
Mathias Vorreiter Pedersen	780010d8f9	C++/C#: Sync identical files	2020-02-20 22:15:06 +01:00
Tom Hvitved	a772b82fea	Address review comments	2020-02-20 19:48:49 +01:00
Robert Marsh	bed6d2b225	Merge branch 'master' into rdmarsh/cpp/malloc-alias-locations	2020-02-19 16:44:13 -08:00
Anders Schack-Mulligen	91166431d2	Java/C++/C#: s/Callable/DataFlowCallable/	2020-02-19 17:23:01 +01:00
Anders Schack-Mulligen	c6016bb08c	Java/C++/C#: Improve join-order in pathStep predicate	2020-02-19 14:47:39 +01:00
Mathias Vorreiter Pedersen	59a19679ea	C++/C#: Sync identical files after merge	2020-02-19 11:06:00 +01:00
Mathias Vorreiter Pedersen	bbcc1e1c37	Merge branch 'master' into sync-ir-valuenumbering-internals	2020-02-19 10:43:48 +01:00
Robert Marsh	adfe5f30a1	Merge branch 'master' into rdmarsh/cpp/ir-flow-through-outparams	2020-02-18 08:42:27 -08:00
Tom Hvitved	a695b567ec	Data flow: Sync files	2020-02-17 19:39:52 +01:00
Tom Hvitved	3a4f52315c	Data flow: Track simple call contexts in `nodeCand[Fwd]1`	2020-02-17 19:37:35 +01:00
semmle-qlci	ecad925101	Merge pull request #2631 from hvitved/dataflow/generalize-flow-summaries Approved by aschackmull	2020-02-17 18:22:46 +00:00
Tom Hvitved	0e7838aca5	Data flow: Sync files	2020-02-17 15:08:26 +01:00
Tom Hvitved	c7aa31d90a	Address review comments	2020-02-17 15:07:53 +01:00
Tom Hvitved	dcdb5299f0	C#: Update expected test output	2020-02-17 10:52:02 +01:00
Tom Hvitved	7eae5f913c	C#: Update data-flow test	2020-02-17 10:45:44 +01:00
Tom Hvitved	28307399f8	Data flow: Sync files	2020-02-17 10:45:35 +01:00
Tom Hvitved	bc6c4744b1	Data flow: Follow-up changes to `FlowExploration` module	2020-02-17 10:43:26 +01:00
Tom Hvitved	307ac7f0b3	Data flow: Remove `UntypedAccessPath` again	2020-02-17 10:32:27 +01:00
Mathias Vorreiter Pedersen	8b8a8cae5b	C++/C#: Sync identical files	2020-02-14 16:11:57 +01:00
Mathias Vorreiter Pedersen	98db6d8fd7	C#: Fix format and sync files	2020-02-14 08:22:25 +01:00
Mathias Vorreiter Pedersen	d4c6f487bc	C++/C#: Fix sync config file for value numbering sharing	2020-02-13 22:32:52 +01:00
Tom Hvitved	09b1e8b161	C#: Update expected test output	2020-02-13 20:08:11 +01:00
Tom Hvitved	b5b0c2b8cf	Data flow: Sync files	2020-02-13 16:34:06 +01:00
Tom Hvitved	6be2bf8374	Address review comments	2020-02-13 16:34:06 +01:00
Jonas Jensen	033a4c30ea	C++: Fix perf of IR value numbering On some snapshots, notably ffmpeg, the IR `ValueNumbering` recursion would generate billions of tuples and eventually run out of space. It turns out it was fairly common for an `Instruction` to get more than one `ValueNumber` in the base cases for `VariableAddressInstruction` and `InitializeParameterInstruction`, and it could also happen in an instruction with more than one operand of the same `OperandTag`. When a binary operation was applied to an instruction with `m` value numbers and another instruction with `n` value numbers, the result would get `m * n` value numbers. This led to doubly-exponential growth in the number of value numbers in rare cases. The underlying reason why a `VariableAddressInstruction` could get multiple value numbers is that it was keyed on the associated `IRVariable`, and the `IRVariable` is defined in part by the type of its underlying `Variable` (or other AST element). If the extractor defines a variable to have multiple types because of linker ambiguity, this leads to the creation of multiple `IRVariable`s. That should ideally be solved in `TIRVariable.qll`, but for now I've put a workaround in `ValueNumberingInternal.qll` instead. To remove the problem with instructions having multiple operands, the construction in `Operand.qll` will now filter out any such operand. It wasn't enough to apply that filter to the `raw` stage, so I've applied it to all three stages.	2020-02-12 14:38:41 +01:00

... 19 20 21 22 23 ...

2355 Commits