Basic query for Python code¶
Learn to write and run a simple CodeQL query using Visual Studio Code with the CodeQL extension.
For information about installing the CodeQL extension for Visual Studio code, see “Installing CodeQL for Visual Studio Code.”
About the query¶
The query we’re going to run performs a basic search of the code for if statements that are redundant, in the sense that they only include a pass statement. For example, code such as:
if error: pass
Finding a CodeQL database to experiment with¶
Before you start writing queries for Python code, you need a CodeQL database to run them against. The simplest way to do this is to download a database for a repository that uses Python directly from GitHub.com.
In Visual Studio Code, click the QL icon
in the left sidebar to display the CodeQL extension.Click From GitHub or the GitHub logo
at the top of the CodeQL extension to open an entry field.Copy the URL for the repository into the field and press the keyboard Enter key. For example, https://github.com/saltstack/salt.
Optionally, if the repository has more than one CodeQL database available, select
pythonto download the database created from the Python code.
Information about the download progress for the database is shown in the bottom right corner of Visual Studio Code. When the download is complete, the database is shown with a check mark in the Databases section of the CodeQL extension (see screenshot below).
Running a quick query¶
The CodeQL extension for Visual Studio Code adds several CodeQL: commands to the command palette including Quick Query, which you can use to run a query without any set up.
From the command palette in Visual Studio Code, select CodeQL: Quick Query.
After a moment, a new tab quick-query.ql is opened, ready for you to write a query for your currently selected CodeQL database (here a
pythondatabase). If you are prompted to reload your workspace as a multi-folder workspace to allow Quick queries, accept or create a new workspace using the starter workflow.
In the quick query tab, delete
select ""and paste the following query beneath the import statementimport python.from If ifstmt, Stmt pass where pass = ifstmt.getStmt(0) and pass instanceof Pass select ifstmt, "This 'if' statement is redundant."
Save the query in its default location (a temporary “Quick Queries” directory under the workspace for
GitHub.vscode-codeql/quick-queries).Right-click in the query tab and select CodeQL: Run Query on Selected Database. (Alternatively, run the command from the Command Palette.)
The query will take a few moments to return results. When the query completes, the results are displayed in a CodeQL Query Results view, next to the main editor view.
The query results are listed in two columns, corresponding to the expressions in the
selectclause of the query. The first column corresponds to the expressionifstmtand is linked to the location in the source code of the project whereifstmtoccurs. The second column is the alert message.
If any matching code is found, click a link in the ifstmt column to open the file and highlight the matching if statement.
Note
If you want to move your experimental query somewhere more permanent, you need to move the whole
Quick Queriesdirectory. The directory is a CodeQL pack with aqlpack.ymlfile that defines the content as queries for Python CodeQL databases. For more information about CodeQL packs, see “Managing CodeQL query packs and library packs.”
About the query structure¶
After the initial import statement, this simple query comprises three parts that serve similar purposes to the FROM, WHERE, and SELECT parts of an SQL query.
Query part |
Purpose |
Details |
|---|---|---|
|
Imports the standard CodeQL libraries for Python. |
Every query begins with one or more |
|
Defines the variables for the query.
Declarations are of the form:
|
We use:
|
|
Defines a condition on the variables. |
In other words, the first statement contained in the |
|
Defines what to report for each match.
|
Reports the resulting |
Extend the query¶
Query writing is an inherently iterative process. You write a simple query and then, when you run it, you discover examples that you had not previously considered, or opportunities for improvement.
Remove false positive results¶
Browsing the results of our basic query shows that it could be improved. Among the results you are likely to find examples of if statements with an else branch, where a pass statement does serve a purpose. For example:
if cond():
pass
else:
do_something()
In this case, identifying the if statement with the pass statement as redundant is a false positive. One solution to this is to modify the query to ignore pass statements if the if statement has an else branch.
To exclude if statements that have an else branch:
Extend the
whereclause to include the following extra condition:and not exists(ifstmt.getOrelse())
The
whereclause is now:where pass = ifstmt.getStmt(0) and pass instanceof Pass and not exists(ifstmt.getOrelse())
Re-run the query.
There are now fewer results because
ifstatements with anelsebranch are no longer included.