Order of SQL Execution
The
FROM
clause, and subsequent JOIN
s are first executed to determine the total working set of data that is being queried. This includes sub-queries in this clause, and can cause temporary tables to be created under the hood containing all the columns and rows of the tables being joined.Once we have the total working set of data, the first-pass
WHERE
constraints are applied to the individual rows, and rows that do not satisfy the constraint are discarded. Each of the constraints can only access columns directly from the tables requested in the FROM
clause. Aliases in the SELECT
part of the query are not accessible in most databases since they may include expressions dependent on parts of the query that have not yet executed.The remaining rows after the
WHERE
constraints are applied are then grouped based on common values in the column specified in the GROUP BY
clause. As a result of the grouping, there will only be as many rows as there are unique values in that column. Implicitly, this means that you should only need to use this when you have aggregate functions in your query.If the query has a
GROUP BY
clause, then the constraints in the HAVING
clause are then applied to the grouped rows, discard the grouped rows that don't satisfy the constraint. Like the WHERE
clause, aliases are also not accessible from this step in most databases.Any expressions in the
SELECT
part of the query are finally computed.Of the remaining rows, rows with duplicate values in the column marked as
DISTINCT
will be discarded.If an order is specified by the
ORDER BY
clause, the rows are then sorted by the specified data in either ascending or descending order. Since all the expressions in the SELECT
part of the query have been computed, you can reference aliases in this clause.Finally, the rows that fall outside the range specified by the
LIMIT
and OFFSET
are discarded, leaving the final set of rows to be returned from the query.Last modified 1yr ago