Why You Should (Sometimes) Avoid Expressions in SQL Predicates

I’ve recently discovered a rather significant performance issue on a productive Oracle 11g customer database. And I’m sure you have this issue too, which is why I’m documenting it here. This is a simplified representation of the setup at the customer site:

        ID PAYMENT_DATE TEXT                               
---------- ------------ -----------------------------------
     33803 21.05.16     DcTNBOrkQIgMtbietUWOsSFNMIqGLlDw...
     29505 09.03.16     VIuPaOAQqzCMlFBYPQtvqUSbWYPDndJD...
     10738 25.07.16     TUkxGpZPrGKaHzDRxczrktkFWvGmiwjR...

Let’s produce the above table with the following statement:


CREATE TABLE payment (
  id            NOT NULL PRIMARY KEY,
  payment_date  NOT NULL,
  text
) AS
SELECT
  level,
  SYSDATE - dbms_random.value(1, 500),
  dbms_random.string('a', 500)
FROM dual
CONNECT BY level &lt;= 50000
ORDER BY dbms_random.value;

CREATE INDEX i_payment_date ON payment(payment_date);

EXEC dbms_stats.gather_table_stats('TEST', 'PAYMENT');

What we’re doing here: There’s a payment table with an ID primary key column, an indexed payment_date column and some “text” info. The real table is, of course, much bigger than this. Now, this table needs infrequent house keeping. In nightly batch jobs, the customer ran through the table and removed all payments that were older than some fixed amount of days, e.g.:


DELETE
FROM payment
WHERE payment_date &lt; SYSDATE - 470 // Older than 470 days

This might look fine at first, because:

We have an index on payment_date, and it can be used for the deletion
SYSDATE is constant per query, so the expression SYSDATE - 470 should also be constant per query

We can prove both statements easily:

1. Index usage

Let’s run:


EXPLAIN PLAN FOR
DELETE
FROM payment
WHERE payment_date &lt; SYSDATE - 470;

SELECT * FROM TABLE(dbms_xplan.display);

And we’re getting:

----------------------------------------------------------------------------
| Id  | Operation         | Name           | Rows  | Cost (%CPU)| Time     |
----------------------------------------------------------------------------
|   0 | DELETE STATEMENT  |                |  3008 |    10   (0)| 00:00:01 |
|   1 |  DELETE           | PAYMENT        |       |            |          |
|*  2 |   INDEX RANGE SCAN| I_PAYMENT_DATE |  3008 |    10   (0)| 00:00:01 |
----------------------------------------------------------------------------
                                                                            
Predicate Information (identified by operation id):                         
---------------------------------------------------
                                                                            
   2 - access("PAYMENT_DATE"<SYSDATE@!-470)

So, we get a relatively cheap access predicate on the I_PAYMENT_DATE index, and there’s this weird SYSDATE@!-470 constant in the plan.

2. SYSDATE being constant

There’s a lot of confusion about what the actual value of these non-deterministic, non-pure date/time functions is in each database. Sometimes, the function is evaluated on each row individually and produces a new value each time. In my opinion, that’s the worst, as such functions are completely unpredictable. Sometimes, there’s a guarantee that these timestamps stay the same for the entire duration of a transaction. That’s a bit weird as a default, but OK why not. In Oracle, it seems that SYSDATE (and SYSTIMESTAMP) are evaluated only a single time on a per-statement level. This would explain that weird SYSDATE@! notation in the execution plan, and it can be “proven” by doing something like this:


CREATE OR REPLACE FUNCTION sleep(i NUMBER) RETURN NUMBER 
AS
BEGIN
  dbms_lock.sleep(i);
  RETURN i;
END;
/

SELECT 
  min(CAST(sysdate AS TIMESTAMP)), 
  max(CAST(sysdate AS TIMESTAMP))
FROM dual
CONNECT BY level &lt;= 5 AND sleep(1) &gt; 0;

DROP FUNCTION sleep;
/

The result of this statement (which unsurprisingly takes around 4 seconds to execute) is:

MIN(CAST(SYSDATEASTIMESTAMP)) MAX(CAST(SYSDATEASTIMESTAMP))
----------------------------- -----------------------------
01.11.16 11:15:20.000000000   01.11.16 11:15:20.000000000

Some background info on this OTN thread here. Unfortunately, the SYSDATE documentation fails to clearly specify this behaviour…

So, what’s the problem?

Even if we’re probably relying on an undocumented feature, it looks like everything is fine, right? SYSDATE - 470 is more or less a constant, so we’re fine putting it in that WHERE clause. Wrong! Let’s run a benchmark, comparing 3 times an equivalent query running each query 100 times (and for repeatability, we use SELECT, not DELETE):


SET SERVEROUTPUT ON
DECLARE
  v_ts TIMESTAMP WITH TIME ZONE;
  v_repeat CONSTANT NUMBER := 100;
  v_range CONSTANT NUMBER := 470;
  v_date CONSTANT DATE := SYSDATE - v_range;
BEGIN
  v_ts := SYSTIMESTAMP;
  
  -- Original query with inline expression
  FOR i IN 1..v_repeat LOOP
    FOR rec IN (
      SELECT *
      FROM payment
      WHERE payment_date &lt; SYSDATE - v_range
    ) LOOP
      NULL;
    END LOOP;
  END LOOP;
    
  dbms_output.put_line('Statement 1 : ' || (SYSTIMESTAMP - v_ts));
  v_ts := SYSTIMESTAMP;
  
  -- Pre-calculated PL/SQL local variable
  FOR i IN 1..v_repeat LOOP
    FOR rec IN (
      SELECT *
      FROM payment
      WHERE payment_date &lt; v_date
    ) LOOP
      NULL;
    END LOOP;
  END LOOP;
    
  dbms_output.put_line('Statement 2 : ' || (SYSTIMESTAMP - v_ts));
  v_ts := SYSTIMESTAMP;
  
  -- Magical 11g scalar subquery caching
  FOR i IN 1..v_repeat LOOP
    FOR rec IN (
      SELECT *
      FROM payment
      WHERE payment_date &lt; (SELECT SYSDATE - v_range FROM dual)
    ) LOOP
      NULL;
    END LOOP;
  END LOOP;
    
  dbms_output.put_line('Statement 3 : ' || (SYSTIMESTAMP - v_ts));
END;
/

The above benchmark uses 3 techniques that all produce the same result but have different timing characteristics:

is using an inline expression in the predicate
is using a bind variable
is using a scalar subquery

When we check out the estimated execution plans, nothing seems to indicate that any solution might be better than the other:


EXPLAIN PLAN FOR
SELECT *
FROM payment
WHERE payment_date &lt; SYSDATE - :v_range;

SELECT * FROM TABLE(dbms_xplan.display);

EXPLAIN PLAN FOR
SELECT *
FROM payment
WHERE payment_date &lt; :v_date;

SELECT * FROM TABLE(dbms_xplan.display);

-- CAST is there because of a &quot;bug&quot; in the EXPLAIN PLAN parser
EXPLAIN PLAN FOR
SELECT *
FROM payment
WHERE payment_date &lt; (
  SELECT CAST(SYSDATE - :v_range AS DATE) FROM dual
);

SELECT * FROM TABLE(dbms_xplan.display);

The output is:

--------------------------------------------------------------------------------------
| Id  | Operation                   | Name           | Rows  | Cost (%CPU)| Time     |
--------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT            |                |  2500 |   453   (0)| 00:00:06 |
|   1 |  TABLE ACCESS BY INDEX ROWID| PAYMENT        |  2500 |   453   (0)| 00:00:06 |
|*  2 |   INDEX RANGE SCAN          | I_PAYMENT_DATE |   450 |     3   (0)| 00:00:01 |
--------------------------------------------------------------------------------------
                                                                                      
Predicate Information (identified by operation id):                                   
---------------------------------------------------                                   
                                                                                      
   2 - access("PAYMENT_DATE"<SYSDATE@!-:V_RANGE)                                      


--------------------------------------------------------------------------------------
| Id  | Operation                   | Name           | Rows  | Cost (%CPU)| Time     |
--------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT            |                |  2500 |   453   (0)| 00:00:06 |
|   1 |  TABLE ACCESS BY INDEX ROWID| PAYMENT        |  2500 |   453   (0)| 00:00:06 |
|*  2 |   INDEX RANGE SCAN          | I_PAYMENT_DATE |   450 |     3   (0)| 00:00:01 |
--------------------------------------------------------------------------------------
                                                                                      
Predicate Information (identified by operation id):                                   

Predicate Information (identified by operation id):                                   
---------------------------------------------------                                   
                                                                                      
   2 - access("PAYMENT_DATE"<:V_DATE)                                                 


--------------------------------------------------------------------------------------
| Id  | Operation                   | Name           | Rows  | Cost (%CPU)| Time     |
--------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT            |                |  2500 |   455   (0)| 00:00:06 |
|   1 |  TABLE ACCESS BY INDEX ROWID| PAYMENT        |  2500 |   453   (0)| 00:00:06 |
|*  2 |   INDEX RANGE SCAN          | I_PAYMENT_DATE |   450 |     3   (0)| 00:00:01 |
|   3 |    FAST DUAL                |                |     1 |     2   (0)| 00:00:01 |
--------------------------------------------------------------------------------------
                                                                                      

Predicate Information (identified by operation id):                                   
---------------------------------------------------                                   
                                                                                      
   2 - access("PAYMENT_DATE"< (SELECT CAST(SYSDATE@!-:V_RANGE AS DATE) FROM           
              "SYS"."DUAL" "DUAL"))

The only exception is that in the 3rd plan, we have this FAST DUAL operation which hints at scalar subquery caching kicking in. Let’s compare actual results, though! (as always, I’m not publishing the real results, just qualitative numbers in a fictive unit of measurement to comply with Oracle legal blah blah)

-- Run 1
Statement 1 : 1.98 wombos
Statement 2 : 1.17 wombos
Statement 3 : 0.80 wombos

-- Run 2
Statement 1 : 1.49 blorfs
Statement 2 : 1.19 blorfs
Statement 3 : 0.78 blorfs

-- Run 3
Statement 1 : 1.46 gnarls
Statement 2 : 1.20 gnarls
Statement 3 : 0.75 gnarls

Wow! As you can see, using a bind variable certainly helps, but when you apply scalar subquery caching, THIS is where you get the most benefits. Now, this query is just a very simple example. The actual customer query was immensely more complex, and trust me – the performance improvement was 10x (I couldn’t believe it myself at first, and I still don’t know exactly why!).

Conclusion (for Oracle 11g)

Modern optimisers “recognise” a lot of expressions to be constant. For instance, in most databases, it doesn’t matter if you’re writing COUNT(1) or COUNT(*), they’re both translated to the same thing. In this particular case, however, I was quite disappointed by the fact that there is a significant difference between these three perfectly equivalent queries as far as I’m concerned, and the least intuitive solution using scalar subquery caching performed the best on Oracle 11g.

But wait (Oracle 12c)

I’ve tested the same behaviour on my Oracle 12c on Docker instance:

Statement 1 : 1.23 xmfs
Statement 2 : 1.11 xmfs
Statement 3 : 1.14 xmfs

Statement 1 : 1.27 xlorgs
Statement 2 : 1.07 xlorgs
Statement 3 : 1.30 xlorgs

Statement 1 : 1.23 glrls (I can invent units of measurement all day!)
Statement 2 : 1.00 glrls
Statement 3 : 1.39 glrls

… where now there seems to be a “regression” in the scalar subquery solution (it’s now the same as without the scalar subquery) and the bind variable solution now seems to be the fastest. If anything at all, this just proves that with SQL, you will always have to measure stuff on your end, because your setup and database versions may differ. The bottom line is: If performance of a single statement matters to you, chances are that you can improve things by 2-digit percentages with just some simple tricks, and in a batch job or under heavy load, this definitely matters! Certainly, you should not use any expressions of the form SYSDATE - some_value in your predicates.

Can you try your experiment using this slightly modified version, which uses TRUNC around the sysdate, so that the three SQL’s would process the same amount of data (because otherwise with 1 and 3, as sysdate moves on, the queries are slightly different each time).

This is what I got on 11.2.0.4

SQL> SET SERVEROUTPUT ON
SQL> DECLARE
  2    v_ts TIMESTAMP;
  3    v_repeat CONSTANT NUMBER := 1000;
  4    v_range CONSTANT NUMBER := 470;
  5    v_date CONSTANT DATE := trunc(SYSDATE) - v_range;
  6  BEGIN
  7    v_ts := SYSTIMESTAMP;
  8
  9    -- Original query with inline expression
 10    FOR i IN 1..v_repeat LOOP
 11      FOR rec IN (
 12        SELECT *
 13        FROM payment
 14        WHERE payment_date < trunc(SYSDATE) - v_range
 15      ) LOOP
 16        NULL;
 17      END LOOP;
 18    END LOOP;
 19
 20    dbms_output.put_line('Statement 1 : ' || (SYSTIMESTAMP - v_ts));
 21    v_ts := SYSTIMESTAMP;
 22
 23    -- Pre-calculated PL/SQL local variable
 24    FOR i IN 1..v_repeat LOOP
 25      FOR rec IN (
 26        SELECT *
 27        FROM payment
 28        WHERE payment_date < v_date
 29      ) LOOP
 30        NULL;
 31      END LOOP;
 32    END LOOP;
 33
 34    dbms_output.put_line('Statement 2 : ' || (SYSTIMESTAMP - v_ts));
 35    v_ts := SYSTIMESTAMP;
 36
 37    -- Magical 11g scalar subquery caching
 38    FOR i IN 1..v_repeat LOOP
 39      FOR rec IN (
 40        SELECT *
 41        FROM payment
 42        WHERE payment_date < (SELECT trunc(SYSDATE) - v_range FROM dual)
 43      ) LOOP
 44        NULL;
 45      END LOOP;
 46    END LOOP;
 47
 48    dbms_output.put_line('Statement 3 : ' || (SYSTIMESTAMP - v_ts));
 49  END;
 50  /
Statement 1 : +000000000 00:00:10.035000000
Statement 2 : +000000000 00:00:08.524000000
Statement 3 : +000000000 00:00:09.955000000

PL/SQL procedure successfully completed.

which is pretty much what I would have expected, namely:

bind fastest
scalar slightly slower
the remaining one being slowest, because of the cost of evaluating of "sysdate-val" every time

3 thoughts on “Why You Should (Sometimes) Avoid Expressions in SQL Predicates”

lukaseder says:

November 2, 2016 at 10:09

Thanks a lot for following up. Good point about truncation. Will repeat soon and provide feedback.

Loading...

Interesting. I still get the same result:

Statement 1 : +000000000 00:00:12.297000000
Statement 2 : +000000000 00:00:11.119000000
Statement 3 : +000000000 00:00:06.348000000

Statement 1 : +000000000 00:00:12.116000000
Statement 2 : +000000000 00:00:09.999000000
Statement 3 : +000000000 00:00:06.293000000

Statement 1 : +000000000 00:00:13.321000000
Statement 2 : +000000000 00:00:10.877000000
Statement 3 : +000000000 00:00:06.231000000

I’m running:

SELECT * FROM v$version;

Oracle Database 11g Express Edition Release 11.2.0.2.0 - 64bit Production
PL/SQL Release 11.2.0.2.0 - Production
"CORE	11.2.0.2.0	Production"
TNS for 64-bit Windows: Version 11.2.0.2.0 - Production
NLSRTL Version 11.2.0.2.0 - Production

… but that particular customer is running 11.2.0.4 Enterprise Edition (if I remember correctly) and they also had the TRUNC(SYSDATE) call and they also had the most drastic improvement through scalar subquery caching…

Why You Should (Sometimes) Avoid Expressions in SQL Predicates

1. Index usage

2. SYSDATE being constant

So, what’s the problem?

Conclusion (for Oracle 11g)

But wait (Oracle 12c)

Like this:

Published by lukaseder

3 thoughts on “Why You Should (Sometimes) Avoid Expressions in SQL Predicates”

Leave a ReplyCancel reply

1. Index usage

2. SYSDATE being constant

So, what’s the problem?

Conclusion (for Oracle 11g)

But wait (Oracle 12c)

Like this:

Published by lukaseder

3 thoughts on “Why You Should (Sometimes) Avoid Expressions in SQL Predicates”

Leave a ReplyCancel reply

Discover more from Java, SQL and jOOQ.