SQL Performance – Java, SQL and jOOQ.

An Efficient Way to Check for Existence of Multiple Values in SQL

Posted on February 16, 2024 by lukaseder

In a previous blog post, we've advertised the use of SQL EXISTS rather than COUNT(*) to check for existence of a value in SQL. I.e. to check if in the Sakila database, actors called WAHLBERG have played in any films, instead of: SELECT count(*) FROM actor a JOIN film_actor fa USING (actor_id) WHERE a.last_name = … Continue reading An Efficient Way to Check for Existence of Multiple Values in SQL →

What’s Faster? COUNT(*) or COUNT(1)?

Posted on September 19, 2019March 23, 2022 by lukaseder

One of the biggest and undead myths in SQL is that COUNT(*) is faster than COUNT(1). Or was it that COUNT(1) is faster than COUNT(*)? Impossible to remember, because there's really no reason at all why one should be faster than the other. But is the myth justified? Let's measure! How does COUNT(...) work? But … Continue reading What’s Faster? COUNT(*) or COUNT(1)? →

How to Avoid Excessive Sorts in Window Functions

Posted on November 6, 2017November 9, 2017 by lukaseder

Usually, this blog is 100% pro window functions and advocates using them at any occasion. But like any tool, window functions come at a price and we must carefully evaluate if that's a price we're willing to pay. That price can be a sort operation. And as we all know, sort operations are expensive. They … Continue reading How to Avoid Excessive Sorts in Window Functions →

10 Cool SQL Optimisations That do not Depend on the Cost Model

Posted on September 28, 2017February 28, 2019 by lukaseder

Cost Based Optimisation is the de-facto standard way to optimise SQL queries in most modern databases. It is the reason why it is really really hard to implement a complex, hand-written algorithm in a 3GL (third generation programming language) such as Java that outperforms a dynamically calculated database execution plan, that has been generated from … Continue reading 10 Cool SQL Optimisations That do not Depend on the Cost Model →

How to Execute a SQL Query Only if Another SQL Query has no Results

Posted on May 31, 2017May 31, 2017 by lukaseder

I stumbled upon an interesting question on Stack Overflow recently. A user wanted to query a table for a given predicate. If that predicate returns no rows, they wanted to run another query using a different predicate. Preferably in a single query. Challenge accepted! Canonical Idea: Use a Common Table Expression We're querying the Sakila … Continue reading How to Execute a SQL Query Only if Another SQL Query has no Results →

The Difficulty of Tuning Queries Over a Database Link – Or How I Learned to Stop Worrying and Love the DUAL@LINK Table

Posted on May 8, 2017 by lukaseder

A large-ish customer in banking (largest tables on that particular system: ~1 billion rows) once decided to separate the OLTP database from the "log database" in order to better use resources and prevent contention on some tables, as the append-only log database is used heavily for analytic querying of all sorts. That seems to make … Continue reading The Difficulty of Tuning Queries Over a Database Link – Or How I Learned to Stop Worrying and Love the DUAL@LINK Table →

How to Calculate Multiple Aggregate Functions in a Single Query

Posted on April 20, 2017June 25, 2024 by lukaseder

At a customer site, I've recently encountered a report where a programmer needed to count quite a bit of stuff from a single table. The counts all differed in the way they used specific predicates. The report looked roughly like this (as always, I'm using the Sakila database for illustration): -- Total number of films … Continue reading How to Calculate Multiple Aggregate Functions in a Single Query →

SQL IN Predicate: With IN List or With Array? Which is Faster?

Posted on March 30, 2017April 18, 2018 by lukaseder

Hah! Got nerd-sniped again: https://stackoverflow.com/questions/43099226/how-to-make-jooq-to-use-arrays-in-the-in-clause/43102102 A jOOQ user was wondering why jOOQ would generate an IN list for a predicate like this: Java COLUMN.in(1, 2, 3, 4) SQL COLUMN in (?, ?, ?, ?) ... when in fact there could have been the following predicate being generated, instead: COLUMN = any(?::int[]) In the second case, … Continue reading SQL IN Predicate: With IN List or With Array? Which is Faster? →

How to Benchmark Alternative SQL Queries to Find the Fastest Query

Posted on March 29, 2017November 9, 2017 by lukaseder

Tuning SQL isn't always easy, and it takes a lot of practice to recognise how any given query can be optimised. One of the most important slides of my SQL training is the one summarising "how to be fast": Some of these bullets were already covered on this blog. For instance avoiding needless, mandatory work, … Continue reading How to Benchmark Alternative SQL Queries to Find the Fastest Query →

Faster SQL Through Occasionally Choosing Natural Keys Over Surrogate Keys

Posted on March 16, 2017August 26, 2022 by lukaseder

There are many many opinions out there regarding the old surrogate key vs. natural key debate. Most of the times, surrogate keys (e.g. sequence generated IDs) win because they're much easier to design: They're easy to keep consistent across a schema (e.g. every table has an ID column, and that's always the primary key)They're thus … Continue reading Faster SQL Through Occasionally Choosing Natural Keys Over Surrogate Keys →

Tag: SQL Performance

An Efficient Way to Check for Existence of Multiple Values in SQL

Like this:

What’s Faster? COUNT(*) or COUNT(1)?

Like this:

How to Avoid Excessive Sorts in Window Functions

Like this:

10 Cool SQL Optimisations That do not Depend on the Cost Model

Like this:

How to Execute a SQL Query Only if Another SQL Query has no Results

Like this:

The Difficulty of Tuning Queries Over a Database Link – Or How I Learned to Stop Worrying and Love the DUAL@LINK Table

Like this:

How to Calculate Multiple Aggregate Functions in a Single Query

Like this:

SQL IN Predicate: With IN List or With Array? Which is Faster?

Like this:

How to Benchmark Alternative SQL Queries to Find the Fastest Query

Like this:

Faster SQL Through Occasionally Choosing Natural Keys Over Surrogate Keys

Like this: