Projects and presto SQL query formatter

4 minute read

Yay, I have added the Projects section to this blog. What is more, it contains the very first project - Presto SQL files formatter.

This section is going to collect all the utility tools which can become handy in regards to work with Presto. It is going to have things like:

  • utility tools work with Presto, like this fresh sql query formatter. Their aim is to help you work with Presto.

  • example implementations of things which can be injected to Presto through plugin mechanism (SPI) like: connector, ...

    Read more...

Unnecessary cross join elimination

5 minute read

Cross join has a bad reputation. It is not that nobody likes it all the time. For example It is OK to use it from time to time. There are even some queries where there is no other way. All of it is totally acceptable, and nobody would complain if it would be only like that. However, cross join has a habit to occur at the least appropriate moment. And once it comes, nothing remains the same. Query usually becomes an order of magnitude slower, and this not something any of you would like to dream of.

Since the 0.162 and 0.157t (Teradata) version of Presto, there is a feature called unnecessary cross join elimination. Si...

Read more...

SQL-on-anything - talk at Warsaw Hadoop User Group

1 minute read

Just in case any of you were close to Warsaw (Poland) and you do not have any plans for Tuesday (January 24) evening. There will be Presto related talk at Warsaw Hadoop User Group (WHUG) meetup. Go to WHUG website to find more information.

One of the key differences between Presto and Hive, also a crucial functional requirement Facebook made when launching this new SQL engine project, was to have the opportunity to query different kinds of data sources via a uniform ANSI SQL interface. Presto, an open source distributed analytical SQL engi...

Read more...

The fundamentals: join algorithms

9 minute read

In previous post I explained how join works from the user point of view. Now it is the right time to go one step deeper and learn how things are actually calculated. This a very broad topic, so today we are going to just touch upon every join algorithms used in Presto. To understand what they do and when they are used

You know already that on one axis join can be INNER, OUTER, SEMI etc. Join execution is an independent axis. It means that t...

Read more...

2016 Q4 Teradata Presto relase - 157t

1 minute read

Recently Teradata released the new version of Presto - 157t (Teradata’s certified version of open source Presto). It is based on 0.157 regular version of Presto and contains features which are not yet available there.

Read more...