One can say that Presto is a MPP (Massively Parallel Processing) kind of application. Well, I have never seen a data warehouse which did not follow this approach. Teradata, Netezza, Vertica and even Hive and many many more, all of these belong to this class of software. It is not only typical for data warehouses, but also for any distributed application which is processing vast amount of data, doing non-trivial and very costly computation on it.
This section is going to collect all the utility tools which can become handy in regards to work with Presto. It is going to have things like:
utility tools work with Presto, like this fresh sql query formatter. Their aim is to help you work with Presto.
example implementations of things which can be injected to Presto through plugin mechanism (SPI) like: connector, ...
Cross join has a bad reputation. It is not that nobody likes it all the time. For example It is OK to use it from time to time. There are even some queries where there is no other way. All of it is totally acceptable, and nobody would complain if it would be only like that. However, cross join has a habit to occur at the least appropriate moment. And once it comes, nothing remains the same. Query usually becomes an order of magnitude slower, and this not something any of you would like to dream of.
Since the 0.162 and 0.157t (Teradata) version of Presto, there is a feature called unnecessary cross join elimination. Si...
Just in case any of you were close to Warsaw (Poland) and you do not have any plans for Tuesday (January 24) evening. There will be Presto related talk at Warsaw Hadoop User Group (WHUG) meetup. Go to WHUG website to find more information.
One of the key differences between Presto and Hive, also a crucial functional requirement Facebook made when launching this new SQL engine project, was to have the opportunity to query different kinds of data sources via a uniform ANSI SQL interface. Presto, an open source distributed analytical SQL engi...
In previous post I explained how join works from the user point of view. Now it is the right time to go one step deeper and learn how things are actually calculated. This a very broad topic, so today we are going to just touch upon every join algorithms used in Presto. To understand what they do and when they are used
You know already that on one axis join can be
Join execution is an independent axis. It means that t...