There is a new cool feature in Presto that will speed up some inequality joins.
Since couple of months there is a new highly efficient connector for Presto. It works by storing all data in memory on Presto Worker nodes, which allow for extremely fast access times with high throughput while keeping CPU overhead at bare minimum.
I have wanted to experiment with Java for a long time to find out whether or not it can take advantage of Single Instruction, Multiple Data (SIMD) instructions to speed up CPU-intensive computations. I found very little information while I was researching this, so I decided to share my own findings.
One can say that Presto is a MPP (Massively Parallel Processing) kind of application. Well, I have never seen a data warehouse which did not follow this approach. Teradata, Netezza, Vertica and even Hive and many many more, all of these belong to this class of software. It is not only typical for data warehouses, but also for any distributed application which is processing vast amount of data, doing non-trivial and very costly computation on it.
This section is going to collect all the utility tools which can become handy in regards to work with Presto. It is going to have things like:
utility tools work with Presto, like this fresh sql query formatter. Their aim is to help you work with Presto.
example implementations of things which can be injected to Presto through plugin mechanism (SPI) like: connector, ...