Apache CarbonData Dev Mailing List archive

Re: Presto+CarbonData optimization work discussion

Posted by bhavya411 on
URL: http://apache-carbondata-dev-mailing-list-archive.168.s1.nabble.com/Presto-CarbonData-optimization-work-discussion-tp18509p18736.html

I have created a pull request 1190 for Presto Optimization where we have done following changes to improve the performance

1. Removed unnecessary loops from the integration code to make it more efficient.

2. Implemented Lazy Blocks as is being used in case of ORC.

3. Improved dictionary decoding to have better results.

I have run this on my local machine for 2 GB data and results are attached with this email, we see an improvement in almost all TPCH queries that we have run.

Thanks and regards

Bhavya

On Thu, Jul 20, 2017 at 12:21 PM, rui qin <[hidden email]> wrote:

For -- 6) spark has the vectorized feature,but not in presto.How to implement
it？

--
View this message in context: http://apache-carbondata-dev-mailing-list-archive.1130556.n5.nabble.com/Presto-CarbonData-optimization-work-discussion-tp18509p18548.html

Sent from the Apache CarbonData Dev Mailing List archive mailing list archive at Nabble.com.

PrestoQueryResults.xlsx (8K) Download Attachment