Login  Register

Re: Improve carbondata CDC performance

Posted by akashrn5 on Feb 24, 2021; 7:31am
URL: http://apache-carbondata-dev-mailing-list-archive.168.s1.nabble.com/Improve-carbondata-CDC-performance-tp106093p106440.html

Hi Venu,

Thanks for your review.

I have replied the same in the document.
you are right

1. its taken care to group by extended blocklets on split path and get the
min-max on block level
2. we need to do group by on the file path to avoid the duplicates from
dataframe output. I have updated the same in the doc please have a look.

Thanks,
Akash R



--
Sent from: http://apache-carbondata-dev-mailing-list-archive.1130556.n5.nabble.com/