Re: Improve carbondata CDC performance
Posted by
akashrn5 on
Feb 24, 2021; 7:31am
URL: http://apache-carbondata-dev-mailing-list-archive.168.s1.nabble.com/Improve-carbondata-CDC-performance-tp106093p106440.html
Hi Venu,
Thanks for your review.
I have replied the same in the document.
you are right
1. its taken care to group by extended blocklets on split path and get the
min-max on block level
2. we need to do group by on the file path to avoid the duplicates from
dataframe output. I have updated the same in the doc please have a look.
Thanks,
Akash R
--
Sent from:
http://apache-carbondata-dev-mailing-list-archive.1130556.n5.nabble.com/