Re: Improve carbondata CDC performance
Posted by
akashrn5 on
Feb 19, 2021; 7:08am
URL: http://apache-carbondata-dev-mailing-list-archive.168.s1.nabble.com/Improve-carbondata-CDC-performance-tp106093p106347.html
Hi david,
Thanks for your suggestion.
I checked in local about the query you suggested, its going as a
*BroadcastNestedLoopJoin*.
As in local dataset is small it goes for that, but in cluster when the data
size grows it goes back to cartesian product again.
How about our own search logic in a distributed way using Interval tree
datastructure? it will be faster and wont impact much.
Other please give your suggestions.
Thanks
Regards,
Akash R
--
Sent from:
http://apache-carbondata-dev-mailing-list-archive.1130556.n5.nabble.com/