Manish Gupta created CARBONDATA-1213:
----------------------------------------
Summary: Removed rowCountPercentage check and fixed IUD data load issue
Key: CARBONDATA-1213
URL:
https://issues.apache.org/jira/browse/CARBONDATA-1213 Project: CarbonData
Issue Type: Bug
Reporter: Manish Gupta
Assignee: Manish Gupta
Fix For: 1.2.0
Problems:
1. Row count percentage not required with high cardinality threshold check
2. IUD returning incorrect results in case of update on high cardinality column
Analysis:
1. In case a column is identified as high cardinality column still it is not getting converted to no dictionary column because of another parameter check called rowCountPercentage. Default value of rowCountPercentage is 80%. Due to this even though high cardinality column is identified, if it is less than 80% of the total number of rows it will be treated as dictionary column. This can still lead to executor lost failure due to memory constraints.
2. RLE on a column is not being set correctly and due to incorrect code design RLE applicable on a column is decided by a different part of code from the one which is actually applying the RLE on a column. Because of this Footer is getting filled with incorrect RLE information and query is failing.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)