[GitHub] [carbondata] ajantha-bhat opened a new pull request #3380: [CARBONDATA-3515] Limit local dictionary size to 10% of configured blockletsize

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[GitHub] [carbondata] ajantha-bhat opened a new pull request #3380: [CARBONDATA-3515] Limit local dictionary size to 10% of configured blockletsize

GitBox
ajantha-bhat opened a new pull request #3380: [CARBONDATA-3515] Limit local dictionary size to 10% of configured blockletsize
URL: https://github.com/apache/carbondata/pull/3380
 
 
   **problem:** currently local dictionary max size is 2GB, because of this for varchar columns or long string columns, local dictionary can be of 2GB size. so, as local dictionary is stored in blocklet. blocklet size will exceed 2 GB, even though configured maximum blocklet size is 64MB. some places inter overflow happens during casting.
   
   **solution:** limit the local dictionary size to 10% of maximum allowed blocklet size
   
   Be sure to do all of the following checklist to help us incorporate
   your contribution quickly and easily:
   
    - [ ] Any interfaces changed? NA
   
    - [ ] Any backward compatibility impacted? NA
   
    - [ ] Document update required? NA
   
    - [ ] Testing done. done
         
    - [ ] For large changes, please consider breaking it into sub-tasks under an umbrella JIRA.  NA
   
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


With regards,
Apache Git Services