[GitHub] [carbondata] ajantha-bhat opened a new pull request #3232: [CARBONDATA-3395] Fix Exception when concurrent readers built with same split object

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[GitHub] [carbondata] ajantha-bhat opened a new pull request #3232: [CARBONDATA-3395] Fix Exception when concurrent readers built with same split object

GitBox
ajantha-bhat opened a new pull request #3232: [CARBONDATA-3395] Fix Exception when concurrent readers built with same split object
URL: https://github.com/apache/carbondata/pull/3232
 
 
   problem: Fix Exception when concurrent readers built with same split object
   
   cause: In CarbonInputSplit, BlockletDetailInfo and BlockletInfo are made lazy. so, BlockletInfo is prepared during reader builder.
   so, when two readers work on same split object, the state of this object is changed and leading to array out of bound issue.
   
   solution:  a) synchronize  BlockletInfo creation,
   b) load BlockletDetailInfo before passing to reader inside getSplit() API itself.
   c) Failure case get the proper identifier to cleanup the datamaps.
   d) build_with_splits, need to handle default projection filling if not configured.
   
   Be sure to do all of the following checklist to help us incorporate
   your contribution quickly and easily:
   
    - [ ] Any interfaces changed? NA
   
    - [ ] Any backward compatibility impacted? NA
   
    - [ ] Document update required? NA
   
    - [ ] Testing done. done
         
    - [ ] For large changes, please consider breaking it into sub-tasks under an umbrella JIRA. NA
   
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


With regards,
Apache Git Services