GitHub user ravipesala opened a pull request:
https://github.com/apache/carbondata/pull/1700 [CARBONDATA-1860][PARTITION] Support insertoverwrite for a specific partition.
This PR depends on
https://github.com/apache/carbondata/pull/1672 and
https://github.com/apache/carbondata/pull/1674 User should able to overwrite partition for a specific partition. Like
INSERT OVERWRITE TABLE partitioned_user
PARTITION (country = 'US')
SELECT * FROM another_user au
WHERE au.country = 'US';
In the above example, the user can overwrite only the partition(country = 'US') data. So remaining partitions data would be intact.
While overwriting a specific partition carbon should first load data to the new segment and drop that partition from all remaining segments using partition.map file.
Be sure to do all of the following checklist to help us incorporate
your contribution quickly and easily:
- [X] Any interfaces changed? NO
- [X] Any backward compatibility impacted? NO
- [X] Document update required? YES
- [X] Testing done
Tests added
- [X] For large changes, please consider breaking it into sub-tasks under an umbrella JIRA.
You can merge this pull request into a Git repository by running:
$ git pull
https://github.com/ravipesala/incubator-carbondata partition-overwrite2
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/carbondata/pull/1700.patchTo close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #1700
----
commit 32e23c7e0d1dfb0435ae70b6d1311e68cec4c615
Author: ravipesala <ravi.pesala@...>
Date: 2017-12-19T07:49:15Z
Support insert overwrite partition
commit 9f0b7d8b1d28cd452633762057d8c7204765e816
Author: ravipesala <ravi.pesala@...>
Date: 2017-12-20T18:07:30Z
handle comments
----
---