Posted by
Akash R Nilugal (Jira) on
URL: http://apache-carbondata-dev-mailing-list-archive.168.s1.nabble.com/jira-Updated-CARBONDATA-4031-Deleted-data-on-the-partition-come-back-after-inserting-overwrite-on-thn-tp101815.html
[
https://issues.apache.org/jira/browse/CARBONDATA-4031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Kejian Li updated CARBONDATA-4031:
----------------------------------
Description:
There is a table with two partitions. User deletes some records on one of partitions and then inserts overwrite the other partition. Deleted records on the previous partition comes back.
1. CREATE TABLE s_x034_carbon (guid STRING, sales_guid STRING) PARTITIONED BY (dt STRING) STORED AS carbondata;
2. load data local inpath '/home/lizi/Workspace/carbondata_test_workspace/data/s_x034_carbon-07.csv' into table s_x034_carbon;
load data local inpath '/home/lizi/Workspace/carbondata_test_workspace/data/s_x034_carbon-08.csv' into table s_x034_carbon;
3. select count(1), dt from s_x034_carbon group by dt;
4. select * from s_x034_carbon where dt=20200907 limit 5;
5. delete from s_x034_carbon where dt= 20200907 and guid='595E1862D81A09D0E1008000AC1E0124';
delete from s_x034_carbon where dt= 20200907 and guid='005056AF06441EDA89ABF853E435A6BD';
6. select count(1), dt from s_x034_carbon group by dt;
7. insert overwrite table s_x034_carbon partition (dt=20200908)
select a.guid as guid, a.sales_guid as sales_guid from s_x034_carbon a
where dt = 20200907;
8. select count(1), dt from s_x034_carbon group by dt;
was:
There is a table with two partitions. User deletes some records on one of partitions and then inserts overwrite the other partition. Deleted records on the previous partition comes back.
1. CREATE TABLE s_x034_carbon (guid STRING, sales_guid STRING) PARTITIONED BY (dt STRING) STORED AS carbondata;
2. load data local inpath '/home/lizi/Workspace/carbondata_test_workspace/data/s_x034_carbon-07.csv' into table s_x034_carbon;
load data local inpath '/home/lizi/Workspace/carbondata_test_workspace/data/s_x034_carbon-08.csv' into table s_x034_carbon;
3. select count(1), dt from s_x034_carbon group by dt;
4. select * from s_x034_carbon where dt=20200907 limit 5;
5. delete from s_x034_carbon where dt= 20200907 and guid='595E1862D81A09D0E1008000AC1E0124';
6. select count(1), dt from s_x034_carbon group by dt;
7. insert overwrite table s_x034_carbon partition (dt=20200908)
select a.guid as guid, a.sales_guid as sales_guid from s_x034_carbon a
where dt = 20200907;
8. select count(1), dt from s_x034_carbon group by dt;
> Deleted data on the partition come back after inserting overwrite on the other partition
> ----------------------------------------------------------------------------------------
>
> Key: CARBONDATA-4031
> URL:
https://issues.apache.org/jira/browse/CARBONDATA-4031> Project: CarbonData
> Issue Type: Bug
> Components: data-query
> Affects Versions: 2.0.0
> Reporter: Kejian Li
> Priority: Critical
> Fix For: 2.1.0
>
>
> There is a table with two partitions. User deletes some records on one of partitions and then inserts overwrite the other partition. Deleted records on the previous partition comes back.
> 1. CREATE TABLE s_x034_carbon (guid STRING, sales_guid STRING) PARTITIONED BY (dt STRING) STORED AS carbondata;
> 2. load data local inpath '/home/lizi/Workspace/carbondata_test_workspace/data/s_x034_carbon-07.csv' into table s_x034_carbon;
> load data local inpath '/home/lizi/Workspace/carbondata_test_workspace/data/s_x034_carbon-08.csv' into table s_x034_carbon;
> 3. select count(1), dt from s_x034_carbon group by dt;
> 4. select * from s_x034_carbon where dt=20200907 limit 5;
> 5. delete from s_x034_carbon where dt= 20200907 and guid='595E1862D81A09D0E1008000AC1E0124';
> delete from s_x034_carbon where dt= 20200907 and guid='005056AF06441EDA89ABF853E435A6BD';
> 6. select count(1), dt from s_x034_carbon group by dt;
> 7. insert overwrite table s_x034_carbon partition (dt=20200908)
> select a.guid as guid, a.sales_guid as sales_guid from s_x034_carbon a
> where dt = 20200907;
> 8. select count(1), dt from s_x034_carbon group by dt;
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)