[ https://issues.apache.org/jira/browse/CARBONDATA-4031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17249644#comment-17249644 ] Mahesh Raju Somalaraju commented on CARBONDATA-4031: ---------------------------------------------------- hi kejian Le, I just tested the same scenario as per your steps in the latest master code. the behaviour looks to be ok. Please check and confirm. when we load the data. +--------+--------+ |count(1)| dt| +--------+--------+ | 100000|20200908| | 100000|20200907| +--------+--------+ when we delete two records in single partition[20200907]: +--------+--------+ |count(1)| dt| +--------+--------+ | 100000|20200908| | 99998|20200907| +--------+--------+ When we insert overwrite from deleted partition. the deleted entries not came. *please check it anything I missed over here ?* +--------+--------+ |count(1)| dt| +--------+--------+ | 99998|20200908| | 99998|20200907| +--------+--------+ > Query result is incorrect after Delete and Insert overwrite > ----------------------------------------------------------- > > Key: CARBONDATA-4031 > URL: https://issues.apache.org/jira/browse/CARBONDATA-4031 > Project: CarbonData > Issue Type: Bug > Components: data-query > Affects Versions: 2.0.0 > Reporter: Kejian Li > Priority: Critical > Fix For: 2.1.1 > > Attachments: s_x034_carbon-07.csv, s_x034_carbon-08.csv > > Time Spent: 6h > Remaining Estimate: 0h > > There is a table with two partitions. User deletes some records on one of partitions and then inserts overwrite the other partition. Deleted records on the previous partition comes back. > 1. CREATE TABLE s_x034_carbon (guid STRING, sales_guid STRING) PARTITIONED BY (dt STRING) STORED AS carbondata; > 2. load data local inpath '/home/lizi/Workspace/carbondata_test_workspace/data/s_x034_carbon-07.csv' into table s_x034_carbon; > load data local inpath '/home/lizi/Workspace/carbondata_test_workspace/data/s_x034_carbon-08.csv' into table s_x034_carbon; > 3. select count(1), dt from s_x034_carbon group by dt; > 4. select * from s_x034_carbon where dt=20200907 limit 5; > 5. delete from s_x034_carbon where dt= 20200907 and guid='595E1862D81A09D0E1008000AC1E0124'; > delete from s_x034_carbon where dt= 20200907 and guid='005056AF06441EDA89ABF853E435A6BD'; > 6. select count(1), dt from s_x034_carbon group by dt; > 7. insert overwrite table s_x034_carbon partition (dt=20200908) > select a.guid as guid, a.sales_guid as sales_guid from s_x034_carbon a > where dt = 20200907; > 8. select count(1), dt from s_x034_carbon group by dt; > -- This message was sent by Atlassian Jira (v8.3.4#803005) |
Free forum by Nabble | Edit this page |