http://apache-carbondata-dev-mailing-list-archive.168.s1.nabble.com/jira-Commented-CARBONDATA-4031-Query-result-is-incorrect-after-Delete-and-Insert-overwrite-tp104750.html
I just tested the same scenario as per your steps in the latest master code.
the behaviour looks to be ok. Please check and confirm.
when we load the data.
When we insert overwrite from deleted partition. the deleted entries not came.
> Query result is incorrect after Delete and Insert overwrite
> -----------------------------------------------------------
>
> Key: CARBONDATA-4031
> URL:
https://issues.apache.org/jira/browse/CARBONDATA-4031> Project: CarbonData
> Issue Type: Bug
> Components: data-query
> Affects Versions: 2.0.0
> Reporter: Kejian Li
> Priority: Critical
> Fix For: 2.1.1
>
> Attachments: s_x034_carbon-07.csv, s_x034_carbon-08.csv
>
> Time Spent: 6h
> Remaining Estimate: 0h
>
> There is a table with two partitions. User deletes some records on one of partitions and then inserts overwrite the other partition. Deleted records on the previous partition comes back.
> 1. CREATE TABLE s_x034_carbon (guid STRING, sales_guid STRING) PARTITIONED BY (dt STRING) STORED AS carbondata;
> 2. load data local inpath '/home/lizi/Workspace/carbondata_test_workspace/data/s_x034_carbon-07.csv' into table s_x034_carbon;
> load data local inpath '/home/lizi/Workspace/carbondata_test_workspace/data/s_x034_carbon-08.csv' into table s_x034_carbon;
> 3. select count(1), dt from s_x034_carbon group by dt;
> 4. select * from s_x034_carbon where dt=20200907 limit 5;
> 5. delete from s_x034_carbon where dt= 20200907 and guid='595E1862D81A09D0E1008000AC1E0124';
> delete from s_x034_carbon where dt= 20200907 and guid='005056AF06441EDA89ABF853E435A6BD';
> 6. select count(1), dt from s_x034_carbon group by dt;
> 7. insert overwrite table s_x034_carbon partition (dt=20200908)
> select a.guid as guid, a.sales_guid as sales_guid from s_x034_carbon a
> where dt = 20200907;
> 8. select count(1), dt from s_x034_carbon group by dt;
>