Login  Register

Re: CarbonData File Deletion Hotfix

Posted by BrooksLi on Sep 16, 2020; 2:55am
URL: http://apache-carbondata-dev-mailing-list-archive.168.s1.nabble.com/CarbonData-File-Deletion-Hotfix-tp100384p100452.html

Hello March,

I agree to take a hotfix for data deletion in loading and compaction flow,
+1.  

Deleting the INSERT_IN_PROGERSS and INSERT_OVERWRITE_IN_PROGRESS is a
dangerous activity, so these two kinds of segments should not be
automatically deleted.

As for MARKED_FOR_DELETE and COMPACTED status segments, these are stale
segments, but we can keep them in the file system until the user/admin calls
clean file action manually.  Since the deletion requires the precision of
the table status.

So my opinion is to remove all the automatic clean steps in
loading/compaction flow first to protect the data from being deleted
accidentally.



--
Sent from: http://apache-carbondata-dev-mailing-list-archive.1130556.n5.nabble.com/