[ https://issues.apache.org/jira/browse/CARBONDATA-4062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Cai updated CARBONDATA-4062: ---------------------------------- Description: To prevent accidental deletion of data, carbon will introduce data trash management. It will provide buffer time for accidental deletion of data to roll back the delete operation. Data trash management is a part of carbon data lifecycle management. Clean files as a data trash manager should contain the following two parts. part 1: manage metadata-indexed data trash. This data is at the original place of the table and indexed by metadata. carbon manages this data by metadata index and should avoid using listFile() interface. part 2: manage ".Trash" folder. Now ".Trash" folder is without metadata index, and the operation on it bases on timestamp and listFile() interface. In the future, carbon will index ".Trash" folder to improve data trash management. was: To prevent accidental deletion of data, carbon will introduce data trash management. It will provide buffer time for accidental deletion of data to roll back the delete operation. Data trash management is a part of carbon data lifecycle management. Clean files as a data trash manager should contain the following two parts. part 1: manage metadata-indexed data trash. this data should be at the original place part 2: manage ".Trash" folder. Now this ".Trash" folder is without metadata index, and the operation on it will depend on timestamp and listFile interface. It should be improve in the future. > Should make clean files become data trash manager > ------------------------------------------------- > > Key: CARBONDATA-4062 > URL: https://issues.apache.org/jira/browse/CARBONDATA-4062 > Project: CarbonData > Issue Type: Improvement > Reporter: David Cai > Priority: Major > > To prevent accidental deletion of data, carbon will introduce data trash management. It will provide buffer time for accidental deletion of data to roll back the delete operation. > Data trash management is a part of carbon data lifecycle management. Clean files as a data trash manager should contain the following two parts. > part 1: manage metadata-indexed data trash. > This data is at the original place of the table and indexed by metadata. carbon manages this data by metadata index and should avoid using listFile() interface. > part 2: manage ".Trash" folder. > Now ".Trash" folder is without metadata index, and the operation on it bases on timestamp and listFile() interface. In the future, carbon will index ".Trash" folder to improve data trash management. -- This message was sent by Atlassian Jira (v8.3.4#803005) |
Free forum by Nabble | Edit this page |