Jacky Li created CARBONDATA-1014:
------------------------------------
Summary: Refactor on data loading and open interface for format extension
Key: CARBONDATA-1014
URL:
https://issues.apache.org/jira/browse/CARBONDATA-1014 Project: CarbonData
Issue Type: Improvement
Reporter: Jacky Li
Fix For: 1.2.0
Refactor on current data loading flow to make it:
1. Use vectorized processing as early as possible
2. Make index build (sorting) CPU cache efficient, by using rowId and key column vector to sort
3. Open interface for format extension, including column encoding, compression, statistics.
Design doc will be posted in this JIRA soon.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)