[jira] [Created] (CARBONDATA-1014) Refactor on data loading and open interface for format extension

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (CARBONDATA-1014) Refactor on data loading and open interface for format extension

Akash R Nilugal (Jira)
Jacky Li created CARBONDATA-1014:
------------------------------------

             Summary: Refactor on data loading and open interface for format extension
                 Key: CARBONDATA-1014
                 URL: https://issues.apache.org/jira/browse/CARBONDATA-1014
             Project: CarbonData
          Issue Type: Improvement
            Reporter: Jacky Li
             Fix For: 1.2.0


Refactor on current data loading flow to make it:
1. Use vectorized processing as early as possible
2. Make index build (sorting) CPU cache efficient, by using rowId and key column vector to sort
3. Open interface for format extension, including column encoding, compression, statistics.

Design doc will be posted in this JIRA soon.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)